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Two  extreme  cases  can  be  solved  for  electron  charging  effects  where  the  first  one  ignores  the 
electron  correlation  and  the  second  one  treats  only  one-electron  scattering.  So  far,  Coulomb 
blockade  has  been  treated  by  the  orthodox  theory  [Likharev,  IBM  J.  Res.  Dev.  32,  144  (1988)], 
where  the  charging  energy  is  expressed  as  e^/2C  using  the  capacitance  C  of  a  tunnel  junction.  On 
the  other  hand,  one-electron  scattering  can  be  solved  exactly  by  using  the  Schrodinger  equation.  But 
it  is  difficult  to  solve  for  a  many-body  system  by  the  Schrodinger  equation.  In  this  article,  stress  is 
placed  on  the  case  where  the  electric  field  cannot  be  ignored  anymore.  A  treatment  of  the  charging 
energy  is  proposed  using  a  screening  model  or  an  electromagnetic  field  model  by  employing  a  cavity 
model  in  the  field  of  photon.  ©  1996  American  Vacuum  Society. 


L  EXPERIMENTS 

It  is  evident  that  the  small  central  electrodes  of  a  double 
tunnel  junction  whose  sizes  are  on  the  atomic  scale  show  a 
Coulomb  staircase,  as  discussed  below.  The  liquid  crystal 
(LC)  molecule  4'-7-alkyl-4-cyanobiphenyl  (7CB)  was  used 
as  the  central  electrode  of  a  double  tunnel  junction  (Fig.  1), 
where  a  scanning  tunneling  microscope  (STM)  tip  acts  as 
one  outer  electrode  and  shows  Coulomb  blockade  [Figs.  2(a) 
and  2(b)].  Also,  a  fullerene  molecule  was  used  as  a  central 
electrode  of  another  double  tunnel  junction,  and  it  also 
shows  Coulomb  blockade  [Fig.  2(a)]. Other  molecules, 
1,7-(CH3)2-1,2-C2BioH9T1(OCOCF3)2  in  a  Langmuir- 
Blodgett  (LB)  film  [Fig.  2(c)], ^  and  octasubstituted  metal- 
lophtalocyanine  (PcPd)  between  electrodes,  also  show  a 
Coulomb  staircase  [Fig.  2(d)].^ 


II.  ORTHODOX  COULOMB  BLOCKADE  MODEL 

Figures  2(a)“2(d)  shows  the  calculated  i-v  characteris¬ 
tics  (solid  lines).  They  were  calculated  based  on  the  orthodox 
theory^  where  the  charging  energy  is  described  as 


where  C  is  the  sum  of  the  capacitances  of  a  double  tunnel 
junction.  Coulomb  blockade  is  observed  when  the  tunnel  re¬ 
sistance  R  is  greater  than  the  quantum  resistance  R^  =  Jille^ 
and  also  e^HC  is  greater  than  the  thermal  energy  k^T.  The 
orthodox  theory  treats  the  case  where  the  number  of  elec¬ 
trons  in  the  central  electrode  is  so  great  that  each  electron  in 
the  central  electrode  does  not  interact  and  the  extent  of  the 
charging  can  be  treated  as  a  capacitor. 


III.  ONE-ELECTRON  SOLUTION  (RESONANT 
TUNNELING) 


Another  extreme  case  is  treating  only  one  electron;  then 
scattering  of  one  electron  by  a  potential  can  be  solved  ex¬ 
actly  by  the  Schrodinger  equation.  In  this  approach,  the  one- 
particle  one-dimensional  Schrodinger  equation  is  solved  with 
piecewise  constant  potentials  U{x)  and  V(x)  and  the  effec¬ 
tive  mass  m(x)  (for  example,  see  Fig.  3): 


d 

2  dx 


1  d^\ 

- —  +(f/+  y  cos  . 

m  dx  j 


(2) 


The  solution  of  this  equation  obeys  the  condition  of  time 
periodicity  of  |'4^p: 


^(6,/+  T)  =  exp 


-ThV(e,t), 

h  / 


0<€<ho),  (3) 

where  e  is  the  quasienergy.  In  every  interval  of  constancy  of 
U,  y,  and  m,  this  solution  looks  like 
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where  a  is  the  index  of  the  interval,  €^=  eAmhco, 


(4) 
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^^Electronic  mail:  nejo@nrim.go.jp 


and  the  wave  vector 
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Substrate 


Fig.  1.  Schematic  configuration  of  a  7CB  LC  molecule  on  a  substrate  and 
under  an  STM  tip  which  forms  a  double  tunnel  junction. 


Figure  4  shows  at  quasilevel  i.e.,  for  incident 
electron  energy  Eq=1.  It  is  seen  that  this  quasilevel  is  the 
second  level  in  the  well  (in  the  limit  of  infinitely  thick  bar¬ 
riers).  Figure  5  shows  at  quasilevel  when  electrons 
flow  from  the  side  of  thinner  barrier  (horizontally  flipped 
potential  of  Fig,  3).  The  probability  density  in  the  well  be¬ 
comes  so  large  that  the  accounting  of  electron- electron  re¬ 
pulsion  becomes  important. 
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Potential  profile  to  hodei  7CB/Pt  under  STM  tip 
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Fig.  3.  One-dimensional  potential  profile  to  model  an  LC  molecule  sketched 
in  Fig.  1. 

IV.  SMALL  TUNNEL  JUNCTIONS 

When  the  size  of  the  central  electrode  becomes  small,  the 
charging  energy  of  a  double  tunnel  junction  can  be  converted 


s 


Fig.  2.  /-  V  curves  calculated  within  the  orthodox  Coulomb  blockade  model 
(Ref.  7)  with  use  of  our  programs  (Ref.  10).  (a)  For  a  C^o  molecule  on  Pd 
(left  axis,  Coulomb  gap  1  eV)  and  for  a  monolayer  of  7CB  LC  molecules  on 
graphite  (right  axis,  Coulomb  gap  0.4  eV)  under  an  STM  tip  at  300  K. 
Experimental  values  are  marked  with  circles  [from  Ref.  4,  Fig,  1(a)]  and 
squares  (from  Ref.  2,  Fig.  1).  (b)  For  a  monolayer  of  7CB  LC  molecules  on 
a  Zr02/Pt  substrate  at  300  K  with  corresponding  experimental  values  from 
Ref.  3,  Figs.  5A  (circles)  and  5D  (squares).  Curves  are  shifted  for  clarity,  (c) 
For  1,7-(CH3)2-1,2-C2BioH9T1  (0C0CF3)2  clusters  in  a  LB  film  on  graphite 
under  an  STM  tip  at  300  K  [from  Ref.  5,  Fig.  2,  curves  1  (circles)  and  2 
(squares)],  (d)  I-V  and  dlfdV  curves  for  a  monolayer  of  PcPd  molecules 
between  two  crossed  gold  stripes  at  4.2  K.  Corresponding  experimental 
values  from  Ref.  6,  Figs.  5(a)  and  5(b),  are  marked  with  empty  (dUdV)  or 
filled  (I-V)  circles. 
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Fig.  4.  at  quasilevel  ej,  i.e.,  for  incident  electron  energy  Eq=\.  It  is 
seen  that  this  quasilevel  is  the  second  level  in  the  well  (in  the  limit  of 
infinitely  thick  barriers). 
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Psi2  at  E0=el  in  flipped  potential 
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Fig.  5.  j'^^P  at  quasilevel  e, ,  when  electrons  flow  from  the  side  of  thinner 
barrier  (horizontally  flipped  potential  of  Fig.  3).  The  probability  density  in 
the  well  becomes  so  large  that  the  accounting  of  the  electron-electron  re¬ 
pulsion  becomes  important. 


to  the  charging  energy  of  the  central  electrode.  The  charging 
energy  of  a  double  tunnel  junction  can  be  described  as  the 
charging  energy  of  the  central  electrode: 


(ni-n2)^e^ 

2C 


+  C.C,, 


(7) 


where  and  ^2  the  number  of  electrons  through  tunnel 
junctions  1  and  2,  respectively,  and  C=Ci  +  C2,  where  Cj 
and  C2  are  the  capacitances  of  the  tunnel  junctions  1  and  2, 
respectively. 

When  the  battery  works  as  the  energy  difference  between 
n  and  n-hl  electrons  in  the  central  electrode,  the  (/i+  l)th 
electron  can  tunnel  to  the  central  electrode: 


2^2 


eV= 


(n+  lye 
2C~ 


2  2 


(8) 


A.  Screening  model 

When  the  central  electrode  consists  of  a  molecule  or  a 
small  metal  droplet,  they  are  dielectric  materials  themselves, 
or  all  the  materials  are  polarized  at  least  under  the  high  elec¬ 
tric  field  (typically  10^  V/m).  The  lines  of  electric  force  start 
from  the  tip  (tip  positive)  and  once  they  reach  the  negative 
pole  of  the  molecule,  they  start  again  from  the  positive  pole 
and  reach  the  substrate  (substrate  negative)  (Fig.  6).  The 
electron  tunnels  from  the  substrate  to  the  positive  area  of  the 
molecule.  When  the  polarity  of  the  tip  and  the  substrate 
changes,  again  the  electron  tunnels  to  the  positive  area  of  the 
molecule. 


Fig.  6.  The  dielectric  material  in  the  tunnel  gap.  The  lines  of  electric  force 
start  from  the  tip  (tip  positive)  and  once  they  reach  the  negative  pole  of  the 
molecule,  they  start  from  the  positive  pole  again  and  reach  the  substrate 
(substrate  negative).  The  LC  molecular  surface  potential  image  (without 
including  the  electric  field  effect)  is  shown.  The  nitrogen  atom  and  the 
center  of  the  biphenyl  group  have  negative  potential,  and  hydrogen  atoms 
and  carbon  atoms  of  the  biphenyl  group  have  positive  potential.  A  numerical 
potential  value  (P)  vs  color  is  green,  P<~- 15.0  kcal/mol;  blue,  —  15.0<P 
<—9.0;  light  blue,  —  9.0<P<~3.0;  yellow,  —  3.0<P<3.0;  violet,  3.0<F 
<9.0;  red,  9.0<P. 


The  potential  of  n  electrons  (Po(r)  without  any  screening  is 
expressed  as 

9o(r)=y.  (9) 

In  the  positive  gas,  the  potential  of  n  electrons  is  screened 
and  the  effective  potential  ^(r)  is  written  as 

V^(p(r)  =  —47rneS{r)  +  4TTe{Sp(r)),  (10) 

where  S{r)  is  a  delta  function  and  -e{Sp(r))  is  the  induced 
charge  density.  Then  the  charging  energy  (n-\-l)^e^/2C  of 
the  capacitor  model  should  be  replaced  by  cp(r)ef2,  and  this 
quantity  reflects  the  screening  effect  so  the  energy  may 
change  by  the  number  of  electrons  in  the  central  electrode. 

Alternatively,  the  charging  energy  can  be  expressed  as  the 
electromagnetic  energy  of  tunnel  junctions  as 

f  E  ^D,  (11) 

Jv 

where  E  is  electric  field,  D  is  electric  displacement,  and  the 
integral  is  taken  over  the  tunnel  gap  space  V.  From  the  view¬ 
point  of  electromagnetic  energy,  until  the  electromagnetic 
field  is  charged  up,  the  next  electron  cannot  enter  the  central 
electrode.  Also,  the  preceding  electron  should  not  escape  the 
central  electrode  before  the  electromagnetic  field  is  charged 
up.  The  former  condition  is  equivalent  to  the  condition  of 
e^/2C>eV  and  the  latter  of  R>Rq  for  observing  Coulomb 
blockade. 

This  electromagnetic  energy  varies  with  increasing  num¬ 
ber  of  electrons  in  the  central  electrode,  but  it  does  not  in¬ 
crease  by  the  unit  of  efC,  since  the  electromagnetic  field  is 
not  quantized  by  elC. 
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B.  Hartree-Fock  calculation 

Furthermore,  the  exact  solution  of  a  many-body  system  is 
obtained  by  ab  initio  calculation.  By  ignoring  electron  cor¬ 
relation,  the  total  energy  increase  of  the  central  electrode  by 
an  increment  of  one  electron  can  be  approximated  using  the 
Hartree-Fock  method,  and  the  condition  for  making  {n 
+  l)th  electron  be  able  to  enter  is 

eV=E{n+l)-E{n),  (12) 

But  the  Hartree-Fock  calculation^  of  an  isolated  molecule 
gives  the  energy  difference  E(n+l)  —  E(n)  as  10  eV  for 
7CB,^  and  it  is  much  larger  than  the  experimental  values  of 
0.1 -0.2  eV.  This  brings  us  to  the  consideration  of  correlation 
energy. 

The  number  of  electrons  (A^tf)  l^e  circle  of  a  radius  of 
the  Thomas-Fermi  length  is 


increasing  the  charging  of  a  molecule  between  two  outer 
electrodes  is  the  screening  as  described  Sec.  IV  A. 

V.  CONCLUSION 

We  and  other  groups  have  shown  that  a  Coulomb  staircase 
can  be  observed  even  for  atomic  scale  central  electrodes  or 
molecules.  Application  of  the  orthodox  theory  is  too  crude 
for  such  a  small  number  of  electrons.  On  the  other  hand, 
one-electron  scattering  by  potentials  can  be  solved  exactly, 
but  it  cannot  include  the  charging  effects.  Up  to  now,  one  of 
the  suitable  approximations  considers  the  screening  effect. 
This  way  of  treatment  should  be  further  improved. 

ACKNOWLEDGMENT 

The  authors  thank  H.  Hori  for  useful  discussions. 


Ajp- 


477/2 


3k 


TF 


12r 


,3/2’ 


where  n  is  number  density  of  electrons, 
Fermi  wave  number,  and 


477/2 
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is  the  Thomas- 

(14) 


Since  r^>l  for  a  small  metallic  central  electrode,  Thomas- 
Fermi  screening  does  not  work  anymore.  Now,  we  have  to 
consider  the  strong  correlation  case.  But  since  the  extent  of 
correlation  energy  is  1  eV  at  most,^  the  calculated  energy 
with  correlation  energy  still  does  not  match  the  experimental 
values.  This  would  show  that  the  most  important  factor  for 
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The  operation  of  the  scanning  tunneling  microscope  (STM)  in  the  near  field  emission  regime 
provides  a  direct,  noninvasive  approach  for  investigating  surfaces  at  nanometer  scale.  Here,  we 
present  a  study  of  the  current  characteristics  in  a  near  field  emission  STM.  The  influence  of  tip’s 
geometry  is  analyzed.  We  show  that  the  electron  field  emission  from  the  sample  is  stable  against 
tip-shape  changes  due  to  adsorbate  diffusion  or  atomic  rearrangements.  ©  1996  American  Vacuum 
Society, 


I.  INTRODUCTION 

The  scanning  tunneling  microscope  (STM)  has  become 
the  most  versatile  tool  for  imaging  and  manipulation  of  sur¬ 
faces  of  conducting  materials. In  the  most  common  opera¬ 
tion  mode,  the  STM  tip  is  scanned  at  a  constant  current  of 
the  order  of  nA  and  at  applied  voltages  well  below  the  tip 
and  sample  work  functions.  This  implies  typical  tip-sample 
distances  of  the  order  or  smaller  than  1  nm.  While  this  has 
the  advantage  of  high  lateral  resolution,  the  close  proximity 
of  probe  and  sample  may  induce  unwanted  modifications  of 
the  interface.  This  phenomena  is  particularly  severe  when 
working  with  soft  materials,  such  as  molecular  films  or  bio¬ 
molecules.  One  way  to  overcome  these  problems  is  to  oper¬ 
ate  the  STM  in  the  near  field  emission  regime.^  By  using 
relatively  high  voltages  and  very  small  currents,  the  working 
distance  can  be  of  the  order  of  a  few  nanometers.  This  mini¬ 
mizes  the  probe-sample  interaction  forces.  Biological  mem¬ 
branes  adsorbed  on  conducting  surfaces  have  been  success¬ 
fully  observed"^’^  following  this  approach.  On  the  other  hand, 
low  voltage  and  high  current  conditions  can  be  used  to 
modify  and  manipulate  biological  material.^ 

The  performance  of  a  near  field  emission  STM  as  an  im¬ 
aging  tool  has  been  discussed  recently.^  By  using  a  simple 
model  for  the  probe- sample  interface,  we  showed  that  it  is 
possible  to  obtain  subatomic  vertical  resolution  and  a  few 
nanometers  lateral  resolution  at  working  intensities  of  the 
order  of  pA.  The  best  performance  is  obtained  when  the 
electrons  are  emitted  from  the  sample.  This  is  in  contrast 
with  earlier  field  emission  approaches  like  the 
“topografiner”  ^  where  electrons  were  always  emitted  from 
a  sharp  tip.  The  high  energy  of  the  electrons  impinging  the 
sample  could  modify  it.  While  the  sample  damage  induced 
by  electron  impact  is  a  severe  drawback  in  the  imaging 
mode,  it  has,  in  contrast,  important  technological  applica¬ 
tions.  The  low  energy  electron  beam  generated  in  the  near 
field  emission  regime  can  be  used  for  high  resolution  litho- 
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graphic  patterning  in  radiation  sensitive  materials.^"^^  On  the 
other  hand,  atomic-scale  desorption  induced  by  electrons 
field  emitted  from  a  STM  tip  has  been  reported  recently. 

Three  regimes  can  be  defined  in  the  current  (/)  versus 
voltage  (V)  curves  in  a  metal-insulator-metal  junction. 
Tunneling  when  the  applied  voltage  is  lower  than  the  work 
function  cp;  the  intermediate  region  of  voltages  (eV^cp) 
called  the  near  field  emission  regime  and  the  standard  field 
emission  or  Fowler-Nordheim  regime  for  high  voltages 
{eV>(p), 

In  this  work,  we  describe  the  current  characteristics  in 
near  field  emission  STM.  We  study  the  dependence  of  the 
distance- voltage  (^-V)  curves  on  the  tip’s  geometry.  The 
stability  of  the  current  is  also  analyzed.  When  the  emission 
comes  from  the  tip,  adsorbates  that  may  appear  on  the  tip 
would  produce  instabilities  in  the  emission  characteristics. 
This  is  due  to  changes  in  the  work  function  and  in  the  field 
very  close  to  the  protrusion.  However,  the  changes  in  the 
field  decay  very  fast  with  the  distance  from  the  protrusion  (at 
least  in  the  far  field  regime).  It  has  been  argued^  that  these 
have  a  negligible  effect  on  the  field  at  the  sample  surface.  As 
we  will  see,  electric  field  calculations  show  that  this  is  in¬ 
deed  the  case.  The  instabilities  could  then  be  avoided  if  the 
polarity  is  reversed,  i.e.,  if  the  electrons  are  field  emitted 
from  the  sample. 

II.  DISTANCE  VERSUS  VOLTAGE  IN  THE 
CONSTANT  CURRENT  MODE 

To  calculate  the  total  current  flowing  between  tip  and 
sample,  it  is  necessary  to  determine  the  current  densities  on 
the  surface  of  the  tip.  For  this  we  need  an  accurate  evaluation 
of  the  potential  and  field  distributions  around  the  emitter 
surface.  Sphere-plane^  and  hyperboloidal  analytical 
models^’^  have  been  used  to  estimate  the  electrostatic  field  in 
the  near  field  emission  regime.  The  numerical  solution  of  the 
Laplace  equation  for  different  tip  shapes  and  tip-sample 
distances^^  show  that  these  estimations  are  only  valid  for 
tip-sample  distances  smaller  than  ^1/2  the  tip  radius.  To 
develop  a  more  realistic  approach,  we  have  modeled  the  tip 
as  a  truncated  cone  with  an  spherical  apex  characterized  by 
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Fig.  1.  Schematic  view  of  the  potential  energy  configuration.  The  upper  part 
shows  typical  electrostatic  equipotential  lines.  The  shadowed  region  corre¬ 
sponds  to  the  three-dimensional  tunneling  barrier.  The  corresponding  energy 
diagram  along  the  tip  axis  is  also  shown. 


the  curvature  radius  R  at  the  tip  apex  and  the  cone  half  angle 
6c  (see  Fig.  1).  For  each  tip-sample  distance  S,  the  electro¬ 
static  potential  V(r)  as  well  as  the  classical  image  potential 
are  calculated  numerically^^  (typical  equipotential 
lines  are  shown  in  Fig.  1). 

Once  the  potential  distribution  is  known,  we  calculate  the 
total  current  intensity  /  by  using  the  following  semiclassical 
approach. The  current  density  /(r^)  at  each  point  of  the 
emitter  surface,  r^,  is  obtained  from  a  one-dimensional 
model, 


em  \ 

ij, 

r  E 

+  ^  {Ep-E„)T{E„,Vs)dE^ 

JEp-eVo 


(1) 


where  is  the  normal  energy  in  the  tunneling  direction,  Ep 
is  the  Fermi  energy,  Vq  is  the  applied  voltage,  and  T{E^  ,rs) 
is  the  tunneling  probability.  For  simplicity,  we  assume  that 
both  tip  and  sample  have  the  same  Fermi  energies  Ep  and 
work  functions  ^(£'^=8  eV  and  9=4.7).  The  T{E^  ,rs)  asso¬ 
ciated  with  each  emitting  point  is  calculated  within  the 
Wentzel-Kramers-Brillouin  approximation 


Cv2{  2m  ] 

1/2 

T{E„,rs)^exp 

dr] 

where  drj  is  the  differential  length  along  the  field  line  normal 
to  the  surface  at  and  the  tunneling  barrier  U{7j)  is  the 
exact  potential  calculated  along  this  field  line. 

?72(£„)  are  the  classical  turning  points  at  a  given  energy  £„  . 
In  Fig.  1,  we  show  schematically  the  potential  energy  con¬ 
figuration  used  in  the  calculations.  In  a  previous  work,^  we 


V(V) 


Fig.  2.  Tip  sample  distances  vs  applied  voltage  at  constant  current  I  for 
different  I  values.  In  the  field  emission  regime  the  current  is  controlled  by 
the  electric  field.  Consequently,  the  current  increases  with  decreasing  the 
tip’s  radius  if  the  emission  is  from  the  tip  (positive  bias).  The  opposite 
happens  when  polarity  is  reversed.  Intensity  range  is  between  0.1  /jlA  and 
0.1  pA. 


had  used  a  mean  barrier  approximation^^  for  the  tunneling 
probability.  However,  this  is  only  valid  for  relatively  thick 
barriers.^^  Here  we  have  calculated  numerically  T(E^)  given 
by  Eq.  (2).  Finally,  the  total  intensity  is  obtained  after  inte¬ 
gration  of  J{rs)  over  the  emitter  surface. 

The  dependence  of  the  tip-sample  distance  S  with  the 
applied  voltage  V  for  two  different  tip  radii  and  different  set 
currents  is  presented  in  Fig.  2.  Positive  bias  corresponds  to 
electron  emission  from  the  tip.  The  qualitative  behavior  of 
the  5-y  plot  at  constant  current  can  be  understood  by  using 
a  simple  argument.  A  constant  current  experiment  in  near 
field  emission  (^>9)  implies  an  approximately  constant 
field  at  the  emitter  surface.  When  the  gap  distance  becomes 
comparable  to  the  tip  radius,  there  is  a  clear  asymmetry  in 
the  curves  because  of  the  field  is  always  higher  on  the  tip 
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Fig.  3.  S/R  vs  the  k  factor =y/(F/?),  corresponding  to  the  emission  from 
the  tip  and  from  the  sample.  Full  symbols  correspond  to  a  smooth  tip.  Open 
symbols  reflect  the  changes  in  k  due  to  the  appearance  of  a  small  hemi¬ 
spherical  protrusion  (^^  =  0.05/?)  on  the  tip  apex.  The  inset  shows  the  in¬ 
fluence  of  a  bump  in  the  equipotential  lines  for  a  distance  of  0.6/? .  As  can  be 
seen,  the  equipotentials  close  to  the  sample  are  almost  unaffected  by  the 
change  of  the  tip  apex  geometry. 

than  on  the  sample  [see,  e.g.,  the  results  for  R=5  nm,  Fig. 
2(a)].  When  the  distance  is  much  smaller  than  the  tip  radius, 
the  field  drops  linearly  in  the  gap  region.  As  a  consequence, 
the  results  are  almost  independent  of  the  polarity  [see  Fig. 
2(b)].  It  is  worth  noticing  that,  because  of  the  semiclassical 
approach  used,  these  results  do  not  take  into  account  the 
influence  of  field  emission  resonances.  These  should  ap¬ 
pear  as  small  quantum  oscillations  overimposed  to  the  semi¬ 
classical  result. 

III.  CURRENT  STABILITY  IN  NEAR  FIELD 
EMISSION  STM 

In  this  section,  we  discuss  the  current  stability  against 
adsorbates  or  atomic  rearrangements  on  the  tip  apex.  The 
electric  field  F  at  the  tip  apex  can  be  written  SiS  F=V/(kR), 
where  R  is  the  radius  of  curvature  of  the  tip  and  A:  is  a 
geometric  factor. Since  k—VI{FR)  is  proportional  to  the 
applied  voltage,  the  S  vs,  k  plot  gives  the  dependence  of  the 
tip-sample  distance  with  the  applied  voltage  in  a  constant 
current  (constant  field)  near  field  emission  experiment.  In 
Fig.  3,  we  present  a  comparison  between  the  S~k  curves  for 
emission  from  the  tip  (full  triangles)  and  from  the  sample 
(full  circles)  surfaces.  For  distances  S/R<1,  the  voltage 
drops  linearly  within  the  tip- sample  gap,  and  the  k  factor  is 
linear  with  S,  At  distances  of  the  order  of  ^0.2R,  k  already 
deviates  from  the  linear  behavior.  This  is  the  same  behavior 
we  have  obtained  in  Fig.  2. 

The  presence  of  a  small  bump  on  the  tip  can  produce  a 
dramatic  change  in  the  current  characteristics.  In  Fig.  3,  we 
present  the  changes  in  k  on  the  tip  (open  circles)  and  on  the 
sample  (open  triangles)  due  to  the  sudden  appearance  of  a 
small  protrusion  of  radius  7?^  =  0.05/?.  The  field  is  strongly 
enhanced  in  the  vicinity  of  the  bump  while  it  is  almost  un¬ 


affected  at  the  sample  surface  (see  inset  in  Fig.  3).  This  effect 
has  important  consequences  in  the  performance  of  the  STM 
in  the  near  field  emission  regime.  In  a  STM  experiment  in 
the  constant  current  mode  at  fixed  applied  voltage,  the  feed¬ 
back  loop  would  try  to  keep  the  field  constant  by  changing 
the  tip-sample  distance.  Let  us  assume  a  typical  STM  tip  of 
radius,  say,  ^10  nm  and  a  working  distance  of  ^3  nm. 
When  electrons  are  field  emitted /rom  the  tip,  the  appearance 
of  an  atomic-size  bump  (/?^^0.5  nm)  on  the  tip  apex  would 
lead  the  tip  to  retract  more  than  10  nm  in  order  to  keep 
constant  field  and  voltage  (i.e.,  constant  k).  On  the  other 
hand,  the  emission  from  the  sample  is  almost  unaffected. 


IV.  CONCLUSIONS 

We  have  studied  the  current  characteristics  in  near  field 
emission  STM  by  using  a  realistic  model  of  the  tip- sample 
interface.  We  have  also  studied  the  effects  due  to  the  pres¬ 
ence  of  a  small  protrusion  on  the  tip  apex.  We  have  shown 
that  the  field  on  the  sample  surface  is  almost  unaffected  by 
the  presence  of  a  small  protrusion  on  the  tip  apex.  As  a 
consequence  the  electron  emission  from  the  sample  is  stable 
against  tip-shape  changes  due  to  adsorbate  diffusion  or 
atomic  rearrangements.  We  think  that  one  of  the  main  appli¬ 
cations  of  our  calculations  is  the  optimization  of  the  experi¬ 
mental  conditions  for  imaging  and  modification  of  molecular 
films  and  thin  oxide  layers. 
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Due  to  the  finite  size  of  the  probe  tip,  atomic  force  microscopy  (AFM)  images  of  biomolecules,  and 
other  structures  similar  in  size,  are  laterally  enlarged.  We  use  mathematical  morphology,  a 
non-linear  image  processing  method,  to  model  the  interaction  between  probe  tip  and  sample.  In  a 
typical  imaging  situation,  baseline  dimensions  are  most  affected  by  the  probe  and  widths  can  be 
80%  tip  and  20%  molecule.  Using  the  morphological  model  and  a  known  tip,  we  can  restore  the 
image  so  that  it  more  closely  resembles  the  actual  surface.  Morphological  restoration  is  ideal  in 
some  regions,  giving  the  exact  sample  surface,  and  improved  in  others.  In  the  case  of  a  carbon 
probe,  restoration  increases  the  perfectly  obtained  surface  area  by  as  much  as  160  times.  Following 
restoration,  lateral  widths  at  fixed  heights  are  improved  by  as  much  as  75%.  Restoration  greatly 
improves  image  resolution  even  if  one  uses  probes  consisting  of  very  small  candidate  structures, 
e.g.,  nanotubes  and  Bucky  balls.  The  tip  imaging  process  is  also  modeled,  and  we  find  that 
calibration  spheres  should  be  larger  than  the  molecules  of  interest  and  that  for  many  tips,  there  is 
little  or  no  advantage  to  using  smaller  spheres.  A  blood  plasma  protein,  fibrinogen,  is  modeled,  and 
AFM  and  restored  images  of  single  molecules  are  computed.  ©  1996  American  Vacuum  Society, 


I.  INTRODUCTION 

The  atomic  force  microscope  (AFM)  is  a  valuable  tool  in 
the  three-dimensional  analysis  of  biopolymers. Using 
AFM,  heights  are  measured  accurately,  but  lateral  dimen¬ 
sions  are  enlarged  due  to  the  finite  size  of  the  tip.  This  is 
particularly  problematic  with  most  biopolymers  which  fall  in 
the  size  range  of  4  to  10  nm  in  lateral  dimensions  by  1  to  5 
nm  in  height.^” A  more  recent  application  of  AFM  is  the 
measurement  of  intermolecular  forces. It  has  become  in¬ 
creasingly  obvious  that  accurate  information  on  tip  and 
sample  size  and  shape  is  needed  to  equate  theory  with  mea¬ 
sured  forces. These  issues  require  one  to  understand  and  to 
possibly  correct  the  distortion  in  AFM  images. 

In  contact  and  tapping  mode  imaging  of  structures  such  as 
globular  proteins  with  relatively  high  (a  “high”  object  can 
contact  the  probe  at  a  point  above  the  apex  where  the  width 
of  the  probe  is  comparable  to  the  lateral  size  of  the  object) 
rounded  sides,  there  is  normally  “contact”  between  the 
sample  and  an  extended  region  of  the  probe  as  it  travels  up, 
over,  and  down  the  structure.  The  interaction  is  non-linear- 
contact  is  either  all  or  nothing.  Unlike  a  linear  imaging  sys- 
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tern,  AFM  can  exactly  measure  the  height  of  a  very  small 
object  but  greatly  enlarge  its  lateral  dimensions.  With  the 
assumption  of  a  hard-surface,  sliding-contact,  we  and  others 
model  the  interaction  between  sample  and  probe  and  propose 
restoration  techniques  to  remove  the  tip-induced  distortion 
and  improve  image  resolution. The  interaction  between 
tip  and  sample  is  not  properly  modeled  by  convolution,  and 
AFM  restoration  is  not  deconvolution  although  some  refer  to 
the  non-linear  process  by  this  name.^^ 

In  a  previous  report,  we  used  mathematical  morphology 
to  model  the  probe- sample  interaction,  developed  the  AFM 
morphological  restoration  technique,  measured  tip  surfaces 
by  imaging  known  objects,  and  created  restored  images  with 
improved  resolution.^^  In  the  restored  images,  some  regions 
were  equal  to  the  true  sample  surface  before  restoration, 
some  were  equal  following  restoration,  and  others  were  im¬ 
proved  by  restoration  but  not  equal  to  the  sample  surface. 
Although  these  three  regions  were  identified,  their  relative 
sizes  were  unknown. 

In  a  recent  paper,  Westra  and  Thompson  modeled  AFM 
imaging  of  thin  film  surfaces.^^  They  measured  errors  in  el¬ 
evation,  related  them  to  what  they  considered  a  perceivable 
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change  in  grey-scale,  and  determined  an  image  perception 
measure  of  distortion.  In  simulated  AFM  images  without  res¬ 
toration,  they  found  that  the  radius  of  curvature  of  the  probe 
should  be  1/10  the  size  of  the  imaged  structures  in  order  to 
have  no  perceivable  difference  in  90%  of  the  pixels.  In  the 
case  of  biomolecules,  adoption  of  this  rule-of-thumb  would 
create  an  impossible  situation  requiring  probe  tips  with  an 
apex  one  or  two  atoms  wide  by  ten  atoms  high.  We  will 
investigate  the  issue  of  probe  size  and  include  an  analysis  of 
morphological  restoration. 

In  the  present  report,  we  use  morphological  modeling  to 
explore  the  characteristics  and  limitations  of  AFM  imaging. 
Using  spherical  models,  we  quantify  the  amount  of  distortion 
before  and  after  restoration.  We  analyze  the  resolution  limit 
of  AFM  using  very  small  candidate  probe  tips  such  as  nano¬ 
tubes  and  Bucky  balls.  Using  models  of  single  fibrinogen 
molecules,  we  predict  the  distortion  introduced  when  one 
images  a  biomolecule.  Although  fibrinogen  is  used  as  an  ex¬ 
ample,  results  will  aid  interpretation  of  AFM  measurements 
from  all  molecules. 


II.  THEORY 

We  briefly  describe  the  theory  of  morphological  modeling 
and  restoration  that  is  presented  in  more  detail  by  Wilson 
et  al?^ 

Morphological  image  processing  is  processing  with  re¬ 
gard  to  shape.  It  is  helpful  to  consider  a  grey-scale  image  as 
a  two-dimensional  surface  with  positive  (white)  grey  values 
corresponding  to  hills  or  ridges  in  the  surface  terrain.  The 
structuring  element  (SE)  incorporates  the  shape  of  interest, 
and,  in  our  application,  it  is  a  two-dimensional  gray-scale 
surface  equal  to  the  AFM  probe  tip.  There  are  two  elemen¬ 
tary  operations,  erosion  (©)  and  dilation  (  0  ).  An  opening 
(O)  is  obtained  by  eroding  the  input  image  and  then  dilating 
the  resultant  image  using  the  same  SE  in  both  operations, 
MOL  =  M©L  0  L.  If  the  input  image  is  first  dilated  and  then 
eroded,  a  closing  (•)  is  obtained,  M%L~M  0  LQL.  An 
opening  can  be  interpreted  physically  as  taking  an  object 
whose  top  surface  is  described  by  the  SE,  pressing  it  up 
against  the  underside  intensity  surface  of  the  image,  and 
moving  it  about  while  marking  the  highest  point  reached  by 
the  object  at  each  pixel  in  the  image.  Positive  excursions 
smaller  than  the  SE  are  eliminated  with  an  opening.  In  a 
closing,  the  SE  is  reflected  left-right  and  flipped  upside 
down.  The  resulting  structure  is  then  pressed  down  from  the 
top  onto  the  grey-scale  image  surface  and  moved  around. 
The  minimum  gray-scale  values  reached  by  the  reflected, 
upside  down  SE  are  recorded  and  become  M#L.  Negative 
excursions  smaller  than  the  SE  are  removed  by  a  closing. 

Between  the  probe  and  sample,  we  assume  a  hard- surface 
interaction  with  no  lateral  forces.  In  Figure  1,  the  original 
sample  surface  is  s;  the  AFM  image  surface  is  z\  and  the 
restored  surface  with  improved  resolution  is  r.  Given  a  tip, 
r,  and  a  model  sample,  5,  one  can  calculate  the  AFM  image 
surface,  z}^ 

z=50r.  (1) 
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Fig.  1.  Morphological  restoration  is  illustrated.  The  sample  surface  is  im¬ 
aged  by  the  probe  tip.  As  compared  to  the  measured  AFM  image,  z,  the 
restored  image,  r,  more  closely  follows  the  sample  surface,  5,  and  has 
improved  lateral  dimension.  For  the  morphological  operations,  the  probe  tip 
surface  is  considered  to  point  upward  with  the  tip  center  placed  at  the  origin 
(not  shown).  In  region  <1>,  r  =  z  =  s\  in  region  <2>,  r~s  z\  and  in 
region  <3>,  r  j  9^  z.  In  regions  <1>  and  <2>,  ideal  restoration  is 
obtained  and  the  restored  surface  equals  the  sample  surface.  In  region  <3> 
restoration  gives  an  image  surface  having  improved  resolution.  In  region 
<4>,  r~z  ^  5,  and  there  is  no  improvement  with  restoration. 


Given  the  AFM  image  surface  and  a  tip,  one  can  obtain  a 
restored  image  having  improved  resolution, 

r—zQt,  (2) 

Presently,  we  model  interactions  and  use  a  known  sample 
surface,  s\  hence,  we  can  calculate  r  from  a  closing  {r  —  s 
0  tQt  =  s%t).  In  most  situations,  one  restores  a  measured 
image  of  an  unknown  sample  with  a  known  tip.  In  this  case, 
we  compute  r=z©f. 

We  can  analytically  investigate  the  tip-sample  interaction 
using  simplified,  one-dimensional  structures  consisting  of  a 
parabolic  tip  and  a  circular  sample.  A  closing  with  a  parabola 
removes  small  negative  excursions  and  bounds  the  image 
from  above.  We  can  analyze  the  interaction  in  terms  of 
derivatives.^"^’^^  The  equation  of  a  tip-down  parabola  is 
y  =  Bx^  with  B>0,  and  this  gives  a  constant  second  deriva¬ 
tive,  2B,  that  bounds  the  second  derivative  of  a  closed  im¬ 
age.  In  the  case  of  the  top  of  a  circle  + 

=  R^;y>0,~-R^x^R),  we  obtain  [d^y/dx^  —  (—  l/R)/ 
[1  —  — and  a  second  derivative  that  is 

always  negative.  Hence,  with  regard  to  second  derivatives, 
the  parabola  always  exceeds  the  circle,  and  we  anticipate  that 
a  closing  of  the  circle  top  with  a  parabola  will  leave  the 
circle  top  unchanged.  A  closing  also  bounds  the  first  deriva¬ 
tive.  The  first  derivative  of  a  circle  approaches  ±oo  as 
X— i.e.,  tangents  at  the  edges  of  the  circle  are  vertical. 
Closing  with  a  parabola  will  bound  this  limit.  From  these 
analyses  of  continuous  functions,  we  anticipate  that  with  the 
exception  of  the  left  and  right  edges,  closing  with  a  parabola 
of  any  size  will  not  change  the  surface  of  the  sample  and  the 
circle  will  be  perfectly  obtained.  In  practice,  we  are  con¬ 
cerned  with  discrete  computations  using  digitally  sampled 
AFM  images  of  tips  and  samples.  More  important,  the 
spherical  object  always  rests  on  a  planar  substrate,  and  this 
greatly  limits  the  closing.  These  issues  will  be  investigated 
later. 
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Fig.  2.  One  can  switch  sample  and  tip  surfaces  and  obtain  identical  AFM 
images.  In  A,  there  is  a  rounded  sample,  s,  and  a  pointed  tip,  t.  In  B,  there 
is  a  pointed  sample  and  a  rounded  tip.  In  both  cases,  identical  image  sur¬ 
faces,  z,  are  obtained  in  C. 

Dilation  is  commutative,  M  B  L  =  L  ^  hence,  identi¬ 
cal  AFM  images  are  obtained  by  dilating  the  sample  by  the 
tip,  or  the  tip  by  the  sample: 

Z  =  (3) 

It  follows  that  we  obtain  an  identical  AFM  image  surface  by 
interchanging  the  tip  and  sample  surfaces.^^  In  Figures  2A 
and  2B,  very  different  samples  are  used  yet  identical  AFM 
images  are  obtained.  Obviously,  an  AFM  image  can  very 
much  depend  upon  the  shape  of  the  probe  tip. 

Using  a  known,  “calibration”  sample  such  as  a  gold  or 
polystyrene  sphere, one  can  exploit  the  commutative  prop¬ 
erty  to  obtain  an  image  of  the  probe  tip.  We  can  substantially 
improve  the  resolution  of  the  tip  image  by  restoration.  Recall 
that  r=zQt;  switching  5  and  t  gives  a  restored  tip  surface, 

r,ip  =  zQs.  (4) 

The  generated  tip  surface  can  then  be  used  to  restore  an 
image  of  unknown  objects  such  as  proteins  and  nanostruc¬ 
tures. 

With  regard  to  restoration,  a  potential  complication  is  that 
in  a  closing  the  dilation  and  erosion  must  be  done  with  the 
same  SE.  In  the  case  of  AFM  imaging,  we  have  a  good,  but 
not  ideal,  estimate  of  the  probe  tip,  and  this  may  limit  the 
ability  to  create  a  restored  image.  That  is,  we  image  with  a 
real  tip  and  restore  with  a  slightly  different,  estimated  tip. 
This  creates  an  interesting  error  propagation  problem.  Even 
if  we  measure  the  probe  tip  exactly,  we  are  uncertain  about 
its  effective  origin.  Fortunately,  a  change  in  origin  of  the  tip 
in  the  restoration  (erosion)  process  gives  a  true  restored  im¬ 
age  that  is  simply  shifted  spatially;  that  is, 
=  =  hx,,y,)  is  t  trans- 

lated  by  (x^,y^).  Spatial  translation  of  the  restored  image 
should  not  affect  interpretation. 

There  are  at  least  four  important  regions  associated  with  a 
restoration  which  are  shown  in  Figure  1.  In  region  <1>, 
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both  the  raw  AFM  image  and  the  restored  AFM  image  are 
equal  to  the  sample  surface  {r=z  =  s).  In  region  <2>,  the 
restoration  recovers  the  sample  surface  (r  =  5’),  and  the  raw 
AFM  image  is  degraded  (z  ^  ^).  In  region  <3>,  the  resto¬ 
ration  improves  image  resolution  but  does  not  exactly  re¬ 
cover  the  sample  surface  (r  s).  In  region  < 3> ,  restoration 
error  in  the  lateral  dimension  progressively  increases  as  one 
approaches  the  baseline,  and  the  restored  shape  in  region 
<3>  reflects  the  probe  shape.  In  region  <4>,  the  probe  is 
so  large  that  no  portion  of  it  enters  a  negative  excursion  in 
the  sample.  In  this  case,  the  restoration  does  not  improve 
resolution  and  r=z  but  r  s.  Restoration  in  regions  <I> 
and  <2>  is  ideal\  it  obtains  the  sample  surface.  In  region 
<3>,  restoration  gives  an  improved  version  of  the  image. 
Finally,  it  should  be  understood  that  restoration  does  not  al¬ 
low  one  to  image  a  region  on  the  undersurface  of  a  sample 
not  reached  by  AFM. 

III.  METHODS 

A.  Computations  and  visualizations 

In  IDL  (Interactive  Data  Language,  Research  Systems 
Inc.,  Boulder,  CO),  we  construct  mathematical  models  of 
probe  tips  and  of  sample  objects  including  fibrinogen  mol¬ 
ecules.  The  models  are  written  to  image  files  and  morpho¬ 
logically  processed  using  IDL  and/or  DIPStation  (Perceptive 
Systems  Inc.,  Boulder,  CO).  In  the  case  of  measurements, 
DIPStation  is  used  to  properly  prepare  the  data.  For  example, 
we  scan  images  and  cutout  regions  of  interest  for  further 
processing.^^  Grey-scale  and  surface  visualizations  are  done 
in  DIPStation  and  IDL. 

B.  Models  of  AFM  probe  tips 

We  use  a  paraboloid  to  model  the  probe  tip.  A  paraboloid 
well  describes  the  carbon  probes  that  we  have  imaged;^®  it  is 
easily  parametrized;  and  it  has  desirable  mathematical  prop¬ 
erties  with  regard  to  second  derivatives  as  described  in  Sec¬ 
tion  II.  The  probe  becomes  a  structuring  element  in  the  mor¬ 
phological  calculations,  and  it  is  defined  to  be  apex-up  at  the 
origin, 

Z=-B„(jc2  +  j2).  (5) 

For  ease  of  analysis,  some  describe  the  probe  tip  as  a  sphere, 
and  it  may  reasonably  describe  the  region  very  near  the 
apex.^  However,  in  the  case  of  biomolecules  with  high  sides, 
a  paraboloid  will  better  describe  contacts  between  the  probe 
and  sample.  It  may  also  be  a  more  accurate  model  for  future 
force  calculations  where  interactions  with  some  layers  above 
the  apex  of  the  probe  are  required. 

We  are  interested  in  the  limits  of  imaging  obtained  with 
very  narrow  probes.  For  the  case  of  a  one-dimensional  (ID) 
parabolic  tip  {y=Bx^),  the  radius  of  curvature  is  at 

the  apex.  We  use  this  to  compute  the  parabola  size  that  will 
just  enclose  various  objects  of  interest.  One  possibility  is  a 
single  atom  with  diameter  of  2  A  giving  B  =  5  nm”  ^ .  An¬ 
other  structure  is  the  mono-atomic  tip  described  by  Fink.^^ 
Field  ion  micrograph  tip  images  show  that  the  first  three 
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layers  consist  of  one,  three,  and  seven  atoms.  Assuming  2  A 
diameter  atoms  and  hexagonal  close  packing,  we  create  a 
model  of  the  first  three  layers  7  A)  of  the  probe  tip.  We 
find  that  a  paraboloid  with  Bjj  =  5  nm“^  encloses  all  but  a 
very  small  portion  of  the  third  layer  of  this  model.  Other 
potential  probe  tips  are  carbon  Bucky  balls  and  nanotubes.  A 
nanotube  consists  of  a  cylinder  with  spherical  caps  resem¬ 
bling  Bucky  balls  at  the  ends;  nanotubes  are  significantly 
stiffer  than  any  other  presently  known  materials^^  making 
them  ideal  candidates  for  AFM  probes.  The  diameter  of  a 
nanotube  cap  is  ^20  A  giving  5  =  0.5  nm”^^^  Recently 
reported  measurements  on  currently  used  tips  yields  radius 
of  curvature  measurements  that  range  from  30  nm^^  down  to 
3-6  nm,^^  and  this  corresponds  to  0.017<5<0.17  nm“^ 
For  measured  electron-beam  deposited  carbon  probes, we 
determined  5^0.015  nm'^ 

These  structures  give  us  some  probe  sizes.  On  graphs,  we 
place  marks  for  a  carbon  probe  (5  =  0.015  nm“^),  a  Bucky 
ball  (5  =  0.5  nm“^),  and  a  single  atom  (5  =  5  nm"^).  It 
should  be  understood  that  we  model  a  parabola  which  envel¬ 
ops  either  the  Bucky  ball  or  the  single  atom.  Different  results 
may  be  obtained  if  one  models  a  cylindrical  nanotube  with  a 
Bucky  ball  cap.  In  the  case  of  fibrinogen  imaging,  we  use 
three  realistically  dimensioned  carbon  tips:  narrow 
(5y  =  0.030  nm~^),  medium  (5^  =  0.015  nm“^),  and  wide 
(5^  =  0.0075  nm"^).  The  medium  probe  mimics  our  mea¬ 
surements  of  carbon  probes. 


C.  Fibrinogen  model 


We  create  a  molecular  model  of  the  blood  plasma  glyco¬ 
protein,  fibrinogen  using  lateral  dimensions  from  electron 
microscopy  studies^  ^  and  heights  from  our  own  preliminary 
AFM  measurements.  The  fibrinogen  model  consists  of  two 
globular  D  domains  (ellipsoids  with  equal  lateral  diameters 
of  6.5  nm  and  a  height  of  1.28  nm)  and  a  central  E  domain 
(an  ellipsoid  with  equal  lateral  diameters  of  5.0  nm  and  a 
height  of  1.0  nm).  The  globular  domains  are  linked  by 
coiled-coil  structures  which  we  model  as  cylinders.  We  link 
the  three  ellipsoids  with  two  cylinders  of  14.75  nm  length 
and  1.2  nm  diameter.  The  height  of  the  E  domain  is  set  so 
that  the  ratio  of  E  to  D  heights  is  the  same  as  the  ratio  of  E 
to  D  lateral  dimensions. 

We  use  simple  structures  to  create  the  fibrinogen  model, 
and  equations  are  listed  below  where  x  and  y  refer  to  posi¬ 
tions  along  the  planar  substrate  and  z  refers  to  height.  The 
top  of  an  ellipsoidal  volume  (Equation  6)  provides  a  model 
of  a  globular  domain,  and  a  cylinder  of  length  I  oriented 
along  the  x  axis  (Equation  7)  provides  connections  between 
globular  domains. 


{x-xof  ^  jy-yof  ,  (z-zof 

Ex  Ey 


(j-yo)  ,  (z-zof 


(6) 

(7) 


To  construct  a  molecular  model,  we  link  ellipsoids  with  cyl¬ 
inders.  Ellipsoids  always  rest  on  a  planar  substrate.  Cylinders 
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Fig.  3.  Effect  of  tip  size  on  restoration.  The  sample,  dashed  curve,  is  a 
circle,  and  the  two  restored,  solid  curves,  are  plotted.  The  raw  AFM  image  is 
not  shown.  As  B  increases,  the  tip  gets  sharper  and  the  restored  image 
improves. 


are  parallel  to  the  substrate  and  attached  at  the  center  of  the 
facing  surface  of  the  smaller,  central  ellipsoid.  In  the  case  of 
AFM  imaging,  only  the  top  of  a  three-dimensional  object  is 
measured,  and  we  create  a  two-dimensional  surface  using  the 
top  of  all  objects.  Each  globular  domain  has  lateral  dimen¬ 
sions  (Ej^  and  Ey)  and  a  height  (E^). 

IV.  RESULTS 

We  first  investigate  the  limits  of  AFM  imaging  and  the 
properties  of  AFM  restoration  using  simplified,  one¬ 
dimensional  models.  Probe  models  vary  from  hypothetical, 
synthetic  nanostructure  probes  to  conventional  carbon  and 
silicon  nitride  probes.^^  Later,  we  investigate  three- 
dimensional  models  of  fibrinogen. 

A.  Properties  of  morphological  restoration 

Using  discrete  calculations,  we  investigate  the  interaction 
of  a  parabolic  tip  with  a  circular  sample. 

To  test  the  analytical  analysis  in  Section  II,  we  investigate 
a  simplified  arrangement  where  a  circle  is  raised  on  a  narrow, 
arbitrarily  high  pedestal  above  the  substrate  in  order  to  elimi¬ 
nate  the  effect  of  floor  contact.  Using  a  range  of  parabolas 
(0.01<5<10)  and  circles  (1<5<100),  we  determine  that 
the  restored  curve  (r)  exactly  equals  the  sample  circle  (s)  at 
all  points  on  the  top  surface.  These  results  agree  with  the 
analytical,  continuous  analysis  in  Section  II.  Extending  to  a 
two-dimensional  analysis  and  ignoring  the  substrate,  a 
sphere  can  be  restored  exactly  with  any  reasonably- sized  pa¬ 
raboloid  tip. 

With  the  substrate  present,  we  obtain  restored  images  of  a 
circle  using  two  different  parabolas  (Figure  3).  In  both  cases, 
the  restored  curve  is  much  improved  as  compared  to  the 
AFM  curve,  z  (not  shown),  and  it  is  exactly  equal  to  the 
sample  circle  as  long  as  the  tip  does  not  contact  the  substrate. 
On  either  side  of  the  circle,  we  obtain  an  image  of  the  tip 
parabola  where  the  parabola  contacts  the  planar  substrate  and 
the  edge  of  the  circle.  The  more  narrow  probe  yields  a  better 
restoration. 

For  continuous  analytical  functions,  we  derive  an  expres¬ 
sion  for  the  point  of  contact  on  a  circle  when  the  tip  rests  on 
the  baseline  substrate  (see  the  Appendix).  Following  restora¬ 
tion,  above  this  contact,  the  circle  is  imaged  perfectly.  We 
compute  the  angle  of  perfect  imaging  using  Equation  A6, 
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Fig.  4.  The  angle  of  perfect  imaging  is  investigated  for  continuous  (A)  and 
discrete  (B)  computations.  In  A,  we  use  the  expression  derived  in  the  Ap¬ 
pendix  for  continuous  functions  (Eq.  (A6))  and  plot  the  angle  of  perfect 
imaging  as  a  function  of  independent  variables,  R,  the  radius  of  the  imaged 
circle,  and  B,  the  parabola  size.  Prior  to  restoration,  only  an  infinitesimal 
point  is  perfectly  imaged.  Using  discrete  calculations,  a  much  different  re¬ 
sult  is  obtained,  and  the  angle  of  perfect  imaging  is  shown  before  and  after 
restoration  (B).  Vertical  lines  mark  B  values  obtained  from  a  carbon  probe, 
a  Bucky  ball,  and  a  single  atom.  In  the  discrete  calculations,  the  pixel  size  is 
0.5  nm  and  the  elevation  calibration  is  0.076  nm/gray- value,  typical  values 
used  in  our  biomolecule  imaging  experiments. 


and  in  Figure  4A,  we  plot  the  result  as  a  function  of  the 
parabola  parameter,  B,  on  the  x-axis  and  the  circle  diameter, 
R.  As  expected,  as  B  increases  and  the  tip  “sharpens,”  the 
initial  parabola-circle  contact  moves  down  the  side  of  the 
circle  and  the  angle  approaches  the  ideal  result  of  1 80®.  The 
angle  also  depends  upon  the  size  of  the  sphere,  and  in  the 
case  of  the  smallest  sphere  {R=l  nm),  it  is  still  well  below 
180°  at  B=10  nm“^  Extending  to  a  two-dimensional  sur¬ 
face,  the  angle  measurement  is  proportional  to  surface  area. 
Hence,  for  the  case  of  a  10  nm  sphere  and  a  carbon  probe, 
the  restored  angle  is  90°  out  of  180°  and  50%  of  the  surface 
area  is  restored  exactly.  Prior  to  restoration,  only  a  single 
infinitesimal  point  at  the  top  of  the  sphere  is  imaged  per¬ 
fectly. 

Images  consist  of  pixels  which  are  discrete  samples  in 
space  and  grey  level,  and  discrete  results  will  sometimes 
differ  from  analytical  results  on  continuous  functions.  For 
this  reason,  in  discrete  simulations,  we  compute  the  angle  of 
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Fig.  5.  Given  a  circle  of  diameter  14  nm,  we  compute  the  widths  of  the 
AFM  curve,  z  (upper  curve  in  each  panel)  and  the  restored  curve,  r  (lower 
curve  in  each  panel)  at  different  heights  above  the  planar  substrate  where  1 .5 
R  refers  to  a  height  of  1 .5  times  the  radius  of  7  nm.  Widths  are  plotted  as  a 
function  of  the  parabola  parameter,  B.  As  B  increases,  the  curves  converge 
to  the  actual  width  of  the  sample.  Vertical  lines  mark  B  values  obtained  from 
a  carbon  probe,  a  Bucky  ball,  and  a  single  atom. 


perfect  imaging  with  and  without  restoration  (Figure  4B).  We 
determine  the  number  of  pixels  that  equal  the  sample  pixels 
and  convert  this  number  to  an  angle  using  basic  geometry. 
The  angle  of  perfect  imaging  is  obtained  as  a  function  of 
parabola  size,  B,  and  the  radius  of  the  circle,  R.  In  this 
discrete  simulation,  the  pixel  size  is  0.5  nm  and  the  elevation 
calibration  is  0.076  nm/gray- value,  typical  values  used  in  our 
biomolecule  imaging  experiments.  For  the  case  of  the  carbon 
probe  and  a  circle  (/?=10  nm),  a  single  pixel  is  perfectly 
obtained  prior  to  restoration,  and  this  corresponds  to  an  angle 
of  <5.6°,  the  angle  contained  within  ±  1  discrete  samples  at 
the  top.  After  restoration,  we  get  90°,  or  greater  than  a  16- 
fold  increase  in  the  exactly  restored  surface  area.  Prior  to 
restoration,  in  order  to  obtain  more  than  one  perfect  pixel,  a 
much  narrower  probe  must  be  used  (B  ^  0.05  nm“  ^  as  ob¬ 
served  in  Figure  4B).  Results  prior  to  restoration  depend 
greatly  on  pixel  size,  and  we  also  investigate  a  much  smaller 
pixel  size  of  0.05  nm  (not  shown).  In  this  case,  the  “not 
restored”  curve  is  shifted  down  the  y-axis.  Once  again,  for  a 
carbon  probe,  we  get  90°  following  restoration  and  one  pixel 
before  restoration.  In  this  case  of  finer  sampling,  the  angle 
contained  within  one  pixel  is  reduced  by  an  order  of  magni¬ 
tude  to  <0.56°  and  the  perfectly  obtained  surface  area  is 
increased  at  least  160  times  by  restoration. 

The  improvement  in  lateral  measurements  following  res¬ 
toration  is  shown  in  Figure  5.  For  both  the  AFM  and  restored 
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curves,  we  plot  the  width  as  a  function  of  B  on  the  jc-axis  and 
the  height  above  the  planar  substrate.  There  is  significant 
improvement  with  restoration  at  all  heights  shown.  As  ex¬ 
pected,  as  B  increases  and  the  tip  becomes  sharper,  the 
curves  converge  to  the  actual  width.  For  the  case  of  a  carbon 
probe  and  a  circle  of  14  nm  in  diameter,  at  baseline,  the  AFM 
and  restored  widths  are  both  69  nm  (not  shown).  Hence,  at 
baseline,  the  width  measurement  is  80%  probe  and  20% 
sample.  Half-way  up  at  a  height  of  l.OR,  the  widths  are  50.5 
nm  and  25  nm  before  and  after  restoration,  respectively. 
These  values  give  72%  probe  and  28%  sample  prior  to  res¬ 
toration  and  44%  probe  and  56%  sample  following  restora¬ 
tion.  At  a  height  of  1.5R,  there  is  more  improvement.  In  this 
case,  the  widths  are  37.5  nm  and  15.6  nm  before  and  after 
restoration,  respectively.  At  this  height  of  1.5R,  the  width  of 
the  circle  is  12  nm,  and  the  widths  give  68%  probe  and  32% 
sample  before  restoration  and  22%  probe  and  78%  sample 
following  restoration. 

We  have  seen  that  restoration  greatly  improves  AFM  reso¬ 
lution  when  realistic  probes  and  sample  objects  are  used. 
However,  with  some  unusual  tip  shapes,  there  is  no  improve¬ 
ment  following  restoration  (Figure  6).  Over  a  broad  range  of 
sample  surfaces,  following  restoration,  images  from  a  con¬ 
vex  tip  (A)  improve  but  a  straight-edge  tip  (B)  and  a  concave 
tip  (C)  do  not.  The  requirement  for  improvement  is  that  the 
tip  be  convex;  for  a  point-down  tip  this  means  that  the  sec¬ 
ond  derivative  is  positive. 

B.  Tip  imaging 

As  described  in  Section  II,  one  can  create  a  restored  tip 
surface  using  a  known  sample  object.  A  parabolic  tip  is  im¬ 
aged  using  circle  samples  of  various  sizes.  In  Figure  7,  we 
plot  the  restored  curve  as  an  overlay  on  the  actual  tip  curve. 
We  find  that  the  tip  is  perfectly  imaged  until  the  probe 
reaches  the  floor,  at  which  point,  an  image  of  the  circle  is 
obtained.  If  one  creates  a  restored  tip  image  using  a  small 
sphere  of  radius  R,  one  obtains  a  truncated  tip  image  which 
flattens  at  a  distance  2R  from  the  apex. 

C.  Application  to  fibrinogen 

In  Figure  8A,  the  fibrinogen  model  (center)  is  imaged 
using  a  carbon  probe  (left)  and  a  silicon  nitride  probe  (right). 
As  determined  previously,  silicon  nitride  tips  are  signifi¬ 
cantly  more  lumpy  and  asymmetrical  than  carbon  probes.^^ 
As  a  result,  the  image  obtained  with  the  silicon  nitride  probe 
is  quite  lumpy.  Both  AFM  images  are  significantly  enlarged. 

The  fibrinogen  model  (Figure  8B,  middle)  is  dilated  to 
create  the  AFM  image  (top)  using  the  medium  paraboloid  tip 
model  (5y  =  0.015  nm“^).  The  AFM  image  is  morphologi¬ 
cally  restored  (bottom).  The  restored  image  more  closely  ap¬ 
proximates  the  model;  that  is,  the  globular  domain  linkages 
and  the  globular  domains  are  more  clearly  defined.  We  get 
the  correct  height  at  the  center  of  the  domains  both  in  the 
case  of  the  AFM  and  restored  images. 

Next,  we  will  quantify  the  improvement  of  the  restored 
image  over  the  AFM  image.  In  Figure  8C,  a  fibrinogen 
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Fig.  6.  The  effect  of  morphological  processing  is  investigated  using  some 
unusual  tip  shapes.  In  panel  A,  a  realistic,  convex  tip  is  shown  (inset).  The 
sample  curve  (solid),  AFM  curve  (outer  dashed),  and  restored  curve  (inner 
dashed)  are  plotted.  In  this  case,  there  is  significant  improvement.  With  a 
straight-edge  tip  (B)  and  with  a  concave  tip  (C),  there  is  no  improvement 
with  restoration.  The  restored  curve  and  AFM  curve  are  identical.  Tips  in  the 
insets  are  shown  point  down  for  illustration  purposes,  but  in  the  morpho¬ 
logical  restoration  process,  they  are  defined  to  be  point  up. 

model  is  imaged  and  restored  using  the  wide  (top),  medium 
(middle),  and  narrow  (bottom)  model  tips.  Using  grey-scales, 
we  label  regions  defined  previously.  They  are:  white  (<1 
>),  black  (<2>),  and  grey  (<3>).  Note  that  the  relatively 
uninteresting  region  at  the  baseline  is  white  (<1>).  Be¬ 
cause  of  the  success  of  the  restoration  process,  the  area  of 
region  <2>  is  much  greater  than  region  <  1>.  In  addition, 
following  restoration,  the  area  of  region  <2>  is  substan¬ 
tially  increased  with  the  smaller  probe.  The  size  of  region 
<3>  is  decreased  substantially  when  we  use  the  smaller 
probe  because  this  region  reflecting  the  probe  image  at  the 
boundary  is  smaller  with  the  narrow  tip. 

We  count  labeled  pixels,  and  report  them  as  triplets 
(<1>,  <2>,  <3>),  We  neglect  the  uninteresting  region 
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Fig.  7.  A  parabolic  tip  is  imaged  using  various  circular  samples.  The  origi¬ 
nal  tip  is  shown  as  a  dashed  curve.  Other  curves  are  the  restored  imaged  tip 
using  samples  having  radii  from  5  to  50  nm  as  indicated  in  the  figure.  The 
restoration  flattens  out  on  either  side  because  the  tip  contacts  the  floor. 

<1>  pixels  at  the  baseline.  The  number  of  pixels  are  as 
follows:  wide  (3,  84,  10066),  medium  (19,  134,  6122),  and 
narrow  (33,  210,  3806).  Ratios  of  perfectly  obtained  pixels 
(restored/not-restored)  are  wide  (29),  medium  (8.05),  and 
narrow  (7.36).  Note  that  these  ratios  are  greatly  affected  by 
the  pixels  in  the  cylindrical  linkages.  Region  <3>  pixels  are 
greatly  affected  by  the  size  of  the  tip.  One  way  to  compare 
the  size  of  region  <3 >  is  to  compute  ratios  of  numbers  of 
pixels.  The  ratios  are  0.37  (narrow/wide)  and  0.6  (medium/ 
wide).  Yet  another  way  to  compare  the  effect  of  probe  is  the 
ratio  of  perfectly  obtained  pixels  following  restoration.  The 
ratios  are  1.7  (medium/wide)  and  2.8  (narrow/wide).  Even 
with  restoration,  significant  improvement  is  obtained  using 
the  narrow  probe  tip. 

V.  DISCUSSION 

A.  Properties  of  morphological  restoration 

The  one-dimensional  modeling  studies  illustrate  the  im¬ 
portance  of  the  restoration  process.  In  the  case  of  continuous 
functions,  an  infinitesimal  point  is  perfectly  obtained  prior  to 
restoration.  For  a  carbon  probe  and  a  10  nm  diameter  circle, 
90°  is  perfectly  obtained  following  restoration  (Figure  4A) 
giving  an  infinite  improvement.  In  the  case  of  discrete  com¬ 
putations,  more  than  one  pixel  can  be  perfectly  sampled  prior 
to  restoration  especially  for  very  sharp  tips,  wide  samples, 
and/or  large  pixels  (Figure  4B).  However,  in  the  case  of  a 
realistic  carbon  probe,  only  one  pixel  is  obtained  perfectly 
prior  to  restoration.  In  the  case  of  a  carbon  probe  and  a  0.05 
nm  pixel,  the  angle  contained  within  ±  1  pixel  is  0.56°,  and 
the  perfectly  obtained  surface  area  is  increased  by  over  160 
times  following  restoration.  To  put  it  another  way,  prior  to 
restoration,  the  perfectly  obtained  surface  area  is  0.3%  of  the 
total;  following  restoration,  it  is  50%.  Restoration  also  im¬ 
proves  measured  lateral  widths  by  as  much  as  75%  (Figure 
5). 

We  find  that  both  in  the  case  of  AFM  and  restored  sur¬ 
faces,  the  width  at  the  bottom  of  an  object  is  most  affected 
by  the  presence  of  the  tip.  In  the  case  of  currently  realistic 
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tips  such  as  carbon  probes  used  to  image  spherical  objects  of 
approximately  14  nm  diameter,  a  width  at  baseline  is  ap¬ 
proximately  80%  probe  and  20%  molecule.  Thus,  the  com¬ 
mon  practice  of  reporting  the  width  at  baseline  very  much 
depends  upon  the  size  of  a  tip.  By  reporting  other  widths 
above  the  substrate,  AFM  researchers  can  better  compare 
results  which  are  less  dependent  on  tip  widths  and  shape 
(Figure  5).  For  example,  at  1/4  below  the  maximum  height, 
before  restoration,  the  width  measurement  consists  of  68% 
probe  and  32%  molecule.  Following  restoration,  the  numbers 
improve  to  22%  probe  and  78%  molecule. 

As  described  in  Section  I,  Westra  and  Thompson  modeled 
AFM  imaging  of  thin  film  surfaces^^  and  determined  a  rule- 
of-thumb  that  the  radius  of  curvature  of  the  probe  should  be 
1/10  the  size  of  the  imaged  structures.  We  have  extended 
their  analysis  to  include  a  range  of  object  sizes  (1  to  100  nm 
radius),  a  range  of  tip  sizes  (radii  of  curvature  0.1  nm  to  50 
/xm),  and  morphological  restoration.  We  require  pixels  to  be 
perfectly  obtained  following  restoration;  Westra  and  Thomp¬ 
son  use  a  relaxed  condition  requiring  a  grey-scale  error  less 
than  that  perceived  by  the  human  eye  on  90%  of  pixels.  If 
one  requires  90%  of  the  pixels  to  be  perfectly  obtained,  our 
restoration  analysis  also  shows  that  one  needs  a  tip  radius 
^  1/10  the  radius  of  the  object  (Figure  4A).  However,  if  one 
relaxes  the  condition  of  90%  to  50%  perfectly  restored  area, 
then  even  objects  as  small  as  10  nm  in  radius  can  be  obtained 
adequately  in  a  restored  image  using  a  conventional  carbon 
tip.  In  addition,  objects  as  small  as  1  nm  in  radius  can  be 
imaged  with  potentially  obtainable  tips  larger  than  a  Bucky 
ball.  These  are  conservative  analyses,  and  an  even  larger 
proportion  of  pixels  have  relatively  little  elevation  distortion. 
Also,  we  find  that  many  features  in  our  fibrinogen  model  are 
plainly  seen  following  restoration.  We  can  determine  the 
number  of  globular  domains  and  their  spacing.  Hence,  for 
globular  biomolecules,  useful  information  is  obtained  with 
relatively  large  probes. 

Given  the  tremendous  potential  improvement,  we  recom¬ 
mend  restoration  of  all  AFM  images.  This  is  especially  im¬ 
portant  for  biomolecules  consisting  of  globular  domains  in 
the  range  of  1-10  nm.  Remarkably,  even  with  very  narrow, 
candidate  probes  such  as  a  Bucky  ball,  there  is  significant 
improvement  with  restoration  (Figure  4B).  In  fact,  there  is 
almost  a  nine-fold  improvement  in  the  correctly  obtained 
surface  area. 

B.  Tip  shape  and  calibration 

With  some  unusual,  non-convex  probe  tip  shapes,  there  is 
no  improvement  with  restoration  (Figure  6).  All  carbon 
probes  that  we  have  measured  using  AFM  are  convex  within 
15  nm  of  the  apex.^^  Also,  high  resolution,  scanning  electron 
microscopy  images  of  these  tips  show  them  to  be  convex 
within  several  hundred  nanometers  of  the  apex.  On  the  other 
hand,  silicon  nitride  tips  can  show  local  concavities  within 
15  nm  of  the  apex,^^  A  convex  shape  is  implicit  in  reports  of 
tips  having  a  radius  from  30  nm^^  down  to  3-6  nm.^^ 
Atomic-scale  tips  are  desirable,  and  in  the  case  of  carefully 
constructed  tips  containing  a  single  atom  at  the  end,  the  first 
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Fig.  8.  Fibrinogen  models  are  investigated.  In  A,  a  fibrinogen  model  (center) 
is  imaged  using  a  carbon  probe  (left)  and  a  silicon  nitride  probe  (right).  As 
described  in  Section  III  C,  the  model  is  the  top  surface  of  a  structure  con¬ 
sisting  of  ellipses  and  cylinders.  Surfaces  are  rendered  using  shading.  The 
image  obtained  with  the  silicon  nitride  probe  is  quite  lumpy  due  to  the  shape 
of  the  probe  tip.  In  B,  we  demonstrate  the  restoration  process.  The  fibrino¬ 
gen  model  (center)  is  dilated  with  a  paraboloid  tip  (top)  and  this  is  eroded  to 
create  the  restored  image  (bottom).  Heights  are  rendered  as  a  grey-scale 
image  with  light  values  being  high  elevations.  In  C,  the  fibrinogen  model  is 
imaged  and  restored  using  a  wide  (top),  medium  (middle),  and  wide  (bot¬ 
tom)  tip.  Regions  obtained  following  restoration  are  identified  in  shades  of 
gray.  They  are:  white  (region  <  1>),  black  (region  <2>),  and  grey  (region 
<3>). 
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Fig.  9.  A  parabola  tip  will  contact  a  circle  as  shown.  Both  rest  on  the  planar 
substrate  which  is  the  top  of  the  hatched  area.  Mathematically,  there  are  five 
potential  contact  points  that  are  marked  with  crosses  and  enumerated.  In  the 
text,  we  derive  an  expression  for  the  physically  meaningful  contact  points 
(1)  and  (5)  in  terms  of  variables  defined  in  the  figure.  The  region  above  the 
contact  points  marked  by  the  angle  6  is  obtained  perfectly  following  resto¬ 
ration. 


three  layers  contain  one,  three,  and  seven  atoms. An  envel¬ 
oping  surface  over  an  atomic  model  consisting  of  these  three 
layers  has  fairly  straight  surfaces  with  local  convexities  and 
concavities.  However,  we  have  never  seen  atomic  scale  fea¬ 
tures  resulting  from  atoms  on  the  side  of  our  tips  riding  over 
a  sharp  edge  (e.g.,  a  step  on  mica  or  highly  oriented  pyrolytic 
graphite).  Thus,  we  do  not  expect  atomic  scale  convexities 
and  concavities  to  greatly  limit  the  usefulness  of  restoration. 

One  of  our  interests  is  to  determine  optimal  calibration 
objects  for  imaging  tips.  Contrary  to  our  initial  expectation, 
with  smooth,  paraboloid-shaped  tips,  we  find  no  need  to  use 
small  spheres  to  get  an  accurate  representation  of  the  tip 
(Figure  7),  and  this  is  a  demonstration  of  our  arguments  in 
Section  11.  Moreover,  the  downside  with  a  small  sphere  is 
that  one  obtains  a  truncated  tip  image  (Figure  9).  If  one  uses 
a  truncated  tip  to  restore  larger  objects,  unacceptable  distor¬ 
tions  will  result.  One  rational,  but  imperfect,  solution  is  to 
terminate  the  tip  image  near  the  “baseline”  by  placing  the 
useful  portion  of  the  tip  on  a  pedestal  or  marking  SE  pixels 
in  the  unusable  region  as  ones  not  to  be  computed.  A  better 
solution  is  to  measure  the  tip  surface  with  a  bigger  calibra¬ 
tion  sphere.  Thus,  the  results  show  that  there  is  little  benefit 
to  measuring  a  smooth  tip  with  a  small  sphere  and  that  the 
sphere  size  should  be  larger  than  any  unknown  object  of 
interest.  It  is  understood  that  irregularly  shaped  tips  with 
small  protrusions  and  indentations  must  be  sampled  with  a 
smaller  sphere. 

Accumulation  or  removal  of  debris  on  the  probe  tip  dur¬ 
ing  AFM  scanning  is  a  potential  source  of  error.  By  changing 
the  size  of  the  SE,  one  can  model  the  effect  of  this  phenom¬ 
enon  on  AFM  and/or  restored  images.  Obviously,  if  one  uses 
an  improper  tip  for  restoration,  this  will  propagate  to  an  error 
in  the  restored  image.  Vesenka  et  al?^  suggest  dispersing 
calibration  standards  on  the  surface  containing  the  objects  of 
interest.  In  this  way,  the  accumulation  and  removal  of  debris 


J.  Vac.  Scl.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2415 


Wilson  et  al.:  Morphological  modeling  of  AFM  imaging 


2415 


from  the  tip  can  be  monitored  within  an  image.  In  principle, 
morphological  modeling  can  then  be  used  to  dynamically 
alter  the  tip  shape  to  reflect  contaminants. 

C.  Imaging  of  biomolecules 

Figure  8B  illustrates  the  dramatic  improvement  of  fibrino¬ 
gen  images  following  restoration.  With  restoration,  conven¬ 
tional  probes  can  be  used  to  obtain  useful  images  containing 
recognizable  domain  structures.  Molecular  modeling  also 
helps  identify  the  limitations  imposed  by  the  probe  tip.  AFM 
images  obtained  with  a  silicon  nitride  tip  are  very  distorted 
and  asymmetrical  (Figure  8A).  This  effect  is  reduced,  but  not 
eliminated  by  morphological  restoration  (not  shown).  In  the 
case  of  carbon  probe  tips,  there  are  fewer  small  excursion 
distortions,  but  there  is  significant  lateral  enlargement.  Fol¬ 
lowing  restoration  of  the  medium  tip,  the  perfectly  obtained 
pixels  are  increased  by  a  factor  of  8.  Note  that  with  molecu¬ 
lar  modeling  we  count  pixels  rather  than  measure  surface 
area.  This  helps  explain  the  somewhat  less  dramatic  effects 
than  obtained  with  one-dimensional  modeling.  Comparing 
results  with  different  probe  sizes  in  Figure  8C,  we  determine 
that  small  probes  are  desirable.  Even  in  this  case,  restoration 
is  extremely  important  for  imaging  biomolecules. 

VL  CONCLUSION 

We  have  quantified  the  improvement  obtained  when  using 
morphological  restoration  on  AFM  images.  Using  currently 
available  tips  to  image  molecularly  sized  objects,  the  number 
of  perfectly  obtained  pixels  is  increased  at  least  7  times  by 
restoration.  For  optimal  imaging,  restoration  is  not  enough, 
and  significant  improvement  is  obtained  with  sharper  probe 
tips.  However,  even  with  the  sharpest  tips  that  are  likely  to 
be  available  in  the  foreseeable  future,  morphological  restora¬ 
tion  provides  significant  improvement  in  AFM  imaging.  This 
is  especially  true  when  imaging  globular  structures  like  bio¬ 
molecules.  We  conclude  that  morphological  modeling  and 
processing  greatly  aid  interpretation  of  AFM  measurements. 
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APPENDIX:  ANGLE  OF  IDEAL  IMAGING  FOR  A 
SPHERICAL  SAMPLE  AND  A  PARABOLOID  TIP 

We  derive  equations  describing  the  contact  point  between 
a  paraboloid  tip  and  a  spherical  sample  both  of  which  are 
resting  on  a  floor  produced  by  the  planar  substrate.  Because 
of  the  spherical  symmetry,  we  can  simplify  the  geometry  and 
investigate  the  problem  of  a  parabola  and  circle  (Figure  9). 
In  the  figure,  the  top  of  the  hatched  region  indicates  the  floor, 
and  the  origin  is  judiciously  chosen  in  the  center  of  the 
circle.  The  contacts  (x,y)  must  lie  on  both  curves  at  a  point 
where  the  tangents  are  equal.  Under  these  conditions,  there 


are  mathematically  five  possible  solutions  which  are  marked 
with  crosses  and  enumerated  on  the  circle  (Figure  9).  Only 
points  (1)  and  (5)  are  physically  meaningful  in  our  case. 

Given  the  coordinate  system  in  the  figure,  the  equation  of 
a  circle  of  radius  R  is 

x^+y^  =  R^  (Al) 

and  the  parabola  is  given  in  terms  of  a  parameter  B  and  an 
offset,  Xq  , 

y  +  R  =  Bix-Xof.  (A2) 

The  tangent  of  the  circle  is  dy/dx—  —x/y  and  the  tangent  of 
the  parabola  is  dy ! dx  —  2B{x  —  xq)  ,  Equating  tangents  and 
rearranging  for  Xq  ,  we  obtain  the  equation  below. 


(A3) 


Substituting  into  Equation  (A2)  and  further  eliminating  jc 
using  Equation  (Al),  we  obtain  an  equation  containing  y  to 
the  third  power. 


y  +  R- 


R^-y^ 
4By^  ■ 


(A4) 


There  are  three  solutions:  y  ^  =  ( —  1  +  Vl  +  16BR )  /SB , 
y2  =  ("  1  -  Vl  +  16BR)/SB,  and  y3=  ~R.  In  Figure  9,  these 
three  solutions  correspond  to  points  (1)  and  (5);  (2)  and  (4); 
and  (3),  respectively.  We  use  y^  which  corresponds  to  the 
physically  interesting  solutions  at  (1)  and  (5).  Substituting 
y  1  into  Equation  (Al),  we  obtain  two  x  values  corresponding 
to  points  (1)  and  (5),  respectively. 

We  are  interested  in  the  angle  of  perfect  restoration  (0) 
above  the  contact  points  (Figure  9).  Using  the  y^  solution, 
we  obtain  the  final  result  below. 


(9- 180° -2  sin" 


=  180° -2  sin" 


-  1  +  Vl  +  16BR 


(A5) 

(A6) 
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A  recently  developed  optical-scatter  instrument,  the  scanning  scattering  microscope  (SSM)  produces 
two-dimensional  images  of  very  small  surface  features  and  of  variations  in  surface  topography.  In 
its  present  configuration,  the  lateral  resolution  is  better  than  10  /xm  and  its  sensitivity  to  surface 
roughness  is  in  the  angstrom  (rms)  range.  The  performance  of  the  SSM  has  been  demonstrated  using 
calibration  gratings,  Si(lOO)  surfaces,  and  Ge  thin  films  grown  on  silicon;  intercomparison  also  was 
made  with  atomic  force  microscope  measurements.  These  results  indicate  that  this  scanned  optical 
technique  is  a  very  useful  noncontact  method  for  evaluating  surface  microtopography. 


I.  INTRODUCTION 

The  continuing  impact  of  miniaturization  and  shrinking 
feature  size  on  semiconductor  device  performance  has 
greatly  increased  the  need  for  noncontact  methods  for  mea¬ 
suring  the  topography  and  defect  structure  of  surfaces  and 
buried  interfaces.  In  the  development  of  ultrathin  (~5  nm) 
gate  insulators,  for  example,  where  microroughness  and  de¬ 
fects  directly  affect  electrical  characteristics,^  it  is  critical  to 
monitor  and  control  both  the  surface  and  interface  topogra¬ 
phy  as  well  as  defect  size  and  density.  Precise  on-line  mea¬ 
surement  methods  are  needed  not  only  of  the  average  surface 
roughness  but,  even  more  so,  of  the  lateral  changes  in  topog¬ 
raphy,  often  with  a  lateral  resolution  of  less  than  one  mi¬ 
crometer.  The  scanning  scattering  microscope  (SSM),  which 
we  describe  here,  has  the  capability  of  filling  many  of  these 
needs;  we  have  used  it  to  produce  two-dimensional  maps  of 
surface  microroughness  and  defect  structure  with  subnanom¬ 
eter  roughness  sensitivity  and  with  micrometer  lateral  reso¬ 
lution.  The  images  generated  by  the  SSM  allow  us  to  visu¬ 
alize  very  subtle  variations  in  surface  microroughness  and 
defect  structure. 

The  SSM  is  a  recent  development  of  one  of  the  earliest 
optical  techniques  for  the  surface  roughness  measurement  of 
finely  finished  surfaces,  the  total  integrating  scattering  (TIS) 
method.^  This  method  is  based  on  the  concept  that  rough 
surfaces  scatter  more  light  (into  off-specular  directions)  than 
smooth  surfaces.  The  information  content  of  the  off-specular 
scattered  light,  however,  is  quite  different  from  that  of  the 
specular  beam  so  that  optical  techniques  based  on  scattered 
light  measurement  (such  as  TIS)  are  characterized  by  the  rms 
roughness  of  the  surface  and  not  by  the  shape  or  actual 
height  of  surface  features.  In  its  original  implementation,  TIS 
measurements  are  done  in  a  static  mode  using  a  relatively 
large  (~1  mm^)  beam  of  light  to  obtain  an  average  value  of 
surface  roughness  for  the  illuminated  area.  The  concept  be- 
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hind  the  SSM  technique  is  to  make  use  of  the  information 
contained  in  the  scattered  light  (TIS)  but  to  use  a  microfo- 
cused,  scanned  light  beam  in  order  to  produce  two- 
dimensional  maps  of  surface  roughness  and  defect  distribu¬ 
tions. 

The  usefulness  of  the  SSM  technique  for  imaging  lateral 
variations  in  surface  roughness  was  initially  demonstrated 
with  a  prototype  instrument"^"^  operating  at  atmospheric 
pressure.  Earlier  measurements  obtained  in  a  static,  un¬ 
scanned  mode  (i.e.,  the  TIS  method)  showed  that  sensitive 
microroughness  measurements  also  could  be  made,  in  situ, 
under  ultrahigh  vacuum  conditions.^  In  this  article  we  de¬ 
scribe  a  newly  designed,  substantially  improved  version  of 
the  original  SSM  concept. 

The  general  form  of  a  TIS  scatterometer  is  shown  in  Fig. 
1.  The  laser  beam  impinges  on  the  sample  at  normal  inci¬ 
dence;  the  specular  beam  is  reflected  back  to  a  photodetector 
to  measure  /q — the  specular-light  intensity.  The  scattered  off- 
specular  light  4  is  focused  by  a  hemispherical  mirror  (some¬ 
times  called  a  Coblenz  sphere)  onto  a  photodetector.  This 
photodetector  is  located  in  the  basal  plane  of  the  mirror, 
spaced  equally  distant  from  the  center  of  the  mirror  as  the 
illuminated  spot  on  the  sample,  i.e.,  the  photodetector  and 
the  illuminated  spot  are  at  conjugate  focal  points  of  the 
hemisphere.  The  ratio  of  the  scattered-light  intensity  to  the 
specularly  reflected  light  intensity  is  defined  as  the  TIS. 
Quantitatively,  this  technique  is  based  on  an  approximate 
theory^"  which,  in  the  smooth-surface  limit  (R^<0.1  X), 
directly  relates  TIS  to  rms  roughness  (in  optics  also 
known  as  o) 


TIS  =  /,//o  =  (4^/?,/\)2,  (1) 


where  is  the  off-specular  light  intensity,  /q  the  specularly 
reflected  intensity,  and  X  the  wavelength  of  the  laser  light 
source.  With  certain  limitations, TIS  measurements  are 
now  widely  used  for  surface  roughness  metrology  and  are 
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Fig,  1.  Optical  scattering  concept  of  TIS  and  the  SSM. 


also  used  to  identify  surface  defects  as  well  as  monitor  thin- 
film  fabrication.  The  TIS  method  has  been  adopted  by  the 
American  Society  for  Testing  and  Materials  as  a  standard  test 
method  (F1048-87)^^  for  surface  roughness. 


The  SSM  technique  which  we  are  describing  in  this  ar¬ 
ticle  combines  the  features  of  TIS  with  those  of  the  scanned 
probe  techniques.  This  has  resulted  in  two  major  modifica¬ 
tions  to  the  general  TIS  setup  in  Fig.  1 . 

(1)  XY  raster  scanning  of  the  sample,  in  a  point-by-point 
manner,  so  that  TIS  [or  rms  roughness  calculated  using 
Eq.  (1)]  can  be  plotted  as  a  function  of  sample  position 
to  create  two-dimensional  roughness  maps, 

(2)  Use  of  a  focused  laser  beam  to  produce  an  illumination 
spot  size  of  micrometer  dimensions  for  improved  lateral 
resolution.  As  a  result,  the  power  density  of  the  micro- 
focused  spot  is  much  higher  and,  therefore,  provides  an 
increased  sensitivity  to  surface  microroughness. 


11.  THE  SSM  APPARATUS 

A  schematic  diagram  of  the  improved  SSM  is  shown  in 
Fig.  2.  It  is  composed  of  four  basic  elements:  an  illumination 
system,  a  beam  analyzer  system,  a  detection  system,  and  a 
sample  scanning  system. 

The  illumination  system  consists  of  a  highly  stabilized 
He-Ne  laser  (3  mW,  single  mode,  polarized,  power  stability 
better  than  0.1%),  a  Faraday  isolator,  a  spatial  filter  with 
beam  expander,  an  illumination  lens,  a  cube  beam  splitter,  a 


THE  SURFACE  SCATTERING  MICROSCOPE  (SSM) 


Fig.  2.  Experimental  configuration  of  the  SSM. 
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quarter- wave  plate,  and  a  microfocusing  lens  (f=  110  mm). 
The  sensitivity  of  the  stabilized  laser  to  retroreflectance  ne¬ 
cessitated  the  use  of  a  Faraday  isolator.  The  choice  of  spatial 
filter,  beam  expander,  and  focusing  lens  was  made  so  as  to 
minimize  the  size  of  the  beam  spot  on  the  sample. 

The  beam  analysis  system  consists  of  a  telemicroscope,  a 
CCD  camera,  a  plate  beam  splitter,  and  a  beam  profile  ana¬ 
lyzer.  A  part  of  the  specular  beam  is  reflected  by  the  plate 
beam  splitter  into  the  telemicroscope  and  CCD  camera  of  the 
beam  analysis  system;  a  magnified  image  of  the  beam  can 
then  be  observed  and  its  beam  profile  analyzed. 

The  detection  system  consists  of  a  hemispherical  mirror 
(inner  diameter=  132  mm),  the  detector  for  the  scattered  light 
(PDl),  the  detector  for  the  specularly  reflected  light  (PD2),  a 
quarter-wave  plate,  and  a  focusing  lens  for  PD2.  The  relative 
sensitivity  of  PDl  and  PD2  was  calibrated.  The  hemispheri¬ 
cal  mirror  has  a  hole  (diameter =8.0  mm)  positioned  8.0  mm 
off  axis.  The  beam  enters  the  hole  parallel  to  the  axis  of  the 
mirror.  The  hemispherical  mirror  focuses  the  scattered  light 
onto  the  photodetector  PDl  placed  at  the  conjugate  focal 
point  of  the  mirror  (Fig.  1).  For  collection  of  the  off- specular 
light,  this  geometry  imposes  a  lower  limit  of  ^3.5°  on  the 
scattering  angles.  From  the  geometry  of  this  instrument,  the 
upper-limit  scattering  angle  would  be  close  to  90°.  Using  a 
one  dimensional  grating  equation,^^  we  estimate  the  band¬ 
width  to  be  between  0.095  and  1.6  jjm~^  (which  correspond 
to  surface  wavelengths  between  0.6  and  10.5  /xm).  It  is  dif¬ 
ficult,  however,  to  define  the  upper  bandwidth  limit  (for  a 
90°  scattering  angle)  from  the  geometry  alone  as  the  light 
scattered  to  large  angles  may  be  reflected  by  the  photodetec¬ 
tor  surface  and,  therefore,  would  not  contribute  to  the  elec¬ 
tronic  signal  for  ;  the  upper  bandwidth  limit  may  thus  be 
less  than  we  have  estimated.  The  quarter-wave  plate  changes 
the  polarization  of  the  specular  beam  from  circular  to  linear  p 
polarized  so  that  the  cube  beam  splitter  is  nearly  transparent 
to  it.  The  specular  light  is  then  focused  by  the  PD2  lens  onto 
the  second  photodetector  (PD2).  The  signals  from  both  pho¬ 
todetectors  are  fed  into  an  A/D  converter  and  processed  by 
the  computer.  For  the  scattered  light,  there  is  an  option  to 
amplify  the  signal  50X  so  that  there  is  adequate  dynamic 
range  for  measuring  weak  signals. 

The  scanning  system  consists  of  XY-piezo  drivers,  a  lin¬ 
ear  position  encoder  of  submicrometer  resolution,  and  the 
scanner  holder  with  tilt  and  rotation.  The  piezo  drivers  and 
the  encoder  operate  in  an  open  loop  setup  for  the  y  direction 
and  in  a  closed  loop  for  the  x  direction.  They,  as  well  as  the 
whole  SSM  system,  are  controlled,  by  the  SUPERSCOPE  ll 
software^^*  installed  and  customized  on  our  computer.  We 
have  found  the  scan  linearity  to  be  better  than  1  pm  over  a 
512X512  pvc^  scan. 

The  illumination  lens  (f=  1  m)  (Fig.  2),  when  inserted  in 
the  light  path,  serves  to  illuminate  an  area  of  about  1  mm^  of 
the  specimen  so  that  the  specimen  can  be  viewed  directly  on 
the  beam  analyzer  display  and  regions  of  interest  located. 
The  specimen  can  be  moved  by  the  joystick  in  both  x  and  y 
directions  to  the  selected  region. 

An  alignment  procedure  has  been  devised  which,  once  it 


is  initially  established,  allows  for  routine  specimen  installa¬ 
tion  and  focusing.  Once  the  SSM  is  aligned  (this  includes 
focusing  the  telemicroscope  on  the  image  of  the  microfocus¬ 
ing  lens),  optimal  focusing  of  the  laser  beam  on  the  sample 
can  be  achieved  by  moving  the  sample  in  the  z  direction,  so 
as  to  obtain  the  smallest  round  spot  on  the  beam  analyzer 
display. 

Data  acquisition  is  controlled  by  the  SUPERSCOPE  ll 
system,  customized  for  our  SSM.  The  maximum  data  ac¬ 
quisition  rate  is  30  pixels  per  second.  If,  for  each  pixel,  we 
average  100  voltage  measurements  from  each  photodetector, 
the  acquisition  time  for  taking  parallel  specular  and  off- 
specular  images  (256X256  pixels  with  1  pm  step  size)  is 
about  1  h.  After  collecting  the  data,  some  processing,  statis¬ 
tics,  and  wavefront  manipulation  can  be  immediately  done 
within  the  SUPERSCOPE  ll  software.  The  data  are  then 
loaded  into  an  image  analysis  program  and  further  pro¬ 
cessed;  “rms  roughness  maps”  also  can  be  created  by  our 
application  programs. 

III.  RESULTS  AND  DISCUSSION 
A,  Lateral  resolution  test 

To  evaluate  the  lateral  resolution  of  the  SSM  (for  the  laser 
focusing  optics  described  here),  we  used  a  VLSI  standards 
STM  180 A  ^-calibration  mate-rial. This  specimen  has 
three  gratings  with  18  nm  high  steps  made  by  VLSI  technol¬ 
ogy.  The  gratings  are  platinum-coated  silicon-dioxide  pat¬ 
terns  on  a  silicon  die.  We  used  gratings  with  periodicities  of 
20  and  10  pm.  In  Fig.  3(a),  we  present  the  scattered  light 
image  of  the  20  pm  grating,  i.e.,  10  pm  square  pits  (18  nm 
deep)  separated  from  each  other  by  a  10  pm  wide  step.  In 
Fig.  3(b)  an  AFM  image  of  the  same  area  is  presented;  the 
darker  regions  are  deeper  than  the  lighter  ones  by  18  nm. 
This  scan  was  taken  in  the  contact  mode  using  an  Ultralever* 
tip  from  Park  Scientific.*  All  of  the  AFM  measurements  in 
this  work  were  done  in  air  with  a  Park  Scientific  Autoprobe 
LS  system.'^* 

If  we  overlay  the  two  pictures  in  Fig.  3,  we  can  clearly 
interpret  the  SSM  image.  The  laser  light  is  scattered  mainly 
from  the  edges  of  the  square  pits  in  the  VLSI  pattern  in  Fig. 
3(b).  If  we  assume  the  same  reflectivity  from  the  bottom  of 
the  pits  as  from  top  of  the  pattern  and  that  the  profile  of  the 
beam  is  Gaussian,  then  the  contrast  obtained  from  the  square 
features  with  the  darker  dots  in  their  centers  [Fig.  3(a)]  cor¬ 
responds  to  a  beam  diameter  [full  width  at  half-maximum 
intensity  (FWHM)  of  the  intensity  profile]  of  8-10  pm.  The 
beam  radius  at  the  focal  point  of  the  microfocusing  lens  can 
also  be  estimated^’ from 


r=AX//(7rro),  (2) 

where  k  (632.8  nm)  is  the  laser  light  wavelength,/ (110  mm) 
is  the  focal  length  of  the  microfocusing  lens,  Tq  (7.5  mm)  is 
the  radius  of  the  expanded  beam,  and  A  is  a  constant  correc- 
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Fig.  3.  Images  of  a  part  of  a  20  yctm  grating  (VLSI  standards  STM180A*). 
(a)  The  SSM  scattered-light  image  (scan  size  is  86X86  yum^).  The  brighter 
areas  are  from  light  scattered  by  the  edges  of  the  VLSI  pattern,  (b)  An  AFM 
image  of  the  same  area  as  taken  with  the  SSM.  The  dark  areas  are  deeper 
than  the  brighter  ones.  The  small  white  spots  are  from  particulates  which 
remain  on  the  surface  after  cleaning. 


tion  factor  (^1.00)  balancing  spherical  aberration  and  dif¬ 
fraction  of  the  lens.  Equation  (2)  gives  a  beam  spot  diameter 
2r=6--8  /jm,  which  is  in  reasonable  agreement  with  our 
estimates  from  Fig.  3(a).  The  brighter  spots  on  the  edge  of 
the  grating  in  Fig.  3(a)  are  due  to  the  close  proximity  (~2 
/xm)  of  two  or  more  scattering  features  (edges)  that  contrib¬ 
ute  additional  scattered  light  to  the  edge  regions  of  the  grat¬ 
ing. 

The  small  bright  spots  in  the  AFM  image  in  Fig.  3(b)  are 
due  to  particulates  on  the  sample  surface.  Many  of  these 
particulates,  however,  are  not  detected  by  the  SSM;  they  are 
not  evident  in  Fig.  3(a).  One  possible  explanation  as  to  why 
these  particles  are  not  strong  optical  scatterers  may  be  related 
to  their  optical  constants — they  may  be  essentially  transpar¬ 
ent.  Both  the  SSM  and  the  AFM  now  operate  in  an  air  envi¬ 
ronment  containing  many  dust  particles.  In  the  former  case, 
the  sample  is  mounted  in  a  vertical  plane,  but  in  the  latter 
case  it  is  horizontal  and  is  more  prone  to  dust  contamination. 
It  is  also  possible  that  the  sample  was  contaminated  with 
dust  particles  during  the  sample  transfer  from  the  SSM  to  the 
AFM  and  also  during  the  actual  AFM  measurements. 

Figure  4(a)  is  a  scattered  light  image  of  the  10  /xm  grat¬ 


Fig,  4.  Images  of  a  part  of  a  10  /mm  grating  (VLSI  standards  STM180A*). 
(a)  The  SSM  scattered-light  image  (scan  size  is  51X51  /mm^).  The  brighter 
areas  are  from  light  scattered  by  the  edges  of  the  VLSI  pattern,  (b)  An  AFM 
image  of  the  same  area  as  taken  with  the  SSM.  The  dark  areas  are  deeper 
than  the  brighter  ones.  The  small  white  spots  are  from  particulates. 

ing,  i.e.,  5  /xm  square  pits  separated  from  each  other  by  5  /xm 
terraces  that  are  18  nm  high.  For  comparison  and  direct  in¬ 
terpretation,  an  AFM  image  of  the  same  area  is  displayed  in 
Fig.  4(b).  The  bright  areas  in  Fig.  4(a)  come  from  the  super¬ 
posed  scattering  of  the  four  edges  of  the  square  pits,  i.e.,  we 
cannot  resolve  two  strongly  scattering  features  (the  edges  of 
the  pits)  5  /xm  apart  or  closer. 

From  the  SSM  measurements  of  the  VLSI  standards* 
sample  and  the  above  comments,  we  conclude  that  the 
present  lateral  resolution  of  our  instrument  is  between  5  and 
10  /xm  and  is  mainly  due  to  the  imaging  properties  (long 
focal  length,  small  numerical  aperture)  of  our  microfocusing 
lens. 

B.  Test  of  surface  roughness  sensitivity 

It  is  possible  to  convert  the  scattered  light  intensity  data 
into  rms  roughness  by  using  the  familiar  relationship  for  a 
perpendicular  incident  beam 

/?^=X/(477)V(/.//o),  (3) 

as  given  by  Eq.  (1).®’'° 

From  Eq.  (3)  and  measurements  of  the  noise  level  of  the 
SSM  data  acquisition  system,  we  estimate  the  lower  limit  of 
detectable  rms  roughness  to  be  —0.1  nm. 
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Fig.  5.  An  rms  roughness  map  of  the  Si(lOO)  surface  in  a  three-dimensional 
representation  (scan  size  is  256X256  /^m^).  This  image  was  constructed 
from  the  ratio  of  the  scattered  light  to  the  specular  light,  both  taken  simul¬ 
taneously  by  the  SSM;  the  value  of  for  each  pixel  was  calculated  using 
Eq.  (3).  The  calculated  rms  roughness  of  the  Si  surface  is  0.5  nm.  The  five 
peaks  are  due  to  surface  particles. 


We  tested  the  surface  roughness  sensitivity  of  our  SSM 
experimentally  with  a  very  smooth  Si(lOO)  sample.  The  rms 
roughness  (R^)  of  this  sample  as  measured  by  our  AFM  was 
0.2-0.3  nm.  This  rms  roughness  was  obtained  by  scanning 
over  an  area  of  the  surface  which  was  free  of  particulates. 

In  Fig.  5,  we  present  a  rms  roughness  map  of  the  Si(lOO) 
surface  created  from  the  ratio  of  the  scattered  off- specular 
light  intensity  and  the  corresponding  specular  intensity  using 
Eq.  (3);  the  size  of  the  measured  area  is  256X256  /xm^.  The 
value  for  the  dark  regions  in  Fig.  5  is  0.5  nm.  The  five 
bright  spots  most  probably  come  from  particulates  sticking 
to  the  silicon  surface.  The  assumptions  under  which  Eq.  (3) 
was  derived^”^®  are  not  applicable  to  the  brighter  areas  so  in 
this  case  the  R^  values  cannot  be  interpreted  in  a  similar  way 
to  the  case  of  the  dark  area. 

We  remeasured  the  same  area  as  in  Fig.  5  with  the  scat¬ 
tered  light  signal  amplified  50  times  and  obtained  an  identi¬ 
cal  image.  The  R^  value,  however,  was  found  to  be  0.8  nm 
for  the  dark  area. 

Both  R^  values  for  the  Si(lOO)  specimen  obtained  by  the 
SSM  are  larger  than  those  obtained  by  the  AFM.  There  are 
several  possible  explanations  for  this  difference. 

(1)  These  two  instruments,  basically,  measure  different 
characteristics  associated  with  surface  roughness.  For  the 
AFM,  the  tip  radius  and  geometry  greatly  affect  the  range  of 
surface-wavelength  features  that  can  be  accurately  deter¬ 
mined.  Surface  wavelengths  less  than  four  tip  radii  are  not 
accurately  traced  and  thus  affect  the  AFM  “rms  roughness” 
results.  Similar  restrictions  also  apply  to  the  SSM  where  the 
laser  light  wavelength  X  affects  the  roughness  sensitivity  for 
surface  wavelengths  shorter  than  \.  In  essence,  these  two 
techniques  differently  sample  the  roughness  features  of  a  sur¬ 
face  (i.e.,  the  bandwidth  of  the  two  techniques  are  different) 
and  thus  there  is  no  a  priori  reason  to  expect  that  the  rough¬ 
ness  data  will  be  the  same. 

(2)  A  small  fraction  of  the  specular  beam  (after  reflection 
from  the  sample)  may  be  scattered  from  the  interfaces  of  the 
optical  elements,  primarily  from  the  first  interface  of  the  mi- 
crofocusing  lens  (i.e.,  back  towards  the  sample  and  PDl), 
and  thereby  could  increase  the  signal  collected  by  PDl. 


Since  the  specular  light  intensity  is  typically  three  orders  of 
magnitude  greater  than  that  of  the  scattered  light,  even  a  very 
small  portion  of  the  specular  beam  reflected  back  towards  the 
sample  could  account  for  the  above  difference  in  R^  values. 
This  effect  might  be  reduced  by  using  antireflection  coatings 
on  the  microfocusing  lens. 

(3)  PD2  in  Fig.  2  may  not  provide  an  accurate  value  of  Iq 
since  the  specular  light  intensity  is  attenuated  by  each  optical 
interface  on  its  path  away  from  the  sample.  We  have  mea¬ 
sured  this  attenuation  and  obtained  an  attenuation  coefficient 
which  was  used  to  correct  the  intensity  measured  by  PD2  in 
order  to  obtain  Iq  .  The  above  procedure  was  complicated  by 
the  fact  that  the  laser  beam  is  polarized  and  thus  the  trans¬ 
mission  characteristics  of  the  optical  components  depend  on 
the  polarization  state.  Since  different  specimens,  or  different 
areas  of  one  specimen,  can  depolarize  the  incoming  light 
differently,  the  attenuation  coefficient  for  Iq  may  be  speci¬ 
men  dependent.  This  dependence  of  the  attenuation  factor  on 
specimen  characteristics  may  account  for  a  part  of  the  differ¬ 
ence  between  R^  values  obtained  with  the  SSM  and  the 
AFM.  A  more  accurate  technique  for  determining  Iq  is  under 
development. 

(4)  There  may  have  been  submicrometer  low- scattering 
particulates  (too  close  to  each  other  to  be  resolved  with  our 
laser  beam)  which  can  increase  the  rms  value  over  the 
scanned  area. 

From  the  above  measurements,  we  conclude  that  the  sen¬ 
sitivity  of  the  SSM  for  quantitative  measurements  of  rms 
surface  roughness  is  in  the  Angstrom  range.  Additional  mea¬ 
surements  on  ultra  smooth  silicon  surfaces  are  in  progress 
and  should  provide  us  with  a  better  estimate  of  the  rms 
roughness  sensitivity. 


C.  Defects  in  a  Ge  thin  film  evaporated  on  SiOa/Si 

A  germanium  thin  film  evaporated  onto  the  natural  silicon 
dioxide  layer  on  silicon  was  measured  by  both  the  SSM  and 
the  AFM.  Due  to  the  difference  in  lattice  parameters,  the  thin 
Ge  films  grown  on  SiOa/Si  were  expected  to  have  many 
structural  defects  that  could  be  seen  with  the  SSM.  The  same 
areas  of  these  samples  were  measured  with  the  SSM  and  the 
AFM  in  order  to  compare  directly  the  two  techniques. 

We  first  looked  at  a  24.8  nm  germanium  thin  film  evapo¬ 
rated  on  Si02/Si.  For  direct  comparison  with  the  AFM,  a 
sample  navigation  procedure  was  devised: 

(1)  A  mark  was  made  on  the  specimen  of  ~1  mm  length 
with  the  tip  of  a  scalpel. 

(2)  We  then  examined  the  specimen  with  the  auxiliary  opti¬ 
cal  microscope  of  the  AFM  to  find  a  unique  feature  of 
—  100  yttm  size — a  scratch,  microcrack,  dot  pattern, 
etc. — and  measured  its  coordinates  with  respect  to  the 
manually  made  mark. 

(3)  This  unique  feature  was  imaged  with  the  telemicroscope 
of  the  SSM  and  SSM  scans  were  made  over  an  area 
close  to  this  feature.  Figure  6(a)  is  a  SSM  scan  over  an 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2422 


Lorincik  et  al.:  SSM  for  surface  microtopography  and  defect  imaging 


2422 


Fig.  6.  Images  of  the  surface  of  a  24.8  nm  Ge  thin  film  evaporated  onto  a 
Si02/Si  substrate.  The  arrows,  labeled  by  capital  letters,  point  to  defects 
which  give  a  strong  scattered-light  signal,  (a)  The  SSM  scattered-light  im¬ 
age  (scan  size  is  75X65  The  brighter  areas  are  from  light  scattered  by 
defects  in  the  Ge  thin  film,  (b)  An  AFM  image  of  the  same  area  taken  after 
the  SSM  measurements. 

area  of  75X65  close  to  a  “natural”  scratch — a 
unique  feature  —  on  the  specimen.  A  small  fraction  of 
the  scattered  light  signal  from  this  scratch  is  on  the  right 
side  of  Fig.  6(a);  the  actual  scratch  is  about  10-20  /xm 
further  to  the  right.  For  comparison  with  the  AFM,  we 
choose  three  spots  marked  by  arrows  and  labeled  by 
capital  letters  in  Fig.  6(a)  and  measured  their  positions 
with  respect  to  the  natural  scratch. 

(4)  We  were  then  able  to  precisely  locate  the  region  of  in¬ 
terest  in  the  AFM.  Measurements  were  made  in  both 
contact  and  noncontact  modes;  reasonable  agreement 
was  found  between  the  two  modes  so  that  only  the  con¬ 
tact  mode  AFM  images  will  be  presented  here. 

Figure  6(b)  is  an  AFM  image  of  the  identical  area  to  that 
shown  in  the  SSM  measurement  of  Fig.  6(a).  There  are  more 
features  —  bright  spots  and  dark  ones  than  in  Fig.  6(a).  Since 
we  located  the  position  of  the  bright  spot  centers  with  respect 
to  the  natural  scratch  in  Fig.  6(a)  (with  a  precision  of  '~1 
/xm),  we  were  able  to  correlate  unambiguously  the  morpho¬ 
logical  defects  in  Fig.  6(b)  with  the  scattering  centers  in  Fig. 
6(a).  The  features  in  Fig.  6(b)  corresponding  to  the  bright 
spots  in  Fig.  6(a)  are  marked  identically  in  both  images.  The 
A-defect  and  the  C-defect — the  dark  ones — are  clearly  iden¬ 
tifiable.  The  B-defect — ^the  bright  one — looks  like  a  feature 


Fig.  7.  Small  area  scan  over  the  (A)  defect  in  Fig.  6(b).  Scans  (a)  and  (b)  are 
the  results  of  line  measurements  of  defect  (A). 

on  top  of  the  surface  as  opposed  to  A  and  C.  To  get  a  more 
detailed  picture  of  the  scattering  centers,  we  took  small  area 
scans  over  them.  As  an  example,  we  present  a  picture  of  the 
A  defect  and  the  corresponding  line  scans  in  Fig.  7,  This 
defect  is  a  pit  '-^40  nm  deep  and  —1.5  /xm  diam.  The  C 
defect  is  a  pit  of  the  same  depth  but  of  —0.5  ytxm  diam.  The 
B  defect  is  a  pit  of  —  1  /xm  diam,  but  we  could  not  measure 
its  depth  since  there  was  a  particle  sticking  to  the  edge  and 
covering  —75%  of  its  opening.  As  a  result,  this  defect  looks 
like  a  bright  spot  in  Fig.  6(b),  and  in  this  case  the  intense 
light  scattering  is  from  a  feature  that  is  underneath  the  par¬ 
ticle,  i.e.,  a  pit  or  other  subsurface  defect.  It  seems  clear  that 
the  three  features  labeled  in  Fig.  6(a)  are  due  to  submicrome¬ 
ter  pits  at  the  surface;  the  features  responsible  for  the  bright 
spots  seen  by  the  AFM  [Fig.  6(b)]  but  not  by  the  SSM  [Fig. 
6(a)],  however,  do  not  seem  to  be  strong  optical  scatterers, 
probably  because  they  are  nearly  transparent. 

The  SSM  measurements  of  Ge/Si02/Si  samples  indicate 
that  this  technique  is  very  sensitive  to  surface  defects  such  as 
pits  and  cracks  and  that  may,  therefore,  be  very  useful  for 
solving  technological  problems  associated  with  thin  film 
growth. 

IV.  CONCLUSION 

An  optical  scanning  instrument,  the  SSM,  has  been  devel¬ 
oped  specifically  for  imaging  lateral  microroughness  varia¬ 
tions  at  surfaces  and  interfaces.  The  capabilities  of  this  in¬ 
strument  were  evaluated  using  three  different  test  materials: 
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Table  I.  Scanning  scattering  microscope  performance  parameters. 


Light  source 

He-Ne  laser,  single-mode, 
intensity  stabilized, 
power  3  mW,  X=632.8  nm 

Laser  beam  stability 
(intensity) 

-^0.1% 

Laser  beam  angle  of  incidence 
(from  the  surface  normal) 

0° 

Focused  laser  beam  spot  size 
(FWHM) 

^10  /Ltm 

Lateral  resolution 

=^10  pm  (limited  by  beam  spot 

Surface  roughness  sensitivity 
(minimal  detectable  roughness) 
rms  roughness  range  (estimated) 

size) 

in  the  range  of  0. 1  -0.8  nm 

Data  acquisition  speed 

0.1-30  nm 

Scanning  linearity  in  x  and  y 

256x256  pixels/h 

Working  distance 

^1.0  pm 

Bandwidth  (estimated  from  the 

centimeters 

geometry  of  the  SSM) 

0.095-1.6  /im“' 

Surface  wavelength  range 

(0.06/\-l/X) 

(estimated) 

0.6  ^m-10.5  yiim 

a  VLSI  standards*  patterned  sample,  a  very  smooth  Si(lOO) 
sample,  and  a  deposited  Ge  thin  film  containing  various  sur¬ 
face  defects.  Results  obtained  with  these  materials  demon¬ 
strate  the  very  high  sensitivity  of  this  technique  for  evaluat¬ 
ing  changes  in  microroughness  at  the  subnanometer  level. 
This  sensitivity  is  especially  useful  since  it  can  be  realized  in 
an  instrument  that  operates  at  a  large  working  distance  (cm) 
from  the  sample  surface.  A  summary  of  the  performance  pa¬ 


rameters  of  the  SSM  is  given  in  Table  1.  The  SSM  has  a 
unique  capability  for  noncontact  measurements  of  variations 
in  surface  microroughness;  in  its  current  configuration  the 
spatial  resolution  is  about  8  /mm  and  the  rms  roughness  sen¬ 
sitivity  is  approximately  a  few  tenths  of  a  nanometer. 
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*In  this  article  certain  commercial  equipment,  instruments  or  materials  are 
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As-terminated  Si  surfaces  are  model  semiconductor  interface  systems  with  a  wide  range  of 
technological  applications.  We  have  studied  As-terminated  Si  (001)  surfaces  with  scanning 
tunneling  microscopy  which  reveals  that  it  is  possible  to  produce  a  well  ordered  surface.  Some 
characteristic  types  of  defects  do  occur  on  this  surface,  however,  which  are  found  to  be  of  particular 
interest.  These  include  antiphase  domain  boundaries,  step  edges,  and  long,  atomically  straight 
trenches  running  perpendicular  to  the  dimer  rows  across  the  surface.  The  nature  and  origin  of  these 
defects  are  discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Arsenic  termination  of  the  Si(001)2Xl  surface  forms  a 
well-ordered  passivating  layer.  With  increasing  miniaturiza¬ 
tion  of  semiconductor  devices,  the  depletion  zones  associ¬ 
ated  with  the  surface  become  a  significant  problem.  A  passi¬ 
vating  layer  such  as  As  has  the  potential  to  remove  such  a 
depletion  region  and  enable  further  miniaturization.  It  is  also 
desirable  to  integrate  the  optical  properties  of  GaAs  with  the 
industrial  capabilities  of  Si  growth  by  growing  GaAs  struc¬ 
tures  on  Si  substrates.  Device  structures  including  bipolar 
transistors,^  solar  cells,^  and  heterojunction  dipole  lasers^ 
have  been  grown  successfully,  but  unfortunately  a  high  den¬ 
sity  of  dislocation  defects,  which  may  hinder  device  opera¬ 
tion,  are  found  to  occur  in  the  GaAs  and  attributed  to  origi¬ 
nate  from  strain  occurring  from  the  lattice  mismatch.^"^  In 
these  studies,  it  was  found  necessary  to  grow  a  large  buffer 
layer  to  reduce  the  strain  from  the  lattice  mismatch.  These 
buffer  layers  can  constitute  over  50%  of  the  thickness  of  the 
device  and  with  the  drive  for  smaller  devices;  the  origins  of 
this  strain  is  a  matter  of  considerable  interest.  Growth  of 
GaAs  is  usually  performed  on  a  Si  surface  after  formation  of 
a  monolayer  of  As,"^  and  it  is  therefore  crucial  to  the  under¬ 
standing  of  the  heteroepitaxial  growth  of  GaAs  on  Si  to  un¬ 
derstand  the  structure  of  the  As  terminated  Si  surface,  paying 
particular  attention  to  the  defect  structure  on  this  surface 
which  may  seed  defects  in  crystal  growth  on  top. 

II.  As  TERMINATED  Si(001)  SURFACE 

The  (001)  surface  of  Si  forms  a  (2X1)  reconstruction  by 
producing  Si  dimers  in  the  [110]  direction  which  form  rows 
in  the  [110]  direction.  Initial  scanning  tunnel  microscopy 
(STM)  studies  suggested  symmetric  dimers^’^  with  the 
dimers  arranged  in  rows  which  are  observed  to  be  of  the 
order  of  50  A  long.  The  surface  has  quite  a  high  level  of 
defects  and  buckled  dimers  were  observed  with  the  STM 
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near  to  vacancy  type  defects  on  the  surface^’^  which  gave  rise 
to  areas  of  local  c(4X2)  and  p{2X2)  symmetry.  Later,  low- 
temperature  STM  studies  supported  asymmetric  dimers^ 
while  photoemission  studies  have  supported  both  symmetric, 
covalent  dimers^  and  asymmetric,  ionic  dimers.^  Much  work 
has  been  done  since  to  answer  the  question  of  whether  the 
dimers  are  symmetric  or  asymmetric. 

When  the  surface  is  terminated  with  arsenic,  the  As  atoms 
form  symmetric  As  dimers  on  the  surface  as  demonstrated  by 
angle  resolved  photoemission  studies,^®’^^  grazing  incidence 
x-ray  diffraction  investigations^^  and  core-level  spectroscopy 
which  indicates  one  site,  i.e.,  symmetric  As  dimers.^^  The 
reconstruction  is  shown  in  Fig.  1  and  because  the  As  atoms 
are  fully  coordinated,  they  create  a  stable,  passivating 
layer.^®’^^’^"^  The  lowered  surface  energy  of  arsenic  termina¬ 
tion  of  Si,  therefore,  inhibits  interdiffusion  and  produces  a 
well-ordered,  chemically  inert,  model  surface. 

An  As-terminated  surface  can  be  produced  by  molecular 
beam  epitaxy  (MBE)  growth  of  arsenic  on  MBE  grown 
Si(001)(2Xl).  For  ~monolayer  (ML)  coverages,  STM  stud¬ 
ies  have  reported  very  well-ordered  rows  of  symmetric  As 
dimers  with  few  defects. The  vicinal  surface  has  also 
been  observed  with  STM."^’^^"^^  These  authors^^’^^  report 
well-ordered  rows  of  As  dimers  perpendicular  to  step  edges 
that  are  not  smooth  for  samples  grown  in  conditions  similar 
to  ours.  Though  it  is  possible  to  produce  surfaces  with  the 
dimer  rows  parallel  to  step  edges  by  using  specific  growth 
conditions. Total  energy  calculations  of  GaAs  growth  on 
this  surface^  ^  indicate  that  growth  of  one  or  two  epitaxial 
layers  of  GaAs  on  ideal,  atomically  flat  Si(OOl)  would  be 
hindered  by  rehybridisation  of  Ga  and  As  in  the  first  layer, 
indeed  high  resolution  transition  electron  microscope 
(HRTEM)  and  STM  observations  indicate  that  GaAs  growth 
occurs  on  the  Si(OOl):  As  surface  by  nucleation  and  island 
growth.  The  fact  that  growth  is  observed  to  be  easier  on  the 
vicinal  surface  suggests  that  step  edges  play  an  important 
role  in  growth,  and  this,  in  turn,  suggests  that  defects  on  the 
flat  surface  might  be  of  importance  in  the  nucleation  and 
growth  process. 
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Fig.  1.  Ideal  reconstructed  surface  of  Si(001)2x  l:As  the  arsenic  dimerizes 
and  forms  rows. 


III.  EXPERIMENT 

The  details  of  the  sample  growth  have  been  specified 
elsewhere.^^  The  n-type  (0.006-0.015  fl/cm,  Sb  doped) 
Si(OOl)  substrates  were  thoroughly  cleaned  at  a  850  °C  be¬ 
fore  being  allowed  to  cool  under  an  AS4  flux  to  120  °C.  At 
this  point,  reflection  high  energy  electron  diffraction 
(RHEED)  observations  indicated  a  (2X1)  pattern  character¬ 
istic  of  the  As  terminated  Si(OOl)  surface.  At  a  temperature 
of  120  °C  the  AS4  flux  was  shuttered  and  the  sample  was 
allowed  to  cool.  Arsenic  was  then  deposited  on  the  sample  to 
cap  it  so  that  it  might  be  transported  in  air  between  labora¬ 
tories.  Post-growth  RHEED  observations  confirmed  the  As 
overlayer  to  be  of  an  amorphous  nature,  and  Auger  analysis 
later  confirmed  the  cap  to  be  greater  than  280  A  thick.^^ 

The  STM  measurements  were  carried  out  in  an  Omicron 
STM  instrument  at  UHV,  operating  at  a  base  pressure  of 
<10^^  mbar.  The  system  is  fitted  with  a  preparation  chamber 
and  a  fast  entry  port  for  rapid  introduction  of  samples.  The 
main  chamber  is  equipped  with  a  retractable  four-grid  rear¬ 
view  low-energy  electron  diffraction  (LEED)  optics.  The 
STM  is  designed  for  room-temperature  measurements  with 
the  tungsten  tip  held  at  ground  potential  and  the  sample  bi¬ 
ased.  Imaging  was  carried  out  in  constant  current  mode  at 
positive  and  negative  bias  (tunneling  into  empty/out  of  filled 
states  in  the  As  atoms,  respectively). 

Following  introduction  to  the  STM  apparatus,  the  sample 
was  prepared  by  thermally  desorbing  the  protective  As  cap 
by  heating  to  400  °C  for  60  min  in  ultrahigh  vacuum  (UHV). 
A  very  sharp  2X1  LEED  pattern  was  observed;  this  process 
is  known  to  leave  a  single  ML  of  As  terminating  the 
surface.  The  sample  was  then  placed  in  the  STM  chamber 
and  imaging  proceeded. 

IV.  OBSERVATIONS  AND  DISCUSSION 

Figure  2  shows  a  large  1000X700  A  area  of  the  sample 
surface  and  shows  rows  of  symmetric  As  dimers  that  are 
often  atomically  straight  for  over  a  hundred  angstroms  across 
the  surface.  There  are  islands  of  undesorped  arsenic  left  on 
the  surface  but  very  few  vacancy  or  interstitial  defects,  which 
represents  much  better  ordering  on  this  surface  than  the  clean 
Si(OOl)  (2X1)  surface,  in  agreement  with  previous  work.^^ 
The  As  islands  cover  about  20%  of  the  surface,  which  is 
considerably  more  than  in  other  work^^’^^  where  As  was  de¬ 
posited  on  the  vicinal  surface  using  a  ^-cell  in  situ.  We 
attribute  our  larger  island  coverage  to  our  different  prepara- 


Fig,  2.  Area  of  sample  1000X700  A  imaged  with  bias  +2  V  and  tunneling 
current  of  1  nA.  Image  shows  long  straight  dimer  rows,  a  few  undesorbed 
arsenic  islands,  and  trench  defects.  Most  of  the  length  of  the  step  edge  that 
is  visible  can  be  seen  to  be  type  A  like  our  model  (Fig.  3)  discussed  in  the 
text  with  only  a  few  areas  of  type  B  step. 


tion  technique  in  which  there  is  a  lot  of  As  present  on  the 
surface  to  start.  The  islands  may  have  been  desorbed  more 
completely  if  a  longer  anneal  was  used.  Step  edges  are  ob¬ 
served  to  be  ragged  and  the  dimer  rows  are  ordered  perpen¬ 
dicular  to  each  other  on  adjacent  terraces  indicating  single 
steps. 

Following  Chadi’s^^  classification  of  steps  on  the  clean 
Si(001)2Xl  surface  as  types  A  and  B,  we  see,  in  Fig.  2, 
many  examples  of  steps  which  are  similar  to  type  A;  in  that 
rows  of  arsenic  dimers  are  perpendicular  to  steps  up  and 
dimer  rows  run  parallel  to  the  edge  of  steps  down.  We  mea¬ 
sure  the  steps  to  be  1.8  A  high  on  average  which  agrees  with 
a  single  step  model  and  find  that  the  horizontal  distance, 
perpendicular  to  a  step,  between  the  step  edge  and  the  start  of 
a  row  of  dimers  on  the  lower  terrace  is  about  11.4  A.  These 
dimensions  agree  with  a  simple  model  of  a  single  step  with 
missing  dimers  at  the  base  of  the  step  which  we  show  in  Fig. 
3  with  the  appropriate  experimentally  measured  distances  in¬ 
dicated.  In  contrast,  we  see  proportionally  much  fewer  ex¬ 
amples  of  steps  that  are  like  B-type  steps  on  the  surface  and 


^Top  layer  As 
0  Next  layer  As 
O  1st  layer  Si 
o  2nd  layer  Si 

PLAN  VIEW 
[001]  [£jo] 
''[110] 

SIDE  VIEW 


Fig.  3.  Suggested  model  of  the  single  steps  found  predominantly  on  this 
Si(001)2Xl:As  surface.  The  experimentally  determined  distances  discussed 
in  the  text  are  indicated  in  their  appropriate  places. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2426 


Jackson  et  a/.:  STM  study  of  As-terminated  Si(001)  surfaces 


2426 


Fig.  4.  (a)  STM  image  of  step  edge  taken  with  bias  -2  V  and  tunneling 
current  of  1  nA.  Line  profiles  taken  across  the  dimers  on  the  (b)  upper  and 
(c)  lower  terrace,  shown  to  the  same  vertical  scale.  The  dimers  appear  to  be 
asymmetric  on  the  upper  terrace  but  appear  symmetric  on  the  lower. 


9  Top  layer  As 
O  1st  layer  Si 
o  2nd  layer  Si 

^  l001][iio] 
niio] 


Fig.  5.  (a)  Ball  and  stick  type  model  of  an  in-phase  missing-dimer  trench. 
Assuming  that  the  Si  atoms  in  the  trench  dimerize  to  reduce  the  energy  of 
their  dangling  bonds.  The  experimentally  determined  width  of  the  trench  is 
shown,  (b)  Ball  and  stick  model  of  an  antiphase  domain  boundary  or  scar 
type  defect. 


find  that  they  are  often  accompanied  by  nearby  trench  de¬ 
fects  or  As  islands.  Our  results  then  indicate  that  this  surface 
preferentially  forms  steps  that  are  similar  to  A-type  single 
steps  on  the  clean  Si(001)2X  1  surface  which  is  in  agreement 
with  previous  work.^^’^^ 

We  observe  a  low  density  of  steps  overall  on  the  surface 
which  leads  to  very  large,  atomically  flat,  areas.  These  ter¬ 
races  are  observed  to  be  generally  about  1500  nm^  in  extent, 
with  some  terraces  being  over  2100  nm^.  The  step  edges 
allow  us  to  make  an  interesting  observation  about  the  STM 
as  a  tool.  Figure  4  shows  a  poor  resolution  image  of  a  step 
edge,  the  image  is  so  poor  that  the  dimer  rows  on  the  lower 
terrace  are  difficult  to  distinguish  but  are  seen  more  clearly 
when  line  profiles  are  taken.  The  line  profiles  show  appar¬ 
ently  asymmetric  dimers  on  the  upper  terrace,  on  the  left, 
while  on  the  adjacent  terrace  separated  by  one  step  down;  the 
dimers  exhibit  no  asymmetry.  This  only  occurred  on  a  small 
set  of  scans,  and  we  attribute  it  to  an  asymmetry  in  the  tip. 
Although  similar  effects,  seen  in  an  STM  study  of  Si(OOl) 
2X1,^^  were  attributed  to  asymmetric  and  symmetric  dimers 
existing  on  the  surface  due  to  the  near-degeneracy  of  their 
energies, we  do  not  believe  that  to  be  the  explanation  here 
due  to  the  strong  theoretical  and  experimental  evidence  that 
the  As  dimers  are  symmetric. 

The  defects  that  are  most  obvious  on  the  surfaces  are  the 
atomically  straight  trenches  which  are  perpendicular  to  the 
dimer  rows  and  are  extremely  long,  extending  over  850  A  in 
some  cases.  Similar  trenches  have  been  observed  before  on 
the  vicinal  surface  where  the  dimer  rows  are  parallel  to  the 
step  edges.  In  our  experiment,  the  absence  of  steps  on  the 


samples  allows  us  to  see  the  extraordinary  length  of  the 
trenches.  We  observed  the  trenches  to  be  about  0.6  A  deep 
and  about  10.9  A  wide;  dimensions  which  are  consistent 
with  a  single  missing  dimer  type  defect  [in  Fig.  5(a)].  In  Fig. 
6,  we  show  an  example  of  a  trench,  on  the  right,  along  with 
an  antiphase  domain  boundary  on  the  left  which  is  consistent 
with  a  simple  model  shown  in  Fig.  5(b).  In  the  trench  shown 
in  Fig.  6,  the  dimer  rows  are  in-phase  on  either  side  of  the 
trench,  but  we  also  observe  examples  where  the  rows  are 
out-of-phase  on  either  side  of  the  trench.  Since  we  find  both 
in-phase  and  out-of-phase  trenches  on  the  surface  in  equal 
amounts,  and  we  see  simple  antiphase  domain  boundaries 
such  as  that  in  Fig.  6,  we  conclude  that  the  trenches  do  not 
form  to  accommodate  antiphase  domains.  Nor  are  these 
trench  defects  simply  steps  down  and  steps  up  close  together 
since,  as  we  have  demonstrated  above,  the  surface  has  a  pref¬ 
erence  for  the  dimer  rows  to  be  aligned  along  the  step  edges 
which  the  opposite  to  the  trench  edges.  Furthermore,  the 


Fig.  6.  STM  image  of  area  approximately  300X150  A  taken  at  sample  bias 
of  -^2  V  with  tunneling  current  of  1  nA.  Image  shows  two  types  of  defect; 
an  antiphase  domain  boundary  on  the  left  and  an  in-phase  trench  on  the 
right. 
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trenches  do  not  resemble  the  disordered  dimer  channels  in¬ 
duced  by  Ni  on  Si(OOl)^'^  by  virtue  of  their  extreme  order. 

Similar  trench-like  structures  have  been  reported  on 
A^-terminated  Cu(OOl)  surfaces^^  and  on  other  semiconductor 
systems.  Trenches  up  to  200  A  long  have  been  observed  on 
the  Ge(lll):As  surface, trenches  were  reported  on  the 
Si(001)2Xl:P  surface, well  ordered  trenches  separated 
by  8  to  15  dimers  have  been  reported  on  Si(001):Ge^^  and  in 
the  study  of  Si(001):Bi^^  it  was  also  found  that  very  well 
ordered  trenches  form.  In  that  study,  the  samples  gave  LEED 
patterns  of  (2Xn);  examination  by  STM  revealed  trenches 
separated  by  n  dimers,  and  n  was  found  to  vary  from  5  to  12 
depending  on  the  annealing  temperature.^^  We  note  that  in 
our  specimens,  the  unequal  separation  of  the  trenches  seen  in 
Fig.  2  is  confirmed  by  the  (2X 1)  LEED  pattern.  We  note  that 
the  As-terminated  surface  only  forms  trenches  perpendicular 
to  the  dimer  rows  in  contrast  to  the  E -terminated  surface 
where  both  perpendicular  and  parallel  trenches  are  seen.^^ 
Furthermore,  on  the  P -terminated  surface  the  trenches  were 
often  observed  to  continue  over  step  edges  for  short 
distances.^^  This  only  seems  to  happen  occasionally  on  the 
As-terminated  surface  with  just  a  couple  of  examples  to  be 
seen  in  Fig.  2.  The  proposed  missing  dimer  model  for  the 
perpendicular  trenches  on  the  Si(001)2Xl:P  surface  sug¬ 
gested  two  missing  dimers^^  whereas  our  measurements  have 
indicated  that  a  single  row  of  missing  dimers  is  most  likely. 
Any  differences  between  the  P-  and  As-terminated  surfaces 
are  probably  due  to  the  different  atomic  sizes  of  phosphorus 
and  arsenic. 

In  all  these  previous  examples,  the  trenches  were  believed 
to  be  strain  relief  mechanisms.  We  therefore  conclude  that 
the  trenches  seen  here  form  as  a  mechanism  for  strain  relief 
due  to  the  incommensurate  nature  of  the  As  overlayer.  We 
note  that  no  such  defects  have  been  reported  on  studies  on  4° 
vicinal  Si (001)  where  the  dimer  rows  run  perpendicular  to 
the  step  edges, and  attribute  this  to  the  terraces  on  this 
surface  being  only  40-50  A  wide  allowing  strain  to  be  re¬ 
lieved  on  the  terrace  by  relaxation  over  the  edge  of  a  step.  In 
our  study,  the  separation  of  the  trenches  varies  from  10  to  35 
dimers.  This  is  larger  than  reported  on  the  other  systems^^’^^ 
showing  that  the  trench  separation  depends  on  preparation 
and  not  only  the  extent  of  lattice  mismatch. 

Since  the  trenches  are  only  found  in  the  [110]  direction, 
this  means  that  the  structure  is  strained  only  in  the  [110] 
direction.  This  is  physically  reasonable  since  to  dimerize  the 
surface  layer,  atoms  have  to  move  towards  each  other  to 
achieve  a  bonding  distance,  the  energy  of  repulsion  of  the 
sp^  bonds  on  the  Si  atoms  being  offset  by  the  free-energy 
loss  of  the  dimer  atom’s  dangling  bonds  being  removed. 
Thus,  when  the  silicon  surface  has  to  accommodate  the  rela¬ 
tively  large  arsenic  atom,  it  is  quite  easy  for  it  to  relax  to¬ 
wards  the  preferred  bond  angle  of  109°  in  the  direction  along 
the  bond,  i.e.,  the  [110]  direction  (see  Fig.  1).  In  the  [110] 
direction,  however,  no  such  relaxation  is  possible  which 
means  that  after  a  certain  number  of  dimers  have  been  added 
to  a  row,  the  surface  must  relieve  the  strain  by  the  formation 
of  trenches.  This  hypothesis  also  explains  the  missing  As 


dimers  along  the  bottom  of  the  step  in  the  model  shown  in 
Fig.  3,  there  being  both  strain  and  steric  hindrance  effects 
preventing  arsenic  adatoms  from  sticking  at  the  base  of  the 
step. 

V.  CONCLUSION 

We  have  shown  a  number  of  defects  that  occur  on  the 
otherwise  well-ordered,  Si(001)2Xl:As  surface.  We  suggest 
models  of  step  edges,  antiphase  boundaries,  and  trenches 
from  our  STM  images  of  these  defects,  and  we  propose  that 
the  trenches  are  formed  as  a  means  of  strain  relief  due  to  the 
incommensurate  nature  of  the  arsenic  overlayer. 
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Simultaneous  imaging  of  Si(111)  7x7  with  atomic  resolution  in  scanning 
tunneling  microscopy,  atomic  force  microscopy,  and  atomic  force 
microscopy  noncontact  mode 
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The  reconstructed  Si  (111)  7X7  surface  was  imaged  in  several  operation  modes  of  the  combined 
ultrahigh  vacuum  atomic  force  microscope/scanning  tunnel  microscope.  By  imaging  single  atom 
defects  on  the  sample  surface  a  clear  proof  of  the  atomic  resolution  in  noncontact  mode  of  the  force 
microscope  was  possible.  By  simultaneous  measurements  of  several  interaction  parameters  and  by 
the  investigation  of  force-distance  curves,  it  was  possible  to  explain  the  origin  of  the 
interaction.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  7X7  reconstruction  of  the  Si  (111)  surface  was  ana¬ 
lyzed  by  scanning  tunneling  microscope  (STM)  measure¬ 
ments  in  ultrahigh  vacuum  (UHV).^  This  structure  often 
shows  single  atom  defects.  By  imaging  these  defects  the  true 
atomic  resolution  of  a  microscope  can  be  proven.  In  the  STM 
mode,  the  atomic  resolution  on  Si  (111)  can  be  achieved 
easily.  With  an  atomic  force  microscope  (AFM)  it  is  much 
more  difficult  to  get  similar  resolution.  Most  “atomic  reso¬ 
lution  images”  by  AFM  do  not  show  single  atom  defects  or 
clearly  resolved  steps,  so  it  is  pseudo  atomic  resolution  by 
multitip  imaging.  AFM  images  of  a  monoatomic  cleavage 
step  on  NaF^  showed  a  lateral  step  width  of  1  nm  although 
the  regular  lattice  pattern  could  be  imaged  on  the  terraces. 
This  step  width  indicates  that  no  true  atomic  resolution  was 
achieved. 

First  AFM  measurements  on  Si  (111)  were  done  in  contact 
mode.  The  adhesion  force  of  the  point  contact  between  the 
clean  Si  surface  and  the  silicon  tip  is  up  to  10^  nN.  ^  This 
large  force  destroys  the  surface  reconstruction.  The  adhesion 
forces  could  be  reduced  by  coating  the  cantilever  with  poly- 
tetrafluorethylene  (PTFE).  ^  With  such  a  tip,  it  was  possible 
to  image  the  atomic  periodicity  and  the  corner  holes  of  the  Si 
surface,  but,  due  to  friction  effects,  no  true  atomic  resolution 
was  possible. 

First  measurements  in  AFM  noncontact  mode  ^  on  the  Si 
(111)  surface  were  able  to  resolve  the  7X7  reconstruction  for 
a  very  short  period  of  time,  ^  but  the  quality  of  the  image  was 
much  worse  than  typical  STM  images.  Also  by  AFM  non- 
contact  mode,  it  was  possible  to  image  the  InP  (110)  surface 
with  atomic  resolution.  ^ 

II.  EXPERIMENTAL  SETUP 

The  Si  (111)  7X7  surface  can  only  be  prepared  in  UHV. 
Therefore,  a  two  chamber  UHV  system  was  used  (pumped 
by  turbo  pump,  ion  getter  pump,  and  titanium  sublimation 
pump).  The  pressure  in  the  analysis  chamber  was  below 
5X10“^^  mbar  during  the  experiments.  For  sample  and  tip 
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preparation,  the  system  was  equipped  with  sputter  ion  source 
and  sample  heater  for  direct  or  resistive  heating.  To  check  the 
sample  preparation  a  low-energy  electron  diffraction  (LEED) 
system  was  available. 

The  experiments  were  made  with  a  standard  commercial 
instrument  (OMICRON  UHV  AFM/STM)  which  is  based  on 
a  design  of  Howald  and  co-workers.  ^  It  uses  the  widely 
acknowledged  beam  deflection  technique.  The  crucial  align¬ 
ment  of  the  light  beam  onto  cantilever  and  position  sensitive 
detector  (PSD)  has  been  achieved  using  two  mirrors 
mounted  on  ball  motors.  ^  In  this  setup,  the  sample  is 
scanned  and  the  force  detection  is  fixed.  For  the  noncontact 
mode  operation  an  oscillator  circuit  is  used  to  stimulate  the 
cantilever  to  vibrate  at  its  resonance  frequency  which  is  de¬ 
tected  by  a  frequency  modulation  detector.  The  principle  of 
frequency  detection  was  described  by  Albrecht  and  co¬ 
workers.  ^  A  snap-in  protection  measures  the  damping  of  the 
cantilever  and  retracts  the  tip  before  it  is  too  close  to  the 
surface. 

The  control  electronics  (OMICRON  Scala  System)  allows 
a  choice  of  several  operation  modes.  It  is  possible  to  select 
tunneling  current,  cantilever  deflection,  or  frequency  shift  as 
the  input  source  for  the  distance  regulation.  A  topography 
image  of  the  sample  surface  can  be  recorded  by  measuring 
the  z  position  of  the  tip  with  active  feedback.  If  a  slow  feed¬ 
back  is  selected  the  tip  cannot  follow  the  atomic  corrugation 
and  the  z  position  of  the  tip  stays  nearly  constant-  In  both 
operation  modes,  several  input  signals  such  as  tunneling  cur¬ 
rent,  cantilever  deflection,  and/or  frequency  shift  can  be  re¬ 
corded  simultaneously.  In  spectroscopy  mode,  the  scanning 
is  stopped  at  each  scan  point  and  a  spectroscopy  curve  is 
measured. 

For  the  experiments,  single  crystal  silicon  cantilevers 
were  used.  They  have  a  force  constant  in  the  range  of  10 
N/m  and  a  resonance  frequency  of  about  300  kHz,  the 
doping  of  the  silicon  is  sufficient  to  use  the  cantilevers  for 
STM  experiments.  The  cantilevers  have  been  cleaned  in  situ 
by  sputtering  with  an  argon  ion  beam. 
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Fig.  1.  Left:  STM  image  (10X10  nm)  of  a  Si  (111)  7X7  surface  measured  with  a  conductive  cantilever.  Right:  simultaneous  measurement  of  the  interaction 
force  between  tip  and  surface  during  the  tunneling  process  (white = lower  attractive  force). 


III.  RESULTS 

The  combined  AFM/STM  was  calibrated  by  STM  mea¬ 
surements  with  a  tungsten  tip  on  Si  (111)  7X7.  The  measure¬ 
ment  was  also  used  to  proof  the  quality  of  the  surface  prepa¬ 
ration.  For  all  other  experiments  described  in  this  article, 
conductive  silicon  cantilevers  were  used.  Scanning  tunneling 
microscopy  (STM)  measurements  on  Si  (111)  using  these 
conductive  cantilevers  as  a  tunneling  tip  resolved  the  atomic 
structure  of  the  7X7  surface  reconstruction.  These  images 
showed  an  atomic  corrugation  of  up  to  0.5  nm  which  is  much 
larger  than  known  from  normal  STM  images  with  tungsten 
tips.  This  large  corrugation  can  be  explained  by  interaction 
force  bending  the  cantilever.  To  proof  this  effect,  the  deflec¬ 
tion  of  the  cantilever  was  recorded  simultaneously  while 
measuring  a  cantilever  STM  image.  The  STM  image  was 
recorded  in  constant  current  mode  (active  feedback)  at  a  tun¬ 
neling  current  of  0.5  nA  and  a  gap  voltage  of  2  V  (sample 
positive).  The  cantilever  was  bent  towards  the  surface  which 
indicates  that  the  interaction  force  was  attractive.  At  the  po¬ 
sitions  of  the  atoms  the  interaction  force  was  about  0.8  nN 
smaller  than  at  the  position  of  the  holes.  In  Fig.  1  both  im¬ 
ages  are  shown. 

The  same  silicon  sample  was  imaged  later  on  in  AFM 
noncontact  mode.  It  was  possible  to  image  the  atomic  recon¬ 
struction  with  similar  measurement  parameters  as  described 
in  the  literature.^  For  the  experiments,  a  cantilever  with  10 
N/m  and  a  resonance  frequency  of  290  kHz  was  used.  The 
feedback  setpoint  was  adjusted  to  -95  Hz  and  the  peak-to- 
peak  oscillation  was  approximately  20  nm.  The  atomic  struc¬ 
ture  can  be  seen  over  the  entire  image,  but  due  to  the  large 
tip  surface  separation  with  these  parameters,  the  lateral  reso¬ 
lution  of  such  images  was  not  satisfying. 

Another  set  of  experiments  was  started  by  combining 
STM  and  AFM  noncontact  mode.  The  Si  (111)  surface  was 
imaged  first  with  a  conducting  cantilever  in  STM  mode.  It 
was  possible  to  detect  the  resonance  frequency  of  the  canti¬ 
lever  during  the  tunneling  process  although  the  oscillation 
was  not  excited  by  the  piezo.  This  thermal  oscillation  of  the 


cantilever  with  an  amplitude  of  approximately  0.03  nm  was 
not  visible  in  the  force  signal,  but  the  high  sensitivity  of  the 
FM  detector  allows  to  measure  the  frequency  of  this  small 
vibration  (see  Fig.  2).  The  interaction  force  was  attractive 
resulting  in  a  decrease  of  the  resonance  frequency.  At  the 
position  of  the  atoms  the  attractive  force  was  stronger  ( A/= 
-  145  Hz)  than  at  the  position  of  the  holes  (A/=  - 12  Hz). 

Now  the  cantilever  was  excited  to  an  oscillation  of  about 
20  nmpp  at  the  resonance  frequency.  If  the  tip  is  close  enough 
to  the  surface  and  a  voltage  is  applied  between  tip  and 
sample  a  current  can  be  detected.  This  current  can  be  ex¬ 
plained  by  the  tip  touching  the  surface  at  the  closest  point  of 
the  oscillation.  The  current  is  increasing  for  closer  distances 
which  indicates  that  the  tip  is  touching  for  a  longer  period  of 
the  oscillation.  The  expected  periodic  maxima  in  the  tunnel¬ 
ing  current  were  not  detected.  Only  the  average  current  was 
measured  due  to  the  limited  bandwidth  (30  kHz)  of  the  cur¬ 
rent  amplifier.  Figure  3  shows  the  distance  dependence  of  the 
point  contact  current  and  the  simultaneously  measured  shift 
of  the  resonance  frequency  of  the  cantilever.  The  data  were 
taken  during  a  tip  approach. 

A  voltage  of  2  V  (sample  positive)  was  applied.  The  cur¬ 
rent  increases  linearly  up  to  50  nA  which  is  the  saturation  of 
the  current  amplifier.  The  resonance  frequency  is  first  de¬ 
creasing  due  to  the  attractive  van  der  Waals  and  electrostatic 
forces.  However,  as  soon  as  the  tip  starts  to  touch  the  sur¬ 
face,  the  frequency  increases  again  due  to  the  repulsive  van 
der  Waals  forces. 

It  is  possible  to  image  the  Si  (111)  7X7  surface  recon¬ 
struction  by  measuring  this  point  contact  current.  Further¬ 
more  the  spatial  display  of  the  frequency  shift  leads  to  a  high 
resolution  image  if  the  tunneling  current  feedback  is  virtually 
zero.  The  frequency  is  shifted  to  lower  values  by  the  attrac¬ 
tive  van  der  Waals  force  and  the  electrostatic  force.  Figure  4 
shows  a  smaller  decrease  of  the  frequency  for  the  position  of 
the  atoms  than  for  the  position  of  the  corner  holes.  The  dif¬ 
ference  in  resonance  frequency  between  atoms  and  holes  is 
20  Hz,  the  average  frequency  shift  was  -680  Hz.  This  effect 
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Fig.  2.  Left:  STM  image  (11X11  nm)  of  a  Si  (111)  7X7  surface  measured  with  a  conducting  cantilever.  Right:  simultaneous  measurement  of  the  frequency 
shift  of  the  thermal  oscillation  of  the  cantilever  (white  ==  lower  attractive  force). 


can  be  explained  by  a  repulsive  van  der  Waals  force  between 
the  surface  atom  underneath  the  tip  and  the  very  end  of  the 
tip.  This  force  is  added  to  the  attractive  van  der  Waals  forces 
and  the  Coulomb  force  showing  no  or  only  very  small  local 
variations.  The  repulsive  van  der  Waals  forces  indicate  that 
the  tip  is  very  close  to  the  surface  and  therefore  the  quality  of 
the  image  is  similar  to  STM  images  of  the  Si  (111)  7X7 
surface. 

A  high  quality  image  of  the  Si  (111)  7X7  reconstruction 
in  AFM  noncontact  mode  is  possible  by  choosing  similar 
parameters  as  for  the  image  described  above.  Again,  a  volt¬ 
age  (1,9  V)  was  applied  between  tip  and  sample,  but  now  the 
frequency  shift  was  used  as  an  input  to  the  distance  regula¬ 
tion.  The  setpoint  was  adjusted  to  a  frequency  shift  of  -870 
Hz  and  the  feedback  was  too  slow  to  follow  the  atomic  cor¬ 
rugation.  An  image  taken  with  these  parameters  is  shown  in 
Fig.  5,  At  the  position  of  the  atoms  (A/=-“862  Hz)  the 
attractive  force  is  again  lower  than  at  the  position  of  the 
holes  (A/=  —  872  Hz). 

The  tip  should  follow  the  atomic  structure  if  the  feedback 
speed  is  increased.  However,  due  to  the  repulsive  forces  at 


Fig.  3.  Point  contact  current  and  frequency  shift  as  a  function  of  tip  height 
(x= frequency  shift,  square = point  contact  current). 


the  position  of  the  atoms  this  is  not  the  case.  If  the  feedback 
loop  tries  to  keep  the  frequency  shift  constant  the  tip  has  to 
come  closer  to  the  surface  at  the  position  of  the  tip.  This 
effect  inverts  the  image  of  the  atomic  structure  which  could 
be  seen  in  the  measurements.  Because  of  the  inversion  of  the 
slope  in  the  A/(z)  curve  (see  Fig.  3),  it  was  not  possible  to 
get  a  stable  image  in  constant  frequency  shift  mode.  The 
instability  of  the  distance  regulation  often  leads  to  a  snap-in 
of  the  cantilever  which  then  destroys  the  tip. 

IV.  CONCLUSION 

For  the  first  time,  it  was  possible  to  image  the  Si  (111) 
7X7  surface  reconstruction  simultaneously  in  several  opera¬ 
tion  modes  of  the  combined  AFM/STM,  such  as  STM  com- 


Fig.  4.  AFM  noncontact  mode  image  (9x9  nm)  of  Si  (111)  7X7  with  slow 
STM  feedback  (white- lower  attractive  force). 
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Fig,  5.  AFM  noncontact  mode  image  (10X10  nm)  of  Si  (111)  7X7  with 
slow  feedback  on  frequency  shift  (white = lower  attractive  force). 


bined  with  measurement  of  cantilever  deflection  or  reso¬ 
nance  frequency.  The  images  demonstrate  that  the  atomic 
force  microscope  is  capable  of  achieving  similar  resolution 
as  a  STM. 

The  quality  of  the  images  were  very  reproducible  after 
finding  the  right  parameters.  It  was  possible  to  image  the  Si 
(111)  surface  for  several  times  and  with  several  tips  with 
similar  quality  of  the  images.  The  tip  preparation  is  impor¬ 
tant  to  get  these  results.  Even  more  crucial  is  the  small  gap 
distance  between  tip  and  sample  surface  which  can  be  con¬ 
trolled  perfectly  by  the  tunneling  current.  It  was  already  pos¬ 
sible  to  repeat  the  experiment  with  the  same  tip  preparation 
and  the  same  imaging  parameters  in  another  laboratory. 

The  results  can  be  explained  qualitatively  by  superimpos¬ 
ing  van  der  Waals  forces  and  electrostatic  forces.  The  AFM 
noncontact  mode  (constant  frequency  shift  imaging)  is  only 
possible  if  the  resonance  frequency  decreases  with  decreas¬ 
ing  gap  distance.  For  typical  interaction  potentials  this  is 


only  true  for  larger  distances;  for  smaller  distances  the  reso¬ 
nance  frequency  increases  again  due  to  repulsive  forces.  If  an 
attractive  electrostatic  force  is  superimposed,  the  turnover 
point  is  moved  to  smaller  distances,  therefore,  noncontact 
mode  images  are  possible  for  smaller  gap  width  than  without 
an  applied  voltage  which  leads  to  better  lateral  resolution. 

The  electrostatic  forces  have  a  long  interaction  range  so 
that  this  interaction  is  not  modulated  on  an  atomic  scale.  The 
local  variations  of  the  resonance  frequency  are  only  caused 
by  van  der  Waals  forces.  By  varying  the  tip  sample  distance, 
it  is  possible  to  get  a  higher  frequency  on  top  of  the  atoms  if 
the  tip  is  very  close  or  to  get  a  lower  frequency  on  top  of  the 
atoms  if  the  tip  is  further  away.  A  more  quantitative  expla¬ 
nation  of  the  results  can  be  made  by  measuring  spectroscopic 
data  with  spatial  resolution  such  as  /(z),  /(V),  F(z),  F{V), 
Z)/(z),  or  Df{V)  curves.  These  experiments  are  planned  for 
the  future.  To  understand  the  interaction  in  more  detail,  a 
theoretical  simulation  of  the  measurements  would  be  very 
helpful. 
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The  complex  pattern  of  the  nanowire  skeletons  of  different  light  emitting  porous  silicon  structures 
is  investigated  by  transmission  electron  microscopy  (TEM)  and  atomic  force  microscopy  (AFM). 

Diffraction  lines  and  dark  field  images  are  used  to  identify  and  determine  the  crystallite  specimen 
long  range  order.  TEM  images  give  the  size  and  particle  orientation,  and  AFM  images  show  a 
three-dimensional  pattern  formed  by  an  interconnecting  skeleton  of  particles.  Near  infrared 
photoluminescent  porous  silicon  (0.006  fl  cm)  structures  show  a  skeleton  of  nanosized  silicon 
aggregates  which  form  domains  of  spatially  oriented  crystallites.  For  red  photoluminescent  samples 
(4.9  D  cm)  the  electron  diffraction  spots  are  discontinuously  split  into  tiny  intensity  maxima.  The 
diameter  of  the  wire  structure  forming  porous  silicon  as  measured  by  TEM  allows  us  to  estimate  the 
distortion  of  the  AFM  images  due  to  the  finite  size  of  the  tip  radius.  A  critical  angle 
aQ^2  arctan[iir/(  l~K)]  where  K  is  the  ratio  of  the  height  of  the  structure  to  the  tip  diameter  was 
defined  and  it  was  shown  that  for  structure  walls  steeper  than  the  distortion  may  be  substantial. 
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L  INTRODUCTION 

A  debate  exists  on  the  precise  processes  responsible  for 
porous  silicon  (PS)  photoluminescence.  A  better  under¬ 
standing  of  the  complex  structure  of  skeletons  consisting  of 
nanosized  particles  in  different  light  emitting  PS  species  is 
required.  Increased  control  over  formation  morphologies  is 
also  of  fundamental  importance  for  many  applications  pro¬ 
posed  for  these  materials.  The  most  adequate  techniques  to 
investigate  the  atomic  structure  of  metal  and  semiconductor 
surfaces  are  provided  by  scanning  tunneling  microscopy 
(STM)  and  TEM.^’^  STM  measurements  on  films  thicker 
than  500  nm  revealed  instabilities  in  the  tunnel  current  so 
that  surface  topography  could  not  be  definitely  determined.^ 
In  a  recent  paper,  we  reported  TEM  results  which  reveal  the 
nanostructure  of  luminescent  PS.^"^^  PS  surfaces  have  been 
previously  investigated  by  using  AFM  and  STM.^^"^"^  Theo¬ 
ries  of  image  formation  have  been  reported,  the  most  preva¬ 
lent  being  the  tip  sample  convolution  which  is  based  on 
purely  geometrical  considerations  assuming  a  strictly  stiff 
cantilever,  with  only  one  degree  of  freedom  in  the  vertical 
direction. The  essence  of  the  theory  is  that  in  each  observed 
image  there  exists  some  information  about  the  tip  in  addition 
to  the  information  about  the  sample. 

In  the  experiments  described  below,  we  exploit  the  capa¬ 
bilities  of  AFM  combined  with  TEM  to  examine  the  nature 
of  the  structure  of  PS  layers.  Moreover,  in  order  to  gain  a 
better  understanding  of  the  image  formation  mechanism  in 
AFM,  a  comparison  of  AFM  and  TEM  images  is  also  per¬ 
formed.  To  this  end,  we  report  AFM  studies  of  the  three- 
dimensional  pattern  formed  by  the  PS  layer,  while  TEM 


^^Electronic  mail:  ross@ifi.unicamp.br 


studies  give  the  size  and  orientation  of  the  nanoparticles. 

II.  EXPERIMENT 

Single-crystal,  polished  (100)  and  (111)  wafers  of  doped 
(p-  and  n-type)  silicon  of  0.006,  0.4,  and  4.9  Et  cm  resistiv¬ 
ities  were  cut  into  rectangles  with  areas  of  approximately  1.0 
cm^.  The  etching  bath  was  a  50:50  (by  volume)  solution  of 
48%  HF  (Merck)  and  95%  ethanol  (Merck).  The  samples 
were  etched  using  a  model  27 3 A  potentiostat/galvanostat 
(Princeton  Applied  Research).  After  anodization  the  silicon 
slab  edge  was  inserted  into  the  TEM  for  study,  as  previously 
described.^  TEM  examinations  were  carried  out  in  a  Zeiss 
CEM902  microscope  using  80  keV  electrons  and  equipped 
with  an  electron  energy  loss  spectrometer  (EELS)  and  an 
image  intensifying  camera. 

The  silicon  samples  for  AFM  observations  were  removed 
from  the  electrolyte  without  removing  the  liquid  adhered  to 
its  surface,  placed  in  a  glass  container,  and  immersed  in  liq¬ 
uid  nitrogen.  The  frozen  samples  were  then  placed  under 
vacuum  (10“^  Torr),  and  their  glass  containers  were  im¬ 
mersed  in  liquid  nitrogen.  Solution  sublimation  from  the 
samples  took  about  2  h.  AFM  observations  were  carried  out 
in  a  TopoMetrix  TMX  2000  AFM  instrument,  using  the  vari¬ 
able  force  mode  in  the  repulsive  force  region.  Images  were 
obtained  in  air  on  freshly  produced  surfaces.  The  room  tem¬ 
perature  photoluminescence  system  has  been  described 
previously.^ 

III.  RESULTS 

Images  obtained  by  TEM  from  a  0.009  flcm,  n-type, 
(100)  oriented  silicon  sample,  anodized  at  current  density  of 
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Fig.  1.  Transmission  electron  micrographs  of  the  porous  silicon  structure 
formed  on  (100)  oriented  silicon,  tt-type,  0.009  cm  sample,  (a)  Inset 
shows  a  region  with  4X  magnification.  X  indicates  a  20-nm-diam  particle. 
Diffraction  contrast  lines  form  a  pattern  of  almost  straight  lines,  (b)  Curved 
diffraction  contrast  lines  and  microscopic  structure. 


100  mA/cm^,  are  displayed  in  Figs.  1(a)  and  1(b).  This 
sample  shows  a  near  infrared  photoluminescence  peak  at 
1.45  eV.  Two  distinct  types  of  structures  can  be  seen  in  Fig. 
1(a):  a  pattern  of  columnar  dark  spots  nm  in  diameter 
over  the  entire  PS  surface  and  an  array  of  holes  (whitish 
areas)  —25  nm  across,  forming  a  structure  of  pores  perpen¬ 
dicular  to  the  plane  of  the  silicon  slab  and  exhibiting  wall 
dimensions  of  the  order  of  —30  nm.  In  addition,  Fig.  1(a) 
shows  diffraction  contrast  lines  on  the  surface  of  the  PS 
films.  These  contrast  lines  arise  from  the  etching  action  when 
the  crystallites  forming  the  crystal  are  slightly  tilted.  It  is  also 
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Fig.  2.  Dark  field  images  obtained  by  TEM  in  the  {311}  directions  of  a 
porous  silicon  structure  formed  in  (100)  oriented  silicon  n-type,  0.009 
D  cm.  The  long-range  ordering  resulting  from  the  original  crystalline  struc¬ 
ture  is  given  by  the  width  of  the  brightest  region  (~500  nm). 


evident  that  when  the  misorientation  of  the  crystallites  dis¬ 
torts  the  original  grid  into  curved  lines,  the  dark  fringes  are 
curved,  as  shown  in  Fig.  1(b). 

In  order  to  check  the  crystallites  spatially  variable  orien¬ 
tation  in  the  etched  samples,  dark  field  images,  as  shown  in 
Fig.  2,  were  obtained  by  TEM.  Spots  corresponding  to  the 
{111},  {220},  {311},  {400},  {331},  and  {422}  diffraction  were 
observed.  The  dark  field  image  for  the  {311}  direction  (Fig. 
2)  clearly  shows  that  the  nanocrystals  in  the  brightest  area  of 
the  photograph  are  preferentially  aligned  along  this  orienta¬ 
tion.  The  long-range  ordering  resulting  from  the  original 
crystalline  structure  in  the  sample  is  of  the  order  of  —500 
nm. 

Figure  3(a)  shows  the  TEM  image  of  a  4.9  fl  cm,  p-type, 
(100)  oriented  silicon  sample,  anodized  at  a  current  density 
of  5  mA/cm^.  This  sample  shows  a  photoluminescence  peak 
at  1.8  eV.  The  shape  of  the  etched  parts  of  the  sample  are 
emphasized  by  a  whitish  color.  The  structure  of  the  anodized 
silicon  is  seen  to  be  composed  of  highly  isolated  columns 
[marked  Y  in  Fig.  3(a)]  as  small  as  15  A  cross-section  diam¬ 
eter.  Figure  3(b)  shows  that  the  asterism  of  the  electron  dif¬ 
fraction  spots  is  discontinuously  split  into  tiny  intensity 
maxima.  Spots  corresponding  to  the  {311},  {331},  and  {533} 
diffraction  were  observed  [indicated  by  1,  3,  and  4  in  Fig. 
3(b)].  This  observation  indicates  that  segmented  crystallites 
are  produced,  which  are  relatively  undistorted  among  them¬ 
selves  although  differing  slightly  in  orientation  from  their 
neighbors.  The  etched  lattice  is  formed  by  an  aggregate  of 
comparatively  strain-free  subgrains  separated  from  one  an¬ 
other  and  having  nearly  the  same  orientation  as  the  original 
crystal. 
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(b) 


Fig.  3.  (a)  TEM  images  of  porous  silicon  structure  formed  by  anodizing  a 
4.9  n  cm  (100)  oriented,  /?-type  silicon  samples.  Y  indicates  a  particle  with 
a  diameter  of  —15  A.  (b)  Electron  anodized  diffraction  pattern  of  porous 
silicon  obtained  by  TEM  showing  spots  corresponding  to  the  (311),  (331), 
and  (533)  diffraction,  indicated  by  1,  3,  and  4. 


The  surface  topography  of  PS  samples  was  then  investi¬ 
gated  by  AFM.  An  image  corresponding  to  a  central  position 
in  the  1  cm  diameter,  0.009  fl  cm  samples,  anodized  with  a 
current  of  100  mA/cvo?  during  20  min,  is  shown  in  Fig.  4. 
Etched  holes  are  emphasized  by  dark  color.  A  skeleton  struc¬ 
ture  is  observed,  which  is  formed  by  a  pattern  of  large  holes 
having  diameters  of  '^1  /mm  (dark  regions,  indicated  by  Y). 
The  walls  of  these  pores  have  dimensions  of  this  order  as 
well. 

Silicon  tips  from  Park  Instruments  [Ultralever'r*^(Si)  and 
Microlever'^^  (SiN)]  were  used  to  obtain  improved  spatial 
resolution.  An  image  obtained  by  AFM  (Microlever'^^  tip) 
from  a  p-type  4.9  O  cm  (100)  oriented  silicon  sample,  anod¬ 
ized  during  30  min  at  5  mA/cm^  current  density,  is  displayed 
in  Fig.  5.  These  samples  show  a  red  photoluminescent  peak 
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at  1.8  eV.^^  The  image  shows  a  pattern  of  columns  (light 
isolated  spots)  over  the  entire  PS  surface.  A  more  detailed 
mapping  of  the  PS  formed  structure  is  shown  by  its  profile. 
Typical  profiles  of  this  sample  observed  by  AFM  show  val¬ 
leys  and  hills  as  small  as  55  A  wide  [see  Fig.  6(a)  obtained 
with  the  Microlever™  tip  and  Fig.  6(b)  with  the  Ultralever™ 
tip]. 

IV.  DISCUSSION 

A.  TEM  observations,  diffraction  lines,  and  dark  fieid 
images 

The  results  above  demonstrate  that  PS  layers  formed  dur¬ 
ing  silicon  anodization  of  different  light  emitting  species  also 
have  different  structures  (see  Figs.  1-3).  Diffraction  lines 
and  dark  field  images  are  used  to  identify  and  define  the 
crystallite  specimen  long-range  order.  The  long-range  order 
shown  in  Fig.  2  is  '--500  nm.  Figure  3(b)  shows  that  PS 
formed  on  4.9  D  cm  samples  is  composed  of  small  loosely 
connected  crystallites,  as  previously  reported^^  PS  layers  are 
therefore  formed  by  slightly  disoriented  lattice  segments  or 
subgrains,  separated  by  etched  holes.  TEM  images  give  the 
sizes  and  particle  orientations  (Figs.  1-3). 

B.  Observation  by  AFM  and  TEM 

A  comparison  of  TEM  and  AFM  images  of  samples  was 
then  performed.  Figures  1(a)  and  1(b)  are  TEM  images  and 
Fig.  4  depicts  an  AFM  image  of  the  0.009  (1  cm,  n-type, 
(100)  oriented  silicon  substrate  each  showing  different  as¬ 
pects  of  the  porous  nanosized  structure.  The  AFM  images 
give  a  three-dimensional  view  of  the  skeleton  structure  of  PS 
which  cannot  be  observed  by  TEM.  This  three- 
dimensionality  allows  the  visualization  of  the  interconnect¬ 
ing  structure. 

An  AFM  image  corresponding  to  a  red- 
photoluminescence  sample  is  presented  in  Figs.  5  and  6. 
Note  that  it  was  not  possible  to  resolve  features  smaller  than 
those  shown  in  this  figure.  The  discrepancy  between  the  PS 
surface  shown  by  TEM  and  the  one  shown  by  AFM  is  due  to 
the  response  of  the  probe  tip  to  the  force  exerted  by  the 
surface  structure.  AFM  images  are  a  reconstruction  from 
digital  data  obtained  from  a  probe  tip  response  to  a  sample’s 
topography  as  the  tip  is  scanned  across  the  surface.  Studies 
involving  structures  highly  three-dimensional,  as  PS  layers, 
can  be  complicated  by  the  fact  that  although  the  vertical 
resolution  is  usually  excellent,  the  AFM  produces  distorted 
lateral  features  on  the  steeply  inclined  sides  of  nanoparticles. 
The  effect  of  the  finite  curvature  of  the  tip  in  AFM  imaging 
has  been  noted  by  several  authors.^^"^^  Therefore  a  knowl¬ 
edge  of  the  tip  geometry  is  of  fundamental  importance. 

C.  AFM  Image  distortions  and  geometrical  resolution 
limitations 

The  distortion  of  an  AFM  image  is  illustrated  by  the  re¬ 
sponse  of  a  conical  tip  with  an  apex  tip  radius  to  various 
structures  as  shown  in  Figs.  7(a)  and  7(b).  Two  situations  can 
be  distinguished,  namely  a<aQ  or  a^aQ,  where 
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Fig.  4.  An  AFM  image  of  a  porous  silicon  etched  in  (100)  oriented  silicon,  n-type,  0.009  fi  cm.  It  shows  a  skeleton  structure  of  holes  and  walls  formed  by 
a  substructure  of  ~40-nm-thick  walls,  indicated  by  Z. 


ao=2  arctan[/s:/(  1  -  ^T)]  K=  h/2Rj ,  h  is  the  height  of  the 
object,  Rj  \s>  the  apex  radius,  and  a  is  angle  of  rise  or  fall  of 
the  structure.  Figure  7(b)  shows  that  for  a<aQ,  the  increase 
in  the  width  of  a  structure  in  the  sample  is  given  by 
Rj  sin(a/2)  for  each  rise  or  fall  of  the  sample  structure.  This 
value  does  not  depend  on  the  height  h  of  the  structure. 

For  and  small  values  of  h  {hlRj<03)  only  the 

spherical  apex  surveys  the  structure.  Figure  7(a)  shows  the 
observed  profiles  for  this  situation,  while  Fig.  8(a)  displays 


the  unobserved  region  d  or  “hole  region”  and  Fig.  8(b)  the 
critical  angle  as  a  function  of  the  object  height  (/z)  for  the 
tip  apex  radii  of  200,  50,  30  and  15  A.  The  distortions  intro¬ 
duced  do  not  affect  the  height,  but  only  the  width  and  the 
increase  in  the  width  will  depend  on  the  height  {h)  and 
angles  (a’s)  of  the  structure.  Table  I  gives  the  values  of  the 
calculated  full  width  at  half  height  for  objects  of  various 
heights  h  and  apex  tip  radii  Rj,  The  object  is  a  symmetrical 
trapezoid  with  a  flat  horizontal  top  of  15  A  width,  and  a  is 


NY 

0  nm  0  nm 


Length  (nm) 

Fig.  5.  An  AFM  image  of  p-type,  4.9  H  cm  (100)  oriented  silicon  sample 

anodized  during  30  min  at  5  mA/cm^  current  density.  It  shows  a  pattern  of  Fig.  6.  Typical  profiles  of  the  sample  of  Fig.  5  with  (a)  a  Microlever™  SiN 
columns  over  the  entire  surface.  tip;  (b)  an  Ultralever'^^  Si  tip. 
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Fig.  7.  Distortion  of  an  AFM  image  illustrated  by  the  response  of  a  tip  of 
radius  Rxio  a  step  structure  which  rises  at  an  angle  a  to  a  vertical  height  h. 
(a)  a  larger  than  the  critical  angle  Oq  ;  (b)  a  smaller  than  the  critical  angle 

the  angle  of  rise  (or  fall)  of  the  structure.  Observe  that  the 
measured  particle  width  depends  critically  on  both  the  par¬ 
ticle  height  and  the  tip  radius. 

D.  Porous  silicon  AFM  formed  Images 

The  spatial  resolution  of  the  TEM  images  is  15  A,  while 
that  of  the  AFM  depends  critically  on  the  profile  of  the  probe 
tip  and  sample.  When  the  tip  apex  radius  is  of  a  magnitude 
similar  to  the  one  of  the  sample  structure,  the  distances  be¬ 
tween  adjacent  peaks  in  the  samples  are  well  reproduced  in 
the  observed  image.  Also,  there  is  less  distortion  of  peaks 
than  of  valleys.  The  depth  of  valleys  in  the  image  is  strongly 
dependent  on  the  radius  of  the  tip.  The  radii  of  tips  were 
determinated  from  the  profiles  shows  in  Fig.  6.  For  the  Mi- 
crolever'T^  tips  the  estimated  radius  obtained  from  the  pro¬ 
files  shown  in  Fig.  6(a)  is  48  ±  10  A  and  for  the  Ultralever™ 
tips  [Fig,  6(b)]  the  radius  is  equal  to  30  ±6  A. 

The  porous  silicon  layers  are  formed  by  a  skeleton  struc¬ 
ture  of  ~  15  A  diameter  cylinders  as  shown  in  Fig.  3(a).  The 
profiles  present  peaks  with  heights  around  10  A.  Using  this 
height  value  and  the  estimated  radius  of  the  tips,  the  critical 


Fig.  8.  (a)  Value  of  the  unobserved  region  length  as  a  function  of  the 
vertical  height  for  various  tip  radii  Rj.  (b)  Critical  angle  value  uq  as  a 
function  of  the  vertical  height  for  various  tip  radii  Rj . 


angle  for  the  Ultralever™  and  Microlever™  tips  are  equal  to 
60°  and  42°,  respectively,  which  are  larger  than  the  observed 
a’s  of  10°  to  15°,  as  illustrated  in  Fig.  7(b).  Thus  the  true 
width  is  the  measured  width  minus  R  sin(a/2)  for  both  the 
rise  and  the  fall.  Isolated  peak  profiles  shown  in  Figs.  6(a) 
and  6(b)  were  measured  and  the  ascending  and  the  descend¬ 
ing  angles  calculated  as  well  as  the  radii  of  the  peaks.  The 
smallest  particle  width  was  estimated  to  be  15 ±4  A  (for  the 
Microlever^M  tip)  and  15±8  A  (for  the  Ultralever™  tip). 
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Table  I.  The  observed  full  widths  at  half-maximum  (FWHM)  of  structures 
with  a>aQ,  where  the  width  is  15  A  at  the  top  for  various  heights  from  2  to 
15  A  and  for  tips  with  apex  radii  of  200,  100,  50,  30,  and  15  A.  No  speci¬ 
fication  is  made  of  the  half  angle  of  the  cone,  so  that  no  values  of  FWHM 
are  given  for  h>03R’r  where  this  parameter  would  be  necessary. 
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V.  CONCLUSIONS 

In  conclusion,  this  work  has  shown  that  different  struc¬ 
tures  are  responsible  for  near  infrared  and  red  luminescent 
PS.  Diffraction  lines  and  dark  field  images  are  used  to  iden¬ 
tify  and  define  the  crystallite  specimen  long-range  order. 
TEM  images  give  the  size  and  particle  orientation,  and  AFM 
images  show  a  three-dimensional  pattern  formed  by  an  inter¬ 
connecting  skeleton  of  particles.  The  spatial  resolution  of  the 
TEM  images  is  15  A,  while  that  of  the  AFM  depends  criti¬ 
cally  on  the  profile  of  the  probe  tip  and  the  height  and  slope 
of  the  observed  structure. 

The  measured  diameter  wire  structure  forming  porous 
silicon  structure  by  AFM  and  TEM  allows  us  to  calculate  the 
distortion  of  the  AFM  images  due  to  the  finite  size  of  the  tip 
radius.  A  critical  angle  ao=2  arctan[^/(l  —  where  K 

is  the  ratio  of  the  height  of  the  structure  to  the  tip  diameter, 
was  defined  and  substantial  distortions  were  observed  for 
structure  walls  that  are  steeper  than  the  critical  angle. 
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We  present  a  systematic  study  of  the  performance  of  scanning  tunneling  microscope  (STM)'based, 
low  energy  electron  beam  lithography,  using  simulations  of  field  emission  from  STM  tips, 
emphasizing  realistic  conditions  of  tip  geometry  and  operation.  We  calculate  the  potentials  and 
electric  field  for  a  hemispherical  model  emitter  in  an  axially  symmetric  system.  Emission  current 
density  at  the  tip  is  calculated  using  the  Fowler-Nordheim  equation,  and  current  density  at  the 
sample  is  obtained  by  calculating  trajectories  of  emitted  electrons.  We  characterize  the  beam 
diameter  at  the  sample  as  a  function  of  emitter  radius,  tip-sample  bias,  emission  current,  resist 
thickness,  and  tip  work  function.  The  beam  diameter  is  primarily  affected  by  the  tip-sample  gap, 
increasing  at  larger  gaps,  characteristic  of  high  bias  and  large  tip  curvature.  For  optimal  tip  radius 
the  beam  diameter  increases  linearly  with  bias  from  approximately  2  nm  at  5  V  to  25  nm  at  50  V. 

Beam  diameter  is  nearly  independent  of  emission  current  over  the  range  0.05-50  nA.  Dielectric 
resist  films  cause  an  increase  in  beam  diameter  due  to  increased  tip-substrate  gap.  Beam  diameter 
is  very  sensitive  to  tip  work  function,  increasing  dramatically  for  low  work  function  tips.  Tips 
comprised  of  asperities  on  flat  surfaces  produce  significantly  smaller  beams  compared  to  “standard” 
tips  of  the  same  emitter  radius.  However,  for  low  bias  (<15  V)  beam  diameter  becomes  insensitive 
to  tip  geometry.  We  compare  these  simulations  to  selected  experimental  results  to  evaluate  the 
limitations  to  performance  and  assess  the  feasibility  of  routine  sub- 10  nm  structure  fabrication  using 
STM-based  low  energy  electron  beam  lithography.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Low  energy  electron  beam  lithography  is  an  attractive  op¬ 
tion  for  nanometer-scale  device  fabrication  because  of  the 
virtual  elimination  of  scattering  effects  which  limit  resolu¬ 
tion  in  conventional  electron  beam  lithography.^’^  The  scan¬ 
ning  tunneling  microscope  (STM)  operating  in  field  emission 
mode  is  widely  employed  to  produce  electron  beams  with 
energy  <100  eV  for  this  purpose.  The  relative  simplicity  of 
the  system,  small  beam  size,  and  very  high  current  density  at 
low  energy  makes  the  STM  an  ideal  research  tool,  and  a 
potentially  useful  manufacturing  tool  for  nanometer- scale 
structures. 

There  are  a  number  of  possible  approaches  to  nanostruc¬ 
ture  fabrication  using  the  STM,^’^  but  the  most  promising  for 
widespread  utility  is  low  energy  electron  exposure  of  an  im¬ 
aging  material  (resist),  which  is  used  to  transfer  a  pattern  into 
a  substrate  or  film.  A  variety  of  resist  materials  has  been 
investigated  using  STM-based  lithography,  including  PMMA 
and  other  conventional  resists, thin  siloxane  films,^  mo¬ 
lecular  films  such  as  Langmuir-Blodgett  and  self-assembled 
monolayer  films,^“^^  thin  inorganic  materials,  such  as 
CaF2,^^  and  adsorbed  atomic  layers,  particularly  hydrogen  on 

The  limitations  to  feature  size  and  resolution  obtainable 
by  this  process  are  functions  of  both  the  resist  material  per¬ 
formance  and  the  ability  to  form  a  small  diameter  beam  of 
low  energy  electrons.  However,  there  have  been  few  inves¬ 
tigations  of  the  performance  of  a  STM  tip  as  a  low  energy 
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electron  beam  source.  McCord  and  Pease^^  performed  simu¬ 
lations  of  electron  emission  from  an  isolated  sphere  in  the 
field  emission  regime  (5  eV<£^<100  eV),  and  identified 
many  of  the  major  features  of  STM-based  lithography.  Beam 
diameter  at  the  sample  was  shown  to  be  a  sensitive  function 
of  the  tip  radius,  gap,  and  tip-sample  bias. 

Since  this  early  modeling  effort,  many  experimental  dem¬ 
onstrations  of  nanometer-scale  lithography  have  appeared.  In 
particular,  we  have  shown  that  it  is  possible  to  directly  mea¬ 
sure  the  current  distribution  from  STM  tips  used  in  litho¬ 
graphic  processes.  These  results  give  impetus  to  more  ex¬ 
tensive  modeling  using  realistic  emitter  structures  and 
operating  conditions  that  may  be  compared  directly  to  ex¬ 
periments.  In  this  article  we  present  results  of  field  emission 
simulations  of  STM  tips  as  low  energy  (<50  eV)  electron 
beam  sources  for  nanolithography.  We  emphasize  compari¬ 
son  to  realistic  conditions  where  a  tip  is  operated  in  constant 
voltage  and  current  mode,  using  feedback  control  of  the  gap 
to  maintain  a  fixed  current.  We  investigate  the  effects  of  bias 
voltage  and  tip  radius  on  beam  diameter  for  idealized  hemi¬ 
spherical  tips.  We  also  investigate  the  dependence  of  beam 
diameter  on  total  emission  current,  tip  work  function,  and  the 
presence  of  a  dielectric  (resist)  layer  between  the  tip  and 
substrate.  Alternate  tip  configurations  in  which  the  electric 
field  at  the  tip  is  tailored  to  confine  the  trajectories  of  emitted 
electrons  into  a  small  beam  diameter  are  evaluated.  These 
results  help  establish  reasonable  expectations  for  perfor¬ 
mance  of  STM-based  nanolithography. 

Finally,  we  critically  examine  selected  experimental  sys¬ 
tems  to  evaluate  their  performance  compared  to  the  expected 
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Fig.  1.  Geometry  of  the  field  emission  calculation.  The  cathode  (tip)  and 
anode  (sample)  are  assumed  to  be  conducting. 


emitter  characteristics.  We  conclude  that  routine  sub- 10  nm 
lithography  is  feasible  using  low  energy  (<20  V)  exposures 
and  very  thin,  atomic  or  molecular  layer  resists. 


II.  FIELD  EMISSION  SIMULATIONS 


The  field  emission  simulations  are  carried  out  in  the  ge¬ 
ometry  shown  in  Fig.  1.  We  define  an  axially  symmetric 
system,  in  which  the  cathode  (emitter)  is  a  conducting  hemi¬ 
sphere  of  radius  .  The  emitter  is  attached  to  a  shank  with  a 
variable  cone  angle,  y.  The  anode  (sample)  forms  a  conduct¬ 
ing  plane  at  some  distance  Za »  where  the  gap  between  emitter 
and  sample  is  g  =  (Za~'^c)'  ^  dielectric  film  (resist)  can  be 
interposed  between  the  tip  and  sample. 

The  potential  in  the  space  between  the  anode  and  cathode 
obeys  Laplace’s  equation: 

V6Vy=0,  (1) 

where  V  is  the  potential  at  any  point  in  the  vacuum  or  resist 
film  and  6  is  the  dielectric  constant  of  the  medium.  In  spheri¬ 
cal  polar  coordinates  Eq.  (1)  is^^ 
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After  specifying  the  emitter  geometry  and  the  emitter- 
sample  gap,  we  define  a  set  of  grid  points  in  r  and  and 
solve  Eq.  (2)  for  V(r,0)  using  a  finite  difference  method 
with  the  boundary  conditions  that  y=0  at  the  cathode  (tip) 
and  y=  yo  at  the  anode  (sample). 

The  emission  current  density  at  the  tip  is  given  by  the 
Fowler-Nordheim  equation: 


7= 


AE^ 


exp(^ 


(3) 


where  E  is  the  electric  field,  cf)  is  the  work  function  of  the  tip, 
and  A,  B,  and  C  are  constants.  The  electric  field  at  the  emit¬ 
ter  surface  is  the  gradient  of  V{r^ ,  6).  We  integrate  the  emis¬ 
sion  over  the  entire  tip  surface  and  adjust  the  anode  distance, 
Za  y  to  obtain  the  desired  total  emission  current  at  a  fixed  bias. 

Sample  current  density  is  obtained  by  calculating  trajec¬ 
tories  of  electrons  leaving  the  tip  at  different  angles,  and 
mapping  the  emission  current  density  from  the  tip  to  the 


sample.  We  assume  that  electrons  leave  the  tip  normal  to  its 
surface,  with  zero  kinetic  energy,  and  we  ignore  scattering  of 
the  electrons  in  the  resist  film  (if  present). 

Most  calculations  are  carried  out  using  a  tip  work  func¬ 
tion,  eV,  appropriate  for  a  W  tip.^^  The  shank  angle, 

y,  has  minimal  effect  on  the  electric  field  at  the  tip  and  the 
trajectories  of  electrons  traveling  to  the  sample,  except  for 
very  sharp  tips  and  large  tip-sample  distances.  All  “stan¬ 
dard”  tip  simulations  are  carried  out  using  a  shank  angle  of 
20°.  We  also  explore  some  alternate  tip  geometries  in  an 
attempt  to  confine  or  focus  the  electron  current  at  the  sample 
by  tailoring  the  electric  field  in  the  vicinity  of  the  tip. 

III.  RESULTS  AND  DISCUSSION 
A.  Characteristics  of  standard  tips 

We  have  examined  emission  characteristics  of  standard 
tips  as  a  function  of  tip  radius,  anode  voltage,  total  emission 
current,  resist  thickness,  and  tip  work  function.  In  typical 
operation  of  an  STM-based  lithography  system,  the  anode 
voltage  and  emission  current  are  held  constant,  and  the  gap  is 
adjusted  by  a  feedback  circuit  to  maintain  constant  electric 
field  at  the  tip  and  thus  constant  emission.  This  is  a  very 
important  aspect,  since  the  electric  field  configuration  in  the 
gap  and  trajectories  of  electrons  to  the  sample  are  very  sen¬ 
sitive  to  the  tip-sample  gap.  For  direct  comparison  to  this 
mode  of  STM  operation,  all  simulations  presented  here  treat 
the  gap  as  a  free  parameter.  We  have  previously  shown  that 
these  simulations  agree  well  with  the  gap  measured  as  a 
function  of  anode  voltage  in  an  operating  STM.^^  Since  we 
are  primarily  interested  in  the  current  density  distribution  at 
the  sample,  results  are  presented  in  terms  of  the  beam  diam¬ 
eter  at  the  sample  surface.  We  define  the  beam  diameter  as 
that  which  contains  60%  of  the  total  current.  (This  is  ap¬ 
proximately  equivalent  to  the  standard  method  of  measuring 
beam  diameter  in  electron  beam  lithography  systems.) 

Considering  standard  tips  with  radii  of  5-50  nm,  we  cal¬ 
culate  the  beam  current  density  distributions  for  biases  from 
5  to  50  V.  The  current  density  distribution  at  the  sample 
follows  a  functional  form  of  approximately  exp(-r^  V 
a),  where  the  width  parameter,  a,  depends  on  the  bias  and 
the  tip-sample  gap.  The  beam  diameter  as  a  function  of  bias 
voltage  is  shown  in  Fig.  2(a).  The  increase  in  beam  diameter 
with  increasing  bias,  for  all  tip  radii,  is  primarily  due  to 
increased  gap  at  increased  bias,  and  resulting  radial  spread¬ 
ing  of  electron  trajectories.  This  is  demonstrated  in  Fig.  2(b), 
where  the  beam  diameter  is  seen  to  increase  approximately 
linearly  with  the  gap  that  is  required  to  produce  1  nA  current 
at  the  given  bias. 

Tips  with  higher  curvature  (small  r^)  develop  higher  elec¬ 
tric  fields  at  a  given  Vq  and  gap,  so  that  the  feedback  circuit 
responds  by  retracting  the  tip  to  maintain  the  preset  emis¬ 
sion.  For  sharper  tips  emission  also  occurs  over  a  larger 
range  of  angle,  6,  at  the  apex  of  the  tip.  This  results  in  the 
phenomenon  shown  in  Fig.  2(a)  in  which  sharp  tips  give 
smaller  beam  diameters  at  low  bias  but  larger  beam  diam¬ 
eters  at  increasing  bias,  due  to  the  larger  gap.  These  results 
are  in  good  agreement  with  the  earlier  simulations  of  Mc- 
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Fig.  2.  (a)  Beam  diameter  vs  sample  bias  for  various  tip  radii  at  1  nA 
emission  current;  (b)  beam  diameter  vs  tip  sample  separation  for  the  calcu¬ 
lations  shown  in  (a). 


Cord  and  Pease,  who  demonstrated  that  there  is  an  opti¬ 
mum  ratio  of  tip  radius  to  gap  which  gives  minimum  beam 
diameter.  In  general,  the  beam  diameter  increases  approxi¬ 
mately  linearly  with  bias  for  optimum  tip  radius,  and  best 
performance  is  achieved  at  low  bias  for  all  tip  radii. 

We  have  examined  other  influences  on  beam  diameter  for 
the  standard  tip  geometry.  Dependence  of  beam  diameter  on 
beam  current  is  shown  in  Fig.  3,  for  a  tip  radius  of  30  nm, 
and  for  several  biases.  The  beam  diameter  decreases  slightly 
at  high  current  due  the  decreased  gap  required  for  higher 
fields.  But  the  dependence  is  fairly  weak  over  three  orders  of 
magnitude  of  current  due  to  the  steep  dependence  of  emis¬ 
sion  on  electric  field  characteristic  of  the  Fowler-Nordheim 
emission  process.  Only  very  small  changes  in  gap  are  neces¬ 
sary  to  alter  the  field  sufficiently  for  large  variations  in  cur¬ 
rent.  We  have  ignored  space  charge  effects,  which  can  sub¬ 
stantially  alter  the  trajectories  of  low  energy  electrons  in  high 
current  density  beams.  For  currents  <1  fik,  at  typical  bias 
and  tip™sample  gap,  there  is,  on  average,  only  one  electron 
in  the  gap  at  a  time. 

The  presence  of  a  resist  film  will  alter  the  electric  field  at 
the  tip  and  in  the  gap.  The  primary  effect  of  interposing  a 
resist  film  will  be  to  increase  the  tip-sample  separation, 
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Fig.  3.  Beam  diameter  vs  emission  current  for  a  30-nm-radius  tip  at  10-30 
V  sample  bias.  Experimental  data  (from  Ref.  14)  are  for  10  V  bias. 


causing  increased  beam  diameter.  A  resist  will  also  have  a 
higher  dielectric  constant  than  vacuum  (or  air).  Presuming 
that  the  tip  is  not  allowed  to  penetrate  the  resist  film,  the 
electric  field  in  the  vacuum  gap  will  be  approximately 
£  (gap)  (resist),  where  e  is  the  resist  dielectric  constant. 

Because  £" (resist)  decreases  with  increasing  resist,  thickness, 
^/(gap)  must  decrease  in  order  to  produce  adequate  electric 
field  at  the  tip  to  maintain  emission.  For  thick  resists,  the  tip 
must  be  very  close  to  the  resist  surface  in  order  to  maintain 
emission  [<i(gap)<l  nm  for  resist  thickness  of  50  nm].  In 
general,  particularly  for  resists  with  low  dielectric  constant, 
as  resist  thickness  increases,  it  is  difficult  to  maintain  an 
adequate  electric  field  at  the  tip  to  provide  the  desired  emis¬ 
sion  current.  The  tip  will  then  be  forced  to  penetrate  the 
resist  in  order  to  establish  the  required  electric  field. 

Beam  diameter  at  the  resist/vacuum  interface  is  small, 
because  of  the  reduced  <i(gap),  however  the  beam  spreads 
traversing  the  resist  film.  Calculations  for  a  tip  radius  of  30 
nm,  20  V  bias,  and  6=3  show  that  the  beam  diameter  at  the 
resist/substrate  interface  increases  with  resist  thickness  with 
a  slope  of  ^0.3.  For  this  calculation,  we  assume  the  electron 
travels  through  the  resist  film  without  scattering.  Of  course 
electrons  will  scatter  in  the  resist,  as  they  must  to  deposit 
energy,  and  the  volume  of  exposed  material  will  be  further 
increased  by  the  scattering  of  these  low  energy  electrons, 
comparable  to  the  scattering  of  secondary  electrons  produced 
by  high  energy  electron  beam  exposure. 

Alternate  tip  materials  with  lower  work  function  or  tips 
with  low  work  function  adsorbates^^  can  be  employed.  This 
leads  to  higher  emission  current  at  a  given  electric  field,  as 
seen  from  Eq.  (3).  This  is  generally  desirable  for  other  field 
emission  devices,  such  as  emitters  for  electron  microscopes, 
vacuum  microelectronics,  or  display  devices.  However  in  a 
lithographic  application,  a  low  work  function  tip  will  achieve 
a  given  current  density  at  a  larger  tip-sample  gap,  resulting 
in  larger  beam  diameter  at  a  given  current  and  voltage  than 
for  the  same  geometry  tip  of  a  higher  work  function.  The 
effect  is  quite  dramatic,  and  is  shown  in  Fig.  4.  Plotted  are 
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Fig.  4.  Effect  of  tip  work  function  on  tip-sample  separation  and  beam 
diameter  for  a  30-nm-radius  tip,  20  V  bias,  and  1  nA  emission. 


tip-sample  gap  and  beam  diameter  at  the  sample  as  a  func¬ 
tion  of  tip  work  function  for  a  30-nm-radius  tip  at  20  V  bias 
and  1  nA  current.  The  direct  correlation  of  tip-sample  gap 
and  beam  diameter  is  well  demonstrated  here.  Additional  ef¬ 
fects  of  tip  geometry  and  bias  behave  in  a  similar  manner  to 
Fig.  2. 


Fig.  6.  Tip  electric  field,  tip  emission  current  density,  and  sample  current 
density  for  standard  and  super  tips.  Emitter  radius  is  5  nm,  bias  is  20  V,  and 
emission  is  1  nA  for  each. 


B.  Alternate  tip  configurations 


Fig.  5.  Equipotential  contours  for  (a)  standard  tip,  (b)  asperity  tip,  and  (c) 
recessed  tip.  Contour  interval  is  0. 1  Vq  ■  Tip-sample  gap  is  that  required  for 
1  nA  emission  at  20  V  bias. 


The  idealized  standard  tip  configuration  shown  in  Fig.  1  is 
unlikely  to  be  representative  of  real  electron  emitters  for  a 
number  of  reasons.  While  tips  may  appear  round  and  smooth 
on  a  macroscopic  scale,  invariably,  microscopic  details  of  the 
tip  surface  will  include  local  asperities  due  to  tip  preparation, 
tip  alternation,  contamination,  or  even  discontinuities  due  to 
atomic  steps  and  defects  on  the  surface.  These  will  all  lead  to 
a  local  alteration  of  the  electric  field  and  emission  character¬ 
istics  that  deviate  from  the  ideal  spherical  tip.^^  It  is  gener¬ 
ally  assumed  in  scanning  tunneling  microscopy  that  tunnel¬ 
ing  occurs  primarily  through  only  a  few  atoms  at  a  local 
asperity,  and  many  tip  preparation  procedures  are  effectively 
intended  to  produce  small  asperities. 

Asperities  can  also  be  intentionally  fabricated  in  a  con¬ 
trolled  fashion  to  produce  a  desired  electric  field  configura¬ 
tion  at  the  tip.  “Super  tips”^^’^^  have  been  constructed  by 
producing  controlled  asperities  on  the  end  of  relatively  blunt 
tips  for  use  as  electron  and  ion  emitters  with  a  reduced  an¬ 
gular  emission  profile.  While  a  very  small  asperity  indicates 
a  high  curvature  emitter,  the  electric  field  is  suppressed  at 
large  polar  angles  by  the  proximity  of  a  flat  surface,  and 
emission  is  restricted  to  a  small  range  of  polar  angles  at  the 
apex  of  the  asperity.  The  result  is  a  beam  that  is  focused  in 
the  forward  direction. 

This  focusing  effect  can  also  be  useful  in  lithographic 
applications.  We  have  examined  to  alternate  tip  configura¬ 
tions  intended  to  produce  this  focusing  effect,  shown  in  Fig. 
5  along  with  equipotential  contours  for  a  tip-sample  gap  that 
would  produce  1  nA  current  at  a  sample  bias  of  20  V.  The 
origin  of  the  effect  is  clear  from  the  shapes  of  the  equipoten¬ 
tial  contours  and  the  recognition  that  field  lines  are  perpen¬ 
dicular  to  the  potential  contours.  The  standard  tip  in  Fig.  5(a) 
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Fig.  7.  Beam  diameter  vs  sample  bias  for  standard  and  asperity  tips  with 
radii  of  5  and  10  nra  at  1  nA  emission.  Experimental  data  are  from  Ref.  14. 

with  a  5  nm  radius  exhibits  large  curvature  of  the  potential 
contours,  leading  to  a  large  degree  of  radial  spreading  of  the 
electron  trajectories.  Figure  5(b)  depicts  an  asperity  tip  con¬ 
sisting  of  a  hemisphere  with  5  nm  radius  on  a  nearly  flat 
surface  (shank  angle  =85°).  The  potential  contours  are  com¬ 
paratively  much  flatter  in  the  region  of  the  emitter,  giving 
electron  trajectories  focused  in  the  forward  direction.  Figure 
6  shows  a  comparison  of  the  electric  field  and  emission  pro¬ 
files  at  the  tip  and  the  current  profile  at  the  surface  for  the 
standard  tip  and  the  asperity  tip  in  Figs.  5(a)  and  5(b).  For 
the  standard  tip,  at  20  V  bias  and  1  nA  emission,  the  tip- 
sample  gap  is  17  nm,  and  emission  occurs  out  to  >40°  polar 
angle  on  the  tip.  This  results  in  a  diffuse  beam  with  a  diam¬ 
eter  of  21  nm.  The  asperity  tip,  however,  shows  high  field 
and  significant  emission  only  out  to  30°  polar  angle,  and  the 
tip- sample  gap  is  reduced  to  10  nm.  This  gives  a  beam  di¬ 
ameter  of  9  nm,  or  less  than  half  that  of  the  standard  tip. 

Beam  diameter  versus  sample  bias  for  asperity  tips  and 
standard  tips  of  5  and  10  nm  tip  radius  are  shown  in  Fig.  7. 
The  characteristics  of  the  asperity  tip  emission  follow  the 
same  trends  as  those  for  the  standard  tip,  only  with  decreased 
beam  diameter  due  to  the  focusing  effect  and  reduced  tip- 
sample  separation.  We  note  from  Fig.  7  that  the  effect  is  most 
pronounced  for  small  asperities.  For  the  larger  (10  nm)  as¬ 
perity  the  alteration  of  the  potential  near  the  tip  apex  is  less 
pronounced  and  the  focusing  effect  is  minimized.  We  also 
note  that  the  focusing  effect  is  most  pronounced  at  higher 
bias,  where  the  tip-sample  separation  is  large.  Small  tip- 
sample  separation  at  low  bias  also  results  in  a  minimal  alter¬ 
ation  of  the  potential  near  the  apex,  and  minimal  focusing. 

Comparison  of  Fig.  7  with  Fig.  2(a)  shows  that  the  5  nm 
asperity  tip  produces  a  beam  slightly  smaller  than  the  “opti¬ 
mum”  standai'd  tip  at  all  biases  above  15  V.  However,  at 
biases  of  less  than  about  15  V,  the  beam  diameter  is  rela¬ 
tively  insensitive  to  the  tip  geometry,  whether  it  is  an  asper¬ 
ity  or  standard  tip.  This  insensitivity  suggests  that  for  low 
voltage  lithography  one  need  not  be  very  concerned  about 
the  nature  of  the  tip. 

Further  focusing  of  electron  trajectories  can  be  obtained 


using  an  emitter  structure  similar  to  that  shown  in  Fig.  5(c). 
This  is  a  recessed  emitter  than  could  potentially  be  fabricated 
by  ion  sputtering  techniques  similar  to  those  used  to  make 
extremely  sharp  tips  for  metrology  applications,^^  and  re¬ 
cessed  emitters  for  vacuum  microelectronics.^"^’^^  In  this 
schematic  example  the  recessed  tip  is  a  hemisphere  with  a 
diameter  of  5  nm.  The  top  of  the  tip  is  at  the  same  height  as 
the  planar  cathode,  and  the  tip  is  separated  from  the  rest  of 
the  cathode  by  a  10-nm-wide  trench.  The  intention  here  is  to 
use  the  planar  cathode  as  a  Wehnelt  cylinder  to  focus  elec¬ 
tron  trajectories  in  the  forward  direction.  The  focusing  effect 
is  evident,  producing  a  beam  diameter  of  7.2  nm  for  the 
conditions  of  Fig.  6,  compared  to  1 8  nm  for  the  standard  tip 
and  8.8  nm  for  the  asperity  tip.  The  performance  of  this  tip 
depends  primarily  on  the  width  of  the  trench  between  the  tip 
and  the  planar  cathode.  Structures  with  trench  width  of  >20 
nm  perform  less  well  than  the  tip  of  Fig.  5(b).  Performance 
could  be  increased  by  narrowing  the  trench  or  by  recessing 
the  emitter  below  the  surface  of  the  planar  cathode.  Of 
course  the  comers  of  the  planar  cathode  would  have  to  be 
suitably  rounded  to  prevent  development  of  an  excessive 
field  at  this  comer.  Fabrication  of  such  a  tip  becomes  prob¬ 
lematic,  and  operation  of  a  recessed  tip  in  close  proximity  to 
a  sample  may  result  in  the  planar  cathode  being  too  close  to 
the  sample  (with  very  high  electric  field  at  the  planar  cathode 
surface).  This  type  of  emitter  stmcture  may  be  useful  in  other 
field  emission  devices  that  require  a  focused  electron  beam.^^ 

We  have  not  attempted  to  model  emitters  consisting  of 
only  a  single  or  very  few  atoms.  Such  ultrasharp  tips  have 
been  fabricated  and  used  as  point  source  emitters  for  electron 
microscopes.  As  expected,  they  demonstrate  very  high 
fields  and  emission  current  densities  at  the  apex.  Cutler  and 
co-workers,^^  however,  have  shown  that  ultrasharp  tips  do 
not  obey  the  usual  formulation  of  the  Fowler-Nordheim 
emission  theory,  and  suggest  that  Eq.  (3)  may  be  in  error  by 
orders  of  magnitude  for  tip  radii  <10  nm.  It  appears  that  a 
more  sophisticated  treatment  of  the  field  emission  process  is 
required  for  modeling  atomically  sharp  tips. 

C.  Comparison  to  experiments 

While  most  experimental  demonstrations  of  STM-based 
lithography  have  not  specified  tip  geometry  nor  directly  mea¬ 
sured  electron  current  distributions,  we  can  compare  these 
simulations  to  a  number  of  experimental  results  demonstrat¬ 
ing  feature  sizes  on  the  order  of  10  nm  that  have  appeared 
over  the  past  few  years. 

We  have  directly  measured  the  electron  current  distribu¬ 
tion  at  the  sample  by  imaging  the  distribution  of  desorbed  H 
atoms  from  Si(OOl)  surfaces  in  the  STM,  reported 
elsewhere.^"^  Above  a  threshold  energy  of  6-8  eV,  this  elec¬ 
tron  stimulated  desorption  process  provides  a  direct  measure 
of  the  electron  flux  distribution  with  minimal  electron  scat¬ 
tering  effects.  We  also  measured  the  tip-sample  separation 
as  a  function  of  sample  bias.  All  of  these  measurements  were 
made  with  a  W  tip,  prepared  by  electrochemical  etching,  that 
was  not  evaluated  prior  to  introduction  into  the  STM.  Over 
the  range  of  5-30  V  bias,  we  observed  a  beam  diameter  that 
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varied  from  2  to  8  nm.  The  measured  tip-sample  gap  ranged 
from,  nominally,  1  nm  at  tunneling  conditions  of  2  V  bias  to 
16  nm  at  40  V.  The  tip- sample  gap  agrees  well  with  standard 
tip  simulations  for  hemispherical  emitters  with  r  =  30-50 
nm. 

The  observed  beam  diameters,  however,  were  smaller 
than  those  calculated  for  standard  tips  of  optimum  radius, 
particularly  at  higher  bias,  but  agree  well  with  the  simula¬ 
tions  including  focusing  effects  associated  with  emission 
from  asperities  on  a  blunt  tip,  shown  in  Fig.  7.  We  do  not 
know  if  emission  is  occurring  from  a  single  site  or  from  a 
number  of  asperities.  (We  have  seen  experimental  evidence 
for  multiple  site  emission,  on  occasion.)  Beam  diameter  was 
observed  to  be  essentially  constant  over  more  than  two  or¬ 
ders  of  magnitude  of  emission  current  at  fixed  bias,  in  good 
agreement  with  the  calculations,  as  shown  in  Fig.  4.  The 
estimated  functional  form  of  the  beam  profile  is  also  in  rea¬ 
sonable  agreement  with  the  calculations.  The  experimental 
data  are  best  fit  with  an  exponential  form,  decaying  radially 
as  to  while  the  calculations  indicate  an  depen¬ 
dence.  Feature  sizes  produced  as  a  function  of  exposure  dose 
show  broadening,  consistent  with  the  beam  profile  measured 
at  low  dose,  with  some  evidence  of  electron  reflection  effects 
at  high  bias  and  dose.  This  system,  by  eliminating  scattering 
effects  and  using  an  ideally  thin,  self-developing  resist  ap¬ 
pears  limited  only  by  electron  current  distribution  from  the 
tip. 

Effects  of  electron  scattering,  large  tip- sample  separation 
with  thick  resists,  and  postexposure  processing  can  all  limit 
the  size  of  features  that  can  be  fabricated  independently  of 
the  performance  of  a  given  tip.  Exposure  of  self-assembled 
monolayer  resists  should  show  minimal  electron  scattering 
effects  and  minimal  beam  spreading  due  to  increased  tip- 
sample  gap,  and  most  accurately  reflect  the  beam  diameter  at 
the  surface.  Marrian  et  al}^  have  exposed  monolayers  of  a 
phenylethylenediaminemethoxysilane  resist,  approximately, 
1  nm  thick  and  achieved  linewidths  of  <10  nm  for  low  volt¬ 
age  (<10  V)  exposures.  Exposures  of  4-chloromethyl- 
phenyltrichlorosilane  at  8  V  produced  features  <20  nm 
(measurement  limited  by  the  grain  size  of  the  Ni  subse¬ 
quently  plated  on  the  sample).  Direct  comparison  to  our 
simulations  is  not  possible,  but  these  studies  appear  to  be 
approaching  the  limit  of  patterning  ability  determined  by 
beam  diameter. 

Zhang  et  al}  have  examined  polymethylmethacrylate 
(PMMA)  and  polyvinylcinnamate  films  prepared  by  the 
Langmuir-Blodgett  technique  that  give  4.5- 14-nm- thick 
films.  Linewidths  of  30-100  nm  were  observed  after  expo¬ 
sure  at  20-30  V  sample  bias,  and  various  development  and 
pattern  transfer  procedures.  These  dimensions  are  consider¬ 
ably  larger  than  the  expected  beam  diameter  at  these  condi¬ 
tions  and  probably  reflect  the  effects  of  resist  development 
and  subsequent  etching  processes  on  broadening  of  the  fea¬ 
ture  dimension. 

A  number  of  investigators  have  examined  thicker  films  of 
conventional  electron  beam  resists  prepared  by  spin-on  tech¬ 
niques.  McCord  and  Pease^  report  fabrication  of  20  nm 


linewidths  in  20-nm- thick  films  of  PMMA.  Dobisz  and 
Marrian^  and  Marian  et  al^  report  linewidths  from  30  to  50 
nm  for  50- 80-nm- thick  resist  (Shipley  SAL-601)  films. 
These  results  apparently  show  little  broadening  due  to  resist 
processing  and  are  consistent  with  broadening  of  the  electron 
beam  profile  by  the  presence  of  a  thick  resist  film. 

While  a  number  of  studies  appear  to  be  approaching  the 
beam-limited  performance,  definition  of  sub- 10  nm  features 
has  only  been  demonstrated  for  atomic  or  molecular  layer 
resists.  As  suggested  by  the  simulations,  beam  spreading 
(and  ultimately  electron  scattering)  in  thicker  (>10  nm)  re¬ 
sist  films  leads  to  larger  feature  sizes.  Routine  sub- 10  nm 
lithography  may  be  limited  to  very  thin  resists,  such  as  self- 
assembled  monolayer  films,  or  the  adsorbed  atomic  or  mo¬ 
lecular  layer. 

IV.  SUMMARY 

We  have  presented  a  systematic  study  of  the  performance 
of  STM-based,  low  energy  electron  beam  lithography  based 
on  simulations  of  field  emission  from  STM  tips  in  close 
proximity  to  a  substrate  surface.  The  major  factor  in  deter¬ 
mining  the  beam  diameter  is  the  tip-sample  gap,  which  is  set 
to  obtain  the  necessary  field  strength  at  the  tip  for  the  desired 
emission.  The  beam  diameter  increases  as  a  function  of  bias 
because  of  the  larger  gap  at  higher  voltage  and  subsequent 
spreading  of  the  electron  trajectories.  At  a  given  bias  there  is 
a  trade-off  between  tip  radius  and  tip-sample  gap  that  leads 
to  an  optimum  tip  radius  for  minimum  beam  size.  At  low 
bias  (<10  V)  sharp  tips  give  smallest  beam  size,  whereas  at 
high  bias  (>40  V)  more  blunt  tips  result  in  smaller  beam 
diameter.  For  optimal  tip  radius  the  beam  diameter  increases 
linearly  with  bias  from  approximately  2  nm  at  5  V  to  25  nm 
at  50  V.  Beam  diameter  is  nearly  independent  of  emission 
current  over  the  range  0.05-50  nA. 

Dielectric  resist  films  interposed  between  tip  and  sample 
cause  an  increase  in  beam  diameter  due  to  increased  tip- 
substrate  gap.  Reduction  of  the  tip  work  function  results  in  a 
dramatic  increase  in  beam  diameter  due  to  the  larger  gap 
required  to  obtain  the  desired  emission  from  a  low  work 
function  tip. 

We  have  examined  alternative  tip  structures  intended  to 
tailor  the  electric  field  in  the  vicinity  of  the  tip  and  focus  the 
electron  beam  to  a  smaller  diameter.  Asperities  consisting  of 
a  small  radius  hemisphere  on  a  nearly  flat  surface  produce 
beams  with  significantly  smaller  diameter  than  those  pro¬ 
duced  by  a  standard  tip  of  the  same  emitter  radius.  However, 
for  sample  bias  <15  V  the  beam  diameter  becomes  relatively 
insensitive  to  tip  geometry,  converging  on  a  beam  diameter 
of  ~5  nm  at  10  V  bias  for  all  tips. 

The  simulations  agree  quite  well  with  experiments  that 
directly  measure  the  current  distribution  at  the  sample.  These 
results,  using  adsorbed  atomic  H  as  a  resist,  display  feature 
sizes  limited  only  by  the  characteristics  of  the  electron  emit¬ 
ter.  Experiments  using  molecular  layer  resists,  such  as  self- 
assembled  monolayer  films,  also  approach  resolution  limits 
set  by  the  electron  beam  diameter.  Given  the  results  of  the 
simulations  presented  here  and  the  beam-diameter-limited 
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performance  of  a  number  of  experimental  systems,  we  feel 
confidence  that  routine  sub- 10  nm  structure  fabrication  using 
STM-based  low  energy  electron  beam  lithography  is  feasible 
with  current  techniques. 
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The  resolution  of  electron-beam  lithography  is  limited  by  the  proximity  effect,  which  is  due  to  the 
scattering  of  incident  electrons.  This  scattering  leads  to  exposure  of  neighboring  areas  and, 
therefore,  to  pattern  degradations.  It  can  be  modeled  by  convolving  the  incident  dosage  distribution 
with  a  point  spread  function  and  the  subsequent  developing  process  by  a  pointwise  nonlinear 
function.  This  article  presents  an  iterative  algorithm  that  exploits  this  overall  model  to  compute 
corrections  of  the  proximity  effect  in  the  nanometer  range.  A  convex  error  function  is  derived  that 
satisfies  all  physical  constraints  and  the  problem  is  set  up  as  a  convex,  constrained,  and  nonlinear 
minimization  problem.  The  correction  algorithm  is  a  hybrid  of  the  conjugate  gradient  and  gradient 
projection  algorithms.  Its  performance  for  a  test  pattern  is  evaluated  and  then  a  detailed  analysis  of 
the  effect  of  inaccuracies  of  the  model  parameters  and  of  the  necessary  quantization  of  the  corrected 
incident  dosage  distribution  is  presented.  ©  1996  American  Vacuum  Society, 


1.  INTRODUCTION 

The  resolution  of  electron-beam  lithography  is  limited  by 
the  so-called  proximity  effect,  which  describes  degradations 
of  the  pattern  during  exposure  due  to  forward  scattering  of 
electrons  on  their  path  through  the  resist  and  due  to  back¬ 
ward  scattering  of  electrons  from  the  substrate.  The  scattered 
electrons  change  locally  the  chemical  properties  of  the  resist, 
which  in  turn  causes  different  solution  rates  of  the  developer, 
so  that  areas  exposed  by  scattered  electrons  are  also  partly 
removed. 

In  1975  Chang^  suggested  modeling  the  scattering  as  a 
convolution  of  the  incident  dosage  distribution  with  a  prox¬ 
imity  function.  The  proximity  function  describes  the  distri¬ 
bution  of  the  received  exposure  at  some  resist  depth  for  a 
pointlike  incident  electron  beam.  His  proximity  function 
consists  of  a  sum  of  two  Gaussian  terms,  a  narrow  one  for 
the  forward  scattering  and  a  wide  one  for  the  backward  scat¬ 
tering.  This  model  has  undergone  some  extensions  since 
1975,  but  is  widely  used  to  design  correction  algorithms  for 
the  proximity  effect. 

The  idea  of  many  correction  techniques  is  to  modify  the 
incident  dosage  distribution  so  that  the  degradations  in  the 
developed  pattern  due  to  the  proximity  effect  are  reduced. 
Specifications  on  the  accuracy  of  the  correction  depend 
strongly  on  several  conditions,  such  as  the  actual  physical 
system,  the  feature  sizes  of  interest,  the  basic  properties  and 
tolerances  on  the  geometry  of  the  features,  etc.  Therefore,  a 
diverse  number  of  approaches  and  techniques  have  been  de¬ 
veloped  during  the  last  20  years An  issue  of  major  con¬ 
cern  in  all  the  published  work  is  the  trade-off  between  accu¬ 
racy  and  computational  efficiency  in  designing  the 
correction.  However,  with  decreasing  feature  sizes  more  so¬ 
phisticated  algorithms  are  necessary  to  achieve  the  required 
resolution. 

We  present  here  an  algorithm  that  allows  correction  for 
features  in  the  nanometer  range.  It  incorporates  two  impor- 
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tant  ideas  to  obtain  high  resolution.  First,  it  is  based  on  an 
extended  mathematical  model  that  includes  the  nonlinear  ef¬ 
fects  of  the  developing  process.  Second,  it  computes  an  in¬ 
dividual  incident  dosage  value  for  every  incident  beam  loca¬ 
tion  to  achieve  the  highest  possible  resolution. 

In  Sec.  II  the  assumed  model  used  is  described  in  detail. 
For  an  efficient  implementation  of  proximity  effect  correc¬ 
tion,  fast  Gaussian  filters  are  necessary.  This  issue  is  ad¬ 
dressed  in  Sec.  III.  Section  IV  then  defines  a  convex  error 
function  which  reduces  the  problem  of  proximity  effect  cor¬ 
rection  to  a  constrained  nonlinear  optimization  problem.  The 
final  algorithm  presented  in  Sec.  V  is  a  hybrid  of  gradient 
projection  and  conjugate  gradient  algorithms.  Simulation  re¬ 
sults  and  an  error  analysis  are  given  in  Secs.  VI  and  VII. 

In  the  following  we  use  the  term  exposure  for  the  received 
energy  in  the  resist  due  to  incident  electrons  and  scattering 
effects. 


II.  DESCRIPTION  OF  THE  MODEL 

The  model  for  the  overall  lithography  process  consists  of 
two  steps,  the  scattering  of  electrons  described  by  the  prox¬ 
imity  function  and  the  chemical  developing  step  leading  to  a 
three-dimensional  resist  relief.  Recent  results showed 
that  Chang’s  double  Gaussian  model  for  the  proximity  func¬ 
tion  F(x,y)  is  not  accurate  enough  if  very  small  feature  di¬ 
mensions  are  approached.  Since  additional  Gaussian 
terms^^’^^  diminish  this  problem,  we  define  F(x,y)  to  be  the 
general  sum  of  n  Gaussians: 


F{x,y) 


1 


(1) 


The  rji  define  the  contribution  of  each  term.  In  this  form 
F{x,y)  is  normalized: 


[  f  F{x,y)dx  dy=\,  (2) 

J  — 00  J  —00 


In  all  devised  proximity  correction  algorithms,  it  is  implicitly 
assumed  that  F(jc,y)  is  space  invariant  and  that  at  every 
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pixel  linear  superposition  holds  for  the  exposure.  Thus,  the 
exposure  distribution  in  the  resist  E{x,y)  can  be  determined 
by  a  two-dimensional  convolution  of  the  applied  incident 
dose  distribution  D{x,y)  and  the  proximity  function  F(x,y): 

£’(x,y)  =  cD(x,y)**F(x,y),  (3) 

where  c  is  a  constant  factor  and  **  denotes  a  two- 
dimensional  convolution. 

The  lithographical  process  is  stepped  in  horizontal  direc¬ 
tions,  so  that  the  beam  is  applied  at  discrete  rectangular  grid 
points  with  an  equal  spacing  A.  Since  the  incident  beam  has 
a  Gaussian  distribution  for  most  lithographical  equipment,  it 
can  simply  be  incorporated  into  F(x,y)  by  adjusting  the 
in  Eq.  (1).  The  incident  dose  distribution  D(x,y)  is  then 

M-l  N-\ 

D{x,y)=  X  2  d{k,l)d{x-kA,y-lA),  (4) 

k=0  1=0 


where  M  and  N  are  the  number  of  grid  points  in  the  x  and  y 
direction  and  d(kj)  is  the  dose  applied  at  the  grid  point 
(/:A,/A).  Thus,  E{x,y)  can  be  determined  by  the  superposi¬ 
tion  of  a  finite  number  of  continuous  functions.  In  particular, 
if  we  are  only  interested  in  discrete  points  on  a  rectangular 
grid  with  spacing  A/v  with  v  being  an  integer,  then 
E[ni{A/v),n2{A/v)]  can  be  determined  by  a  discrete  con¬ 
volution. 

If  we  define 


r  1 


d{ni,n2)-- 


D 


0 


rii  n2 


V  V 


n\ 


—  — , —  if  —  and  —  are  integers 


0  otherwise 


where  Dn  is  a  normalization  dose,  and 


f{ni,n2)  =  A'^F\  n,  -,n2  ^ 


(5) 


(6) 


then  a  discrete  convolution  equivalent  to  Eq.  (3)  can  be  writ¬ 
ten  in  a  unit-free  form: 


e{ni,n2)  = 


cDq 

FoA^ 


(7) 


where  Fq  is  a  normalization  factor  for  the  exposure.  It  is 
important  to  point  out  that  Eq.  (7)  is  exact.  It  is  not  the  result 
of  discretization  by  approximating  a  continuous  integral  by  a 
discrete  sum,  as  it  is  often  done  in  modeling  of  continuous 
systems.  For  this  research,  we  always  take  v—l  so  that 
d(ni,n2)  =  (l/DQ)d{ni  ,^2). 


Fig.  1 .  Characteristic  curve  for  a  positive  resist. 


In  the  subsequent  developing  step,  the  exposed  areas  are 
removed  in  the  case  of  a  positive  resist.  The  process  itself 
depends  on  many  parameters  such  as  resist  composition, 
thickness  of  the  resist,  type  and  application  of  the  developer, 
bake  conditions,  etc.  However,  for  a  fixed  set  of  these  con¬ 
ditions,  a  resist  can  be  characterized  by  a  nonlinear  curve 
(Fig.  1)  giving  the  amount  of  resist  removed  as  a  function  of 
the  exposure.^^  The  exposure  corresponding  to  a  resist  re¬ 
moval  of  50%  is  normalized  to  1.  Hence,  if  we  neglect  ef¬ 
fects  due  to  horizontal  etching  and  a  nonuniform  energy  dis¬ 
tribution  in  the  vertical  direction,  the  developing  process  can 
be  described  by  a  nonlinear,  point- wise  function  .y(-)  which 
applies  to  every  pixel.  The  two  intervals  of  zero  slope  of  .y(  • ) 
in  Fig.  1  indicate  regions  where  the  resist  stays  completely 
unchanged  or  is  totally  removed.  They  can  be  exploited  to 
achieve  better  corrections. 

The  overall  two-step  discrete  pattern  formation  process 
can  now  be  written  as 

p(«i,n2)  =  s[c/(«i,n2)**/(ni,«2)].  (8) 

where  /?(«!  ,^2)  is  the  final  resist  pattern.  A  block  diagram  is 
shown  in  Fig.  2.  The  normalization  constants  Dq  and  Fq 
were  incorporated  into  the  incident  dosage  distributions 
J(nj,n2). 

The  objective  is  to  compute  a  dirty, 112)  such  that 
p(ni,n2)  approximates  a  desired  pattern  as  closely  as  pos¬ 
sible.  The  only  desired  resist  patterns  considered  here  are 
those  which  are  strictly  binary  and  which  can  be  represented 


proximity  function  f(nl,n2)  characteristic  function  s(.) 

Fig.  2.  Model  for  the  overall  proximity  effect. 
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accurately  by  a  discrete  pattern  Ihe  grid  in¬ 

troduced  above.  Furthermore,  we  assume  in  our  approach 
that  there  is  no  additional  exposure  from  outside  the  pattern. 
For  convenience,  pixels  of  the  desired  pattern  with  com¬ 
pletely  removed  resist  will  be  called  zero  pixels  or  zero  re¬ 
gions,  and  pixels  with  the  original  resist  thickness  one  pixels 
or  one  regions,  respectively. 

To  obtain  a  more  compact  representation  of  the  following 
mathematical  derivations,  a  matrix  notation  widely  used  in 
image  processing  is  adopted  here.  We  use  lexicographical 
ordering  of  the  two-dimensional  signals  to  represent  them  as 
column  vectors  (bold  small  letter).  The  convolution  with  the 
proximity  function /(^i  ,n2),  can  be  written  as  a  matrix  mul¬ 
tiplication  with  a  block  Toeplitz  matrix  F  (bold  capital  let¬ 
ter).  The  point- wise  nonlinearity  ^(v)  would  be  applied  to 
every  element  of  the  vector  v. 


III.  EFFICIENT  GAUSSIAN  FILTERS 

For  high-energy  electron  beams,  the  spread  of  the  back- 
scattered  electrons  can  be  several  micrometers,  which  trans¬ 
lates  for  a  grid  spacing  of  a  few  nanometers  into  a  two- 
dimensional  proximity  function  of  several  hundred  pixels  in 
radius.  Since  our  proposed  proximity  correction  technique 
will  be  iterative,  numerous  convolutions  of  the  pattern  with 
the  proximity  function  will  be  necessary.  Thus,  very  efficient 
Gaussian  filters^^’^^  are  needed  to  compute  corrections  in  a 
reasonable  time. 

The  most  efficient  way  to  implement  a  good  approxima¬ 
tion  of  a  Gaussian  filter  is  to  exploit  the  Central  Limit  Theo¬ 
rem,  which  states,  in  a  general  form,  that  the  convolution  of 
a  large  number  of  positive  functions  with  finite  first  and  sec¬ 
ond  moments  converges  under  mild  constraints  in  the  limit  to 
a  Gaussian  function.  Thus,  a  Gaussian  filter  can  be  imple¬ 
mented  by  iteratively  convolving  a  signal  with  an  arbitrary, 
but  simple,  positive  kernel.  The  uniform  box  filter  is  very 
well  suited  for  that  purpose.^^'^^  A  box  filter  of  length  N  and 
gain  N  can  be  implemented  by  the  recursive  equation 

y[n]  =  y[n—l]  +  x[n]  —  x[n~N]  (9) 

with  only  one  addition  and  one  subtraction  per  output  pixel. 

This  idea  can  be  used  to  implement  good  approximations 
of  two-dimensional  Gaussian  filters  with  only  17  floating 
point  operations  per  output  pixel  by  applying  two  box  filters 
along  each  axis  and  along  each  45°  diagonal.  Furthermore, 
the  amount  of  computation  can  be  reduced  to  less  than  eight 
floating  point  operations  for  wide  Gaussian  filters,  because 
methods  from  multirate  signal  processing  can  be  applied.  If 
less  accuracy  is  acceptable,  implementations  with  fewer  than 
two  floating  point  operations  per  pixel  are  possible.  A  de¬ 
tailed  discussion  of  the  resulting  filter  structures  and  a  stabil¬ 
ity  analysis  for  cascaded  box  filters  can  be  found 
elsewhere.^^’^^  The  important  result  here  is  that  Gaussian  fil¬ 
ters  of  any  size  can  be  implemented  with  only  a  few  floating 
point  operations  per  pixel.  Thus,  there  is  no  computational 
difference  between  narrow  Gaussians  for  the  forward  scat¬ 


tered  and  wide  Gaussians  for  the  backward  scattered  elec¬ 
trons.  These  can  be  treated  simultaneously,  in  the  2D  convo¬ 
lution  of  Eq.  (8). 

IV.  DEFINITION  OF  A  CONVEX  ERROR  FUNCTION 

A  specific  error  measure  has  to  be  defined  in  order  to 
compute  an  optimal  incident  dose  distribution  ^/(ni  ,^2)-  We 
consider  the  squared  Euclidean  norm  of  the  distance  between 
the  actual  developed  pattern  and  the  desired  pattern.  The 
proximity  effect  correction  problem  is  then 

min2  (Pdes.-'5(rd),)^  d5=0,  (10) 

d  ' 

where  ^(Fd),-  denotes  the  ith  element  of  the  vector  ^(Fd). 
The  non-negativity  of  d  is  significant,  because  a  negative 
value  would  imply  the  removal  of  deposited  energy.  This 
definition  of  the  error  function  (10)  has  the  disadvantage  of 
not  being  convex,  because  5(-)  is  a  nonconvex  function. 
Therefore,  general  algorithms  widely  used  in  nonlinear  pro¬ 
gramming  may  converge  only  to  suboptimal  solutions.  This 
problem  can  be  overcome  by  modifying  Eq.  (10)  so  that 
there  is  a  different  error  function  ^,(-)  for  every  pixel.  Let 
^,(  • )  be  the  error  function  for  the  exposure  at  vector  element 
(Fd),- ,  then  we  can  write  the  total  squared  error  as 

£(d)  =  E  e?[(Fd),.].  (11) 

i 

Due  to  the  binary  nature  of  ^  it  is  sufficient  to  introduce 
only  two  different  error  functions,  one  for  the  zero  pixels  and 
the  other  for  the  one  pixels.  To  incorporate  the  nonlinearity 
5(  • )  we  define  two  new  functions,  ^ i ( • )  and  >So( ' ) • 


5o(^)  =  ' 

^  .s(jc)  if  X  e  [l,oo) 

[^'(l)(x— 1)  +  5(1)  ifjcG[0,l) 

(12) 

5l(x)  =  < 

[^.y(x)  if  xg[0,1) 

[s'(1)(j:-1)  +  s(1)  xe[l,oo) 

(13) 

where  s'{-)  is  the  first  derivative  of  5(-)-  Both  5i(-)  and 
5o(*)  are  continuously  differentiable  and  they  allow  a 
straightforward  definition  of  ^o( ' )  ^i( ' )  for  zero  or  one 

pixels,  respectively, 

o 

II 

o 

(14) 

ei{-)  = 

(15) 

It  can  easily  be  shown  that  the  resulting  ^(d)  is  con  vex, but 
not  strictly  convex.  Hence,  there  may  be  more  than  one  glo¬ 
bal  minimum  point,  but  the  set  of  all  global  minima  is 
convex.^^ 

In  the  case  of  a  positive  resist,  a  zero  region  of  a  pattern 
will  develop  correctly,  if  the  remaining  resist  thickness  is 
zero.  On  the  other  hand,  a  one  region  of  a  pattern  will  de¬ 
velop  correctly  if  the  remaining  resist  layer  is  sufficiently 
thick.  This  requirement  may  even  be  met  when  about  half  of 
the  thickness  of  the  resist  layer  is  removed.  Thus,  the  devia¬ 
tions  in  one  and  zero  regions  from  the  desired  pattern  are  of 
very  different  significance  in  terms  of  a  good  correction.  This 
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Fig.  3.  Distribution  of  the  weighting  factors  in  the  weighting  matrix  W. 


observation  can  be  incorporated  into  our  correction  tech¬ 
nique  by  using  an  individual  weighting  factor  for  every 
pixel.  In  general,  the  larger  w,-  is,  the  smaller  will  be  the  local 
error.  The  best  results  are  achieved  by  using  different 
weights  at  edges,  corners,  and  in  the  interior  of  the  zero 
regions  (see  Fig.  3)  and  unit  weight  in  one  regions.  The 
weights  should  be  largest  for  the  interior  part  of  zero  regions, 
less  at  the  edges,  and  smallest  at  corners.  Such  a  distribution 
allows  us  to  change  the  location  of  transition  regions  to  a 
certain  extent. 

Putting  all  this  together,  we  can  formulate  the  following 
minimization  problem.  Let  ^(',-)  be  a  vector  element-wise 
function  of  two  vector  arguments: 


e(Xi,yi)  = 


ei(x,) 

co(x/) 


if  y,.=  l 

if  y,=o- 


(16) 


The  proximity  correction  problem  becomes  a  nonlinear,  con¬ 
vex,  constrained  optimization  problem 

min[e(Fd,Pdes)''We(Fd,Pdes)],  d^O,  (17) 

d 


where  W  is  a  diagonal  matrix  with  the  weighting  factors 
.  We  will  use  the  notation  E(d)  for  the  overall  error 
term  in  Eq.  (17). 


V.  PROXIMITY  EFFECT  CORRECTION  ALGORITHM 

Since  the  problem  of  proximity  effect  correction  is  a  con¬ 
strained  minimization  problem,  the  well  established  frame¬ 
work  of  nonlinear  optimization  theory  can  be  applied.  A  hy¬ 
brid  of  the  Fletcher-Reeves  conjugate  gradient  algorithm,^^ 
Rosen’s  projected  gradient  algorithm, and  the  Goldstein - 
Levitin-Polyak  algorithm^^  showed  best  performance  for 
this  particular  problem.  Here  we  show  only  the  central  ideas 
and  concepts  behind  our  new  algorithm.  A  detailed  listing  in 
pseudocode  of  the  algorithm  can  be  found  in  the  Appendix. 

The  algorithm  is  a  descent  method  and  has,  therefore,  the 
underlying  basic  form 

(18) 

where  is  the  incident  dosage  vector  after  k  iterations.  The 
sequence  {d^}  constitutes  the  vertices  of  a  polygon  in  the 
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convex  comer 


concave  comer 


Fig.  4.  Explanation  of  convex  and  concave  corners. 


multidimensional  search  space,  which  converges  to  the  solu¬ 
tion  vector  d*.  Every  segment  of  the  polygon  is  determined 
in  two  steps.  First,  a  descent  direction  is  determined  and 
second,  an  appropriate  step  length  is  chosen  such  that 

F(d,+i)^F(d,).  (19) 

The  non-negativity  constraint  in  Eq.  (17)  can  be  incorporated 
by  applying  a  closest  point  projection  P  into  the  set  of  non¬ 
negative  points  after  every  iteration.  To  ensure  convergence, 
more  stringent  requirements  than  in  the  unconstrained  case 
have  to  be  placed  on  aj^  ^  For  the  new  proximity  effect  cor¬ 
rection  algorithm,  diverse  experiments  showed  that  aj^  can  be 
determined  by  solving  the  unconstrained  minimization 

a^  =  arg  min  E{dj^+  ahj^).  (20) 

a 

This  yields  an  appropriate  with  a  minimum  of  computa¬ 
tions.  The  total  error  E{d)  is  locally  approximately  quadratic, 
so  that  Eq.  (20)  can  be  reduced  to  minimizing  a  parabola.  A 
detailed  step  size  rule  for  can  be  found  in  Appendix  2. 

If  an  element  d^.  of  the  vector  is  zero,  then  dj^.  lies  on 
the  border  of  the  set  of  non-negative  points,  and  the  corre¬ 
sponding  constraint  is  said  to  be  active.  For  we  use  a 
conjugate  direction  vector  determined  in  the  same  way  as  in 
the  Fletcher-Reeves  conjugate  gradient  algorithm. How¬ 
ever,  we  restrict  to  the  space  induced  by  the  current  set  of 
nonactive  constraints  by  setting  all  \  to  zero  for  which 
djt  .  =  0.  This  implies  that  once  a  constraint  becomes  active,  it 
stays  active  at  least  until  a  local  minimum  is  found.  Hence, 
particular  care  must  be  taken  in  the  choice  of  the  initial  in¬ 
cident  dosage  distribution  do- 

Experiments  have  shown  that  the  convergence  rate  during 
the  first  few  iterations  is  higher  when  those  constraints  active 
in  d*  are  already  active  in  do*  In  several  publications  it  is 
assumed  that  d*  is  zero  wherever  is  zero.  However,  it 
turns  out  that  admitting  incident  dosage  in  zero  regions 
around  convex  comers  of  the  pattern  (see  Fig.  4)  improves 
the  correction  results.  We,  therefore,  use  a  starting  distribu¬ 
tion  of  the  form 

do=/3Pdes+ec,  (21) 

where  c  is  a  binary  pattern.  Figure  5  shows  c  as  gray  regions 
around  the  convex  comers  of  the  black  zero  regions.  Since 
the  term  6  c  is  used  only  to  ensure  that  the  constraints  around 
the  convex  comers  are  not  active,  €  can  be  chosen  very 
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Fig.  5.  General  form  of  the  starting  incident  dosage  distribution  for 
the  iterative  algorithm.  Black  regions  have  value  p  and  gray  regions  have 
value  6. 

small.  The  scaling  factor  p  should  minimize  £^(^des)‘  It 
takes  into  account  that  the  required  incident  dosage  depends 
among  other  things  on  the  density  of  the  pattern,  because  the 
scattering  of  electrons  directs  some  of  the  incident  dosage  to 
the  zero  regions  of  p^es  •  Hence,  it  holds  in  general  that  the 
smaller  the  pattern  density  is,  the  higher  the  incident  dosage 
has  to  be,  since  a  higher  portion  of  the  dosage  is  directed  to 
insignificant  areas. 

VI.  PERFORMANCE  OF  THE  ITERATIVE 
ALGORITHM 

A.  Test  data 

For  test  purposes  a  set  of  parameters  for  the  model  was 
defined  using  the  data  for  a  30  kV  electron  beam  onto  silicon 
substrate  given  by  Gentilli  et  The  parameters  for  the 
proximity  function  F{x,y)  in  Eq.  (1)  are  /x  — 3,  ?7i  =  l, 
CTj  ==0.053  yarn,  ?72=0.587,  cr2— 3.55  /xm,  ?73=0.087,  and 
(xm.  We  use  in  Eq.  (4)  a  grid  of  A  =  10  nm  spacing. 
The  Gaussian  with  ctj  was  implemented  as  a  truncated,  sepa¬ 
rable  finite  impulse  response  (FIR)  filter,  and  the  Gaussians 
with  <J2  and  were  computed  with  the  efficient  Gaussian 
approximation  outlined  in  Sec.  III.  We  use  the  data  for  a  high 
contrast,  positive  polymethylmethacrylate  (PMMA)  devel¬ 
oper  with  contrast  y==9.4  (see  Fig.  1).^^ 

A  4X4  yum^  test  pattern  was  designed.  In  Fig.  6  the  zero 
regions  of  the  pattern  are  black  and  the  one  regions  are 
white.  The  test  pattern  contains  primarily  small  features  with 
sizes  down  to  50  nm,  which  is  less  than  ctj  .  The  most  diffi¬ 
cult  features  to  correct  are  the  square  holes  with  adjacent 
comers  and  sidelengths  of  50,  100,  200,  and  300  nm  (lower 
right  comer  of  Fig.  6),  the  small  square  with  a  sidelength  of 
50  nm  (upper  left  comer  of  Fig.  6),  the  double  cross  structure 
with  linewidths  of  50  and  100  nm  (upper  left  part  of  Fig.  6), 
the  pine  tree  structure  with  a  buried  inner  line  (upper  right 
part  of  Fig.  6),  and  closely  spaced  narrow  lines  with  a  line- 
width  and  a  line  spacing  of  50  nm  each  (outer  left  part  of 
Fig.  6). 

It  is  important  to  point  out  that  only  minor  effects  due  to 
the  backscattering  of  electrons  for  the  test  pattern  are  en- 
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Fig.  6.  Test  pattern  of  size  4X4  [xn?. 


countered,  because  the  test  pattern  is  small  compared  to  the 
range  of  the  backscatter  Gaussian.  Thus,  the  background  dos¬ 
age  due  to  this  effect  is  small.  As  this  work  is  primarily 
dedicated  to  the  correction  of  the  proximity  effect  in  nano¬ 
lithography,  we  are  mainly  interested  in  the  ability  of  the 
proposed  algorithm  to  correct  small  features  and  details  pri¬ 
marily  degraded  by  the  forward  scattering.  For  that  purpose 
the  size  of  the  test  pattern  in  Fig.  6  is  sufficient. 

B.  Correction  of  the  test  pattern 

We  evaluate  the  performance  of  the  algorithm  within  the 
model  defined  in  Sec.  II  and  under  the  various  assumptions 
made  about  the  actual  physical  process.  Within  the  first  few 
iterations  of  the  correction  algorithm  the  error  decreases  rap¬ 
idly.  For  larger  numbers  of  iterations  the  decrease  slows 
down.  Therefore,  it  seems  reasonable  not  to  run  the  algo¬ 
rithm  until  it  converges  to  an  optimal  solution,  but  to  use  the 
result  after  about  100-200  iterations  as  the  proximity  cor¬ 
rected  input  pattern  for  the  lithographical  process.  Further¬ 
more,  we  have  observed  that  the  correction  becomes  more 
sensitive  to  inaccuracies  in  the  model  parameters  when  the 
number  of  iterations  is  high. 

To  evaluate  the  quality  of  the  correction  we  create  gray 
slice  plots  of  the  developed  corrected  pattern  for  three  dis¬ 
tinct  regions.  A  zero  region  is  assumed  to  be  completely 
developed  out  if  the  remaining  resist  thickness  is  less  than 
2%  of  the  original  thickness  and  will  be  marked  black.  On 
the  other  hand,  a  one  region  of  the  pattern  is  developed  cor¬ 
rectly  if  the  remaining  resist  layer  is  sufficiently  thick  (we 
require  at  least  60%  of  the  original  thickness).  The  corre¬ 
sponding  regions  will  be  marked  white.  The  remaining  pixels 
with  intermediate  values  specify  transition  regions  and  are 
marked  in  gray.  Figure  7  shows  this  kind  of  gray  slice  plot 
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Fig.  7.  Gray  slice  image  of  the  developed  pattern  without  proximity  effect 
correction. 


Fig.  9.  Proximity  effect  corrected  incident  dosage  after  150  iterations.  Black 
denotes  the  highest  incident  dosage. 


for  the  developed,  uncorrected  test  pattern.  Small  features  of 
100  nm  width  and  less  exhibit  significant  deviations. 

In  our  implementation  of  the  correction  algorithm,  we 
used  for  the  weighting  matrix  W  the  weights  350:5:1  for  the 
interior,  the  edges,  and  the  corners  of  the  zero  regions,  re- 
spectively.  Figure  8  shows  the  gray  slice  plot  for  the  devel¬ 
oped  corrected  pattern  after  150  iterations.  Nearly  all  fea¬ 
tures  are  developed  out  correctly.  The  calculated  incident 
dosage  distribution  can  be  seen  in  Fig.  9.  Obviously,  the 
optimal  incident  dosage  distribution  is  a  combination  of  dose 
modulation  and  edge  displacement. 


Fig.  8.  Gray  slice  image  of  the  developed  pattern  after  150  iterations. 


A  better  understanding  of  the  improvements  in  the  pattern 
obtained  by  our  algorithm  can  be  seen  by  examining  small  or 
complicated  features  more  closely.  Two  such  regions  in  Fig. 
6  are  the  double  cross  and  the  narrow  adjacent  lines  in  the 
comb  structure.  Figures  10(a)  and  10(b)  show  a  part  of  the 
developed  double  cross  structure  without  correction  and  after 
150  iterations,  respectively.  The  improvements  are  remark¬ 
able.  In  the  case  of  the  narrow  adjacent  lines  (left  lower 
comer  of  Fig.  6)  in  Figs,  11(a)  and  11(b),  the  separating 
resist  wall  between  the  two  leftmost  lines  has  reduced  height 
and  wide  transition  regions.  Here  the  physical  resolution  lim¬ 
its  of  the  system  are  reached. 

VII.  ERROR  ANALYSIS 

In  the  evaluation  of  the  algorithm  it  was  assumed  that  the 
model  parameters  were  known  exactly.  The  determination  of 
the  precise  proximity  function  is,  however,  difficult  and  mea¬ 
sured  parameters  for  similar  systems  vary  significantly  in  the 
literature.  Furthermore,  the  lithographical  equipment  might 
only  be  able  to  change  the  incident  dosage  at  a  point  in 
certain  discrete  steps,  whereas  the  corrected  incident  dosage 
distribution  can  contain  any  positive  value.  Thus,  the  actual 
applied  incident  dosage  distribution  would  be  a  quantized 
version  of  the  corrected  one.  In  this  section,  we  will  assess 
the  influence  of  model  parameter  inaccuracies  and  of  quan¬ 
tization  effects  on  the  quality  of  the  developed  pattern. 

A.  Inaccuracies  of  the  proximity  function 

For  our  examinations  we  use  corrected  incident  dosage 
distributions  after  k  iterations  for  the  test  pattern  and  the 
model  parameters  of  Sec.  VI.  We  then  determine  £(d^)  for 
perturbed  proximity  functions  F  to  simulate  the  usage  of 
inaccurate  model  parameters.  In  every  experiment  only  one 
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Fig.  10.  A  part  of  the  developed  double  cross  structure  (a)  without  correc¬ 
tion  and  (b)  after  150  iterations. 


Fig.  11.  Developed  50  nm  narrow  lines  in  comb  structure  (a)  without  cor¬ 
rection  and  (b)  after  150  iterations. 


Gaussian  was  modified  by  varying  cr^-  but  the  general  form  of 
a  triple  Gaussian  proximity  function  was  maintained.  We  de¬ 
note  the  standard  deviation  of  the  modified  Gaussian  as  6", . 
It  represents  the  proximity  function  during  the  actual  litho- 
graphical  process.  In  the  experiments  6^,-  was  varied  by  20% 
around  cr^ .  Thus,  a  larger  than  was  used  to  simulate  too 
small  model  parameters,  and  vice  versa.  Figure  12  shows  the 
results  for  the  wide  Gaussian  with  standard  deviation  (T2 
modeling^  the  backward  scattering  of  the  electrons.  Obvi¬ 
ously  iE(d^)  is  more  sensitive  to  too  large  model  parameters 
than  to  too  small  ones.  The  effect  of  varying  (T2  slightly 
decreases  with  the  number  of  iterations.  The  main  conclu¬ 
sions  for  the  wide  Gaussian  are  that  small  deviations  of  a2 
have  only  little  influence  on  ^(d/.),  that  this  characteristic  is 


fairly  independent  of  the  number  of  iterations,  and  that  it  is 
preferable  to  use  a  a2  in  the  model  which  is  too  small  rather 
than  too  large. 

Variations  of  the  narrow  Gaussian  with  ai  modeling  the 
forward  scattering  of  the  electrons  (see  Fig.  13)  reveal  a 
completely  different  behavior.  With  an  increasing  number  of 
iterations  the  sensitivity  of  ^(d^)  for  a/<j<l  increases  dra¬ 
matically.  Of  particular  importance  are  the  intersections  of 
error  curves  for  different  numbers  of  iterations.  They  show 
that  inaccuracies  in  the  parameters  for  the  forward  scattering 
Gaussian  can  actually  deteriorate  the  correction  results  when 
the  number  of  iterations  is  increased.  This  effect  can  easily 
be  explained.  The  corrected  incident  dosage  distribution  after 
150  iterations  (see  Fig.  9)  compensates  for  the  proximity 
effect  not  only  by  modulation  of  the  incident  dosage  but  also 
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Sensitivity  of  the  Error  to  Variations  of  the  Backward  Scattering  Gaussian 


Fig.  12.  Sensitivity  of  £{djt)  to  variations  of  the  standard  deviation  The 
X  axis  is  given  in  6^2/ o'2. 


by  exploiting  the  forward  scattering  of  the  electrons  itself. 
Some  regions,  especially  along  edges  and  at  concave  cor¬ 
ners,  are  not  exposed  to  any  incident  electrons,  only  to  scat¬ 
tered  ones.  The  correct  exposure  in  these  regions  depends 
completely  on  the  accuracy  of  the  narrow  model  Gaussian.  If 
it  is  larger  than  the  Gaussian  in  the  lithographical  process, 
significant  pattern  deteriorations  occur,  as  some  areas  do  not 
receive  any  exposure.  In  general  it  is  advantageous  to  use  (jj 
too  small  rather  than  too  large. 

The  influence  of  the  third  Gaussian  term  is  insignificant. 
For  the  same  kind  of  experiments  the  increase  of  £'(d^)  due 
to  variations  of  (73  never  exceeded  1%,  The  main  result  of 
this  analysis  is  that  the  narrow  Gaussian  is  the  most  critical 
part  of  the  proximity  function.  Its  accuracy  determines  the 
achievable  quality  in  the  corrected  developed  pattern.  Since 
the  experimental  determination  of  the  exact  proximity  func¬ 
tion  in  general  and  of  the  narrow  Gaussian  in  particular  is 
very  difficult,  the  best  set  of  model  parameters  probably  has 


Fig.  13.  Sensitivity  of  E(d^)  to  variations  of  the  standard  deviation  o-^ .  The 
X  axis  is  given  in  0-2/ 0-2  ‘ 


normalized  quantization  step  size 

Fig.  14.  Sensitivity  of  E(Qdi^)  to  uniform  quantization  for  different  quanti¬ 
zation  step  sizes. 


to  be  determined  by  several  test  corrections  for  slightly 
modified,  experimentally  determined  sets  of  parameters. 

B.  Errors  due  to  quantization 

To  examine  the  influence  of  the  dosage  quantization, 
E[Q{dk)]  was  computed  for  uniform  midtread  quantizers 
2(-)  with  different  quantization  step  sizes.  In  lithography 
systems  the  minimum  quantization  step  size  is  given  by 
some  minimum  shutter  time.  Figure  14  shows  the  results  for 
the  corrected  incident  dosage  after  10,  25,  50,  100,  and  150 
iterations  and  normalized  quantization  step  sizes  from  0  to  2. 
The  normalization  constant  is  the  same  as  introduced  in  Sec. 
II,  i.e.,  a  normalized  incident  dosage  of  magnitude  one  over 
a  large  area  leads  to  50%  removal  of  the  resist  during  the 
developing  process.  The  sensitivity  of  the  error  to  quantiza¬ 
tion  seems  to  decrease  in  general  with  an  increasing  number 
of  iterations  of  the  correction  algorithm.  This  can  be  ex¬ 
plained  by  examining  the  development  of  the  histogram  of 
nonzero  values  of  the  vector  elements  of  for  increasing  k. 
The  histogram  of  the  uncorrected  incident  dosage  values 
consists  of  one  main  peak  at  ^  and  a  small  peak  at  e  in  Eq. 
(21).  Thus,  the  quantization  introduces  the  same  quantization 
error  at^all  points  of  do  with  value  or  6,  respectively. 
Hence,  do  is  very  sensitive  to  quantization.  After  a  few  itera¬ 
tions  the  histogram  of  d^  is  still  mainly  concentrated  around 
yS  [see  Fig.  15(a)].  However,  for  an  increasing  number  of 
iterations  the  histogram  of  d^^  widens  and  loses  its  distinct 
peak  [see  Fig.  15(b)].  A  uniform  quantizer  is  optimal  in  the 
mean  square  sense  for  uniform  distributions.  Thus,  with  an 
increasing  number  of  iterations  the  sensitivity  of  the  overall 
error  to  quantization  decreases  in  general.  However,  as  we 
can  see  for  the  case  of  100  iterations,  extraordinarily  large 
errors  can  be  introduced  under  certain  conditions.  In  this 
particular  case,  the  incident  dosage  of  a  large  part  of  a  zero 
region  was  mapped  to  a  too  small  value.  Due  to  the  high 
weighting  factor  for  zero  regions  in  the  error  norm  (17),  the 
increase  in  the  overall  error  becomes  very  significant.  It  is 
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Fig.  15.  Histograms  for  nonzero  values  of  the  incident  distributions  after  (a) 
50  and  (b)  150  iterations. 

interesting  that  this  effect  occurs  only  for  a  certain  number  of 
iterations  and  disappears  again  for  higher  k,  as  the  distribu- 
don  of  changes.  Figure  16  shows  the  gray  slice  plot  for 
di5o  developed  with  subsequent  quantization  of  step  size  one 
and  a  modified  F  in  the  lithographical  process  with  a  0^2  that 
is  5%  smaller  than  the  (J2  in  the  correction  algorithm.  The 
quality  of  the  result  is  still  very  good  and  only  little  addi¬ 
tional  artifacts  appear. 

VIII.  CONCLUSIONS 

A  new  algorithm  to  compute  corrections  to  the  proximity 
effect  in  the  nanometer  range  was  presented.  It  exploits  prop¬ 
erties  of  the  proximity  effect  and  the  subsequent  chemical 
developing  process.  The  devised  mathematical  model  in¬ 
cludes  a  convex  error  norm,  so  that  the  problem  of  proximity 
effect  correction  can  be  reduced  to  a  nonlinear,  convex,  and 
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Fig.  16.  Gray  slice  image  of  the  developed  pattern  with  corrected  and  quan¬ 
tized  incident  dosage  distribution  after  150  iterations  and  a  modified  prox¬ 
imity  function  in  the  lithographical  process. 

constrained  optimization  problem.  The  suggested  correction 
algorithm  is  a  hybrid  of  different  conjugate  gradient  and  gra¬ 
dient  projection  algorithms.  Since  the  incident  dosage  distri¬ 
bution  has  to  be  convolved  several  times  with  the  proximity 
function  during  the  correction,  very  efficient  Gaussian  filters 
were  used.  The  performance  of  the  algorithm  was  demon¬ 
strated  for  a  test  pattern  with  features  as  small  as  50  nm. 
Finally,  the  effect  of  inaccuracies  in  the  model  parameters 
and  of  quantization  of  the  corrected  incident  dosage  distribu¬ 
tion  was  analyzed.  The  Gaussian  term  modeling  the  forward 
scattering  of  the  electrons  has  to  be  known  precisely  in  order 
to  achieve  the  high  resolution  of  this  correction  technique. 
The  quantization  effects,  on  the  other  hand,  are  not  that  sig¬ 
nificant,  so  that  good  results  should  be  achievable  on  most 
lithographical  equipment. 
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APPENDIX 
1,  The  algorithm 

First  of  all,  some  operators  have  to  be  defined.  The  closest 
point  projection  P  into  the  set  of  non-negative  points  is 

Pd=max(0,d),  (Al) 

where  max(0,d)  is  the  vector,  whose  ith  element  is  the  maxi¬ 
mum  of  the  set  {0,J,}.  The  set  of  active  constraints  at  d  is 
given  as 

A(d)  =  {/:^^-0}.  (A2) 

Of  interest  is  the  subspace  5^  of  R^,  which  is  induced  by  a 
subset  (2C{1,2,...,A}  as 
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SQ={xeR^:Xi=0  if  i^Q}.  (A3) 

Particularly  important  is  ,  the  linear  subspace  defined 
by  the  set  of  active  constraints  at  d.  Cq  denotes  the  linear, 
orthogonal  projection  onto  Sq.  It  is 

CgU=u;  Vug5q,  (A4) 

((/-Cg)u,Cgu)  =  0;  VueR^,  (A5) 

where  I  is  the  identity  operator,  (•,•)  denotes  the  inner  prod¬ 
uct.  Cq  can  be  implemented  very  efficiently,  as  the  ith  vector 
element  of  CqU  has  the  form 


if  i&Q 
if  i $  <2 


(A6) 


The  conditions  for  a  global  solution  d*  of  Eq.  (17)  are 


Q(d*)V^(d*)  =  0,  (A7) 

(/-Q(d*))V£(d*)^0.  (A8) 

Using  these  definitions,  the  algorithm  can  be  written  in  the 
following  form: 


(1)  Use  Eq.  (21)  as  the  incident  dosage  do* 

(2)  Determine  the  set  of  active  constraints  2=A(do). 

(3)  Compute  the  projected  gradient  go=CgVE(do)  and  set 
ho=-go- 

(4)  For  it  =  0,l,...,n-l: 


(a)  Use  the  step  size  rule  of  Appendix  2  to  determine  . 

(b)  Set  d^+i=P(d^  +  aiht). 

(c)  Determine  the  set  of  active  constraints  (2=A(dj+i). 

(d)  If  k<n~  1 : 

(e)  Compute  the  projected  gradient  =CgV£(djt+i). 

(f) Ifg,+  i^0: 

(g)  —Compute  A=(gt+i-g«:+i>%,gJ; 

(h)  —set  K+i  =  -gk+i+/^kCQK- 

(i)  Else  g^+i=0: 

(j)  —Determine  g^+i=(/- CglVECd^+i); 

(k)  —if  gi+i^O,  d^+1  is  the  global  minimum.  Stop; 

(l)  — else,  set  Q  to  and  go  to  3. 

(m)  Next  k, 

(5)  Replace  do  by  d„  and  go  to  3. 

In  practice,  n  can  be  set  to  os.  A  more  comprehensive 
discussion  of  the  algorithm  itself,  its  properties,  and  its  per¬ 
formance  in  comparison  to  other  algorithms  can  be  found  in 
the  thesis.^^ 


2.  The  step  size  rule 

As  the  point-wise  error  functions  eo(*)  ^q.  (14)  and 

• )  in  Eq.  (15)  are  nonlinear  only  over  a  small  interval,  a 
local  linearization  of  ^o(*)  ^i(*)  will  introduce  very 

little  error,  if  the  step  length  a  to  the  local  minimum  of  the 
line  search  problem  £’(d^  +  ahjt)  is  small  and  if  many  of  the 
points  (Fd)^  for  which  either  ^o(')  ^i(‘)  is  evaluated  lie 

in  the  linear  regions  of  ^o(')  or  ei(-)»  respectively.  In  this 
case,  £’(•)  in  Eq.  (17)  is  well  approximated  locally  by  a 
quadratic  functional.  Hence,  E(djt4-ahj^)  is  for  small  a  well 


approximated  by  a  parabola  and  the  minimization  of  the 
function  can  be  replaced  by  the  determination  of  the  mini¬ 
mum  of  the  approximating  parabola.  A  parabola  is  specified 
uniquely  by  three  parameters,  e.g.,  from  two  values  and  from 
the  derivative  at  one  point: 


yo=E{dk+Ohk), 


,  dEidk+ahk) 

^0  = - - 


a;=0 


(VE(d,),h,), 


y\  =  Eidk+ak-^hk), 


where  y  is  some  small  integer.  The  first  two  values  can  be 
computed  very  efficiently  as  byproducts  of  step  4e  so  that 
only  one  extra  evaluation  of  ^(  ♦ )  is  necessary.  With  this  data 
is 


_ _ 

'^{y\-yQ-oik-r,yQ)' 


(A9) 


We  have  to  take  special  precautions,  if  Eq.  (A9)  provides  a 
negative  a  or  leads  to  an  a  which  does  not  decrease  the  error 
norm.  This  problem  is  attacked  in  the  following  step  size 
rule: 

(1)  Determine  with  Eq.  (A9). 

(2)  If  a,<0: 

(a)  Set 

(b) Set;82=2A. 

(c)  Set);2=E(d^+y32hi). 

(d)  Determine 

1  /3^Cyo-y2)+^(yi~yo) 

2  yS,()’2-)'o)+^(}’o-3'i)  ‘ 

(e) If  ak<0 

(f)  Set)'i=3;2,  /3i=^. 

(g)  Go  back  to  2b. 

(3)  If  E{dk+akhk)^E{dk) 

(a)  Set  = 

(b)  Set  P2~^k  ■ 

(c)  Set  >'2=E(d*.+y82hi)- 

(d)  Determine 

1  y^(yo-y2)+/^(yi~yo) 

2  A(y2->'o)+ A()'o-3'i)  ‘ 

{e)\fE{d^+ak\kk>E{dk) 

(f)  If  )'l>>'2, 

-3'1=)'2>  A=^2- 

(g)  Set  y2  =  E{dk+ak\),  ^=ak. 

(h)  Go  back  to  3d. 

Step  (2)  repeats  parabolic  fits  as  long  as  the  resulting  is 
negative.  The  way  the  successive  ^2  picked  ensures  that 
this  procedure  needs  only  a  finite  number  of  steps.  Step  (3) 
also  repeats  parabolic  fits  until  the  overall  error  is  decreased. 

In  all  numerical  experiments  with  several  test  patterns  and 
hundreds  of  iterations,  the  cases  step  (2)  or  step  (3)  were 
required  only  very  few  times.  Thus  steps  (2)  and  (3)  are 
security  measures  rather  than  essential  parts  of  the  step  size 
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rule.  In  almost  every  case,  the  result  of  Eq.  (A9)  is  sufficient. 
Hence,  the  proposed  step  size  rule  needs  in  general  only  two 
additional  evaluations  of  £'(•)• 
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Independent  parallel  lithography  using  the  atomic  force  microscope 
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Independent  parallel  features  have  been  lithographically  patterned  with  a  2X 1  array  of  individually 
controlled  cantilevers  using  an  atomic  force  microscope.  Control  of  the  individual  cantilevers  was 
achieved  with  an  integrated  piezoelectric  actuator  in  feedback  with  a  piezoresistive  sensor.  Patterns 
were  formed  on  (100)  single  crystal  silicon  by  using  a  computer  controlled  tip  voltage  to  locally 
enhance  the  oxidation  of  the  silicon.  Using  the  piezoresistor  directly  as  a  force  sensor,  parallel 
images  can  be  simultaneously  acquired  in  the  constant  force  mode.  A  discussion  of  electrostatic 
forces  due  to  applied  tip  voltages,  hysteresis  characteristics  of  the  actuator,  and  the  cantilever  system 
is  also  presented.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

The  scanning  probe  microscope  (SPM)  has  demonstrated 
itself  to  be  a  versatile  and  effective  tool  for  patterning  sur¬ 
faces  at  the  nanometer  scale.  Two  common  methods  for 
modifying  surfaces  using  probe  microscopes  are  direct 
physical  patterning  and  electric  field  assisted  patterning. 
While  both  methods  of  surface  modification  are  quite  differ¬ 
ent,  they  both  require  that  a  sharp  tip  interacts  with  the  sur¬ 
face  to  be  patterned. 

Physical  patterning  consists  of  scribing  or  indenting  a 
sample  using  the  tip  of  the  SPM.  Jung^  has  used  this  process 
to  scribe  patterns  into  polymer  surfaces,  and  Mamin^  has 
used  the  physical  indentation  process  in  conjunction  with 
laser  heating  to  store  100  nm  bits  at  100  kHz  in  a  polymer 
surface.  This  approach  has  the  advantage  that  the  sample  is 
typically  much  softer  and  generally  unreactive  with  the  tip, 
thereby  reducing  tip  wear.  The  literature  on  electrical  modi¬ 
fication  of  surfaces  with  probe  microscopes  is  much  more 
extensive.  Scanning  probe  lithography  was  pioneered  by 
Dagata,^  who  patterned  (111)  silicon  with  the  scanning  tun¬ 
neling  microscope  (STM),  and  Lyding"^  has  used  this  same 
technique  in  ultrahigh  vacuum  (UHV)  to  pattern  features  less 
than  a  few  nanometers.  Snow  and  Campbell  have  modified 
this  technique  and  patterned  Si^  and  GaAs^  with  the  atomic 
force  microscope. 

This  form  of  lithography  can  be  used  to  pattern  a  variety 
of  surfaces.^  Using  the  atomic  force  microscope  (AFM)  or 
STM,  various  materials  have  been  patterned,  including  chro¬ 
mium  by  Song,^  titanium  by  Sugimura,^  and  amorphous  sili¬ 
con  by  Kramer.  Recently,  we  have  demonstrated  the  use  of 
amorphous  silicon  as  a  general  resist  for  lithography.^^  Also, 
organic  materials  have  been  modified  by  electrical  means,  as 
demonstrated  by  Majumdar^^  using  PMMA.  Finally, 
Sugimura^^  has  used  the  STM  to  pattern  amorphous  silicon 
and  then  globally  modify  the  surface  by  gold  electroplating. 

Further  work  in  this  area  has  yielded  features  more  com¬ 
plex  than  simple  lines  on  substrates.  Previously  we  demon- 
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strated  the  compatibility  of  scanning  probe  lithography  with 
semiconductor  processing  by  fabricating  a  0.1  /mm  n-metal- 
oxide- semiconductor  field-effect  transistor  (nMOSFET^"^) 
with  the  AFM.  Campbell^^  has  fabricated  a  side-gated  tran¬ 
sistor  on  a  silicon-on-insulator  wafer,  and  Matsumoto^^  has 
fabricated  a  single  electron  transistor  by  modifying  titanium. 
More  recently,  Snow^^  has  demonstrated  real  time  litho¬ 
graphic  monitoring  using  AFM  lithography  in  order  to  fabri¬ 
cate  metal-oxide-metal  junctions. 

While  scanning  probe  lithography  has  revealed  its  poten¬ 
tial  value  by  fabricating  operational  devices,  it  suffers  the 
drawback  of  being  very  slow  since  the  scanning  probe  is  an 
inherently  serial  device.  For  this  form  of  lithography  to  be 
accepted  beyond  the  single  device  level,  the  throughput  of 
this  approach  must  be  greatly  improved.  We  have  addressed 
this  problem  in  two  ways,  by  increasing  the  tip  velocity  and 
by  devising  a  parallel  scheme. 

In  general,  the  tip  velocity  of  scanning  probes  is  governed 
by  the  speed  of  the  feedback  control  of  the  microscope, 
which  is  itself  limited  by  the  resonant  frequency  of  the  ac¬ 
tuator  (typically  less  than  1  kHz).  By  integrating  the  actuator 
onto  the  cantilever,  it  is  now  possible  to  work  up  to  near  the 
cantilever  resonant  frequency  (20-100  kHz),  allowing  the  tip 
velocity  to  be  significantly  increased.  With  our  integrated 
actuator,  we  have  shown  tip  velocities  in  feedback  operation 
of  up  to  3  mm/s  with  a  piezoresistive  sensor^ ^  and  up  to  1 
cm/s  with  optical  lever  detection. 

Increased  tip  velocity  reduces  the  time  that  the  tip  will 
take  to  cover  a  given  area,  but,  even  with  the  increased 
speed,  the  time  necessary  for  a  single  tip  to  cover  a  wafer 
area  is  unacceptably  long.  Parallelism  is  an  alternative  ap¬ 
proach  to  circumvent  this  problem.  Previously  we  have  used 
parallel  arrays  of  five  cantilevers  for  imaging  and 
lithography,^^  but  in  that  work,  we  did  not  have  individual 
control  of  the  cantilevers  and  therefore  could  not  operate 
each  cantilever  in  feedback.  We  have  since  fabricated  an  ar¬ 
ray  of  cantilevers  with  integrated  sensors  and  actuators  and 
used  them  for  parallel  imaging  in  feedback.^^  In  this  article, 
we  advance  our  previous  work  on  the  2X1  arrays  by  modi¬ 
fying  their  design  to  show  parallel  imaging  without  the  need 
for  force  correction.  We  also  demonstrate  that  the  areas 
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Fig.  1.  Schematic  cross-sectional  view  of  a  piezoresistive  cantilever  with 
integrated  piezoelectric  actuator.  The  piezoresistor  lies  on  the  surface  of  the 
silicon,  separated  from  the  ZnO  actuator  by  silicon  dioxide  and  silicon  ni¬ 
tride.  The  piezoresistor  doping  extends  to  the  apex  of  the  tip. 


scanned  by  the  two  cantilevers  can  be  simultaneously  pat¬ 
terned  using  a  system  with  feedback  and  compatible  with 
high  speed  AFM. 


II.  CANTILEVERS 

The  cantilevers  used  in  this  work  are  silicon  piezoresistive 
cantilevers  with  integrated  single  crystal  silicon  tips  devel¬ 
oped  by  Tortonese.^^  We  have  modified  the  cantilever  struc¬ 
ture  to  include  a  zinc  oxide  (ZnO)  actuator,  which  is  fabri¬ 
cated  on  top  of  the  silicon  cantilever  (Fig.  1).  Since  ZnO  is 
piezoelectric,  an  applied  electric  field  will  cause  the  ZnO 
film  to  expand  in  the  direction  of  the  electric  field  and  con¬ 
tract  in  the  lateral  direction.  Due  to  the  bimorph  nature  of  the 
films,  this  contraction  results  in  bending. 

When  imaging,  a  feedback  loop  monitors  the  cantilever 
deflection  with  the  piezoresistor  to  determine  the  voltage  that 
the  ZnO  actuator  needs  to  maintain  constant  force  between 
the  tip  and  the  sample.  The  fabrication  process  and  initial 
results  using  this  type  of  AFM  cantilever  with  an  integrated 
sensor  and  actuator  are  presented  in  Ref.  20. 

The  main  disadvantage  of  the  system  described  in  Ref.  20 
is  that  the  sensor  and  actuator  were  not  completely  mechani¬ 
cally  decoupled.  Since  the  piezoresistive  sensor  extended 
over  the  entire  length  of  the  cantilever,  the  portion  of  the 
sensor  beneath  the  ZnO  deflects  with  the  ZnO,  generating  a 
signal  unrelated  to  the  force  on  the  tip.  In  order  to  take  a 
feedback  image,  it  is  necessary  to  electronically  compensate 
this  mechanical  coupling  which  limited  the  useful  range  of 
the  actuator. 

In  this  work,  we  have  modified  the  cantilever  by  doping 
the  area  beneath  the  ZnO^^  to  eliminate  sensor-actuator  cou¬ 
pling  and  the  need  for  the  correction  circuit.  A  schematic 
diagram  of  the  side  view  of  the  cantilever  including  the  new 
implant  is  shown  in  Fig.  1 .  The  increased  doping  serves  two 
purposes.  The  piezoresistive  effect^^  can  be  described  by  the 
following  equation: 


ZnO  Left  ZnO  Right 


Fig.  2.  Parallel  constant  force  AFM  image  of  a  microcircuit.  The  ZnO  main¬ 
tains  constant  force  over  2  /tm  of  topography.  Beneath  the  images  (ZnO 
signal)  are  the  corresponding  error  signals.  The  reduced  error  signal  scale 
indicates  constant  force  is  being  maintained. 


Ap 

—  (x,y,z)  -  7ri^S{xj,z),  (1) 

where  p  is  the  resistivity  of  the  cantilever  material,  is  the 
longitudinal  piezoresistive  coefficient,  and  S  is  the  stress. 
Assuming  a  constant  piezoresistor  width  and  depth,  AR/R  is 
found  by  integrating  and  normalizing  Eq.  (1)  over  the  length 
of  the  resistor.  Our  sensing  circuitry  measures  changes  in 
resistance  of  the  entire  cantilever  (both  the  area  under  the 
ZnO  and  the  tip  area,  which  contains  the  active  piezoresis¬ 
tive  sensor).  By  increasing  the  doping  under  the  ZnO  region, 
we  decrease  the  piezoresistive  coefficient  (tt^)  by  80%,  a 
reduction  that  directly  affects  the  contribution  to  the  sensor 
signal  induced  by  the  actuator.  Another  improvement  comes 
from  the  nature  of  the  fractional  change.  As  the  doping  in¬ 
creases,  the  resistance  decreases,  and  as  a  result,  the  absolute 
change  in  resistance  from  a  given  stress  is  reduced. 

The  small  coupling  permits  imaging  without  the  need  for 
correction  circuitry.  An  uncorrected  parallel  constant  force 
image  of  an  integrated  circuit  containing  vertical  topography 
of  2  jiim  is  presented  in  Fig.  2.  The  top  images  are  the  volt¬ 
ages  controlling  the  ZnO  actuator  and  represent  the  surface 
topography,  while  the  lower  images  are  the  piezoresistor  sig¬ 
nals  which  represent  the  error  signals.  For  constant  force 
imaging  the  error  signals  should  be  zero.  The  reduced  scale 
on  the  error  signal  images  shows  that  constant  force  is  effec¬ 
tively  maintained. 

The  frequency  response  of  the  cantilevers  has  been  pre¬ 
sented  in  detail  in  previous  work.^^"^^  The  hysteresis  and 
linearity  measurements  of  the  cantilever  are  presented  in  Fig. 
3.  Linearity  and  hysteresis  measurements  of  the  actuator 
were  obtained  by  measuring  the  cantilever  deflection  with  an 
optical  level  system.  The  optical  system  is  calibrated  to  the 
out  of  contact  movement  of  the  cantilever  in  the  following 
manner:  The  tip  is  brought  close  to  the  surface  of  a  calibrated 
piezotube.  The  voltage  to  the  ZnO  is  ramped  until  the  tip  to 
strikes  the  sample.  The  piezotube  is  then  retracted  from  the 
tip  by  a  known  amount.  The  amount  of  additional  voltage  to 
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Fig.  3.  (a)  Hysteresis  curve  for  the  ZnO  actuator.  The  points  represent  the 
data,  while  the  solid  line  is  a  simulation  of  a  Gaussian  beam  passing  over  a 
split  photodiode.  Inset:  Expanded  view  of  the  data  around  zero  applied 
voltage  showing  20  nm  maximum  hysteresis,  (b)  Linearity  plot  for  the  ZnO 
actuator.  The  solid  line  is  a  composite  curve  of  four  photodiode  positions 
representing  the  overall  linearity  curve  of  the  actuator.  The  points  of  the  first 
and  final  windows  used  to  construct  the  overall  plot  are  included.  This 
method  was  required  because  the  cantilever  displaced  the  beam  beyond  the 
linear  range  of  the  photodiode. 


the  ZnO  required  to  bring  the  tip  back  into  contact  with  the 
surface  provides  a  calibration  for  tip  movement  in  air  for  a 
change  in  ZnO  voltage.  The  cantilever  is  then  deflected  by 
the  ZnO  in  air  while  the  output  of  the  laser  is  monitored.  The 
slope  of  the  response  combined  with  the  previous  result 
gives  the  change  in  laser  output  for  a  given  displacement  of 
a  freely  suspended  tip. 

Figure  3(a)  shows  the  output  of  a  split  photodetector 
when  the  ZnO  voltage  is  swept  from  —40  to  40  V  and  then 
back  to  -40  V.  The  inset  of  Fig.  3(a)  shows  an  expanded 
view  of  the  hysteresis  loop  around  zero  volts  applied  voltage. 
The  maximum  hysteresis  of  the  ZnO  in  this  voltage  range  is 
20  nm.  Unfortunately,  it  is  not  possible  to  determine  the  lin¬ 
earity  of  the  actuator  from  Fig.  3(a).  The  split  photodiode 
detector  used  to  measure  the  position  of  the  beam  reflected 
off  the  cantilever  has  a  limited  range  for  which  the  output  is 
linear  with  respect  to  cantilever  deflection.  This  limitation 
occurs  because  the  size  of  the  reflected  beam  is  finite  and 
must  cover  an  adequate  portion  of  both  sides  of  the  detector. 
In  our  microscope,  the  extent  of  the  linear  range  covers 
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roughly  1  juum  of  deflection  and  is  insufficient  to  characterize 
the  entire  range  of  the  ZnO  actuator.  It  is  interesting  to  note 
that  the  sigmoidal  shape  of  Fig.  3(a)  reflects  the  Gaussian 
distribution  of  our  laser  diode.  We  have  also  simulated  a 
Gaussian  beam  traversing  a  split  photodiode  and  plotted  the 
results  with  the  data  in  Fig.  3(a). 

In  order  to  measure  linearity  of  the  actuator,  we  manually 
translated  the  linear  window  of  the  detector  in  discrete  steps 
so  that  it  covered  the  entire  range  of  the  reflected  laser  beam. 
This  provided  us  with  a  series  of  curves,  each  of  which  has 
only  a  small  linear  range.  We  then  extracted  and  shifted  the 
linear  data  from  each  curve  to  construct  the  overall  linearity 
curve.  The  data  was  shifted  such  that  overlapping  voltages  in 
adjacent  windows  had  the  same  deflection. 

Two  such  windows,  which  represent  the  initial  and  final 
data  curves  used  for  constructing  the  overall  plot,  along  with 
the  final  construction,  are  plotted  in  Fig.  3(b).  In  these  win¬ 
dows,  the  defector  position  was  adjusted  so  that  response  is 
linear  at  the  maximum  excursion  of  the  cantilever.  Once  the 
beam  is  completely  deflected  to  one  side  of  the  photodiode, 
the  detector  cannot  provide  information  about  the  cantilever 
deflection  and  the  curves  flatten.  The  composite  curve  of  Fig. 
3(b)  shows  excellent  linearity  of  the  device  over  its  entire 
range  of  3  fim. 

III.  ELECTROSTATIC  FORCES 

Electrical  AFM  lithography  involves  applying  a  voltage 
between  the  tip  and  the  substrate  to  be  modified.  The  force 
produced  by  the  applied  voltage  is  the  negative  spatial  de¬ 
rivative  of  the  energy,  U=  where  C  is  the  capacitance 
and  V  is  the  voltage.  The  capacitance  is  primarily  a  function 
of  the  distance  between  the  tip  and  the  sample,  and  the  ef¬ 
fective  area  of  the  tip. 

When  the  AFM  operates  in  a  repulsive  mode  (contact  im¬ 
aging),  the  electrostatic  attraction  produced  by  an  applied 
voltage  between  the  tip  and  sample  increases  the  contact 
force  but  does  not  cause  the  cantilever  to  deflect.  If  the 
sample  is  soft,  the  applied  voltage  provides  a  method  for 
physical  modification  using  an  electronic  signal.  If  the 
sample  is  hard,  the  increased  forces  accelerate  the  wear  on 
the  delicate  tip.  Since  the  detector  is  sensitive  only  to  canti¬ 
lever  deflection,  it  is  difficult  to  accurately  monitor  this  at¬ 
tractive  force  while  the  tip  remains  in  constant  contact  with 
the  surface. 

In  order  to  measure  the  contact  force  produced  by  the 
electrostatic  attraction,  we  retracted  and  extended  the  sample 
while  monitoring  the  cantilever  deflection  with  the  piezore¬ 
sistor.  A  force  curve  was  obtained  by  plotting  the  deflection 
versus  vertical  position.  Figure  4(a)  shows  a  typical  force 
curve  where  the  sample  is  biased  to  —7.5  V  and  being  re¬ 
tracted  from  the  tip.  An  additional  bias  is  provided  by  the 
piezoresistor  measurement  process.  For  this  data,  the  extra 
bias  is  0.6  V  at  the  tip.  The  electrostatic  attraction  can  be 
determined  by  multiplying  the  cantilever  spring  constant  by 
the  distance  for  which  the  tip  is  electrostatically  fixed  to  the 
sample  during  retraction.  The  electrostatic  force  is  labeled 
“EF”  on  Fig.  4(a).  In  our  experiments,  we  find  that  the  elec- 
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Fig.  4,  (a)  Force  curve  trace  as  the  cantilever  is  retracted  from  the  sample. 
The  sticking  is  due  to  the  electrostatic  attraction  of  the  applied  voltage 
between  the  tip  and  sample,  (b)  The  electrostatic  attraction  between  the  tip 
and  the  sample  as  a  function  of  applied  voltage. 


trostatic  force  is  significantly  greater  than  the  meniscus 
force.  The  electrostatic  force  can  be  measured  as  a  function 
of  applied  voltage  by  reproducing  the  data  in  Fig.  4(a)  for 
many  voltages  and  is  shov^n  in  Fig.  4(b).  A  second  order 
polynomial  describes  the  data  well  in  Fig.  4(b),  indicating 
that  the  force  is  proportional  to  the  square  of  the  applied 
voltage. 

Figure  5(a)  shows  a  scanning  electron  micrograph  of  the 
tip  used  to  obtain  the  data  in  Fig.  4.  This  tip  initially  had  a 
radius  of  curvature  of  ^^300  A,  but  due  to  the  process  we 
used  to  measure  the  contact  force  versus  voltage,  the  tip 
sharpness  was  reduced  to  '^1.5  /mm,  as  shown  in  Fig.  5(d). 
During  initial  measurements  made  with  this  tip,  we  measured 
a  contact  force  of  20  nN  for  a  voltage  of  5  V.  As  the  mea¬ 
surement  process  was  repeated,  this  force  continued  to  in¬ 
crease  until  stabilizing  at  a  force  of  1.25  /jlN.  Presumably  the 
tip  had  dulled  to  a  point  that  no  further  damage  was  occur¬ 
ring.  The  results  of  Fig.  4  and  5  indicate  that  the  contact 
force  between  the  tip  and  sample  under  an  applied  voltage  is 
dependent  on  the  magnitude  of  the  voltage  and  the  degree  of 
tip  wear. 

Preliminary  results  show  that  tip  wear  can  be  a  significant 
problem.  In  our  lithographic  experiments,  we  worked  with 
single  crystal  silicon  tips  patterning  single  crystal  silicon 
substrates.  Due  to  the  hardness  of  the  substrate  and  the 
chemical  interaction  between  silicon  tip  and  silicon  sample, 
and  because  of  the  increased  force  due  to  the  applied  voltage. 


2^Jm 


(c)  (d) 

Fig.  5.  SEM  micrographs  of  (a)  a  new  tip,  (b)  a  slightly  worn  tip  which  is 
still  useful  for  silicon  lithography,  (c)  a  heavily  worn  tip  which  is  no  longer 
useful  for  silicon  lithography,  and  (d)  the  tip  used  for  the  measurement  of 
Fig.  4.  In  (d),  the  heavy  damage  is  due  to  the  measurement  process  de¬ 
scribed  in  Fig.  4. 

the  impact  on  the  tip  is  substantial.  Tips  showing  various 
states  of  wear  are  shown  in  Fig.  5.  In  Fig.  5(a),  we  show  a 
new  tip  which  has  a  radius  of  curvature  of  —300  A.  Figure 
5(b)  shows  a  tip  that  has  performed  and  is  still  capable  of 
performing  high  quality  lithography,  although  the  radius  of 
curvature  has  increased  to  —2000  A.  Figure  5(c)  shows  a  tip 
that  was  originally  effective  in  performing  lithography  but 
has  been  worn  down  to  the  point  where  lithography  is  no 
longer  possible.  The  surface  diameter  of  the  tip  is  about  1 
yLtm.  It  can  be  seen  that  the  tip  wear  is  due  to  the  scanning 
process  and  is  exacerbated  by  the  electrostatic  forces  from 
the  voltage  used  for  lithography — the  straight  angle  on  the 
worn  tip  corresponds  to  the  scanning  angle  of  our  AFM.  We 
believe  that  lithography  fails  as  the  tip  wears  because  of  the 
decrease  in  electric  field  due  to  the  dulling  geometry  of  the 
tip  and  because  most  of  the  conductive  implant  in  the  tip 
region  has  worn  off, 

IV.  LITHOGRAPHY 

Our  preferred  method  for  performing  lithography  is  to  use 
the  conductive  path  in  the  piezoresistor  to  apply  appropriate 
voltages  to  the  conducting  tip.  Initially  the  ZnO  was  used  to 
control  the  lithography  by  biasing  the  sample  and  pulling  the 
tip  off  the  surface  when  lithography  was  not  desired,  but  this 
method  showed  limited  success  due  to  the  large  electrostatic 
forces.  In  other  attempts,  we  tried  to  use  the  ZnO  as  both  the 
sensor  and  actuator, and  dedicated  the  piezoresistor  for 
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lithographic  voltages  only.  While  this  method  provided  good 
imaging  capabilities,  the  lithographic  signal  would  often  in¬ 
terfere  with  the  feedback  loop.  We  believe  that  this  approach 
is  still  attractive,  but  at  this  time  our  cantilevers  are  not  op¬ 
timized  for  the  use  of  the  ZnO  as  a  sensor. 

In  order  to  use  the  piezoresistor  as  both  a  sensor  and  an 
electrical  conductor  for  lithography,  it  is  necessary  to  sepa¬ 
rate  the  two  signals.  By  using  an  ac  bridge,  we  can  sense 
piezoresistor  deflections  at  high  frequency  (typically  100 
kHz)  while  sending  the  electrical  pulses  for  lithography 
through  the  cantilever  at  lower  frequencies  (Altered  to  100 
Hz).  A  benefit  of  this  system  is  that  very  high  scan  speeds 
can  be  obtained  using  the  ac  bridge  (see  Ref.  18).  Another 
consideration  when  using  the  piezoresistor  as  a  path  for  li¬ 
thography  is  electrical  breakdown  between  the  piezoresistor 
and  the  bottom  ZnO  electrode.  Referring  back  to  Fig.  1,  the 
piezoresistor  is  separated  from  the  bottom  electrode  by  a  thin 
film  of  dielectric.  Since  the  bottom  electrode  is  held  at 
ground,  a  lithography  signal  can  cause  breakdown  between 
the  films.  It  is  therefore  necessary  to  bias  both  ZnO  elec¬ 
trodes  and  the  piezoresistor  with  the  lithography  voltage 
used  for  writing  the  lithographic  patterns. 

Also,  for  high  resolution  imaging  it  is  necessary  to  use  a 
high  gain  low  noise  amplifier  for  sensing  the  cantilever  de¬ 
flection.  Unfortunately,  the  large  voltage  needed  for  lithogra¬ 
phy  normally  precludes  the  use  of  such  amplifiers  in  our 
circuitry. 

To  address  the  problems  of  tip  speed,  dielectric  break¬ 
down,  and  low  noise  amplification,  two  transformers  and  a 
battery  powered  amplifier  were  used  in  order  to  decouple  the 
cantilever  from  ground.  By  biasing  the  entire  cantilever  (both 
ZnO  electrodes  and  the  piezoresistor)  to  the  lithography  volt¬ 
age,  the  piezoresistor  could  be  measured  at  ac  without  being 
influenced  by  the  high  voltage  lithographic  signals.  Also,  by 
biasing  the  entire  cantilever,  breakdown  between  the  films  is 
no  longer  a  concern.  Duplicating  the  electronics  for  each  of 
the  cantilevers  within  the  array,  parallel  independent  lithog¬ 
raphy  in  feedback  is  readily  accomplished. 

In  general,  we  have  found  that  more  reliable  pattern  gen¬ 
eration  is  obtained  by  using  a  combination  of  open  and 
closed  loop  feedback.  The  open  loop  portion  occurs  only 
when  the  lithography  pulse  is  on.  When  writing,  the  ZnO 
exerts  a  force  to  partially  oppose  the  electrostatic  forces  in 
order  to  reduce  tip  wear  and  increase  tip  lifetime.  In  this  way, 
cantilevers  are  maintained  in  constant  force  mode  as  they 
profile  the  surface.  Only  when  writing  voltage  is  applied  do 
they  pull  back  from  the  surface  to  compensate  for  the  elec¬ 
trostatic  force.  Once  the  lithography  signal  is  turned  off,  the 
tips  return  to  the  normal  feedback  mode.  Lithography  has 
been  accomplished  using  both  this  mode  as  well  as  regular 
closed  loop  feedback. 

Figure  6(a)  shows  an  optical  image  of  a  large  scale  inde¬ 
pendent  parallel  lithographic  pattern  using  the  open  and 
closed  feedback  system.  The  substrate  is  (100)  silicon  and  is 
patterned  with  a  lithographic  voltage  of  20  V.  The  presumed 
mechanism  for  the  lithography  is  that  the  large  electric  field 
desorbes  hydrogen  and  increases  the  oxidation  rate  on  the 
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Fig.  6.  (a)  Independent  parallel  lithography  using  cantilevers  with  integrated 
sensors  and  actuators.  The  lithography  covers  200  /xmX  100  /tm  with  one  tip 
writing  “N  A”  over  “I”  and  the  other  simultaneously  writing  “N  O”  over 
“V”  in  a  single  pass,  (b)  Magnified  AFM  image  of  (a). 

exposed  silicon.  Details  of  this  process  are  described 
elsewhere.^’^  The  latent  image  is  transferred  to  the  substrate 
with  a  KOH  etch.  In  Fig.  6(a),  each  tip  has  traversed  a  100 
yam  X 100  yam  area  of  the  200  yam  X 100  yam  total  area  in  a 
raster  pattern.  A  computer  program  controlled  the  scanning 
and  coordinated  the  lithography  voltages.  With  one  pass,  the 
left  tip  patterned  the  letters  “N  A’  over  the  “I”  and  the  right 
tip  patterned  the  letters  “N  O”  over  the  “V”.  Figure  6(b)  is 
an  enlarged  AFM  image  of  the  pattern  in  the  square  on  Fig. 
6(a).  The  modulation  in  the  lines  is  caused  by  the  digitization 
from  computer  control. 

V.  CONCLUSIONS 

Expansion  of  scanning  probe  microscope  lithography 
through  parallelism,  combined  with  the  high  scan  speeds  en¬ 
abled  by  integrated  actuators,  are  important  steps  that  begin 
to  address  the  issue  of  throughput  for  a  probe  based  lithog¬ 
raphy  system.  Using  this  approach,  we  have  achieved  the 
first  demonstration  of  parallel,  independent  AFM  lithography 
using  cantilevers  with  independent  control  spanning  an  area 
of  20  000  yam^. 

We  believe  further  improvement  in  cantilever  design  can 
be  implemented.  Optimization  of  the  ZnO  geometry  to  im¬ 
prove  its  simultaneous  sensing  and  actuating  capabilities 
would  allow  the  elimination  of  the  piezoresistor  leaving  the 
silicon  exclusively  for  lithographic  signal  path.  Alternatively, 
adding  a  separate  conducting  path  to  the  tip  would  serve  the 
same  function,  while  allowing  us  to  maintain  the  current  de¬ 
sign. 
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Y.  M.  Gueorguiev®^  and  G.  M.  Mladenov 

Laboratory  of  Physical  Problems  of  Electron-Beam  Technologies,  Institute  of  Electronics,  Bulgarian 
Academy  of  Sciences,  Tzarigradsko  shose  12,  1784  Sofia,  Bulgaria 

D.  I.  Ivanov 

DIPI  Group,  B.  Petkov  Str.  65,  Sofia,  Bulgaria 
(Received  27  October  1995;  accepted  24  May  1996) 

In  the  present  work  inclined  incidence  of  accelerated  electrons  to  solids  is  simulated  using  Monte 
Carlo  technique.  Spatial  distributions  of  absorbed  electron  energy  density  in  a  125  nm 
poly(methylmethacrylate)  resist  layer  on  bulk  Si  substrate  are  obtained  for  angles  of  incidence  30®, 

45®,  and  60®  at  two  beam  energies — 25  and  50  keV — together  with  the  energy  and  angular 
distributions  of  the  backscattered  electrons.  The  results  show  strong  asymmetry  of  the  exposure 
distributions.  Their  peaks  are  significantly  lower,  wider,  and  40-100  nm  shifted,  and  their  shapes  are 
different  in  comparison  with  those  for  normal  incidence  of  electrons.  The  ratio  between  the 
maximum  values  of  exposure  distributions  due  to  the  forward  scattered  and  the  backscattered 
electrons  decreases  with  decreasing  angle  of  incidence.  These  peculiarities  of  exposure  distributions 
may  lead  to  enhanced  proximity  effects  and  cause  deviation  from  required  pattern  shapes.  Therefore, 
if  inclined  incidence  of  accelerated  electrons  to  any  surface  occurs  during  electron  beam 
lithography,  it  has  to  be  taken  into  account.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

Monte  Carlo  simulation  (MCS)  is  the  most  widely  used 
technique  for  modeling  the  electron  scattering  in  solids  and 
particularly  of  obtaining  spatial  distributions  of  the  absorbed 
electron  energy  density  in  resists  as  applied  to  electron-beam 
lithography  (EBL)  due  to  its  advantages  over  other  (espe¬ 
cially  experimental)  methods. 

However,  one  of  the  advantages  of  MCS,  namely  the  pos¬ 
sibility  of  modeling  the  scattering  of  fast  electrons  impinging 
on  targets  of  complex  topography  is,  to  the  best  of  our 
knowledge,  not  properly  studied  and  exploited  so  far  in  re¬ 
gard  to  EBL,  but  it  may  be  of  interest  in  some  particular 
applications  (e.g.,  fabrication  of  high-resolution  concave 
Brag-Fresnel  zone  plate  x-ray  lenses,  EBL  of  very  large 
scale  integrated  VLSI  devices  with  stepped  or  other  kind  of 
complex  topography  surfaces).  This  problem  can  be  consid¬ 
ered,  in  the  first  approximation,  as  the  simulation  of  inclined 
incidence  of  accelerated  electrons  to  solids.  There  are  several 
articles  concerned  with  the  latter  one,^""^  however  they  con¬ 
sider  mainly  backscattering  of  electrons  and  particularly  en¬ 
ergy,  angular,  and  spatial  distributions  of  backscattered  elec¬ 
trons.  As  far  as  we  know,  all  the  results  on  energy  deposition 
of  fast  electrons  reported  hitherto  are  obtained  for  normal 
incidence  of  primary  electrons. 

Therefore,  the  aim  of  the  present  work  is  to  simulate  the 
inclined  incidence  of  fast  electrons  to  the  target  surface  and 
to  obtain  spatial  distributions  of  absorbed  energy  density  in 
the  resist,  deposited  on  substrate  of  the  material  most  widely 
used  in  microelectronics,  namely  silicon. 


"^Electronic  mail:  JGEORG@BGEARN.ACAD,BG 


II.  MONTE  CARLO  SIMULATION 

Since  the  MCS  of  electron  scattering  in  solids  is  discussed 
in  detail  elsewhere,  here  only  the  main  features  of  the 
present  model  will  be  described. 

In  our  program  we  use  the  conventional  Monte  Carlo 
technique  based  on  a  single  scattering  model  and  continuous 
slowing  down  approximation  assuming  the  screened  Ruther¬ 
ford  elastic  scattering  cross  section  with  a  screening  param¬ 
eter  of  Nigam  et  aO^  and  the  Bethe  energy  loss  equation.  In 
order  to  prevent  the  failure  of  the  latter  at  low  electron  en¬ 
ergies  the  parabolic  extrapolation  of  (dE/dx)~^  derived  by 
Rao-Sahib  and  Wittry^^  is  introduced  below  a  certain  elec¬ 
tron  energy.  In  this  way  the  incident  electrons  can  be  pursued 
until  their  energy  decreases  to  50  eV. 

Another  peculiarity  of  the  energy  loss  calculation  is  that 
instead  of  the  conventional  two-point  difference  scheme  we 
involve  the  three-point  difference  scheme  as  proposed  by 
Valiev  et  aO^  The  application  of  the  three-point  difference 
scheme  increases  the  accuracy  of  the  energy  loss  calculation, 
especially  at  low  beam  energies  as  well  as  for  heavy 
targets. 

To  account  for  differences  in  both  the  scattering  and  the 
stopping  properties  of  materials  at  the  interfaces  between  dif¬ 
ferent  layers  of  the  target,  a  procedure  for  recalculation  of 
the  free  path  length  and  the  energy  loss  is  included. 

III.  RESULTS  AND  DISCUSSION 

In  this  work  calculations  are  performed  on  an  IBM  4381 
machine  using  a  total  of  30  000  electron  trajectories  for  each 
simulation.  The  spatial  distributions  of  absorbed  energy  den¬ 
sity  are  obtained  for  a  zero- width  S  function  at  two  different 
energies  of  primary  electrons,  namely  25  and  50  keV.  The 
angle  of  incidence  is  30®,  45®,  or  60®  with  respect  to  the 
target  surface. 
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In  Figs.  1  and  2  the  spatial  distributions  of  the  logarithm 
of  absorbed  energy  density  in  125  nm  poly  (methylmethacry¬ 
late)  (PMMA)  resist  layer  on  bulk  Si  substrate  at  two  beam 
energies:  25  and  50  keV,  respectively,  and  angles  of  inci¬ 
dence  30°  (a),  45°  (b),  and  60°  (c)  are  shown,  while  in  Fig.  3 
the  lateral  distributions  of  absorbed  energy  density  for  the 
same  conditions  are  presented  in  comparison  with  those  for 
normal  incidence.  Since  the  resist  layer  is  chosen  to  be  rela¬ 
tively  thin,  the  energy  loss  of  electrons  in  the  resist  is  aver¬ 
aged  over  the  whole  of  its  thickness.  It  is  necessary  to  point 
out  that  the  electron  beam  propagates  in  the  plane  xOz  (y  =  0 


Fig.  1.  Spatial  distributions  of  the  logarithm  of  the  absorbed  energy  density 
in  a  125  nm  PMMA  resist  layer  on  a  Si  substrate  at  Eo=25  keV  and  angles 
of  incidence  30°  (a),  45°  (b),  and  60°  (c). 


and  azimuth  angle  ^=0)  and,  in  the  case  of  inclined  inci¬ 
dence,  is  directed  to  the  positive  side  of  the  x  axis. 

As  was  expected,  in  all  these  figures  strong  asymmetry  of 
the  distributions  at  inclined  incidence  of  electrons  is  well 
seen.  The  exposure  of  the  resist  caused  by  backscattered 
electrons  at  a:>0  is  significantly  greater  compared  with  that 
at  x<0  as  well  as  with  that  of  normal  incidence  of  electrons, 
especially  at  lower  beam  energy  (25  keV).  In  addition,  the 
peaks  of  the  distributions  which  are  due  to  the  exposure  by 
the  forward  scattered  electrons  also  differ  from  those  for  nor¬ 
mal  incidence.  They  are  significantly  wider  (half  widths  at 
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the  levels  of  turning  of  the  peaks  into  backscattered  distribu¬ 
tions  in  the  direction  of  ;r>0  are  listed  in  Table  I)  and  lower 
(the  ratios  of  maximum  values  of  the  peaks  at  inclined  inci¬ 
dence  of  electrons  to  those  at  normal  incidence  are  also  listed 
in  Table  I)  and  their  maximum  values  are  shifted  from  the 
incident  point  in  the  direction  of  x>0  (see  Table  I).  Such 
distributions  may  lead  to  enhanced  proximity  effects  not  only 
at  x>0  but,  also  surprisingly,  at  x<0,  due  to  considerably 
lower  ratios  between  exposures  caused  by  forward  scattered 


Fig.  2.  Spatial  distributions  of  the  logarithm  of  the  absorbed  energy  density 
in  a  125  nm  PMMA  resist  layer  on  a  Si  substrate  at  Eq=50  keV  and  angles 
of  incidence  30°  (a),  45°  (b),  and  60°  (c). 


and  backscattered  electrons  for  all  cases  of  inclined  inci¬ 
dence. 

Figure  4  compares  the  simulated  energy  distributions  of 
backscattered  electrons  at  Eq=25  keV  and  angles  of  inci¬ 
dence  30°,  45°,  60°,  and  90°  with  that  for  bulk  A1  (whose 
atomic  number  Z  =  13  is  close  to  that  of  Si,  Z=  14)  obtained 
experimentally  at  £*0=30  keV,  normal  incidence,  and  takeoff 
angle  45°  by  Kulenkampff  and  Spyra.^^  Here  dnIdQ  is  the 
number  of  backscattered  electrons  leaving  the  target  with  an 
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Table  I.  Calculated  values  of  some  parameters  of  the  peaks  of  exposure  distributions  at  inclined  incidence  of 
electrons. 


Half  widths 

Shift  of  maximum 

of  peaks 

values  of  peaks 

Parameter 

(nm) 

MaXj 

[nci/MaXuormal 

(nm) 

Eo  (keV) 

25 

50 

25 

50 

25 

50 

O 

O 

600 

600 

0.041  2 

0.054  9 

90 

100 

45° 

430 

430 

0.062  0 

0.077  7 

70 

80 

o 

O 

kO 

350 

350 

0.121  5 

0.148  7 

40 

50 

energy  in  the  interval  Qi  +  dQ],  multiplied  by 

the  number  of  energy  intervals  in  the  range  [0,1],  and  nor- 


(®)  LATERAL  DISTANCE 


(b)  LATERAL  DISTANCE  X(^m) 

Fig.  3.  Comparison  between  the  lateral  distributions  of  the  absorbed  energy 
density  in  a  125  nm  PMMA  resist  layer  on  a  Si  substrate  at  angles  of 
incidence  30°,  45°,  60°,  and  90°  and  Eq^25  keV  (a)  or  ^0=50  keV  (b). 


malized  by  the  number  of  incident  electrons.  Note  that 
Sf(Q)dQ  >  1  since  dn/dQ  is  not  normalized  by  the  number 
of  all  backscattered  electrons.  This  normalization  is  avoided 
because  it  makes  the  levels  of  all  energy  distributions  almost 
equal  and  does  not  allow  to  show  the  increase  in  the  number 
of  backscattered  electrons  with  the  decrease  in  the  angle  of 
incidence. 

In  Fig.  4  it  can  be  seen  that  the  distribution  for  the  target 
PMMA/Si  at  normal  incidence  of  electrons  is  close  to  the 
experimental  distribution  for  the  A1  target.  With  decreasing 
angle  of  incidence  the  level  of  distributions  increases  and 
their  maximum  values  move  to  greater  energies  (see  Table 
II). 

In  Fig.  5  the  angular  distributions  of  backscattered  elec¬ 
trons  at  Eq=25  keV  and  angles  of  incidence  30'',  45°,  60°, 
and  90°  are  shown  in  comparison  with  sin  6.  Here  0  is  the 
escape  polar  angle  of  backscattered  electrons  measured  from 
the  target  plane  and  dnfdO  is  the  number  of  backscattered 
electrons  leaving  the  target  under  an  angle  in  the  interval 
[0i,0i  +  d0],  multiplied  by  the  number  of  angular  intervals 
in  the  range  [0,180°],  and  normalized  by  the  number  of  in¬ 
cident  electrons.  Again,  Jf{6)d6  >  1  for  the  same  reason  as 


NORMALIZED  ENERGY  Q(=E/Ej,) 

Fig.  4.  Comparison  of  simulated  energy  distributions  of  backscattered  elec¬ 
trons  from  a  125  nm  PMMA/Si  target  at  Eq~25  keV  and  angles  of  inci¬ 
dence  30°,  45°,  60°,  and  90°  with  an  experimental  distribution  for  A1  target 
at  Eq=30  keV  and  normal  incidence. 
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Table  II.  Calculated  maximum  values  and  their  positions  of  the  energy  and 
angular  distributions  of  backscattered  electrons  at  Eq=25  keV. 


Para¬ 

meter 

(dn/dQ)^ 

Position  of 
(dn/dQ)^„ 

(dnlde)^^ 

Position  of 
(dn/dO)^^ 

30° 

0.847 

0.86 

0.720  3 

o 

OO 

45° 

0.495 

0.74 

0.458  1 

66° 

60° 

0.339 

0.72 

0.342  2 

o 

OO 

o 

o 

0.277  5 

0.66 

0.240  6 

90° 

in  the  case  of/(j2).  It  is  seen  that  the  distribution  at  90°  can 
be  considered  as  a  Gaussian  one  rather  than  as  being  propor¬ 
tional  to  sin  6,  but  in  our  opinion  it  is  a  reasonable  estimate 
of  the  actual  angular  distribution  at  normal  incidence  of  elec¬ 
trons.  The  decrease  in  the  angle  of  incidence  leads  to  an 
increase  in  the  level  of  distributions  as  well  as  to  a  shift  of 
their  maximum  values  to  smaller  angles  (see  Table  II). 

IV.  CONCLUSIONS 

In  this  article  Monte  Carlo  simulation  of  scattering  of  fast 
electrons  impinging  at  angles  30°,  45°,  and  60°  on  the  125 


ESCAPE  ANGLE  ^(deg) 

Fig.  5.  Comparison  of  simulated  angular  distributions  of  backscattered  elec¬ 
trons  from  a  125  nm  PMMA/Si  target  at  Eq=25  keV  and  angles  of  inci¬ 
dence  30°,  45°,  60°,  and  90°  with  sin  law. 


nm  PMMA/Si  target  surface  is  performed.  The  spatial  distri¬ 
butions  of  absorbed  energy  density  in  the  resist  layer  are 
obtained  at  two  beam  energies,  namely  25  and  50  keV,  to¬ 
gether  with  the  angular  and  energy  distributions  of  backscat¬ 
tered  electrons.  Strong  asymmetry  of  backscattered  parts  of 
exposure  distributions  is  observed  as  well  as  considerable 
changes  of  forward  scattered  parts  in  comparison  with  those 
for  normal  incidence  of  electrons.  These  features  of  the  dis¬ 
tributions  may  cause  a  significant  enhancement  of  proximity 
effects  and  have  to  be  taken  into  account  when  inclined  in¬ 
cidence  of  fast  electrons  to  any  surface  occurs  during  EBL. 

The  results  show  that  the  decrease  in  the  angle  of  inci¬ 
dence  leads  to  an  increase  in  the  levels  of  energy  and  angular 
distributions  of  backscattered  electrons  as  well  as  to  the  shift 
of  maximum  values  of  energy  distributions  to  greater  ener¬ 
gies  and  of  angular  distributions  to  smaller  angles. 
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Radial  and  azimuthal  distortion  aberrations  are  increasingly  a  function  of  the  image  side  lens  bore 
radius  in  the  range  from  1.25  to  5  times  as  large  as  the  maximum  image  field  radius.  This 
phenomenon  is  inconsistent  with  our  previous  understanding.  An  assumption  is  made  that  these 
large  distortions  for  the  large  bore  radii  come  from  the  influence  of  the  magnetic  field  of  one  on  the 
other,  thereby  destroying  the  symmetry.  This  assumption  is  confirmed  from  the  following 
simulation.  When  these  distortions  are  calculated  for  the  ideal  case  where  the  magnetic  fields  are 
calculated  in  the  condition  without  the  other  lens,  they  are  decreased  to  around  ^  of  those  for  the 
case  where  the  magnetic  fields  are  calculated  in  the  real  condition  with  the  lenses  in  proximity. 

When  the  object-image  distance  is  800  mm  and  the  bore  radii  of  lens  1  are  100  and  20  mm,  the 
residual  radial  and  azimuthal  distortions  are  1.5  and  0.7  nm,  the  beam  blur  is  smaller  than  45  nm  for 
the  beam  semiangle  from  0.05  to  0.7  mrad,  where  the  main-field  and  subfield  sizes  in  the  image 
plane  are  20  mm  and  250  yarn,  respectively,  the  beam  energy  is  100  keV,  and  the  space  charge  effects 
are  neglected.  ©  1996  American  Vacuum  Society. 


I,  INTRODUCTION 

Today,  worldwide,  light  optical  exposure  tools  define  the 
device  pattern  in  the  resist  layer  of  silicon  wafers  using  an 
/-line  or  deep  ultraviolet  optical  stepper  with  wavelengths  of 
365  or  248  nm,  respectively.  However,  there  are  several 
other  lithography  tools  that  use  a  much  shorter  wavelength. 
By  using  an  x-ray,  electron,  or  ion  beam  for  the  exposure 
instead  of  a  light  optical  system,  the  resolution  and  depth  of 
focus  increase  dramatically.  In  scanning-electron-beam  li¬ 
thography,  for  example,  character  projection  lithography,^ 
the  size  of  the  scan  field  is  limited  not  only  by  the  optical 
performance  such  as  deflection  distortion,  blur,  and  nonver¬ 
tical  beam  landing,  but  also  by  electronics  such  as  the  accu¬ 
racy  of  the  deflection  amplifier  and  the  resolution  of  the  digi¬ 
tal  to  analog  converter  (DAC)  that  defines  the  beam  position 
on  a  Cartesian  digital  address  grid,  typically  2^^-2^®  points 
along  both  the  x  and  y  axes.  In  projection-electron-beam 
lithography,  the  size  of  the  main  field  is  limited  only  by  the 
optical  performance  such  as  field  distortion,  beam  blur,  and 
nonvertical  beam  landing.  Projection  systems  are  better 
suited  to  large  field  sizes  which  produce  high-volume  pro¬ 
duction  and  main-field  stitching  error- free  systems. 

For  the  lens  system,  the  symmetric  magnetic  doublet, 
and  (PREVAIL)  projection  exposure  with  variable  axis  im¬ 
mersion  lenses^  have  been  studied.  We  obtained  an  aberra¬ 
tion  of  less  than  0.02  jmm  in  distortion  and  0.12  /am  in  beam 
resolution  for  a  27-mm-square  field  size  through  a  computer 
simulation,^  However,  this  large  field  image  projection 
causes  many  problems;  for  example,  it  requires  a  high  emit- 
tance  electron  gun.  To  resolve  these  problems,  the  large  field 
is  divided  into  many  small  subfields,  and  each  subfield  is 
projected  with  a  dynamical  aberration  correction.^"^  We  also 
calculated  the  residual  beam  resolution  and  distortion  (33 
and  1.1  nm,  respectively)  with  a  dynamically  compensated 


field  aberration  for  an  object-image  distance  of  1.5  m,  and 
for  20  and  0.25  mm  for  the  main-field  and  subfield  sizes  in 
the  image  plane,  respectively.^  For  an  object-image  distance 
of  50  cm,  residual  aberration  was  calculated  from  the  trajec¬ 
tory  calculation  after  the  correction  of  lens  current,  cross¬ 
over,  and  beam  positions.^  The  PREVAIL  design  is  promis¬ 
ing  because  it  may  permit  large  beam  semi-angles  if  a  large 
field  is  available  with  reasonable  distortion.  Unfortunately, 
its  detailed  calculation  has  not  been  reported  yet. 

For  the  lens  bore  radius  it  was  stated  that^^  the  bore  radius 
must  be  chosen  sufficiently  large  so  that  aberrations  higher 
than  third  order  are  not  significant.  The  ratio  of  image  field 
radius  to  image  side  bore  radius  was  designated  as  0.212. 
However,  for  a  ratio  of  0.667,  good  aberration  characteristics 
were  obtained,  where  the  aberrations  were  calculated  from 
the  trajectory  calculation  so  that  all  the  higher-order  aberra¬ 
tions  could  be  calculated.  Therefore,  we  have  questioned  pre¬ 
vious  statements  and  studied  aberrations  as  a  function  of  the 
bore  radius  to  ascertain  whether  the  large  lens  bore  radii  are 
adequate  for  the  reducing  image  projection  system. 

An  optical  system  for  the  electron-beam  projection  sys¬ 
tem  for  a  4G-bit  d-RAM  pattern  is  designed  and  the  aberra¬ 
tions  are  calculated. 

II.  LENS  MODELS  AND  CALCULATION 

Figure  1  shows  a  magnetic  lens  model  for  the  symmetric 
magnetic  doublet.  The  object  to  image  distance  and  pole  gap 
for  lens  1  are  kept  to  800  and  560  mm,  respectively.  The 
crossover  position  at  Z  of  2140  mm  is  important  position 
where  is  partitioned  as  4:1  from  the  object  to  image.  Wide 
gap  lenses  are  used  in  which  the  fields  are  not  concentrated 
to  relatively  small  volumes  because  such  wide  gap  lens  are 
necessary  to  obtain  small  third-order  aberration  coefficients.^ 
The  object  side  and  crossover  side  bore  radii  for  lens  1  are 
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Fig.  1 .  Lens  model  and  lens  parameters  for  the  symmetric  magnetic  doublet. 
The  lens  parameters  for  lens  2  are  a  quarter  of  those  for  lens  1 . 

and  R^c  mm.  The  reduction  factor  is  5  and  all  the  lens 
parameters  for  lens  2  are  a  5  of  those  for  lens  1.  R^o  is  varied 
from  50  to  200  mm  and  R^^  is  varied  from  13.33  to  30 
mm.  For  these  models,  the  magnetic  vector  potential  for  each 
mesh  point  is  calculated  using  the  finite  element  method. 
The  total  node  numbers  are  130  (Z  direction)  X  40  {R  direc¬ 
tion).  Third-order  elements  are  used  to  obtain  accurate  mag¬ 
netic  vector  potentials,  so  those  for  390X120  points  are  cal¬ 
culated.  The  boundary  condition  is  defined  as  the  magnetic 
vector  potential  when  the  optical  axis  is  zero.  Over  the  area 
of  0<7?<1200  mm  for  0<Z<2120  mm,  0<7?<77  520 
-36Z  for  2120<Z<2145,  and  0<R<300  mm  for  2145<Z 
<2675,  the  computation  using  the  finite  element  method  was 
extended.  The  magnetic  vector  potentials  between  the  nodes 
are  interpolated  with  a  third-order  spline  function.  With  the 
use  of  these  magnetic  vector  potentials,  the  electron  trajecto¬ 
ries  are  calculated. 

Figure  2  shows  the  calculated  magnetic  field  line,  the 
curved  lines  represent  lines  A(/?,Z)=const.  Since  the  exci¬ 
tation  current  is  opposite  to  that  between  lens  1  and  lens  2, 


Fig.  2.  Magnetic  flux  for  the  real  lens  system,  where  the  object  and  cross¬ 
over  side  bore  radii  and  are  200  and  20  mm,  respectively. 


Fig.  3.  Third-order  aberration  coefficients  calculated  from  the  axial  mag¬ 
netic  field  distribution  and  its  first  derivative  as  a  function  of  the  crossover 
side  bore  radius  R^^  of  lens  1,  where  the  object  side  bore  radius  R^g  is  100 
mm.  Notations  ,  Ci ,  Cf ,  Ci  and  are  axial  chromatic 

(m/rad),  astigmatism  [(rad  m)~^],  distortion  (m"^),  spherical  (mrad~^), 
transverse  chromatic  (dimensionless),  field  curvature  [(radm)”'],  coma 
length  (rad“^),  and  coma  radius  (rad^^),  respectively. 

the  magnetic  flux  from  one  lens  is  repulsed  by  the  magnetic 
flux  from  the  other  lens.  These  features  will  be  discussed 
later. 

The  axial  magnetic  field  distribution  and  its  first  deriva¬ 
tive  are  also  calculated.  Using  these  values,  third-order  aber¬ 
ration  coefficients  are  calculated  when  the  crossover  side 
bore  radius  R^^  the  object  side  bore  radius  R^^  are  var¬ 
ied.  The  results  are  shown  in  Figs.  3  and  4,  where  C^x  ?  , 

C^,  Cj,  Cf,  Cf,  Cl,  and  are  axial  chromatic,  astigma¬ 
tism,  distortion,  spherical,  transverse  chromatic,  field  curva¬ 
ture,  coma  length,  and  coma  radius,  respectively. 

As  seen  in  Fig.  3,  only  the  astigmatism  aberration  coeffi¬ 
cient  is  an  increasing  function  of  the  crossover  side  bore 
radius  and  all  the  other  coefficients  decrease  slowly 
when  R^c  increases.  As  seen  in  Fig.  4,  the  astigmatism  and 
spherical  aberration  coefficients  are  decreasing  functions  of 
the  object  side  bore  radius  R^^ ,  and  the  other  coefficients  are 
increasing  functions  of  R^^. 

The  field  curvature,  astigmatism,  transverse  chromatic, 
and  distortion  aberrations  are  calculated  from  the  beam  tra¬ 
jectories.  The  calculation  procedures  are  as  follows:^  First, 
the  lens  excitation  parameter  k  is  determined.  For  the  trial 
lens  excitation  parameter  ki,  the  principal  ray  trajectory  is 
determined.  Two  electrons  are  emitted  from  the  crossover  to 
the  object  with  two  emission  angles,  Oy  and  O2,  between  the 
optical  axis,  and  the  trajectories  are  calculated.  From  the  ra¬ 
dial  coordinates  of  those  two  trajectories  at  the  object,  the 
emission  angle  O^q  is  calculated  so  that  the  radial  coordinate 
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Fig.  4.  Third-order  aberration  coefficients  as  a  function  of  the  object  side 
bore  radius  of  1®*^®  1’  where  the  crossover  side  bore  radius  is  20 
mm.  The  notations  are  the  same  as  in  Fig.  3. 


at  the  object  is  40  mm  by  interpolation  or  extrapolation. 
The  trajectory  with  this  is  calculated  from  the  crossover 
to  the  object.  At  the  object,  trajectory  conditions  are  calcu¬ 
lated  by  the  method  described  in  Ref.  9.  These  conditions 
contain  X  and  Y  coordinates  Xq  and  Yq,  and  radial  and  azi¬ 
muthal  angles  0  and  <f),  respectively.  To  obtain  in-focus  con¬ 
dition,  four  trajectories  with  the  angular  conditions  {0, 
(f)+0.r\  (<9,  (^-0.1°),  {B+O.r,  ^),  and  ((9-0.1°,  are  cal¬ 
culated  from  the  object  to  image.  From  these  four  trajecto¬ 
ries,  a  focus  position  Zj  for  the  lens  parameter  ki  is  calcu¬ 
lated.  For  the  next  trial  excitation  parameter  ki,  the  same 
procedure  is  followed  and  a  focus  position  Z2  for  the  lens 
parameter  ^2  is  obtained.  From  these  two  focus  positions,  Z^ 
and  Z2,  for  the  lens  parameters,  ki  and  k2,  the  in-focus  lens 
excitation  parameter  k^  is  calculated  by  interpolation  or  ex¬ 
trapolation. 

Second,  for  the  various  radial  positions  in  the  subfield, 
trajectories  conditions  are  determined.  For  the  lens  excitation 
parameter  /cq,  the  emission  angle  is  calculated.  Nineteen 
trajectories  with  ln°  (n=0,l,...,9)  are  calculated  from 

the  crossover  to  the  object.  For  each  trajectory,  the  trajectory 
conditions  at  the  object  are  calculated  by  the  method  men¬ 
tioned  above. 

When  four  trajectories  diverging  radial  and  azimuthal  di¬ 
rection  ±0.1°,  respectively,  per  each  19  trajectory  condition 
are  calculated,  the  out-of-focus  length  AF  and  astigmatism 
difference  AA  are  obtained. 

When,  for  the  principal  ray  trajectory  conditions,  17  tra¬ 
jectories  with  beam  energy  of  100  keV,  100k±l  eV  100k±2 
eV,  100k±5  eV,  100k±10  eV,...  and  100k±200  eV  are  cal¬ 


Fig.  5.  Out-of-focus  length  AF  and  astigmatism  difference  A  A  as  a  function 
of  R(yc ,  where  R^^  is  100  mm. 


culated,  the  transverse  chromatic  aberration  may  be  obtained. 

For  the  19  trajectory  conditions,  their  trajectories  are  cal¬ 
culated  from  the  object  to  image.  From  the  radial  and  Y 
coordinates  for  the  object  and  image  ,  R^ ,  Y^  (=0)  and  Y^ 
the  radial  distortion  RJ4—Ri  and  the  azimuthal  dis¬ 
tortion  4>^  are  calculated. 

Fig.  5  shows  the  astigmatism  difference  AA  and  the  out- 
of-focus  length  AF  as  a  function  of  the  crossover  side  bore 
radius  R^^ .  Like  the  field  curvature  and  astigmatism  calcu¬ 
lated  by  the  third-order  aberration  coefficient  shown  in  Fig. 
3,  they  are  almost  independent  of  the  crossover  side  bore 
radius,  when  the  object  side  bore  radius  is  100  mm.  Figure  6 
shows  the  AF  and  AA  as  a  function  of  the  object  side  bore 
radius  ,  where  the  solid  line  curve  is  the  case  where  the 
crossover  side  bore  radius  R^c  is  20  mm,  and  the  broken  line 
curve  is  the  case  where  As  seen,  the  out-of¬ 

focus  length  AF  is  almost  independent  of  R^^ ,  and  the  astig¬ 
matism  length  is  a  decreasing  function  of  . 

Figure  7  shows  the  radial  and  azimuthal  transverse  chro¬ 
matic  aberrations  for  the  energy  width  of  1  eV  for  AF^  and 
as  a  function  of  the  crossover  side  bore  radius  F^c» 
where  these  aberrations  are  calculated  for  the  trajectories 
emitted  from  the  radial  position  of  40  mm  at  the  object.  As 


Fig.  6.  AF  and  AA  as  a  function  of  R^q  .  The  solid  line  curves  are  the  case 
where  R^^  is  20  mm,  and  the  broken  line  curves  are  the  case  where  is 
Roof5. 
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Rod  mm  ) 

Fig.  7.  Radial  and  azimuthal  transverse  chromatic  aberrations  per  1  eV  for 
ARc  and  »  as  a  function  of  »  where  R^o  is  100  mm. 

seen,  and  increase  slowly  when  the  is  in¬ 
creased.  Figure  8  shows  the  beam  landing  angle  6i  with 
respect  to  the  normal  at  the  image.  The  radial  and  azimuthal 
transverse  chromatic  aberrations  A/?^  and  A<^^  per  1  eV  are 
shown  as  a  function  of  the  object  side  bore  radius  R^o  •  The 
radial  and  azimuthal  transverse  chromatic  aberrations  are  a 
decreasing  function  of  the  object  side  bore  radius  R^^ .  The 
beam  landing  angle  6i  increases  from  0.4°  to  2.5°,  when  R^^ 
increases  from  50  to  200  mm. 

The  radial  and  azimuthal  distortion  RJA—R^  and  as  a 
function  of  image  radius  are  shown  in  the  lower  part  of  Fig. 
9  (“real”  case)  where  the  bore  radii  R^o  and  R^^  are  200  and 
20  mm,  respectively.  The  “ideal”  case  is  discussed  below  in 
Sec.  III.  As  seen,  both  distortions  are  linear  function  of  /?,- , 


Roo  (mm) 

Fig.  8.  Beam  landing  angle  from  the  normal  at  the  image  0^  degree,  radial 
and  azimuthal  transverse  chromatic  aberrations  AR^  and  A</>c ,  as  a  function 
of  R^o  ,  where  is  20  mm. 


9  10  11 
Ri  (mm ) 


Fig.  9.  Radial  and  azimuthal  distortions  RJA—Ri  and  <j)^  as  a  function  of 
the  image  radius  R,- .  Bottom  and  top  are  for  the  real  and  ideal  cases.  Radial 
and  azimuthal  distortions  per  1  mm  image  radius  AR^  and  A0^  are  also 
defined. 

SO  the  distortion  values  per  mm  of  the  image  radius  A/?^ 
(nm/mm)  and  A^^  (nm/mm)  may  be  defined.  When  the  ra¬ 
dial  distortion  is  corrected  once  for  each  subfield,  residual 


Roc  (mm) 

Fig.  10.  Radial,  azimuthal,  and  residual  azimuthal  distortions  AR^,  A<^^, 
and  A(^^^ ,  respectively,  as  a  function  of  the  bore  radius  R^c ,  where  R^^  is 
100  mm. 
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Fig.  11.  ,  and  ^  function  of  the  bore  radius  /?oo  »  where 

the  broken  line  curves  are  the  case  where  R^^  is  20  nun,  and  the  solid  line 
curves  are  the  case  where  R^^  is 


radial  distortion  is  calculated  as  AjR^i?sub/2,  where  is 
radial  subfield  size.  When  the  azimuthal  distortion  is  cor¬ 
rected  to  the  main  field  center  by  rotation  stage  of  the  mask 
or  wafer,  residual  distortion  per  1  mm  of  the  image  radius  is 
where  A<^^o  is  the  azimuthal  distortion  per  1 
mm  of  the  image  radius  for  the  center  of  the  main  field. 

Figure  10  shows  the  radial  distortion  per  1  mm  of  image 
radius  A/?^,  the  azimuthal  distortion  per  1  mm  of  image 
radius  A<^^,  and  the  residual  azimuthal  distortion  per  1  mm 
of  the  image  radius  A0^^  as  a  function  of  the  crossover  side 


Fig.  12.  Magnetic  flux  for  the  ideal  lens  system,  where  the  lens  parameters 
are  same  as  in  Fig.  2. 


Fig.  13.  Magnetic  vector  potential  differences  as  a  function  of  the  radius  R2. 
Solid  line  curves  is  A] -HA 2,  the  broken  line  curves  are  A2,-A2r,  and  the 
dotted  line  curves  are  normalized  values  (Aj-HA2)/Ai ,  where  subscripts  1 
and  2  are  for  lenses  1  and  2,  r  and  i  are  for  the  real  and  ideal  cases,  and  50 
or  200  is  Rf,^  and  20  is  R^c  • 


bore  radius  ,  where  R^^  is  100  mm.  As  seen  in  Fig.  10, 
A(f>^  increases  slowly  when  R^c  increases.  The  radial  distor¬ 
tion  per  1  mm  of  the  image  radius  A/?^  increases  rapidly 
when  R^^  increases.  Though  the  azimuthal  distortion  A^^  is 
large,  the  residual  azimuthal  distortion  A0^^  is  smaller  than 
20  nm/mm. 

Figure  11  shows  A/?^,  and  A(/)^^  as  a  function  of 
R^f, ,  where  solid  lines  are  the  case  where  R^^  is  Rqo^^ 
the  broken  lines  are  the  case  where  R^^  is  20  mm.  As  seen, 
the  radial  and  azimuthal  distortions  increase  rapidly  when 
R^^  increases  from  50  to  200  mm.  As  these  calculated  dis¬ 
tortions  include  all-order  aberration  coefficients,  these  results 
are  inconsistent  with  our  previous  understanding  that  the 
bore  radius  must  be  chosen  sufficiently  large  that  aberrations 
higher  than  third  order  are  not  significant.  A  sufficient  large 
bore  radius  is  harmful  if  wide  gap  lenses  are  used.  The  maxi¬ 
mum  residual  azimuthal  distortion  is  40  nm/mm.  When  the 
subfield  size  is  250  /uum  square,  residual  azimuthal  distortion 
is  only  5  nm,  and  there  is  no  problem. 

III.  DISCUSSION 

A.  Mechanism  for  generation  of  the  distortions 

To  study  the  mechanism  of  generation  of  the  radial  and 
azimuthal  distortions,  the  ideal  case  where  the  magnetic 
fields  of  lenses  1  and  2  are  calculated  in  the  absence  of  one 
another  is  studied.  When  Z  is  smaller  or  larger  than  the  co- 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2472 


M.  Nakasuji  and  H.  Shimizu:  Distortion  aberration  in  symmetric  magnetic  doublet 


2472 


ordinate  of  the  crossover,  the  magnetic  vector  potentials  for 
lens  1  or  lens  2,  respectively,  are  selected  for  the  trajectory 
calculation.  Figure  12  shows  the  magnetic  flux  profile  for  the 
ideal  case,  where  the  lens  parameters  are  same  as  in  Fig.  2. 
In  Fig.  12,  the  magnetic  flux  formed  by  lenses  1  and  2  is 
distributed  in  the  region  where  Z  is  smaller  or  larger  than 
2140  mm,  respectively.  In  contrast,  in  Fig.  2,  the  magnetic 
flux  formed  by  lens  2  is  not  distributed  in  the  region  where  Z 
is  smaller  than  2140  mm,  and  that  formed  by  lens  1  is  not 
distributed  in  the  region  where  Z  is  larger  than  2140  mm, 
only  where  R  is  smaller  than  100  mm. 

From  the  comparison  between  the  real  and  the  ideal  cases, 
it  is  expected  that  the  magnetic  vector  potential  in  the  region 
where  the  electron  passes  through  are  distorted  in  the  real 
case.  The  magnetic  vector  potential  A  i ,  as  a  function  of  ra¬ 
dius  at  the  center  of  lens  1  for  Z  of  1800  mm  and  A  2  as  a 
function  of  radius  at  the  lens  center  of  lens  2  for  Z  of  2225 
mm  are  studied,  that  is  A^  =  A  (1800  mm,  Ri),  A  2= A  (2225 
mm,  R2),  and  R2  =  Ri/4,  where  the  lens  parameters 
(R^^  ,R^J  are  (200,  20)  and  (50,  20).  The  vector  potential  A 1 
for  Ri  and  A 2  for  /?j/4  is  of  opposite  sign  and  nearly  equal  in 
absolute  value.  Three  types  of  the  magnetic  vector  potential 
are  calculated  as  a  function  of  radius  and  are  shown  in  Fig. 
13.  Solid  line  curves  are  A1+A2,  broken  line  curves  are 
^2i~^2n  dotted  line  curves  are  normalized  values 
(A  1+ A 2)/ A  2  for  the  real  case.  As  seen  in  the  solid  curves, 
for  the  large  bore  radius  and  real  case,  A2  +  A2  is  a  decreas¬ 
ing  function  of  radius  R2,  and  for  the  large  bore  radius  and 
the  ideal  case  and  the  small  bore  radius  and  real  case, 
Aj  +  A2  is  an  increasing  function  of  radius  R2.  As  seen  from 
the  broken  line  curves,  the  difference  between  the  real  case 
and  the  ideal  case  for  the  large  bore  radius  is  ten  times  as 
large  as  that  for  the  small  bore  radius.  This  means  that  for  the 
large  bore  radius  the  magnetic  vector  potential  is  influenced 
more  than  that  for  the  smaller  bore  radius.  As  seen  in  the 
dotted  line  curve  for  the  small  bore  radius,  the  normalized 
magnetic  vector  potential  difference  (Ai+A2)/Ai  is  almost 
independent  of  /?2,  however,  (A  1  +  A2)/A  2  for  the  large  bore 
radius  depends  on  /?2.  This  means  that  for  the  trajectories 
whose  radial  position  is  different,  the  smaller  bore  radius 
lens  system  gives  nearly  equal  lens  effect,  however,  the  large 
bore  radii  lens  system  gives  different  lens  effect  for  those 
trajectories.  As  a  result,  the  distortions  for  the  small  bore 
radii  lens  system  are  small  and  those  for  the  large  bore  radii 
lens  system  are  large.  Consequently  it  may  be  concluded  that 
the  generation  mechanism  of  the  large  radial  and  azimuthal 
distortions  for  the  large  bore  radii  R^^  and  Roo^4  is  that  the 
magnetic  fields  of  two  lenses  are  influenced  by  each  other, 
even  if  the  crossover  side  bore  radii  R^^  and  7?^^  small 
enough, 

B.  Design  example 

Requirements  for  the  optics  of  the  electron-beam  projec¬ 
tion  system  for  the  4G-bit  d-RAM  pattern  are  assumed  as 
follows.  The  main  field  and  subfield  sizes,  distortion  aberra¬ 
tion,  beam  blur,  and  beam  energy  are  20  mm,  0,25  mm 
square,  20  nm,  45  nm,  and  100  keV,  respectively.  Figure  14 


Fig.  14,  Aberrations  as  a  function  of  the  beam  half-angle  a,  T  is  the  total 
beam  blur,  and  is  the  beam  blur  budget  for  the  space  charge  effect. 

shows  aberrations  as  a  function  of  the  beam  half  angle, 
where  the  object-image  distance,  bore  radii  R^^  and  R^^ 
are  800,  100,  and  20  mm,  respectively.  They  are  calculated 
from  the  following  equations:  D=0.61X/a,  AC^  =  C^a^, 
ACi  =  CiR^a^,  AC,  =  C^R^a^,  and  ACax=Ca^aAUIU , 
where  is  the  maximum  image  radius  and  a  is  the  beam 
semiangle,  and  where  the  spherical,  axial  chromatic,  coma 
length,  and  coma  radius  aberration  coefficients  ,  C; , 

and  C;.,  respectively,  are  calculated  from  Fig.  3.  The  field 
curvature  and  astigmatism  aberrations  F  and  A  are  calculated 
as  F=  (0.25/2)  AFa  and  A  =  (0.25/2)AAa,  where  AF  and 
AA  are  shown  in  Fig.  5  at  R^^  of  20  mm.  The  radial  and 
azimuthal  transverse  chromatic  aberrations  are  calculated  as 
AU^Rc  and  AUA(l>c,  where  the  energy  width  At/  is  3  eV 
and  AR^  and  A<f>c  are  shown  in  Fig.  7.  The  residual  radial 
and  azimuthal  distortions  are  calculated  from  Fig.  10  as  1.5 
and  0.7  nm,  respectively.  Total  beam  blur  T  is  shown  in  the 
solid  line  curve.  Since  the  beam  blur  specification  is  45  nm, 
the  beam  blur  budget  for  the  space  charge  effect  may  be 
shown  as  in  the  broken  line  curve.  For  the  beam  half  angle 
from  0.05  to  0.7  mrad,  the  beam  blur  is  smaller  than  45  nm 
when  the  space  charge  effect  is  neglected.  Though  the  astig¬ 
matism  and  the  transverse  chromatic  aberrations  are  the  de¬ 
creasing  function  of  the  lens  bore  radii,  these  aberrations  are 
not  dominant  in  the  total  beam  blur,  as  seen  in  Fig,  14. 
Therefore,  the  beam  blur  is  not  improved  in  the  case  of  large 
bore  radii. 

IV.  CONCLUSIONS 

Radial  and  azimuthal  distortion  aberrations  are  an  increas¬ 
ing  function  of  the  image  side  lens  bore  radius  in  the  range 
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from  1.25  to  5  times  as  large  as  the  maximum  image  field 
radius.  This  phenomenon  is  inconsistent  with  our  previous 
understanding.  An  assumption  is  made  that  these  large  dis¬ 
tortions  for  the  large  bore  radii  come  from  the  influence  of 
the  magnetic  field  of  one  on  the  other,  thereby  destroying  the 
symmetry.  This  assumption  is  confirmed  from  the  following 
simulation.  When  these  distortions  are  calculated  for  the 
ideal  case  where  the  magnetic  fields  are  calculated  in  the 
condition  without  the  other  lens,  they  are  decreased  to 
around  of  those  for  the  case  where  the  magnetic  fields  are 
calculated  in  the  real  condition  with  the  lenses  in  proximity. 
When  the  object-image  distance  is  800  mm  and  the  bore 
radii  of  lens  1  are  100  and  20  mm,  the  residual  radial  and 
azimuthal  distortions  are  1.5  and  0.7  nm  and  the  beam  blur  is 
smaller  than  45  nm  for  the  beam  semiangle  from  0.05  to  0.7 
mrad,  where  the  main-field  and  subfield  sizes  in  the  image 
plane  are  20  mm  and  250  /xm,  respectively,  beam  energy  is 
100  keV,  and  the  space  charge  effects  are  neglected. 
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Proximity  effect  correction  for  reticle  making  by  the  dose  correction  method  is  discussed.  A  new 
algorithm  for  calculating  the  optimum  dose  is  proposed,  which  is  based  on  the  dose  formula  method 
and  the  representative  figure  method.  Its  main  feature  is  that  dose  evaluation  points  are  fixed  at 
individual  small  regions  whose  size  is  sufficiently  small  compared  with  the  backscattering  range. 
The  calculation  speed  of  the  computer  is  evaluated  which  is  sufficient  to  suppress  the  correction 
time  to  less  than  the  writing  time.  The  required  calculation  speed  is  500  MIPSX4  CPU  at  most  for 
a  minimum  feature  size  greater  than  0.2  jam  on  the  reticle.  This  result  suggests  that  the  reahtime 
proximity  effect  correction  is  possible  for  making  reticles.  Furthermore,  when  the  algorithm  is 
applied  to  x-ray  mask  fabrication  and  direct  writing  process,  the  calculation  time  can  be  suppressed 
to  less  than  the  writing  time.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Electron  beam  (EB)  writing  systems  have  been  powerful 
manufacturing  tools  for  making  reticles.  Pattern  dimension 
on  wafer  and  the  magnification  rate  of  the  reticle  becomes 
smaller  with  succeeding  LSI  generations.  Correspondingly, 
the  minimum  feature  size  on  the  reticle  is  also  getting 
smaller,  and  higher  accuracy  is  requested  for  reticle  fabrica¬ 
tion.  The  minimum  feature  size  on  the  reticle  will  be  0.8 
/x-0.5  /xm  and  the  permitted  dimension  error  will  be  less 
than  ±15  to  ±10  nm  for  making  1  and  4  Gbit  DRAM  gen¬ 
erations. 

To  obtain  such  a  high  accuracy,  both  the  beam  resolution 
and  the  forward  scattering  range  of  electrons  in  the  resist 
must  be  small,  of  the  order  of  30  nm.  Accordingly,  an  EB 
system  with  high  acceleration  voltage^'^  will  be  required, 
even  for  reticle  fabrication."^’^  Reticle  writing  by  high  accel¬ 
eration  EB,  however,  is  subject  to  many  problems,  for  ex¬ 
ample,  the  proximity  effect^"^^  and  resist  heating. 

In  this  article,  proximity  effect  correction  by  the  dose  cor¬ 
rection  method  is  discussed  for  reticle  fabrication.  Dose  cor¬ 
rection  method  has  an  advantage  to  Ghost^  and  Phantom, 
regarding  high  speed  writing,  because  subsidiary  exposure 
with  a  blurred  beam  is  not  needed.  However,  the  time  for 
calculating  an  optimum  dose  is  crucially  long.  Objectives  of 
this  article  are  (1)  to  propose  a  new  algorithm  for  calculating 
optimum  dose,  (2)  to  show  that  the  time  for  calculating  op¬ 
timum  dose  can  be  suppressed  to  less  than  the  writing  time, 
when  the  new  algorithm  is  applied  to  reticle  fabrication,  and 
(3)  to  x-ray  mask  fabrication  and  direct  writing  process.  The 
new  algorithm  is  based  on  the  dose  formula  method^^’^^  and 
the  representative  figure  method.  In  a  modification  of 
the  previous  algorithm,  the  dose  evaluation  points  are  fixed 
at  the  center  of  the  individual  small  regions  whose  size  is 
sufficiently  small  compared  with  the  backscattering  range 

(7^. 

In  this  article,  the  new  algorithm  is  shown  to  reduce  the 


calculation  time  to  about  1/30,  compared  with  the  previous 
algorithm,  without  deteriorating  the  total  correction  accuracy. 
The  calculation  speed  of  a  computer  is  evaluated  which  is 
sufficient  to  suppress  the  correction  time  to  less  than  the 
writing  time.  The  required  calculation  speed  is  500  MIPS  X 4 
CPU  at  most  for  a  minimum  feature  size  greater  than  0.2  yu-m 
on  reticle.  The  result  suggests  that  the  real-time  proximity 
effect  correction  is  possible  for  reticle  fabrication. 

The  contents  of  this  article  are  as  follows.  In  Sec.  II,  the 
assumptions  and  conditions  of  this  article  are  explained.  In 
Sec.  Ill,  the  data  amount  for  proximity  effect  correction  is 
evaluated  and  the  model  system  for  real-time  correction  is 
explained.  In  Sec.  IV,  the  new  algorithm  for  calculating  op¬ 
timum  dose  is  explained.  The  accuracy  of  the  method  is 
evaluated  in  Sec.  V.  In  Sec.  VI,  the  software  system  for 
evaluating  the  algorithm  is  explained.  In  Sec.  VII,  the  calcu¬ 
lation  speed  is  evaluated.  In  Sec.  VIII,  the  required  calcula¬ 
tion  speed  of  computer  is  evaluated,  which  is  sufficient  to 
suppress  the  correaction  time  to  less  than  the  writing  time. 
This  article  is  summarized  in  Sec.  IX.  In  Appendices  A  and 
B,  the  applications  of  the  new  algorithm  to  both  x-ray  mask 
fabrication  and  direct  writing  process  are  discussed.  In  Ap¬ 
pendix  C,  the  evaluation  of  the  edge  error  is  explained  in 
detail. 


II.  ASSUMPTIONS,  CONDITIONS,  AND  SOME 
NOTES 


In  this  article,  the  double  Gaussian  approximation^  for 
deposited  energy  is  used.  It  is  assumed  that  the  high  voltage 
EB  system  is  used  and  the  forward  scattering  range  is  0. 
Then  the  deposited  energy  E{x)  at  the  position  x  is  described 
as 
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where  the  %  is  the  ratio  between  the  deposited  energy 
caused  by  backscattering  electron  and  that  by  the  forward 
scattering  electron.  In  this  article,  the  acceleration  voltage  of 
electron  is  set  at  50  kV,  and  the  backscattering  range  is  set 
at  10  fjim.  Though,  when  the  forward  scattering  range  can  be 
approximated  by  zero,  our  discussion  can  be  used  for  any 
acceleration  voltage  by  scaling  all  types  of  dimension  by  . 
The  reason  is  that  any  quantity  (e.g.,  the  correction  accuracy) 
depends  on  acceleration  voltage  only  through  . 

In  many  articles,  some  assumptions  are  used  for  the  ideal 
correction  of  the  proximity  effect.  Case  A:  Ideal  correction  is 
to  make  the  deposited  energy  for  the  entire  exposed  region 
uniform.  Case  B;  Ideal  correction  is  to  make  the  deposited 
energy  at  the  edge  of  all  figures  uniform.  In  this  article, 
case  A  is  used  for  simplicity.  Though,  the  results  of  this 
article,  such  as  the  calculation  speed,  do  not  change,  even 
when  the  other  assumption  is  used. 

The  magnification  ratio  of  reticle  is  assumed  to  be  X4. 
When  length  or  dimension  is  described  as  2  ^cm  (w:0.5  /xm) 
in  this  article,  this  means  2  jxm  on  reticle  and  0.5  /xm  on 
wafer.  When  dimension  is  described  as  2  /xm  without  any 
comment,  this  means  2  /xm  on  reticle. 

III.  DATA  AMOUNT  AND  REAL-TIME  CORRECTION 

In  order  to  realize  highly  accurate  proximity  effect  correc¬ 
tion,  figures  must  be  cut  into  small  figures  whose  maximum 
size  is  2  /xm  or  so.  It  is  to  suppress  the  edge  error.^^  The  cut 
small  figures  are  called  elements.  The  data  amounts  for  ele¬ 
ments  are  evaluated  as  follows. 

Gate  array  of  130K  gates  was  used  to  evaluate  the  worst 
case.  The  (black/white)  inverted  pattern  of  the  wiring  layer 
was  selected  for  the  evaluation  because  the  layer  has  no  array 
structure  and  is  the  most  complicated  layer.  Chip  size  is  6X6 
cm  (w:l. 5X1.5  cm)  and  the  minimum  feature  size  in  the 
layer  is  6  /xm  (w:  1.5  /xm). 

For  the  systematic  evaluation  of  data  amount,  it  is  as¬ 
sumed  that  the  progress  of  device  technology  by  one  genera¬ 
tion  reduces  the  design  rule  to  1/2  and  does  not  change  the 
chip  size.  Then,  the  following  relation  holds  for  the  data 
amount: 

=  (2) 

where  2  (X.)  is  the  data  amount  for  the  future  device  with  the 
minimum  feature  size  of  X.  The  character  q{\)  is  that  for  the 
pattern  to  which  the  present  device  is  shrunk  to  have  the 
minimum  feature  size  of  X.  The  character  /x  is  the  minimum 
feature  size  of  the  present  device  without  shrinkage;  i.e., 
/x=6  /xm  (w:  1.5  /xm).  The  factor  (/x/X)^  is  to  make  chip  size 
invariant.  Relation  (2)  holds  not  only  for  the  data  amount  but 
also  for  the  number  of  shots  and  the  calculation  time. 

The  data  amounts  of  elements  are  shown  in  Fig.  1.  Data 
size  for  an  element  was  assumed  to  be  10  bytes.  Figure  1 
shows  that  the  element  division  causes  the  increase  of  data 
amount  to  10-20  G  bytes.  Then,  it  takes  5  h  or  so  to  store 
such  a  large  volume  of  data  to  hard  disk  from  a  storage 
media  such  as  a  magneto-optical  disk.  This  value  is  crucially 
large. 
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Fig.  1.  Data  amount  of  elements  and  shot  number.  Gate  array  was  used  for 
the  evaluation.  Chip  size  is  6X6  cm  on  reticle.  Both  element  size  and  the 
maximum  size  of  shot  are  2X2  fjm. 


There  are  various  possible  solutions  to  this  problem.  One 
solution  is  (pseudo)  real-time  correction  in  an  EB  system.  In 
order  to  realize  real-time  correction,  the  time  both  for  obtain¬ 
ing  the  representative  figures  and  for  evaluating  the  optimum 
dose  must  be  less  than  the  writing  time.  In  the  following 
sections,  we  discuss  how  to  satisfy  this  condition.  Even  if  the 
real-time  correction  is  not  applied,  the  satisfaction  of  this 
condition  is  effective  for  quick  turnaround  time  to  make 
reticles. 


Fig.  2.  Model  system  for  real-time  proximity  effect  correction.  Al,  A2,  5 
and  C  are  high  speed  bus,  such  as  direct  memory  access  (DMA)  bus.  Origi¬ 
nal  figures  are  transferred  through  pass  A1  and  A2.  Computer  X  prepares 
the  representative  figures  Representative  rectangles  are  transferred  through 
pass  B.  Computer  Y  calculates  the  optimum  correction  dose.  Calculated 
optimum  dose  is  transferred  through  pass  C. 
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Fig.  3.  The  method  for  correction  calculation  (a)  original  figures  and  (b)  elements,  dots  show  the  center  of  small  region  at  which  the  correction  dose  is 
evaluated,  (c)  Reference  rectangles  (representative  figures)  and  (d)  calculated  optimum  dose.  SI:  preparation  of  representative  rectangles  (reference  rect¬ 
angles).  S2:  correction  calculation  using  the  dose  formula  method.  S3:  data  transfer  to  EB  system. 


Figure  2  shows  a  simple  model  system  for  real-time  cor¬ 
rection,  which  is  referred  in  the  following  discussions.  The 
calculation  process  is  carried  out  by  the  unit  of  stripe,  which 
is  the  region  exposed  by  one  stage  continuous  motion.  Pipe¬ 
line  architecture  is  assumed;  that  is  when  a  stripe  is  pro¬ 
cessed  in  computer  Y,  the  next  stripe  is  processed  in  com¬ 
puter  X.  The  data  transfer  time  at  the  pass  A I  and  A  2  is 
needed  even  if  the  proximity  effect  correction  is  not  carried 
out.  Therefore,  the  time  is  not  considered  in  this  article.  The 
data  transfer  time  at  the  pass  B  and  C  is  discussed  in  the 
following  section. 

IV.  NEW  CORRECTION  ALGORITHM 

Figure  3  shows  the  new  algorithm.  The  improvement 
from  the  previous  algorithm^^  is  that  the  dose  evaluation 
point  is  fixed  at  the  center  in  individual  small  regions  for 
elements. 

Step  1 :  Representative  rectangles  are  made  from  the  origi¬ 
nal  LSI  patterns  in  individual  regions  S^.  whose  size 
(2  A^X  2  /xm)  is  comparable  with  the  backscattering  range 
o-fy .  This  is  used  as  a  reference  rectangle  in  step  2.  In  the  case 
of  the  system  in  Fig.  2,  Step  1  is  carried  out  by  computer  X. 

Step  2:  The  optimum  dose  is  evaluated  for  the  center 
{xi,yj)  in  the  individual  small  regions  whose  size 
(2AgX2A^  fim)  is  sufficiently  smaller  than  the  backscatter¬ 
ing  range.  Any  suitable  formula  for  the  optimum  dose  is  used 
by  referring  to  the  representative  rectangles  obtained  in  step 
1.  The  following  shows  the  example  where  Pabcovich’s  for¬ 
mula  is  used. 


The  formula  of  the  optimum  dose  ,yy)  at  the  position 
can  be  written  as 

D{xi  ,yj)  =  c[m -\~kU{Xi  ,yy)],  (3) 

^yj)  =  2  01  (4) 

m 

4>iiXi,yj,m)  =  eTf[ir^-Xi)/ab]-erfl{l„-Xi)/(Ti,],  (5) 

<f>2{xi,yj ,m)  =  exf[(u„-yj)/(Ti,]-eif[(l^-yj)/(Th],  (6) 

where  c  is  a  constant  and  k  is  an  optimization  parameter  for 
correction.  The  values  of  the  x  coordinate  at  the  right  side, 
and  the  left  side  of  the  reference  rectangle  m,  are  denoted  by 
and  ,  respectively.  The  values  of  the  y  coordinate  at  the 
upper  side  and  the  lower  side  of  the  reference  rectangle  m 
are  denoted  by  and  ,  respectively.  The  character  0i(02) 
corresponds  to  the  correlation  between  a  dose  evaluation 
point  and  one  reference  rectangle  for  x{y)  coordinate.  The 
summation  S  is  carried  out  for  the  reference  rectangles  m  a 
part  of  which  or  whole  of  which  are  located  within  a  thresh¬ 
old  distance,  L^,  from  the  evaluation  points  For 

example,  the  threshold  distance  is  set  at  2cr^ .  The  calcu¬ 
lation  is  carried  out  by  referring  to  a  table  of  error  functions 
for  the  reference  rectangles. 

Step  3;  EB  system  exposes  all  shots  in  a  small  region  Sg 
with  the  optimum  dose  assigned  for  the  small  region  Sg . 

This  procedure  reduces  the  total  calculation  time — 
compared  with  the  previous  algorithm — because  of  two  rea¬ 
sons.  The  first  reason  is  that  the  process  for  obtaining  ele¬ 
ments  is  abbreviated.  Only  the  process  for  obtaining 
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reference  rectangles  is  needed.  The  calculation  time  for  the 
latter  process  is  sufficiently  shorter  than  that  for  the  former 
process  (1/2-1/5).  The  second  reason  is  that  the  procedure 
reduces  the  calculation  time  for  correction  itself  (step  2)  for 
the  following  reason: 

It  is  found  from  Eqs,  (4)  and  (5)  that  the  relations 

,yi,in)  =  (f>y{Xi,y2,m)  =  <t)i{Xi,y2,,rn)=  '  -  ,  (7) 

=  02(^2  ^yj  yyj  =  (8) 

hold  for  arbitrary  i,  j,  and  m.  This  is  caused  by  fixing  the 
dose  evaluation  point  at  the  center  of  individual  small  re¬ 
gions  .  Because  of  relations  (7)  and  (8),  the  number  of 
references  to  the  error  function  table  is  reduced.  For  ex¬ 
ample,  suppose  that  the  correction  is  carried  out  by  a  unit  of 
a  certain  type  of  area  A  such  as  a  subfield,  and  N^XN^  is  the 
number  of  small  regions  in  the  area  A .  Then  the  number 
of  references  to  the  error  function  table  for  one  reference 
rectangle  is  2XA^g .  On  the  other  hand,  when  the  evaluation 
point  is  not  fixed  as  the  previous  algorithm,  relations  (7)  and 
(8)  do  not  hold,^^  and  the  number  of  references  to  the  error 
function  table  is  iXNgXN^ .  The  fix  of  evaluation  points 
reduces  the  number  of  references  to  the  error  function  table 
to  (1/A^e).  Thus,  the  high  speed  calculation  is  realized. 

The  number  of  calculations  for  correction  itself  (step  2) 
by  this  procedure  depends  on  only  the  region  sizes  2A^  and 
2A^  (i.e.,  the  backscattering  range  of  electron  o-^)  and  does 
not  depend  on  the  number  of  original  LSI  patterns.  There¬ 
fore,  the  calculation  time  of  correction  itself  is  independent 
of  the  design  rule  and  device  generation.  This  is  the  advan¬ 
tage  of  the  representative  figure  method.^® 


V.  CORRECTION  ACCURACY 

A  new  type  of  error  is  caused  by  fixing  the  dose  evalua¬ 
tion  point  at  the  center.  The  error  is  called  F  error  in  this 
article. 

Figure  4  shows  the  schematic  explanation  of  both  the 
worst  case  of  the  edge  error^^  and  the  F  error.  It  is  found 
from  Fig.  4  that  the  worst  edge  error  exceeds  the  F  error. 
This  means  that  the  new  algorithm  does  not  deteriorate  the 
correction  accuracy,  compared  with  the  previous  algorithm  in 
Ref,  18. 

The  worst  error  of  the  deposited  energy  is  evaluated  and 
is  shown  in  the  Fig.  5,  The  worst  error  appears  at  the  edge  of 
the  large  pattern  whose  size  is  sufficiently  larger  than  the 
backscattering  range  .  The  error  depends  on  the  normal¬ 
ized  small  region  size  (2Ag/o'^)  for  element,  when  the  re¬ 
gion  size  for  reference  rectangle  is  fixed.  The  error  in  the 
Fig.  5  involves  all  errors;  that  is,  the  edge  error,  the  error  of 
Pavcobich’s  formula  and  the  intrinsic  error  of  the  represen¬ 
tative  figure  method. 


VL  SOFTWARE  SYSTEMS 

In  order  to  evaluate  the  calculation  speed,  software  sys¬ 
tem  P3C  (PPPC;  preprocessor  for  proximity  effect 
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Fig.  4.  Edge  error  and  F  error.  Small  arrows  show  the  points  at  which  the 
optimum  dose  is  evaluated,  (a)  Shows  the  worst  edge  error,  (b)  shows  the 
case  that  the  evaluation  point  is  at  the  center  of  gravity  of  pattern  in  a  small 
region,  and  (c)  shows  the  F  error  where  the  evaluation  point  is  fixed  at  the 
center  of  small  region.  Deposited  energy  at  the  evaluation  point  is  almost 
ideal  for  (a),  (b),  and  (c)  in  that  only  the  intrinsic  error  of  the  representative 
figure  method  and  the  error  of  correction  algorithm  appear  at  the  point.  The 
inclination  of  the  deposited  energy  becomes  maximum  in  case  (a)  because 
the  effect  of  backscattering  electron  is  maximum.  Thus,  the  deposited  en¬ 
ergy  deviates  in  a  small  region  as  shown  in  (d).  The  worst  edge  error  ex¬ 
ceeds  the  F  error. 

correction)^®  is  used,  and  the  new  software  system  STEALTH 
(system  for  proximity  effect  correction  for  advanced  lithog¬ 
raphy  and  reticle)  is  developed. 


Fig.  5.  Error  dependence  on  the  small  region  size  for  element  2Ag .  Correc¬ 
tion  error  of  deposited  energy  was  evaluated  by  numerical  calculation.  The 
region  size  for  reference  rectangle  was  fixed  at  10X10  /jm.  The  worst 
error  appears  at  the  edge  of  the  large  figure  (whose  size  is  sufficiently  larger 
than  the  backscattering  range 
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GROUP1 


GROUP2 


BLOCK1 


BLOCK2 


COORDINATE 

=  Xi,  Yi 

SIZE  OF  BLOCK 

=  Wi,  Hi 

NUM.  OF  BLOCKS 

=  2 

POINTER  TO  BLOCK 

DATA 

COORDINATE 

=  X2,  Y2 

NUM.  OF  REP.  FIG. 

=  2 

COORDINATE,  (Li,  Ri, 

Ui,  Di) 

COORDINATE,  (L2,  R2, 

U2,  D2) 

NUM.  OF  REP.  FIG. 

=  3 

Fig.  6.  Data  format  for  stealth  (1).  Layout  in  a  chip  is  shown.  Shaded 

rectangles  are  the  representative  rectangles,  which  are  referred  to  for  correc-  Fig*  7.  Data  format  for  stealth  (2).  Representative  figures  in  Fig.  6  are 
tion  calculation.  Data  consists  of  group,  block,  and  representative  figures.  described  according  to  this  format. 

Corresponding  data  format  is  shown  in  Fig.  7. 


A.  P3C  (obtaining  representative  rectangles) 

Details  of  the  system  are  explained  in  Ref.  20.  Only  the 
rough  sketch  of  the  system  is  given  in  the  following. 

P3C  carries  out  (step  1)  in  Sec.  Ill  and  makes  the  repre¬ 
sentative  figures  from  the  original  LSI  pattern.  The  P3C  sys¬ 
tem  carries  out  the  calculation  in  parallel  using  4  CPUs.  Data 
format  for  EB  writing  system  EX-8D  is  used  for  input  and 
output  data.  The  output  from  a  data  conversion  system  is 
used  as  input  data  for  the  P3C  system. 

The  data  format  is  different  from  the  format  used  in 
STEALTH  which  is  explained  in  the  next  subsection.  However, 
the  difference  is  negligible  for  evaluating  the  calculation 
time. 

B.  STEALTH 

The  STEALTH  software  system  carries  out  the  correction 
calculation  in  (step  2)  described  in  Sec.  III. 

The  system  does  not  use  hard  disk  at  the  application  (soft¬ 
ware)  level.  The  stealth  system  accesses  the  input  data  in 
memory  and  stores  the  calculated  optimum  dose  to  the  sys¬ 
tem  memory.  This  is  (1)  to  eliminate  the  disk  access  time  and 

(2)  for  the  application  to  the  real-time  system,  such  as  the 
system  shown  in  Fig.  2. 

The  data  format  for  input  data  is  schematically  explained 
in  Figs.  6  and  7.  The  data  consists  of  the  block  in  which 
some  representative  rectangles  exist.  The  format  is  deter¬ 
mined  to  reduce  data  handling  time. 

The  STEALTH  system  calculates  the  optimum  dose  by  the 
unit  of  the  block  which  is  called  target  block.  The  main  se¬ 
quence  is  as  follows: 

(1)  (P-1)  search  nearby  blocks  which  touch  or  are  within  the 
threshold  distance  from  the  edge  of  the  target  block. 
Figure  8  shows  the  searched  blocks  and  the  target  block, 

(2)  (P-2)  obtain  the  correlations  and  <^2  by  referring  to 
the  reference  rectangles  and  using  the  error  function 
table. 


(3)  (P-3)  evaluate  the  value  of  U  [calculate  Eq.  (4)]. 

(4)  (P-4)  calculate  the  optimum  dose  [Eq.  (3)]. 

The  STEALTH  system  is  not  applied  to  the  arrayed  pattern, 
because  the  array  structure  can  be  easily  expanded  by  P3C 
without  a  burden.  P3C  can  make  one  representative  figure 
for  several  arrayed  patterns  in  a  small  region  and  the 
obtained  representative  figure  can  be  used  several  times  for 
expansion.  Such  a  process  requires  relatively  little  time. 

When  the  STEALTH  system  is  used  in  the  model  system  in 
Fig.  2,  attention  must  be  paid  to  the  data  transfer  time  at  pass 
B  and  C.  It  is  assumed  that  (1)  the  data  transfer  speed  is  20 


Rg.  8.  Blocks  used  for  correction.  Position  marked  as  (x)  in  a  target  block 
shows  the  center  of  small  region  at  which  the  optimum  dose  is  evaluated. 
Shaded  rectangles  are  the  reference  rectangles  (representative  figures).  Dot¬ 
ted  rectangles  show  the  blocks  which  touch  or  are  within  2  a'/,  from  the  edge 
of  the  target  block.  Reference  rectangles  in  the  dotted  blocks  (including  a 
target  block)  are  referred  to  for  correction  calculation. 
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Mbytes/s,  (2)  the  small  region  sizes  2A^  and  2A^  are  2  and 
10  /xm,  respectively,  and  (3)  the  data  is  transferred  by  the 
unit  of  stripe  whose  size  is  0.1  X 14  cm.  Regarding  the  pass  B 
(input  of  stealth),  the  data  amount  for  one  stripe  is  at  most 
22  Mbytes  or  so,  when  the  data  amount  for  one  representa¬ 
tive  figure  is  16  bytes.  The  corresponding  data  transfer  time 
at  path  B  is  0.8  s.  Regarding  the  pass  C  (output  of  STEALTH), 
the  data  amount  for  one  stripe  is  35  Mbytes,  when  the  data 
amount  for  one  optimum  dose  is  1  byte.  The  corresponding 
data  transfer  time  is  about  1.8  s.  The  time  for  writing  one 
reticle  will  be  over  1000  s  for  the  writing  area  of  200 
cm^”14X14  cm,  as  will  be  shown  in  Sec.  VIIL  Thus,  the 
time  for  writing  one  stripe  is  over  7  s.  The  estimated  values 
for  data  transfer  time  are  sufficiently  small,  compared  with 
the  time  for  writing  one  stripe.  The  data  transfer  time  does 
not  limit  the  throughput  of  the  EB  system. 

VIL  CALCULATION  TIME 

A.  P3C  (step  1) 

An  engineering  workstation  (EWS)  with  4  CPUs  was 
used.  The  processing  speed  of  each  CPU  is  52  MIPS.  (Be¬ 
cause  superscalar  architecture  is  adopted  for  the  micropro¬ 
cessor,  effective  MIPS  value  depends  on  a  program.  The  ef¬ 
fective  value  is  supposed  to  be  larger  than  52  MIPS  for  P3C. 
The  MIPS  value  should  be  considered  as  a  guideline.  The 
value  of  SPECint  92  is  40  for  the  processor.)  The  P3C  sys¬ 
tem  carries  out  the  calculation  in  parallel  using  4  CPUs. 

When  the  device  pattern  has  array  structure,  the  calcula¬ 
tion  time  becomes  short.  For  example,  the  calculation  time 
for  64  M  DRAM  is  less  than  1  min  for  even  the  most  com¬ 
plicated  layer  of  the  device.  Thus,  the  gate  array  described  in 
Sec.  II  is  used  for  the  worst  case  evaluation.  Relation  (2)  is 
used  for  systematic  evaluation,  and  the  evaluated  calculation 
time  is  converted  to  the  condition  that  the  size  of  writing  area 
is  200  cm^  on  reticle. 

The  results  are  shown  in  Fig.  9.  The  calculation  time  is 
about  1/4-1/70  of  the  data  conversion  time.  The  acceleration 
ratio  by  4  CPU  multiprocessing  is  about  3.7  (370%),  in  the 
case  of  a  130  k  gate  array. 

B.  STEALTH  (step  2) 

Only  1  CPU  of  the  computer  described  in  the  previous 
subsection  was  used.  The  small  region  size  2  A,,  for  reference 
rectangle  is  fixed  at  10  /xm.  Block  size  was  set  at  20X20 
/xm,  which  corresponds  to  2o-^ .  The  threshold  distance  is 
also  set  at  2  cr^ .  Corresponding  cutoff  error  is  0.25%  at  most, 
and  is  negligibly  small. 

Proximity  effect  correction  was  carried  out  for  the  region 
of  1 X 1  cm.  The  evaluation  was  carried  out  under  the  condi¬ 
tion  that  the  representative  figures  exist  in  all  small  regions 

.  The  number  of  representative  figures  is  1X10^. 

The  obtained  (elapsed)  calculation  time  is  shown  in  Fig. 
10.  The  calculation  time  is  of  the  order  of  1  min.  The  calcu¬ 
lation  speed  per  MIPS  by  STEALTH  is  30  times  faster  than 
that  by  the  previous  system  SPIDER  when  the  small  re¬ 
gion  size  2Ag  is  2  /xm. 


NUMBER  OF  GATES 
130K  520K  2.1  M  8.3M  33M  134M 


ON  RETICLE  (pm) 

Fig.  9.  Calculation  time  by  P3C.  (a)  Time  for  obtaining  representative  rect¬ 
angles  by  P3C.  (b)  Data  conversion  time  (reference).  Parallel  processing 
using  4  CPU  was  adopted.  Calculation  speed  of  1  CPU  is  52  MIPS.  Gate 
array  was  evaluated.  Chip  size  was  6X6  cm  on  reticle.  Small  region  size  for 
reference  rectangles  2A^  was  set  at  10  yitm  on  reticle. 


Figure  10  shows  that  almost  all  of  the  calculation  time 
consists  of  the  time  for  evaluating  Eq.  (4)  (P-3  in  Sec.  VI), 
The  time  required  for  searching  the  nearby  block  is  small. 
This  shows  the  effectiveness  of  the  data  format  described  in 
Sec.  VI.  The  time  for  evaluating  Eqs.  (5)  and  (6)  (P-2  in  Sec. 
VI)  is  also  small,  as  expected  in  Sec.  IV.  This  shows  the 


SIZE  OF  SMALL  REGION,  Se, 
(2Ae)  (pm) 


NUMBER  OF  SMALL  REGION,  Se, 
in  2  6  bx  2  6  b  (ZOxZOpm) 

Fig.  10.  Calculation  time  for  correction  itself  (by  stealth).  52  MIPS  1  CPU 
computer  was  used.  Region  of  IX 1  cm  on  reticle  was  corrected.  Small 
region  size  for  reference  rectangle  2  A,,  was  fixed  at  10X10  Calculation 
time  depends  on  the  small  region  size  for  element,  (a)  Time  for  searching 
nearby  blocks  (Sec.  VI,P-1),  (b)  time  for  calculating  the  correlation  0  (Sec. 
VI, P-2),  (c)  time  for  calculating  Eq.  (4)  in  Sec.  IV  (Sec.  VI,P-3).  (d)  Time 
for  calculating  Eq.  (3)  in  Sec.  IV  (Sec.  VI,P-4).  (e)  Total  calculation  time  by 

STEALTH. 
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NUMBER  OF  GATES 


MINIMUM  FEATURE  SIZE 
ON  RETICLE  (Mm) 


Fig.  11.  Time  for  writing  gate  array  on  reticle.  Writing  area  is  200  cm^. 
Cycle  time  of  EB  system  will  decrease  in  accordance  with  the  minimum 
feature  size  of  device  which  will  be  written  in  the  future.  100-400  ns  shown 
in  the  figure  is  the  cycle  time  of  EB  system.  Black  circles  show  the  combi¬ 
nation  of  the  minimum  feature  size  and  the  cycle  time  which  is  used  or  will 
be  used  in  the  future. 


effectiveness  of  the  new  algorithm;  that  is,  the  fix  of  the 
evaluation  point  at  the  center  of  small  region  . 

VIII.  DISCUSSION 

In  this  section,  the  calculation  speed  of  computer  is  evalu¬ 
ated,  which  is  required  to  suppress  the  calculation  time  to 
less  than  the  writing  time. 

A.  Writing  time 

The  shot  number  of  the  gate  array  described  in  the  Sec.  II 
was  evaluated  and  is  shown  in  Fig.  1 .  Equation  (2)  was  used 
for  the  evaluation.  The  maximum  shot  size  was  assumed  to 
be  2X2  /iim. 

The  corresponding  writing  time  is  shown  in  Fig.  11.  The 
writing  area  was  set  at  200  cm^.  The  shot  number  in  Fig.  1 
was  converted  in  proportion  to  the  difference  of  the  writing 
area.  The  writing  time  was  evaluated  as  (shot  number  X  cycle 
time),  where  cycle  time  means  the  sum  of  the  time  for  one 
shot  exposure  and  the  preparation  time  from  the  end  of  one 
shot  exposure  to  the  start  of  the  next  shot  exposure.  The 
other  times,  such  as  the  time  for  positioning  a  subfield,  were 
ignored.  The  cycle  time  will  decrease  with  the  minimum 
feature  size  of  device  to  be  written  by  the  EB  system  in  the 
future.  In  this  and  the  following  subsections,  three  types  of 
cycle  time  (400,  200,  and  100  ns)  were  used  for  the  evalua¬ 
tion.  When  the  minimum  feature  size  is  0.2  /xm,  the  writing 
time  becomes  3200  s-0.9  h. 

B.  P3C 

In  this  and  the  next  subsections,  parallel  processing  using 
4  CPU  is  assumed,  and  the  calculation  speed  of  the  computer 
is  evaluated  which  is  subjected  to  suppress  the  calculation 


NUMBER  OF  GATES 
130K  520K  2.1  M  8.3M  33M  134M 


ON  RETICLE  (Mm) 

Fig.  12.  Required  calculation  speed  of  computer  for  obtaining  the  represen¬ 
tative  figures  by  P3C.  Parallel  processing  using  4  CPU  was  assumed.  The 
figure  shows  the  calculation  speed  of  computer  which  can  suppress  the 
calculation  time  by  P3C  to  less  than  the  writing  time.  A  small  region  size  for 
reference  rectangle  was  set  at  10  fim.  100-400  ns  shown  in  the  figure  is  the 
cycle  time  of  EB  system.  Black  circles  show  the  combination  of  the  mini¬ 
mum  feature  size  and  the  cycle  time  which  is  used  or  will  be  used  in  the 
future. 


time  to  less  than  the  writing  time  (in  Fig.  11).  It  is  assumed 
that  the  correction  error  of  6%  is  permissible  regarding  the 
deposited  energy.  Correspondingly,  the  sizes  of  small  region 

and  S^.  are  fixed  at  2  and  10  fim,  respectively,  from  Fig,  5. 

Figure  12  shows  the  evaluated  calculation  speed  of  com¬ 
puter,  which  is  required  for  P3C  to  finish  the  calculation 
within  the  writing  time.  The  estimation  was  carried  out  by 
comparing  the  writing  time  and  the  calculation  time  of  P3C 
for  a  unit  area. 

The  required  calculation  speed  increases  with  each  suc¬ 
cessive  device  generations  for  the  following  reason:  The 
complexity  of  the  device  pattern  increases  with  the  device 
generation,  and  the  corresponding  calculation  time  by  P3C 
increases.  On  the  other  hand,  the  number  of  shots  (i.e.,  the 
writing  time)  does  not  increase  much  with  each  device  gen¬ 
eration. 

When  the  minimum  feature  size  is  3.0  yum  (w:0.8  /xm), 
and  the  cycle  time  is  400  ns,  the  corresponding  writing  time 
is  30  min.  The  required  calculation  speed  of  computer  is  40 
MIPS/CPUX4  CPU.  Present  EWS  satisfies  the  condition. 
Highest  calculation  speed  is  required,  when  the  minimum 
feature  size  is  0,2  /xm  (w:0.05  /xm)  and  the  cycle  time  is  100 
ns.  The  corresponding  writing  time  is  3200  s.  The  required 
calculation  speed  is  500  MIPS/CPUX4  CPU.  Although  the 
present  EWS  does  not  satisfy  the  condition,  the  required  cal¬ 
culation  speed  is  not  a  problem,  because  microprocessors  of 
1000  MIPS  are  expected  to  be  produced  when  the  minimum 
feature  size  is  0.2  /xm  (w:0.05  /xm). 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2481 


Takayuki  Abe:  Reticle  fabrication 


2481 


NUMBER  OF  GATES 

130K  520K  2.1M  8.3M  33M  134M 


ON  RETICLE  (|jm) 

Fig.  13.  Required  calculation  speed  of  computer  for  obtaining  the  correction 
dose  by  stealth.  Parallel  processing  using  4  CPU  was  assumed.  The  figure 
shows  the  calculation  speed  of  computer,  which  can  suppress  the  calculation 
time  for  correction  itself  to  less  than  the  writing  time.  Small  region  sizes  for 
reference  rectangle  2^^  and  for  element  2Ag  were  set  at  10  and  2  /tm, 
respectively.  100-400  ns  shown  in  the  figure  is  the  cycle  time  of  EB  system. 
Black  circles  show  the  combination  of  the  minimum  feature  size  and  the 
cycle  time  which  is  used  or  will  be  used  in  the  future. 

C.  STEALTH 

The  acceleration  ratio  by  the  4  CPU  parallel  processing  is 
assumed  to  be  3.6.  The  real  ratio  will  be  better  than  the  value 
because  stealth  accesses  high  speed  bus  and  does  not  ac¬ 
cess  low  speed  hard  disk. 

Figure  13  shows  the  calculation  speed  which  is  required 
for  STEALTH  to  complete  the  calculation  time  within  the  writ¬ 
ing  time.  The  estimation  was  carried  out  by  comparing  the 
writing  time  and  the  calculation  time  of  STEALTH  for  a  unit 
area. 

The  required  calculation  speed  decreases  with  each  suc¬ 
cessive  device  generation  under  the  condition  that  the  cycle 
time  is  independent  of  the  device  generation.  The  reason  is 
(1)  the  calculation  time  for  self-correction  is  independent  of 
the  device  generation  because  of  the  representative  figure 
method;  on  the  other  hand,  (2)  the  number  of  shots  (i.e.,  the 
writing  time)  increases  with  each  successive  device  genera¬ 
tion. 

When  the  minimum  feature  size  is  0.2  /xm  (w:0.05  /xm) 
and  the  cycle  time  is  100  ns,  the  corresponding  writing  time 
is  3200  s.  Then,  the  calculation  speed  of  120  MIPS/CPU  X 4 
CPU  is  required.  Present  EWS  already  satisfies  the  condi¬ 
tion.  On  the  other  hand,  highest  calculation  speed  of  the 
computer  is  required  when  the  minimum  feature  size  is  1.5 
Ijm  (w:0.4  ytxm)  and  the  cycle  time  is  200  ns.  The  corre¬ 
sponding  writing  time  is  1100  s.  The  required  calculation 
speed  is  350  MIPS/CPU X4  CPU.  This  calculation  speed  can 
be  effectively  realized,  for  example,  by  using  two  computer 
systems  of  (200  MIPS/CPU X 4  CPU/system X two  systems). 

Although  the  system  shown  in  Fig.  2  was  assumed  in  this 


article  for  simplicity,  other  types  of  systems  will  also  realize 
real-time  correction.  For  example,  super  multiprocessing 
computer  and  dedicated  data  processing  hardware  are  other 
candidates  for  real-time  correction. 

IX.  SUMMARY 

The  new  algorithm  for  obtaining  the  optimum  correction 
dose  has  been  proposed  and  discussed  for  the  application  to 
reticle  writing.  The  representative  figure  method  and  dose 
formula  method  are  used  for  main  algorithm.  A  characteristic 
feature  of  the  algorithm  is  that  the  dose  evaluation  point  is 
fixed  at  the  center  of  small  regions,  .  This  leads  to  the  high 
speed  correction  calculation  without  deteriorating  the  correc¬ 
tion  accuracy.  The  stealth  system  has  been  developed 
which  realizes  the  correction  algorithm.  The  calculation  time 
by  STEALTH  was  1/30  of  that  by  the  previous  system. 

Under  the  assumption  that  parallel  processing  is  adopted, 
the  calculation  speed  of  computer  has  been  evaluated  which 
can  satisfy  the  condition  that  both  (1)  the  time  for  obtaining 
the  representative  rectangles,  and  (2)  the  calculation  time  for 
correction  itself  are  less  than  the  writing  time.  The  evaluation 
has  been  carried  out  by  using  gate  array  pattern  until  the 
minimum  feature  size  reaches  0.2  /xm  on  reticle  (0.05  /xm  on 
wafer).  The  required  calculation  speed  was  500  MIPS/ 
CPUX4  CPUs  at  most. 

This  result  suggests  that  (1)  the  calculation  time  for  cor¬ 
rection  can  be  suppressed  to  less  than  writing  time,  and  (2) 
the  real-time  proximity  effect  correction  is  possible  for 
reticle  fabrication.  Furthermore,  when  the  new  algorithm  is 
applied  to  x-ray  mask  fabrication  and  direct  writing  system, 
the  calculation  time  can  be  suppressed  to  less  than  the  writ¬ 
ing  time,  as  shown  in  Appendices  A  and  B. 

Note  added  in  proof:  Certain  items  discussed  in  this  ar¬ 
ticle  are  also  discussed  in  Ref,  20.  In  the  present  article, 
however,  these  items  are  discussed  in  greater  detail  or  are 
modified. 
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APPENDIX  A:  SCALING  AND  APPLICATION  TO 
X-RAY  MASK  FABRICATION 

Highly  accurate  proximity  effect  correction  is  needed  to 
make  a  1:1  x-ray  mask,^^“^^  The  new  algorithm  of  this  article 
can  be  applied  to  x-ray  mask  fabrication. 

When  the  acceleration  voltage  is  50  kV,  the  back  scatter¬ 
ing  range  of  the  electron  cr^  for  the  x-ray  fabrication  is  about 
1/4  of  that  for  reticle  fabrication.  Suppose  that  all  types  of 
length  are  scaled  to  (1/4)  as  shown  in  Table  I,  corresponding 
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Table  I.  Scaling  relation  between  x-ray  mask  fabrication  and  reticle  fabri¬ 
cation.  Acceleration  voltage  is  50  kV. 


Scaling 

Reticle 

fabrication 

X-ray  mask 
fabrication 

(1/4) 

Backscattering  range 

10  /jm 

2.5  yLtm 

Maximum  shot  size 

2  jiim 

0.5  yLtm 

Small  region  size  2Ar 

10  /jm 

2.5  yttm 

Small  region  size  2^e 

2  yLtm 

0.5  yLtm 

Minimum  feature  size  on  reticle 

0.75  /tim 

0.2  yLtm 

(example) 

(on  wafer) 

0.2  yLtm 

Writing  region  on  mask 

200  cm^ 

200/4^=12.5  cm^ 

(example) 

(on  wafer) 

200/4^=12.5  cm^ 

to  the  difference  of  the  backscattering  range.  The  writing 
region  of  12.5  cm^  ('-'3.5 X 3.5  cm)  is  reasonable  for  the 
x-ray  mask  fabrication  because  2-4  chips  will  be  written  in 
a  mask.  Table  I  shows  that  the  maximum  shot  size  for  x-ray 
fabrication  is  (1/4)  of  that  for  the  reticle  fabrication.  This 
condition  is  also  reasonable  because  the  optimum  size  of  the 
maximum  shot  decreases  with  the  minimum  feature  size  of 
pattern. 

Under  the  scaling  conditions  of  Table  I,  the  discussion 
and  the  results  in  this  article  can  be  directly  applied  to  the 
x-ray  mask  fabrication.  When  the  cycle  time  is  200  ns  and 
the  minimum  feature  size  is  0.75  [im  (—0.2  /imX4)  on 
reticle,  the  writing  time  is  1600  s  and  the  required  calculation 
speed  for  stealth  is  230  MIPS,  for  the  reticle  fabrication 
under  the  conditions  of  Table  I.  Correspondingly,  when  the 
cycle  time  is  200  ns  and  the  minimum  feature  size  is  0.2  /^m, 
the  writing  time  and  the  required  calculation  speed  for 
STEALTH  are  1600  s  and  230  MIPS,  respectively,  as  well  as 
for  x-ray  mask  fabrication. 

This  discussion  suggests  that  real-time  correction  can  be 
carried  out  for  x-ray  mask  fabrication. 


APPENDIX  B:  APPLICATION  TO  DIRECT  WRITING 
PROCESS 

In  this  Appendix,  all  types  of  length  are  measured  on  a 
wafer. 

The  algorithms  mentioned  in  Sec.  IV  can  be  applied  to 
direct  writing  process.  When  the  data  format  in  Ref.  17  is 
used,  the  amount  of  dose  data  for  one  chip  is  at  most  30 
Mbytes.  Thus,  the  data  amount  is  not  problem  and  real-time 
correction  is  not  needed  in  the  case  of  direct  writing  process. 
However,  the  high  speed  correction  calculation  has  a  great 
advantage,  because  the  correction  calculation  itself  occupies 
50%-99%  of  the  total  calculation  time  when  the  previous 
algorithm  is  used.^® 

The  calculation  time — using  the  new  algorithm — is  evalu¬ 
ated.  The  gate  array  described  in  Sec.  Ill  and  Eq.  (2)  is  used 
for  the  evaluation.  The  parallel  processing  with  using  4  CPU 
is  assumed.  Two  calculation  speed  of  1  CPU  are  assumed:  50 
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Fig.  14.  Application  to  the  direct  writing  process.  Time  for  calculating  op¬ 
timum  dose,  (black/white)  Inverted  pattern  of  gate  array  was  used  for  evalu¬ 
ation  (wiring  layer).  Chip  size  is  1. 5X1.5  cm.  Small  region  sizes  for  refer¬ 
ence  rectangle  and  for  element  were  set  at  10  and  2  /U,m,  respectively,  (a) 
Time  for  obtaining  reference  rectangles;  (b)  calculation  time  for  correction 
itself;  (c)  total  time  for  correction  (a)+(b);  (d)  data  conversion  time  (refer¬ 
ence). 


MIPS  and  200  MIPS.  The  acceleration  ratio  by  the  parallel 
processing  is  set  at  3.7  for  the  calculation  of  the  correction 
itself. 

The  result  of  the  evaluation  is  shown  in  Fig,  14.  The  new 
algorithm  in  this  article  completes  the  calculation  for  correc¬ 
tion  itself  within  2  min  or  so.  The  value  is  negligibly  small 
compared  with  the  writing  time.  The  calculation  time  by 
STEALTH  is  about  (1/30)  of  that  by  the  previous  system.^^ 

Even  when  the  minimum  feature  size  is  0.05  juLva,  the  total 
calculation  time  is  1.5  and  0.4  h  by  using  50  MIPSX4  CPU 
and  200  MIPS  X  4  CPU  computers,  respectively.  The  time  for 
writing  100  chips  was  evaluated  as  about  11  h.^^  The  total 
time  for  correction  is  sufficiently  small  compared  with  the 
writing  time. 


APPENDIX  C:  EDGE  ERROR 

When  the  representative  figure  method  is  used,  the  whole 
pattern  in  a  small  region  Sg  is  exposed  by  a  uniform  dose. 
This  causes  the  error.  The  error  is  called  edge  error  because 
the  error  becomes  maximum  at  the  edge  of  a  large  pattern. 

In  Ref.  16,  the  estimation  of  the  edge  error  was  given; 
however,  the  detail  of  the  estimation  was  abbreviated.  The 
detail  is  described  in  the  following:  Figs.  15(a)  and  15(b) 
show  the  deposited  energy  with  and  without  proximity  effect 
correction,  respectively.  It  is  assumed  that  the  profile  of  the 
deposited  energy  in  the  region  (p)  in  Fig.  15(a)  is  the  same 
as  that  in  the  region  (q)  in  Fig.  15(b).  Then  the  edge  error 
±  SE  is  approximated  by  the  error  ±  <5^*. 

In  the  following,  the  error  ±  at  the  edge  of  the  lines 
and  spaces  (L/S)  pattern  is  discussed.  Pattern  density  is  de- 
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(a)  WITH  (b)  WITHOUT 

CORRECTION  CORRECTION 


Fig.  15.  Edge  error.  Finiteness  of  the  small  region  size  for  element  causes 
the  edge  error.  Uniform  dose  is  assigned  for  all  figures  in  a  small  region;  this 
causes  the  edge  error,  (a)  and  (b)  show  the  case  with  and  without  proximity 
effect  correction. 


noted  as  p.  When  p  is  I,  the  L/S  pattern  becomes  a  large 
figure  and  the  error  takes  the  largest  value.  The  size  of  small 
region  is  described  as  2Ag . 

The  deposited  energy  is  described  as 


J  pattern  1  + 

Xexp[  — (jc— jc')^/cr^]|  tijc'.  (Cl) 

Assuming  that  the  linewidth  and  space  width  cr^ ,  and  the 
edge  of  the  L/S  pattern  is  at  x  =  0,  Eq.  (Cl)  is  modified  as 

X  exp[  ~{x~x')^/ al]  ^dx\  (C2) 

Setting  u=x—x',  Eq.  (C2)  is  modified  to 

(C3) 


Vf 

S{u)-^  —j= —  cxp(  ~u^/cri) 

yj7T(Tt 


du. 


The  inclination  Q  of  the  deposited  energy  at  the  edge  (x  =  0) 
is  obtained  by  the  second  term  of  the  Eq.  (C3)  (the  back- 
scattering  term)  as 

q  =  dE{x)ldx\j^=Q 


-PVE 


du 


/  dX^=:Q 


=  PVeI\  1  +  ] .  (C4) 

Therefore,  the  edge  error  ±  SE  is  described  as 

SE~  SE*  =  qA=pAvEf[  1  +  7,^/2)].  (C5) 

When  p=l,  the  equation  coincides  with  the  estimation  of 
Ref.  16  and  gives  the  maximum  value  of  the  edge  error. 

The  estimation  of  the  edge  error  is  useful  not  only  for 
using  the  representative  figure  method  but  also  for  other  as¬ 
pects  of  proximity  effect  correction,  as  follows: 

Case  1:  When  the  character  projection  method^"^”^^  is 
used  and  the  dose  correction  method  is  applied,  the  dose  in 
one  character  becomes  uniform.  This  causes  an  error,^^’^^  and 
the  error  can  be  considered  as  a  sort  of  the  edge  error.  Thus, 
the  estimation  gives  the  guideline  for  the  maximum  character 
size. 

Case  2:  When  the  dose  correction  method  is  used  and  the 
representative  figure  method  is  not  used,  the  correction  dose 
is  assigned  to  a  small  figure  which  is  also  called  element. 
Dose  for  an  element  becomes  different  from  that  for  the  ad¬ 
jacent  element.  This  causes  the  connection  error  between  two 
shots.  This  error  also  can  be  considered  as  a  sort  of  edge 
error,  and  can  be  evaluated  by  Eq.  (C5).  The  equation  gives 
the  maximum  size  of  element  even  when  the  representative 
figure  method  is  not  used. 

Case  3:  When  Ghost^^  or  Phantom^^’^^’^^  is  used,  align¬ 
ment  is  needed  for  subsidiary  exposure.  Equation  (C5)  gives 
the  rough  guideline  for  the  permissible  alignment  error. 

Case  4:  When  the  dose  formula  method  is  used,  the  table 
of  error  function  must  be  made.  One  value  of  the  error  func¬ 
tion  is  assigned  to  a  finite  range  of  distance,  such  as 
(x/aij—0.5)-(x/(Ti^  =  0.55).  The  finite  grid  size  causes  the 
calculation  error.  The  error  is  also  roughly  estimated  by  Eq. 
(C5)  by  setting  p=l. 

In  cases  1-3,  Eq.  (C5)  gives  the  size  (and  alignment  er¬ 
ror)  of  2  fim  as  a  guideline  to  suppress  the  error  less  than 
3.3%;  when  the  acceleration  voltage  is  50  kV,  the  substrate  is 
Si  and  is  0.8.  In  case  4,  the  grid  size  of  0.1  yu-m  is  suffi¬ 
cient  to  suppress  the  error  less  to  than  0.17%. 
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Surface  tension  of  substrates  and  layers  used  in  semiconductor  technology  (e.g.,  Si,  Si02,  Si3N4, 
AlCu,  and  WTi)  as  well  as  surface  tension  of  materials  of  photolithography  as  resists  and  developers 
were  determined  by  contact  angle  measurements.  Resists  and  developers  could  be  divided  into  two 
groups  with  polar  and  nonpolar  behavior,  respectively.  The  pretreatment  of  the  substrates  (cleaning, 
dehydration,  and  exposure  to  humidity)  influences  the  surface  tension.  It  was  found  that  surface 
modification  by  primers,  for  example,  hexamethyldisilazane  and  trimethylsilydiethylamine  provides 
stable  and  reproducible  surface  tension.  To  reduce  defect  density  of  resist  structures,  a  general 
optimization  approach  of  surface  tension  was  derived  and  successfully  applied,  using  Si  and  AlCu 
as  substrates  in  a  typical  photolithographic  process.  The  results  show  a  good  adhesion  of  resist  will 
be  achieved  if  the  work  of  adhesion  is  greater  than  5  dyn/cm.  A  suggestion  is  given  of  process 
window  of  the  contact  angle  is  given  for  certain  combinations  of  types  of  resist  and  developer. 
©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Surface  tension  of  photolithographic  materials  like  resist 
and  developer  has  a  great  potential  for  the  photolithographic 
process  itself.  For  instance,  homogeneity  of  spin  coatings, 
planarization,  and  adhesion  depend  directly  on  surface  ten¬ 
sion  of  the  materials  involved.  In  this  article,  the  influence  of 
surface  tension  of  photoresists  and  developers  on  adhesion 
and  wetting  is  investigated  and  a  method  to  optimize  the 
process  is  suggested.  In  particular,  for  reducing  feature  sizes 
<1  (xm  the  optimization  of  adhesion  of  these  small  resist 
structures  is  a  factor  of  importance  to  decrease  defect  den¬ 
sity. 

Generally  the  lithographic  process  requires  a  modification 
of  substrate  surfaces  before  resist  coating  to  prevent  small 
features  from  ablation  during  developing,  water  rinsing  and 
following  wet  etch  processes.  Primers  like  hexamethyldisila- 
zane  (HMDS),  trimethylsilyldiethylamine  (TMSDEA)  and 
others  reduce  surface  tension  of  materials  used  in  semi¬ 
conductor  technology  like  Si,  Si02,  Si3N4,  AlCu  and  WTi, 
etc  (Fig.  1).  The  primer  reacts  chemically  with  water  depos¬ 
its  held  by  high-energy  substrate  surfaces  or  more  closely 
regarded  with  bounded  OH-groups.  Thus  surface  tension  is 
reduced  and  so  is  the  possibility  of  penetration  of  developer, 
water,  or  etchants  between  resist  and  the  modified  substrate 
surface. 

II.  THEORETICAL  BASICS  AND  EXPERIMENTAL 
METHODS 

A.  Surface  tension  measurement  of  solids  and  fluids 

Applying  the  drop  method  (Figs.  2  and  3),  the  nonpolar 
part  of  surface  tension  (caused  by  dispersion  energy  be¬ 
tween  molecules)  and  the  nonpolar  part  of  surface  tension  cr^ 
(caused  by  dipole  interaction,  induced  dipole  moments  and 


hydrogen  bonds)  are  measured  at  once  (measurement  device: 
SURFTENS  from  OEG  GmbH).  The  basic  additive  approach 

(1) 

is  based  on  publications  of  Fowkes,^'^  Gardon,^  and  Hansen; 
Beerbower^®  and  Meyer  et  and  The  model  of 

harmonic  mean  is  used  to  approximate  interaction  of  low 
energetic  phases  (symbols:  see  Fig.  2):*^’^"^ 


^asi=(l+cos  e)crgi=4 


d 

^gs 


+ 


gl^gs 


(2) 


Work  of  adhesion  is  defined  as  reversible  work  neces¬ 
sary  to  create  an  interface  area  of  1  cm^  between  two  differ¬ 
ent  chemical  materials  with  the  phases  solid-fluid  or  to  sepa¬ 
rate  them.  This  work  of  adhesion  is  defined  by  Dupre 
equation  as  follows: 


^asl  ^gl"^  ^gs  ^Is  •  (3) 

In  the  following,  all  surfaces  are  modified  by  organic  com¬ 
pounds.  These  compounds  strongly  reduce  the  spreading 
pressure.  That’s  why  the  latter  is  further  neglected 
Using  two  testing  liquids  of  different  polarity, 
it  is  possible  to  determine  the  surface  tension  of  the  substrate 
by  an  analytical  solution  of  Eq.  (2).^^  In  our  experiments, 
water  (16-17  Mfl  cm):  cr^j  =  50.7  dyn/cm,  cTgi  =  22. 1  dyn/cm 
and  methylene  iodide  (CH2I2):  0*^1  =  6.7  dyn/cm,  cTgj  =  44.1 
dyn/cm  were  used.  If  the  surface  tension  of  the  used  sub¬ 
strates  is  known  (o-^i,  0*^2),  the  surface  tension  of  liquids  can 
be  determined  by  solving  Eq.  (2)  using  the  Newton-Raphson 
method.  For  support,  a  start  value  calculation  was  carried  out 
successfully  [see  Eq.  (4a)]: 
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[1  ] 

2 

<  Si-OH 

+  ((CH3)3Si)2-NH  — 

-►2 

<  Si-O-Si-CHs 

CH3  . 

hydrophilic  surface  hydrophobic  surface 


Fig.  1.  Modification  of  high-energy  hydrophilic  surfaces  by  applying 
HMDS  to  improve  resist  adhesion. 


Fig.  3.  Examples  for  contact  angle  of  water  dy,  after  different  surface  modi¬ 
fications  of  Si  with  HMDS/5  %  TMDS  (hotplate  temperature  Tp=\lQ°C) 
using  only  different  prime  times,  (a)  Prime  time=2s,  40®;  (b)  prime 
time =20  s,  ^^^=70®. 


o-gi(  start)  =  (TsiF(  e  1 )  +  0-f2^(  ©  2) . 

o-^i(  start)  =  o-f  1  F(  e  1 )  +  0  2) . 

l-(l  +  (cos  e,)/4) 

^  2(1 +  (cos  0/)/4) 


(4a) 

(4b) 


Measurement  conditions  were  clean  room  <100,  relative  hu¬ 
midity  45% ±2%  at  a  room  temperature  of  21  °C.  The  drop 
volume  was  about  5-10  /il.  After  applying  the  testing  liquid 
(<3  s)  a  picture  was  taken  and  contact  angle  could  be  cal¬ 
culated  by  the  tangent  method. 


B.  Valuation  of  adhesion 

Figure  4  illustrates  how  liquid  penetrates  into  the  interface 
(resist/substrate)  separating  the  resist.  That  occurs  if  the  sub¬ 
strate  has  a  greater  affinity  (small  contact  angle)  for  devel¬ 
oper  or  water  in  the  environment  than  for  the  resist.  Adhe¬ 
sion  of  resist  is  determined  by  work  of  adhesion  under 

air  environment  which  is  given  by 

^al2“^l4‘0‘2”(ri2»  (5) 

where  crj  is  the  surface  tension  of  the  resist,  (J2  is  the  surface 
tension  of  the  substrate,  and  (Ty2  is  the  interfacial  tension 
between  resist  and  substrate;  and  work  of  adhesion  W/12  un¬ 
der  environment  of  the  liquid  (developer  or  water) 

(713+ (J23— cri2>  (b) 


Fig.  2.  Effect  of  surface  tension  using  the  method  of  drop  contact  angle: 
a-gs=ais+a-giCos  0,  (7^=a^-'Tr^,  or^^  \  surface  tension  gas/solid,  o-gii  sur¬ 

face  tension  gas/liquid,  :  interfacial  tension  liquid/solid,  :  surface  ten¬ 
sion  solid  under  vacuum,  and  :  spreading  pressure,  e.g.,  reduction  of 
surface  tension  in  air  or  in  a  vapor,  saturated  condense  pressure. 


where  (7j3  is  the  interfacial  tension  between  resist  and  liquid, 
and  (723  Ibe  interfacial  tension  between  substrate  and  liquid. 
Calculation  of  interfacial  tension  may  be  carried  out  accord¬ 
ing  to  Eqs.  (2)  and  (3)  as  follows: 


0ry  =  O-,+ 0-2—4 


crfaf  crfcrj 
a‘l+aj  <+< 


(7) 


Also,  the  adhesion  factor/^  is  used  for  valuation  of  the  qual¬ 
ity  of  adhesion; 


A=W/12/Wal2. 


(8) 


In  the  case  of/^^0,  no  adhesion  appears  (spontaneous  sepa¬ 
ration). 


C.  The  value  of  wetting 

Another  important  value  in  photolithographic  processes  is 
the  wetting  of  photochemical  materials.  A  uniform  develop¬ 
ment  velocity  will  assure  adequate  wetting  of  the  entire  resist 
surface  with  developer.  In  that  instance,  resist  edge  rough¬ 
ness  and  variations  of  the  structures  will  be  avoided  and  the 
reaction  of  the  development  process  will  take  place,  even  in 
very  small  structures.  A  small  wetting  angle  is  crucial  for  a 
sufficient  penetration  depth  in  small  structures.  To  remove 
residues  completely,  by  the  penetration  of  developer  between 
the  resist  and  the  substrate,  a  certain  wetting  of  the  substrate 
surface  with  developer  must  be  guaranteed.  To  avoid  pin¬ 
holes  in  the  resist  layer  also,  a  good  wetting  of  the  substrate 
with  resist  has  to  be  maintained  during  the  coating  process. 
The  demands  concerning  the  wetting  angle  are  partially  in 
contrast  to  the  demands  of  the  contact  angle  used  for  good 
adhesion.  Several  papers  were  published  by  investi¬ 

gating,  by  mathematical  means,  the  optimization  of  wetting 
angle  in  order  to  produce  polymer  layers  of  high  quality.  In 


Fig.  4.  Principle  of  resist  separation  in  accordance  to  Yanazawa  (Ref.  4). 
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Table  I.  Overview  of  kinds  of  resists  and  developers  in  respect  to  their 
surface  tension  based  on  experimental  results  (Tables  II  and  III). 


Polar  type  (type  A) 

Nonpolar  type  (type  B) 

(f 

(dyn/cm) 

(dyn/cm) 

(dyn/cm) 

(dyn/cm) 

Resist  (/?) 

20 

30 

10 

30 

Developer  (D) 

20 

40 

10 

40 

these  papers,  it  was  proven  that  wetting  and  adhesion  are 
optimal  if  the  polarity  of  the  substrate  and  the  wetting  mate¬ 
rial  are  equal  (polarity  A:f  =  In  practice  the  materials 

are  more  or  less  fixed,  but  modifications  of  the  surface  are  a 
possibility  to  improve  adhesion  and  wetting  characteristics. 
For  evaluation  purposes,  the  wetting  factor/;^  due  to  Wu^"^  is 
used  and  is  reciprocal  to  defect  density  (un wetted  area): 

(9) 

In  the  case  of  spontaneous  wetting  (contact  angle  0=0) 
and  are  valid.  At  contact  angles  0>O  defect 
density  raises  up  because  of  Xis<0  and  The  wetting 

coefficient  is  defined  as  the  difference  between  Work  of 
adhesion  at  substrate/liquid  and  work  of  cohesion  of  the 
liquid: 

^ls=^asl“'^cl='^asl“2(rgi,  (10) 

where  is  the  work  of  cohesion  of  the  liquid.  A  further 
valuation  of  the  wetting  may  be  carried  out  regarding  the 
contact  angle  itself.  We  assume  that  contact  angle  <62°  are 
suited  for  sufficient  wetting. 

III.  RESULTS 

A.  Surface  tension  of  photochemical  materials 

The  results  of  measurement  of  the  surface  tension  of  pho¬ 
tochemical  materials  are  shown  in  Tables  I-III.  I-line  resists 
from  HOECHST,  SHIPLEY  and  ALLRESIST  were  investi¬ 
gated.  The  finding  of  surface  tension  of  the  liquids  is  based 
on  several  combinations  of  substrates  with  the  investigated 


Table  III.  Surface  tension  of  liquid  photoresists  and  developers. 


Surface  tension 

Liquids 

O-gs 

(dyn/cm) 

< 

(dyn/cm) 

Photoresists 

1.5-2 

33-34 

MF501  (type  B) 

8 

39 

MFCD  (type  A) 

20 

42 

AZ351  (1:5)  (type  A) 

23 

43 

AZ726MIF  (type  A) 

20 

36 

E6  (type  B) 

9 

40 

E6/without  wetting  agent  (type  A) 

16 

42 

materials  including  their  modification  with  adhesion  pro¬ 
moter  and  other  polymer  layers,  for  example,  polymethyl¬ 
methacrylate  (PMMA).  The  measurement  precision  of  the 
surface  tension  of  liquids  is  about  10%.  In  the  resulting  data 
the  materials  we  used  can  be  devided  into  two  groups  (Table 
I).  We  think  the  division  can  be  generalized  for  all  resists  and 
developers.  A  decrease  of  surface  tension  of  the  developer 
(type  B)  could  be  better  managed  by  adding  special  wetting 
agents  (Table  III,  for  instance,  developer  E6).  Surface  ten¬ 
sion  of  the  resist  layers  depends  on  softbake  parameters  and 
on  material  composition  (resins,  photoactive  compounds,  so¬ 
lutions  and  wetting  agents).  Spincoating  was  used  to  achieve 
a  resist  thickness  of  1.5  /mm  and  the  softbake  parameters 
were  kept  stable  for  all  resists  (hotplate:  60  s,  90  °C). 

Many  resists  and  developers  can  be  devided  in  this  sense. 
The  measurements  of  Fadda  et  al  show  the  obvious  divi¬ 
sion  of  materials  into  two  groups  as  in  Table  I  also. 

B.  Surface  tension  of  materials  in  semiconductor 
technology 

Modified  and  pure  substrate  materials  were  used:  (1)  Si 
(2)  Si02(TEOS)/phosphor  doped  Si02  (PSG)/Si3N4  layers, 
and  (3)  AlCu0.5(0.5%Cu)  and  WTil0(10%Ti)  layers.  An 
overview  of  the  measured  contact  angles  and  the  calculated 
surface  tensions  according  to  (2)  is  given  in  Tables  IV-VI 
and  Fig.  5.  The  measurement  precision  of  the  surface  tension 


Table  II.  Surface  tension  of  photoresist  layers  (hotplate  90  °C,  60  s). 


Contact  angle 

Surface  tension 

Resist  type 

Resist 

H2O 

CH2I2 

< 

(dyn/cm) 

^gs 

(dyn/cm) 

A 

SPR511“ 

60*^ 

38° 

21 

30 

SPR511'’ 

63° 

36° 

19 

30 

B 

AZ6612“ 

78° 

39° 

11 

32 

AZ6612‘’ 

0 

00 

43° 

11 

30 

AZ711B“ 

73° 

32° 

12 

33 

AZ711B’’ 

77° 

31° 

13 

33 

AZ7510“ 

81° 

43° 

11 

29 

AZ7510’’ 

80° 

38° 

10 

32 

ARP374” 

79° 

43° 

11 

30 

ARP374'' 

75° 

48° 

13 

29 

“Exposed. 

’^Nonexposed. 
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Table  IV.  Examples  for  priming  with  HMDS/5%  TMDS. 


Material 

Contact  angle 

Surface  tension 

H2O 

CH2I2 

< 

(dyn/cm) 

^gs 

(dyn/cm) 

Si 

2° 

34° 

45 

29 

Si  HMDS^ 

65° 

52.5° 

18 

24 

Si02 

2° 

33° 

45 

29 

Si02  HMDS" 

65° 

59° 

22 

21 

Si3N4 

3.2° 

33° 

45 

29 

SisN^HMDS" 

56° 

39° 

23 

29 

^Priming  procedure:  hotplate  20  s,  120  °C. 


calculated  by  the  measured  contact  angle  is  estimated  to  be 
about  5%.  The  measured  values  for  A1  (Fig.  5)  show  good 
correlation  with  those  of  Kaelble  and  Dynes,  They  deter¬ 
mined  the  surface  tension  of  A1  with  (/=38  dyn/cm/(/=28 
dyn/cm.  Also,  the  decay  of  surface  tension  with  increased 
time  of  exposure  to  humidity  is  documented  in  this  article. 
Further  values  of  surface  tension  of  A1  were  published  by 
Moreau^  ((/=38  dyn/cm,  (/-25  dyn/cm),  Zissmann^^ 
(critical  surface  tension  (7^=45  dyn/cm)  and  Eley^^  (o'=93 
dyn/cm). 

Comparative  values  were  found  in  publications  concern¬ 
ing  Si  by  Fowkes^  (cr=78  dyn/cm)  and  Moreau^  (<7'c^78 
dyn/cm),  concerning  Si02  by  Moreau^  (a^=35  dyn/cm, 
(/=15  dyn/cm)  and  concerning  Si3N4  by  Yanazawa"^  (a 
-46,5  dyn/cm). 

Modification  of  surface  properties  were  done  with  primer 
(HMDS/5  %  TMSDEA)  in  a  gaseous  atmosphere  at  a  hot¬ 
plate  temperature  of  T^  =  120  °C  to  reduce  surface  tension  of 
high  hydrophile  materials.  This  was  done  with  Si,  Si02,  and 
Si3N4  successfully,  but  one  problem  was  a  sufficient  reduc¬ 
tion  of  the  surface  tension  of  metals.  For  this  it  is  necessary 
to  use  a  different  primer  or  to  change  the  priming  procedure. 
The  differences  between  Si,  Si02,  and  Si3N4  are  small,  for 
this  reason,  the  investigations  for  the  optimization  of  adhe¬ 
sion  were  carried  out  with  Si  as  a  representative  material. 

The  pretreatment  of  the  substrates  (cleaning,  dehydration 
and  exposure  to  humidity)  will  greatly  influence  their  surface 
tension  (see  Table  V  and  VI).  Contamination  with  water  (de¬ 
pendent  on  relative  ambient  humidity)  and  organic  materials 
on  high-energy  surfaces  will  occur  during  the  time  of  expo- 


Fig.  5.  Contact  angle  obtained  with  HMDS/5  %  TMSDEA  for  different 
priming  time  (hotplate  120  °C). 


sure  to  air,  which  strongly  reduces  the  surface  tension  of 
those  substrates  (77^^316  dyn/cm  at  Si02^’^'^’^^). 

Ambient  oxygen  and  cleaning  processes  with  oxidizing 
effect  lead  to  thin  oxide  layers  on  substrate  surfaces  like  on 
Si  and  on  A1  and  thus  to  high-energy  surfaces.  We  used  sev¬ 
eral  cleaning  processes:  SCI  cleaning  (ammoniumhydroxide 
hydrogenperoxide  mixture),  SC2  cleaning  (hydrocloricacid 
hydrogenperoxide  mixture),  HNO3  cleaning  and  02’'plasma 
treatment  for  Si  (Table  V),  and  for  AlCu  (Table  VI). 

AlCu  surfaces  were  investigated  by  SIMS/AES  after 
cleaning  processes  were  completed.  It  could  be  shown  that 
the  change  of  contact  angle  and  surface  tension  are  influ¬ 
enced  by  the  thickness  of  the  oxide  layer  on  the  surface 
(Table  VI). 

By  means  of  these  spectra,  a  distinct  lower  thickness  of 
the  oxide  layer  on  untreated  AlCu  (contact  angle  0^=29°, 
surface  tension  =  31  dyn/cm,  c/=30  dyn/cm)  in  respect  to 
AlCu  treated  with  strongly  oxidizing  medium  HNO3  (contact 
angled  0^=4'^,  surface  tension  cf^  =42  dyn/cm,  (/=34  dyn/ 
cm)  could  be  proved.  The  thickness  of  the  Al-oxide  layer 
Jqx  of  untreated  AlCu  was  determined  on  the  basis  of  the 
depth  of  escape  of  the  Al-KLL  Auger  electrons  (about  2.5 
nm)  and  the  existence  of  elementary  Al-KLL  signals  to  be 
about  1  nm<dQ^<2,5  nm.  The  thickness  of  the  oxide  layer  of 
AlCu  treated  with  HNO3  is  about  5-8  times  thicker. 


Table  V.  Contact  angle  and  surface  tension  of  Si  for  different  treatments. 


Contact  angle 

Surface  tension 

Pretreatment 

H2O 

CH2I2 

cr^ 

(dyn/cm) 

(dyn/cm) 

cr 

(dyn/cm) 

No  pretreatment 

64,5 

delivery  quality 

31° 

41° 

38 

26,5 

SC  1  cleaning 

2° 

32° 

45 

30 

75 

SC  1,2  cleaning 

2° 

33° 

45 

29 

74 

SC  1,2  cleaning 
exposure  time =5  days 

16° 

38° 

44 

27 

71 

SC  1,2  cleaning 

02-plasma  treatment 

6° 

35° 

45 

28 

73 
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Table  VI.  Contact  angle  and  surface  tension  of  Al/Cu  for  different  treatments. 


Contact  angle 

Surface  tension 

Pretreatment 

H2O 

CH2I2 

(dyn/cm) 

(/ 

(dyn/cm) 

(7 

(dyn/cm) 

No  pretreatment 
exposure  time 

<10  min 

29° 

31° 

37 

30 

67 

1  day 

32° 

40° 

31 

30 

61 

5  days 

73° 

51° 

15 

25 

40 

HNO3  cleaning 

Dl-water 

4° 

20° 

42 

34 

76 

HNO3  cleaning 

Dl-water 

temp.  200  °C,  30  min 

8° 

24° 

43 

32 

75 

HNO3  cleaning 

Dl-water 

exposure  time =5  days 

28° 

30° 

37 

30 

68 

HN03-cleaning 

Dl-water 

02-plasma  treatment 

2.5° 

34° 

45 

29 

74 

In  accordance  to  the  literature  cited,  it  can  be  generalized 
that  all  high-energy  surfaces  (glass,  quartz,  metals,  etc.) 
show,  in  practice,  almost  identical  surface  tension  after  short 
time  of  exposure  to  ambient  humidity  (i.e.,  some  hours). 

The  polar  part  of  those  high-energy  surfaces  is  about 
dyn/cm  and  the  nonpolar  component  is  about  c/«^30  dyn/ 
cm.  But  after  increased  time  of  exposure  to  ambient  humid¬ 
ity,  the  surface  tension  is  more  and  more  reduced  depending 
on  cleaning  processes  used  and  ambient  conditions. 

The  aim  of  special  surface  modification  is  to  provide 
stable  surface  conditions  for  reproducible  and  nearly  defect 
free  process  steps.  To  determine  the  surface  tension  the  con¬ 
tact  angle  of  water  is  well  suited  as  shown  in  the  following 
section. 

C.  Adhesion  of  resist  in  semiconductor  technology 

In  Figs.  6  and  10,  the  measured  values  of  the  contact 
angle  of  water  on  modified  surfaces  of  AlCu  and  Si  are 
traced.  The  work  of  adhesion  obviously  reduces  by 
increasing  the  process  time  of  the  HMDS  coating.  The  real 
effect  of  improved  adhesion — that  depends  on  HMDS  pro¬ 
cess  time — can  be  estimated  by  means  of  the  behavior  of  the 
resist  layer  during  developing  and  rinsing  with  water  [Figs. 
6(b)  and  6(c)]. 

The  work  of  adhesion  of  resist/developer-combination 
RB/DA  at  0^<47°  is  negative  and  spontaneous  separation 
occurs.  In  that  experiment,  a  lifting  of  large  resist  areas  was 
observed.  For  RB/DA  combinations  such  surface  conditions 
must  be  brought  about  by  primer  treatment  to  achieve  water 
contact  angles  of  9^>47°. 

Even  slight  HMDS  treatment  is  sufficient  for  uncritical 
water  rinsing  after  development  [Fig.  6(b)].  That  is  caused 
by  the  high  difference  of  polarity  between  water  (x^=2.3) 
and  resist  (jCp=0.25,...,0.5). 


For  experimental  verification  of  adhesion  of  small  resist 
structures  the  defect  density  of  a  test  structure  consisting  of  1 
/xmX  1  fxm  and  0.8  jiimXO.S  /nm  dies  was  monitored  (Figs.  7 
and  8).  Exposure  of  these  structures  was  carried  out  by  GCA 
8000  /-line  stepper  (NA-0.35,  coherent  parameter  5=0.5). 
Evaluation  was  done  in  the  middle  of  the  image  area  at  an 
exposure  dose  of  1.8  H^  (H^  is  the  amount  of  exposure  nec¬ 
essary  to  promote  the  total  development  of  large  areas  of 
resist  during  the  developing  process).  In  our  case,  dip  devel¬ 
oping  was  used.  Evaluation  of  the  defect  density  was  done 
by  the  light/dark  field  optical  technique.  Figure  9  shows  ex¬ 
amples  of  defect  densities  using  SPR511  and  AZ6612  resists. 
It  can  be  generally  concluded  that  for  defect  free  structures  in 
the  submicrometer  range  work  of  adhesion  of  Wi2>5 
dyn/cm  is  necessary  (for  developer  and  water  treatment). 

In  Fig.  6,  it  shows  the  critical  process  of  adhesion  is  most 
affected  in  the  development  process.  For  the  resist/developer 
combinations  RB/DA,  RA/DA,  and  RB/DB,  a  surface  modi¬ 
fication  for  adjustment  of  a  contact  angle  of  water  of 
0^^65°  assures  good  adhesion  quality  of  the  resist  struc¬ 
tures  even  during  development  process  step. 

For  combination  of  RA/DB,  there  were  not  observed  any 
defects  even  at  smaller  contact  angles  but,  in  this  case,  the 
contact  angle  of  water  should  not  exceed  85®  according  to 
Fig.  6(c). 

Generally,  in  cases  combining  resist  with  high  polarity 
(o^>15  dyn/cm)  with  developer  of  low  polarity  (a^<15 
dyn/cm),  only  a  short  primer  process  should  be  applied; 
whereas,  the  usage  of  resist  with  low  polarity  (a^<15  dyn/ 
cm)  requires  a  longer  primer  process  to  adjust  a  contact  angle 
of  water  of  more  than  65®. 

These  results  are  in  accordance  to  the  publication  of 
Deckert  and  Peters^^  where  they  determined  the  best  adhe¬ 
sion  is  obtained  if  substrate  and  resist  have  low  values  of  o^. 
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Fig.  6.  Work  of  adhesion  for  resist  as  function  of  contact  angle  of  modified 
Si  and  AlCu  substrates  (a)  under  air,  (b)  under  water,  and  (c)  under  devel¬ 
oper  environment. 


Fig.  8.  Examples  for  measurement  of  defects  of  photo  resist  dies  (0.8  fxm 
XO.8  mm)  on  AlCu  with  dark-field  image:  (a)  photoresist/developer: 
SPR511/MFCD,  substrate  defect  density=0%,  (b)  photoresist/ 

developer:  SPR511/MFCD,  substrate  defect  density =30%. 


In  contrast  to  Deckert  and  Peters^^  recommending  to  com¬ 
bine  substrates  with  high  (f  only  with  resist  of  low  values  of 
(f,  we  found,  applying  developer  with  low  (f,  substrates 
and  resists  with  high  could  be  combined  successfully 
(combination  RA/DB,  in  Figs.  6  and  10). 


Fig.  7.  Test  structure  for  investigation  of  adhesion  (SPR511  on  AlCu, 
^,,=25°). 
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Rg.  9.  Density  of  die  defects  depending  on  work  of  adhesion  of  resist  on 
AlCu  and  on  Si02  in  developer  solution  (surface  primer:  HMDS/5%  TMS- 
DEA). 
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Fig.  10.  Quality  function  [Eq.  (11)]  vs  contact  angle  of  water. 


To  apply  resist,  with  low  polarity  at  metal  surfaces,  it  is 
necessary  to  use  other  primers  (e.g.,  TMSDEA  concentrated) 
for  coating  in  a  gaseous  atmosphere  or  to  coat  those  surfaces 
by  spinning  solutions  of  primer  to  adjust  a  contact  angle  of 
water  of  9^>50°. 

D.  Wetting  characteristics  of  resist  and  development 

In  Table  VII,  the  measured  contact  angles  and  wetting 
factors  for  developer  at  Si,  AlCu,  and  resist  layers  as  well  as 
for  liquid  resist  on  Si  and  AlCu  are  shown.  To  guarantee 
good  wetting  for  any  liquid  a  contact  angle  of  that  liquid  (for 
example  of  developer  and  liquid  resist)  should  be  0<62°.^^ 
That  corresponds  to  a  wetting  factor  of >0.30. 

Thus,  problems  concerning  wetting  should  be  only  ex¬ 
pected  to  occur  using  developer  of  type  A  (high  surface  ten¬ 


Table  VIII.  Limits  for  contact  angle  of  water  on  materials  of  semiconductor 
technology  modified  with  primer. 


Contact  angle  (water) 

Resist/developer 

(Example) 

®niin 

®max 

^optimal 

RA/DB 

(SPR5n/MF501) 

10° 

75° 

45° 

RA/DA 

(SPR511/MFCD) 

60° 

85° 

70° 

RB/DB 

(ARP374/E6) 

60° 

85° 

70° 

RB/DA 

(AZ6612/AZ351) 

65° 

85° 

80° 

sion  o->60  dyn/cm)  with  substrates  of  very  low  surface  ten¬ 
sion  (cr<35  dyn/cm),  e.g,,  after  a  long  primer  process  (Table 
VII). 


E.  Optimizing  adhesion  and  wetting 

For  evaluation  of  all  processed  components  the  following 
quality  function  was  defined: 

Q  aw/ wDl/ wDlf wR »  (1  i) 

where  is  the  adhesion  factor  of  interface  resist/substrate 
in  developer,  the  adhesion  factor  of  interface  resist/ 
substrate  in  water,  the  wetting  factor  of  developer  on 
resist,  /^d2  the  wetting  factor  of  developer  on  substrate,  and 
the  wetting  factor  of  resist  on  substrate. 

The  results  of  this  evaluation  are  traced  in  Fig.  10  (only 
positive  values  are  depicted).  On  the  basis  of  these  results, 
the  following  limits  of  the  contact  angle  of  water  on  the 
substrates  can  be  fixed  and  used  for  control  purposes  (Table 
VIII). 


Table  VII.  Wetting  parameters  of  developers  (type  A,  B)  and  liquid  photoresist  on  Si,  AlCu  and  resist  layers. 


H2O  . 

Developer  type  A 

Developer  type  B 

Photoresist 

(liquid) 

Substrate 

®w 

% 

^32 

/WD 

^32 

/wD 

0* 

^R2 

/WR 

Si 

2° 

0° 

2.6 

1.07 

3,6° 

-0.1 

1.0 

19° 

-2 

0.95 

(HMDS) 

45° 

26° 

-6.1 

0.82 

23° 

-4.1 

0.87 

24° 

-3 

0.89 

55° 

41° 

-15 

0.59 

37° 

-10 

0.68 

37° 

-7 

0.76 

60° 

46° 

-18 

0.51 

41° 

-12 

0.62 

40° 

-8 

0.72 

65° 

51° 

-22 

0.43 

44° 

-14 

0.56 

42° 

-8.5 

0.69 

70° 

54° 

-25 

0.38 

48° 

-16 

0.50 

43° 

-9.5 

0.65 

73° 

57° 

-27 

0.33 

50° 

-18 

0.47 

44° 

-10 

0.62 

84.5° 

64° 

-43 

0.24 

55° 

-21 

0.38 

46° 

-11 

0.58 

AlCu 

3° 

0° 

8.1 

1.25 

0° 

6 

1.18 

0° 

3.3 

1.09 

(HMDS) 

7.5° 

0° 

5.5 

1.16 

0° 

3.5 

1.10 

0° 

1.3 

1.04 

12° 

0° 

5.3 

1.16 

0° 

3.4 

1.10 

0° 

1.1 

1.03 

25° 

0° 

1.5 

1.04 

0° 

0.4 

1.01 

13° 

-0.9 

0.97 

33° 

14° 

-1.9 

0.94 

15° 

-1.7 

0.95 

20° 

-2.1 

0.94 

41° 

22° 

-4.2 

0.87 

19° 

-2.6 

0.92 

20° 

-2.1 

0.93 

Resist 

44° 

23° 

-4.8 

0.86 

19° 

-2.8 

0.91 

20° 

-2.1 

0.93 

type  A 

Resist 

61° 

36° 

-11.4 

0.66 

25° 

-4.7 

0.83 

type  B 

81° 

54° 

-24.7 

0.38 

40° 

-11.4 

0.60 
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IV.  SUMMARY 

In  accordance  with  experimental  results  (Figs.  8  and  9),  it 
is  possible  to  adjust  optimal  conditions  for  adhesion  of  resist 
on  different  materials  by  means  of  measurement  of  contact 
angle  of  water  if  surface  tension  of  the  applied  resist  and 
developer  is  known.  It  could  be  shown  that  a  combination  of 
resist  with  high  (f  and  developer  with  low  is  a  good 
prerequisite  for  adhesion  and  wetting.  Whereas,  using  resist 
with  high  combined  with  developer  with  high  (f,  good 
adhesion  can  only  be  guaranteed  if  surface  tension  of  the 
substrate  is  strongly  reduced.  We  think  that  similar  results 
can  be  achieved  in  etching  processes  combining  resists  with 
high  (f  with  etchants  with  high  (f,  respectively.  According 
to  Fig.  9  for  resist  layers  in  liquids  (developer,  water, 
etchants),  the  work  of  adhesion  should  be  >5  dyn/cm. 
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Polysilicon  gate  etching  in  high  density  piasmas.  III.  X-ray  photoelectron 
spectroscopy  investigation  of  sidewail  passivation  of  siiicon 
trenches  using  an  oxide  hard  mask 

F.  H.  Bell  and  O.  Joubert®) 

France  Telecom,  CNET,  38243  Meylan  Cedex,  France  and  LPCM  UMR  110  IMN, 

44072  Nantes  Cedex  03,  France 

(Received  15  January  1996;  accepted  3  June  1996) 

The  characteristics  of  poly-Si  trench  etching  in  high  density  plasma  processes  was  studied  by  x-ray 
photoelectron  spectroscopy.  Poly-Si  films  on  Si02-covered  Si(lOO)  substrates  were  masked  with  a 
200-nm-thick  oxide  hard  mask.  The  200  mm  wafers  were  then  etched  downstream  using  a  helicon 
high  density  plasma  source  and  a  chlorine-based  gas  chemistry.  After  etching,  samples  were 
transferred  under  ultrahigh  vacuum  to  a  surface  analysis  chamber  equipped  with  an  x-ray 
photoelectron  spectrometer.  Regular  arrays  of  trenches  were  used  to  determine  the  photoelectron 
signals  originating  from  the  tops,  sidewalls,  and  bottoms  of  the  features,  A  thin  oxide  film  was  found 
on  the  sides  of  the  oxide  masked  poly-Si  trenches.  The  origin  of  this  film  can  be  related  to  the 
sputtering  and  redeposition  of  oxide  from  the  quartz  tube  of  the  helicon  source  located  in  the  plasma 
generation  region.  A  substantial  amount  of  chlorine  was  present  on  the  poly-Si  sidewall  of  the 
features,  whereas  less  chlorine  was  found  on  the  oxide  surfaces.  The  poly-Si  sidewalls  were  covered 
by  a  small  amount  of  oxygen.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

High  density  plasma  sources  are  increasingly  being  inves¬ 
tigated  to  achieve  high  etch  rates  and  selectivities  and  to 
improve  control  of  etching  profiles.  In  addition,  ion  energy 
control  via  wafer  biasing  is  important  to  minimize  substrate 
and  gate  electrode  damage.  Low  pressure  high  density  plas¬ 
mas  sources,  such  as  the  helicon  source,  can  meet  these 
requirements.^"^ 

In  a  recent  publication,  we  demonstrated  how  to  achieve 
anisotropic  and  selective  etching  of  photoresist  masked 
poly-Si  over  Si02  using  a  Cl2/He  gas  chemistry.  We  devel¬ 
oped  a  two-step  etching  process  resulting  in  anisotropic  etch¬ 
ing  profiles  without  any  undesirable  etching  anomalies.^  The 
chemical  composition  of  the  poly-Si  trenches  after  etching 
and  sample  transfer  under  high  vacuum  by  x-ray  photoelec¬ 
tron  spectroscopy  (XPS)  was  also  determined."^  Regular  ar¬ 
rays  of  lines  and  spaces  were  analyzed  to  determine  the 
chemical  composition  on  the  tops,  sidewalls,  and  bottoms  of 
the  features.  A  thin  oxide  film  was  found  on  the  sidewalls  of 
the  photoresist  and  poly-Si,  it  was  caused  by  the  sputtering 
of  the  quartz  tube  located  in  the  plasma  generation  region  of 
the  helicon  source.  A  substantial  amount  of  chlorine  and  car¬ 
bon  was  formed  on  the  sidewalls  of  the  features  during  etch¬ 
ing.  Smaller  amounts  of  carbon  were  found  on  the  bottoms 
of  the  trenches.  We  concluded  that  the  carbon  rich  film  and 
the  thin  oxide  film  on  the  sidewalls  might  play  an  important 
role  in  passivating  the  sidewalls  of  the  features. 

In  the  present  study,  we  investigated  the  etching  charac¬ 
teristics  when  replacing  the  photoresist  mask  by  an  oxide 
hard  mask.  Furthermore,  we  characterized  the  chemical  to¬ 
pography  of  the  features  by  XPS  and  compared  the  results  to 
those  obtained  using  the  photoresist  masked  wafers. 


^^Electronic  mail:  olivier.joubert@cns.cnet.fr 


II.  EXPERIMENTAL  PROCEDURE 

The  experimental  system  has  been  described  in  detail  in 
recent  publications.^’"^  The  ultrahigh  vacuum  (UHV)  cluster 
tool  consists  of  four  chambers.  The  load-lock  chamber  is 
located  in  a  clean  room  Class  10  and  provides  the  loading  of 
up  to  twenty-five  200  mm  wafers.  The  wafer  can  then  be 
transferred  via  a  transfer  chamber  into  the  etching  reactor  or 
into  the  analysis  chamber.  Wafer  transfer  between  the  cham¬ 
bers  of  the  cluster  tool  is  computer  controlled  by  a  robot  arm 
located  in  the  transfer  chamber.  The  analysis  chamber  is 
equipped  with  an  x-ray  photoelectron  spectrometer  (Fisons 
Surface  Systems  ESCALAB  220/),  a  single  electrostatic  lens 
flood  gun  (VG  model  LEG  41),  and  a  mass  spectrometer 
(Micromass  386).  A  nonmonochromatized  x-ray  source  con¬ 
sisting  of  a  water  cooled  standard  twin  anode  (A1  Ka,  1487 
eV/Mg  Ka,  1253.6  eV),  and  a  monochromatized  A1  Ka  are 
provided  for  XPS  analysis;  further  details  about  the  analysis 
system  can  be  found  elsewhere."^  The  helicon  reactor  made 
by  Lucas  Labs^’^  consists  of  a  plasma  generation  region  in 
which  a  high  density  plasma  helicon  source  is  operated  at 
13.56  MHz,^“^^  and  a  plasma  diffusion  region.  Two  solenoid 
magnets  are  necessary  to  support  the  plasma  in  the  plasma 
generation  region.  Two  other  solenoid  magnets  around  the 
diffusion  region  maintain  a  high  plasma  density  on  the 
sample  and  improve  the  plasma  uniformity.  The  helicon 
plasma  source  is  operated  at  2500  W  forward  power  and  50 
W  reflected  power  at  a  frequency  of  13.56  MHz.  The  sample 
can  be  rf  biased  using  a  600  W  power  supply  at  13.56  MHz. 
Helium  backside  cooling  at  10  Torr  pressure  on  the  backside 
of  the  wafer  is  provided  to  maintain  a  constant  wafer  platen 
temperature  of  25  °C  during  etching. 

The  oxide  and  silicon  etch  rates  were  measured  using  a 
home-made,  real  time  in  situ  HeNe  laser  ellipsometer.^^  The 
ellipsometer  is  an  automated  rotating  ellipsometer  working 
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in  the  polarizer-compensator- sample- analyzer  (PCSA)  con¬ 
figuration.  The  changes  in  the  ellipsometric  parameters 
and  A  (corresponding  to  the  change  in  phase  shift  and  am¬ 
plitude  of  a  reflected  elliptically  polarized  light)  are  related 
to  the  etching  of  poly-Si  or  oxide.  By  determining  the  ellip- 
ticity  of  the  reflected  light,  information  can  be  obtained  on 
the  optical  constants  ^  and  A  of  the  analyzed  materials  and 
its  overlayers.  After  data  acquisition,  a  three-  or  two-layer 
simulation  program  can  be  used  to  calculate  the  poly-Si  and 
oxide  etch  rates  (assuming  that  the  refractive  indices  of 
poly-Si  and  oxide  are  3.866+/ X(— 0.028)  and  1.465,  respec¬ 
tively). 

The  etching  conditions  used  in  this  study  were  identical  to 
those  used  for  the  poly-Si  gate  patterning  described  in  a  re¬ 
cent  publication.^  An  etching  recipe  was  used  to  simulta¬ 
neously  switch  off  the  gas  flows,  source  power,  and  bias 
power  after  etching,  resulting  in  a  more  realistic  “finger¬ 
print”  of  the  plasma  treated  surface."^  Following  etching,  the 
wafer  was  transferred  into  the  UHV  analysis  chamber  within 
30  s.  Although  the  XPS  data  acquisition  required  up  to  sev¬ 
eral  days,  analysis  of  the  same  area  at  the  beginning  and  end 
of  acquisition  showed  that  the  surface  kept  under  ultrahigh 
vacuum  remained  stable. 

The  monochromatized  AX  K a  x-ray  source  was  used  for 
all  experiments  performed  in  this  study.  The  source  is  pro¬ 
vided  with  a  source  defined  small  spot  toroidal  monochro¬ 
mator.  We  used  the  1000  fim  spatial  resolution  mode  result¬ 
ing  in  an  x-ray  spot  diameter  of  less  than  1  mm. 

The  200  mm  wafers  used  in  this  work  were  poly-Si  gate 
structures  patterned  with  an  oxide  hard  mask.  Silicon  (100) 
was  coated  with  a  6-nm-thick  oxide  film.  Poly-Si  with  a 
thickness  of  300  nm  was  then  deposited  on  the  oxide.  The 
wafers  were  patterned  with  a  200-nm-thick  oxide  mask.  At 
the  center  of  the  wafer  a  18  mm^  unmasked  area  allows  the 
in  situ  measurement  of  ^  and  A  with  the  HeNe  laser  ellip- 
someter.  The  smallest  feature  dimensions  were  0.25  /uum  iso¬ 
lated  and  nested  lines. 

The  mask  design  used  in  this  study  allows  characteriza¬ 
tion  of  the  chemical  topography  of  the  tops,  sidewalls,  and 
bottoms  of  the  features  by  XPS.  Photoelectron  signals  from 
different  portions  of  the  features  are  separated  using  geo¬ 
metrical  shadowing  of  photoelectrons  and  differential  charg¬ 
ing  of  the  insulating  and  conducting  surfaces.  The  patterned 
areas  analyzed  in  this  study  consist  of  regular  arrays  of 
trenches,  a  blanket  substrate,  and  an  unpatterned  mask  mate¬ 
rial,  with  the  size  of  the  areas  being  at  least  1.5  mm^.  This 
geometry  allows  the  x-ray  beam  (1  mm  in  diameter)  to  be 
focused  on  one  individual  array,  the  resulting  signal  being 
the  average  of  many  identical  lines  and  spaces.  Variations  in 
the  line  and  space  widths  enable  analysis  of  the  desired  por¬ 
tion  of  the  features.  The  electron  energy  analyzer  detects 
photoelectrons  from  a  ±6°  cone,  as  was  experimentally 
determined."^  Before  analyzing  the  features,  one  wafer  was 
etched  and  the  dimensions  of  the  patterned  features  were 
determined  by  secondary  electron  microscopy.  This  metrol¬ 
ogy  step  provides  an  accurate  measurement  of  the  height  of 
the  mask  and  the  width  of  the  polysilicon  lines  after  the 
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etching  process.  Therefore  the  array  that  allows  analysis  of 
the  desired  portion  of  the  feature  can  be  selected. 

The  quantitative  surface  analysis  by  XPS  is  dependent  on 
several  aspects  such  as  shadowing  of  the  photoelectrons,  the 
relative  response  of  the  analyzer  across  the  acceptance  cone, 
and  the  angular  response  of  the  electron  analyzer.^’ How¬ 
ever,  in  our  configuration,  these  effects  can  be  considered  to 
be  negligible;  further  details  on  mask  design  and  analysis 
procedures  can  be  found  elsewhere."^ 

After  etching,  the  wafer  is  transferred  into  the  analysis 
chamber.  The  wafer  is  placed  in  the  focal  plane  of  the  elec¬ 
tron  energy  analyzer  and  x-ray  sources  using  a  stage  elevator. 
The  ability  to  rotate  the  wafer  around  the  vertical  axis  and  to 
move  it  in  all  horizontal  directions  allows  the  same  array  of 
lines  and  spaces  to  be  analyzed.  Lines  and  spaces  can  be 
arranged  parallel  or  perpendicular  to  the  axis  of  the  electron 
analyzer.  Consequently,  the  tops  of  the  features  and  bottoms 
of  the  spaces  (in  the  parallel  mode)  or  tops  and  desired  parts 
of  the  sidewalls — depending  on  the  space  between  the 
lines — of  the  features  (in  the  perpendicular  mode)  can  be 
analyzed.  The  angle  between  the  monochromatized  x-ray 
source  and  the  electron  analyzer  is  57°,  and  the  angle  be¬ 
tween  the  x-ray  source  and  vertical  axis  of  the  sample  holder 
is  12°.  A  charge  coupled  device  (CCD)  video  camera  in¬ 
stalled  on  the  “hat”  of  the  chamber  is  aligned  with  the  focal 
point  of  the  x-ray  and  electron  energy  analyzer  and  allows 
selection  of  the  desired  area  of  the  sample  to  be  analyzed. 

III.  EXPERIMENTAL  RESULTS  AND  DISCUSSION 

A.  Selective  and  anisotropic  polysilicon 
trench  etching  using  an  oxide  hard  mask 

We  now  study  the  poly-Si  etching  profiles  obtained  using 
an  oxide  hard  mask.  The  etching  conditions  were  those  es¬ 
tablished  in  a  recent  publication:^  a  two-step  etching  process 
is  performed  to  etch  the  poly-Si  gate  anisotropically  and 
highly  selectively  between  the  poly-Si  and  the  gate  oxide. 
First,  a  high  rf  bias  power  of  240  W  (etch  rate:  250  nm/min) 
is  applied  to  obtain  an  anisotropic  etching  profile;  second,  40 
nm  before  reaching  the  gate  oxide  film,  the  rf  bias  power  is 
switched  to  20  W  (etch  rate:  60  nm/min).  The  low  bias 
power  applied  during  the  second  etching  step  increases  the 
poly-Si  to  oxide  selectivity  and  suppresses  the  trenching  ef¬ 
fect.  Identical  process  conditions  to  the  second  etching  step 
are  applied  during  the  50%  overetch  time;  further  details  can 
be  found.^  Figure  1  shows  scanning  electron  microscopy 
(SEM)  micrographs  of  oxide  masked,  poly-Si  nested,  and 
isolated  gates  etched  using  the  two-step  etching  process.  The 
micrographs  show  that  the  features  are  etched  anisotropically 
without  any  trenching,  bowing,  or  other  undesirable  etching 
anomalies.  As  with  the  photoresist  mask,  trenching  can  be 
observed  when  etching  the  sample  using  high  bias  power 
conditions.  The  trenching  phenomenon  is  suppressed  during 
the  second  low  bias  power  etching  step  (not  shown  here). 
The  etch  rates  for  poly-Si  measured  by  in  situ  real  time  el- 
lipsometry  are  found  to  be  similar  for  both  types  of  masks 
(240  nm/min  during  the  first-step  process  and  60  nm/min 
during  the  second-step  process).  Indeed,  the  difference  in 
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Fig.  1.  Secondary  electron  micrograph  of  a  poly-Si/Si02  0.25  isolated 
and  nested  line  and  space  structure  masked  with  an  oxide  hard  mask. 
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terms  of  etching  behavior  between  the  photoresist  and  oxide 
masked  samples  is  the  gate  oxide  etch  rate  measured  by  in 
situ  real  time  ellipsometry.  The  selectivity  in  an  open  area 
increases  by  a  factor  of  >3  when  replacing  the  photoresist 
mask  by  an  oxide  hard  mask.  These  studies  will  be  presented 
in  detail  in  another  article. 


Fig.  2.  Si  2/?  XPS  spectra  recorded  on  blanket  oxide  mask  area  (a),  and 
blanket  substrate  (b).  Spectra  recorded  with  the  0.35  /um  lines  and  spaces 
aligned  parallel  to  the  electron  energy  analyzer,  and  the  charge  neutralizer 
turned  off  (c)  and  turned  on  (d).  The  spectra  in  (e)  and  (f)  are  recorded  with 
the  0.35  /um  lines  and  spaces  aligned  perpendicular  to  the  electron  energy 
analyzer.  Again,  spectra  were  taken  with  the  charge  neutralizer  turned  off  (e) 
and  turned  on  (e). 


B.  Chemical  topography  of  the  etched  features 
determined  using  x-ray  photoelectron  spectroscopy 

In  a  previous  article,  we  investigated  the  chemical  topog¬ 
raphy  of  the  etched  features  when  using  a  photoresist  mask."^ 
In  this  article,  similar  experiments  were  conducted  using  an 
oxide  hard  mask.  The  XPS  spectra  shown  were  recorded  on 
wafers  etched  using  the  process  conditions  described  in  Sec. 
III.  The  mask  material  was  a  200-nm-thick  oxide  hard  mask. 
The  thickness  of  the  poly-Si  was  300  nm  and  the  gate  oxide 
thickness  6  nm.  The  elements  present  on  the  etched  surfaces 
were  silicon,  chlorine,  and  oxygen.  No  carbon  was  found  on 
any  of  the  analyzed  surfaces.  A  charge  neutralizer,  providing 
low  energy  electrons,  was  used  to  charge  surfaces  in  order  to 
differentiate  the  XPS  peaks  resulting  from  insulating  oxide 
mask  surfaces  and  conducting  silicon  surfaces.  In  Sec. 
IIIB  1,  we  present  a  complete  set  of  the  Si  2/?,  Cl  2^,  and  O 
Is  spectra.  In  Sec.  IIIB  2,  we  give  a  quantification  of  all 
elements  present  on  the  surfaces,  followed  by  a  comparison 
of  the  surface  chemistry  on  photoresist  and  oxide  hard  mask 
features. 


1.  Determination  of  the  chemicai  constituents  on 
oxide  masked  poiy-Si  structures  using  XPS 

Figure  2  shows  a  complete  set  of  Si  2jt?  spectra  recorded 
on  a  blanket  oxide  mask  [Fig.  2(a)],  an  unpattemed  gate 
oxide  [Fig.  2(b)],  0.35  fjim  line  and  space  (L/S)  features  with 
the  poly-Si  lines  aligned  parallel  to  the  electron  energy  ana¬ 
lyzer  [Figs.  2(c)  and  2(d)],  and  on  0.35  jum  L/S  features  with 
the  lines  perpendicular  to  the  electron  energy  analyzer  [Figs. 
2(e)  and  2(f)].  When  the  polysilicon  lines  are  aligned  parallel 
with  respect  to  the  electron  energy  analyzer,  only  photoelec¬ 
trons  from  the  tops  and  bottoms  of  the  features  contribute  to 
the  XPS  signal.  When  the  polysilicon  lines  are  perpendicular 
to  the  electron  energy  analyzer,  the  analyzed  areas  are  the 
tops  and  sidewalls  of  the  features. 

The  broad  peak  observed  in  Fig.  2(a)  can  be  attributed  to 
the  Si02  originating  from  the  oxide  hard  mask.  This  peak  is 
shifted  from  105  to  92  eV  due  to  the  charging  of  the  surface 
when  the  charge  neutralizer  is  turned  on.  Not  shown  here  is 
the  spectrum  with  the  charge  neutralizer  turned  off.  For  the 
latter  case,  the  oxide  peak  is  located  at  105  eV  and  no  Si  2p 
peak  at  a  binding  energy  of  99.6  eV  is  detected  on  the  oxide. 
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Figure  2(b)  shows  the  Si  2 p  peak  recorded  on  an  unpattemed 
gate  oxide  area.  The  peak  at  104  eV  can  be  attributed  to  the 
Si02  from  the  gate  oxide,  and  the  Si  2p  peak  at  99.6  eV 
originates  from  the  underlying  silicon  substrate.  Figure  2(c) 
shows  the  Si  peak  recorded  in  0.35  pm  L/S  parallel  to  the 
electron  energy  analyzer  with  the  charge  neutralizer  off. 
Since  no  silicon  is  present  on  top  of  the  oxide  mask  [see  Fig. 
2(a)],  the  peak  at  99.6  eV  can  unambigously  be  attributed  to 
the  Si  2p  peak  originating  from  the  underlying  silicon  sub¬ 
strate.  The  spectrum  shows  two  other  peaks  located  at  104 
and  105.2  eV.  The  Si02  peak  at  104  eV  results  from  the  gate 
oxide  film  and  the  peak  at  105.2  eV  from  the  oxide  hard 
mask  (slightly  shifted  to  a  higher  binding  energy  due  to  the 
charging  of  the  insulating  oxide  mask).  The  spectrum  is  also 
recorded  with  the  charge  neutralizer  on  [Fig.  2(d)].  The  Si02 
resulting  from  the  oxide  mask  is  shifted  to  smaller  binding 
energies  (between  86  and  96  eV)  whereas  the  Si  2p  and  Si02 
from  the  substrate  and  the  gate  oxide  remain  unshifted.  The 
integrated  peak  area  of  the  shifted  Si02  peak  from  the  oxide 
mask  is  identical  to  the  integrated  area  of  the  Si02  peak 
recorded  with  the  flood  gun  off.  Figure  2(e)  shows  the  Si  2p 
spectra  recorded  in  0.35  pm  L/S  features  perpendicular  to 
the  electron  energy  analyzer  with  the  charge  neutralizer 
turned  off.  Due  to  shadowing,  the  analyzed  areas  are  the  tops 
of  the  oxide  mask  and  sidewalls  of  the  features.  The  peak  at 
99.6  eV  can  be  attributed  to  the  Si  2p  peak  originating  from 
the  poly-Si  sidewall.  The  spectrum  also  shows  a  Si02  peak 
(between  101  and  106  eV)  coming  from  the  sidewalls  of  the 
features  and  tops  of  the  oxide  mask.  The  spectrum  is  also 
recorded  with  the  charge  neutralizer  on  [Fig.  2(f)].  The  Si02 
peak  is  now  separated  into  two  components,  whereas  the  Si 
2p  peak  at  99.6  eV  remains  unshifted.  One  Si02  component 
is  located  at  103  eV,  whereas  the  second  Si02  component  is 
shifted  to  smaller  binding  energies  (between  87  and  97  eV). 
The  sum  of  the  integrated  peak  areas  of  the  shifted  and  un- 
shifted  Si02  peaks  is  almost  identical  to  the  integrated  area 
of  the  Si02  peak  (within  5%)  recorded  with  the  flood  gun 
off.  The  shifted  Si02  peak  can  be  attributed  to  the  SiO^  film 
present  on  the  sidewalls  and  tops  of  the  insulating  oxide 
mask,  whereas  the  unshifted  SiO^  peak  results  from  the  con¬ 
ducting  poly-Si  sidewall.  Due  to  limitations  in  mask  dimen¬ 
sions,  no  spectra  could  be  recorded  that  only  involved  analy¬ 
sis  of  the  tops  and  sidewalls  of  the  oxide  mask. 

The  Cl  2^  spectra  recorded  on  the  same  sample  are  dis¬ 
played  in  Fig.  3.  Only  spectra  obtained  with  the  charge  neu¬ 
tralizer  turned  on  are  shown.  Figure  3(a)  shows  the  Cl  2 5 
peak  recorded  on  the  blanket  oxide  mask.  The  peak  is  shifted 
from  270  eV  to  a  lower  binding  energy  at  261  eV  due  to  the 
charging  of  the  insulating  oxide  mask.  Figure  3(b)  shows  the 
Cl  2  s  spectrum  recorded  on  an  unpatterned  gate  oxide  area. 
The  Cl  2s  peak  is  located  at  270  eV  indicating  that  the  thin 
gate  oxide  on  top  of  the  silicon  substrate  behaves  like  a  con¬ 
ducting  surface.  Cl  2s  spectra  recorded  in  0.35  pm  L/S  fea¬ 
tures  with  the  polysilicon  lines  parallel  to  the  electron  energy 
analyzer  are  shown  in  Fig.  3(c).  The  Cl  25-  peak  splits  into 
two  peaks  at  binding  energies  of  270  and  261  eV.  The  un¬ 
shifted  peak  at  270  eV  can  be  attributed  to  chlorine  present 


Fig.  3.  Cl  25  XPS  spestra  recorded  on  blanket  oxide  mask  area  (a),  blanket 
substrate  (b),  0.35  /.tm  lines  and  spaces  aligned  parallel  (c)  and  perpendicu¬ 
lar  (d)  to  the  electron  energy  analyzer.  The  charge  neutralizer  was  turned  on 
for  the  acquisition  of  all  spectra. 


on  the  bottom  of  the  conducting  gate  oxide,  whereas  the 
second  peak  can  be  attributed  to  chlorine  present  on  top  of 
the  surface  of  the  insulating  oxide  mask.  Figure  3(d)  shows 
the  Cl  2.9  peak  recorded  with  the  lines  and  spaces  aligned 
perpendicularly  to  the  electron  energy  analyzer  (the  analyzed 
areas  are  the  tops  and  sidewalls  of  the  feature).  The  unshifted 
Cl  2s  peak  at  270  eV  can  be  attributed  to  chlorine  present  on 
the  conducting  polysilicon  sidewall.  The  Cl  2s  peak  at  260 
eV  can  be  attributed  to  chlorine  present  on  the  tops  and  side- 
walls  of  the  oxide  hard  mask.  Note  that  no  carbon  (at  the 
binding  energy  of  284  eV)  could  be  observed  on  any  of  the 
analyzed  areas. 

O  1.9  spectra  are  shown  in  Fig.  4.  Similar  to  Si  2p  and  Cl 
2s,  the  O  Is  peak  can  be  differentiated  when  the  charge 
neutralizer  is  turned  on.  Figure  4(a)  shows  the  O  1.9  spec¬ 
trum  recorded  on  a  blanket  oxide  mask.  The  O  Is  peak  shifts 
from  534  to  522  eV  when  the  charge  neutralizer  is  turned  on. 
The  O  1.9  spectrum  recorded  on  the  unpatterned  gate  oxide 
area  is  displayed  in  Fig.  4(b).  The  peak  remains  unshifted  at 
a  binding  energy  of  533  eV.  When  the  features  are  analyzed 
with  the  lines  and  spaces  parallel  to  the  electron  energy  ana¬ 
lyzer,  the  O  Is  peak  splits  into  two  components.  The  un¬ 
shifted  peak  at  533  eV  can  be  attributed  to  oxygen  resulting 
from  the  conducting  bottom  of  the  features  (gate  oxide)  and 
the  peak  at  522  eV  results  from  the  insulating  top  of  the 
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Fig.  4.  O  1.S  XPS  spectra  recorded  on  blanket  oxide  mask  area  (a),  blanket 
substrate  (b),  0.35  //,m  lines  and  spaces  aligned  parallel  (c)  and  perpendicu¬ 
lar  (d)  to  the  electron  energy  analyzer.  The  charge  neutralizer  was  turned  on 
for  all  spectra. 

oxide  mask.  The  O  spectra  displayed  in  Fig.  4(d)  origi¬ 
nates  from  the  tops  and  sidewalls  of  the  oxide  mask  and  from 
the  sidewalls  of  the  polysilicon.  The  0  1^  peak  is  split  into 
two  peaks.  The  unshifted  peak  at  533  eV  originates  from  the 
conducting  poly-Si  sidewalls,  whereas  the  broad  peak  cen¬ 
tered  at  524  eV  can  be  attributed  to  oxygen  from  the  tops  and 
sides  of  the  oxide  mask. 

2.  Quantitative  analysis  of  the  chemical 
concentrations  on  the  oxide  masked  features 

We  now  present  the  results  of  the  quantitative  analysis  of 
the  chemical  element  concentrations  on  the  features.  The 
quantitative  coverages  of  silicon,  chlorine,  and  oxygen  are 
derived  from  spectra  recorded  with  the  charge  neutralizer 
turned  on.  The  integrated  peak  intensities  were  divided  by 
the  Scofield  cross  sections  (Si  2p:  0.82;  Cl  2^:  1.69;  O  Is: 
2.93).^^  The  angle  between  all  the  analyzed  areas  and  the 
electron  energy  analyzer  being  45°,  direct  comparisons  can 
be  drawn  between  the  coverages  of  the  different  elements 
present  on  the  tops,  sidewalls,  and  bottoms  of  the  features. 
The  calculated  concentrations  of  the  elements  present  on  the 
surfaces  of  the  features  are  summarized  in  Table  1. 

a.  Blanket  substrate  (gate  oxide),  unpatterned  oxide  mask, 
top  of  the  oxide  mask,  bottom  of  the  substrate,  and  oxide 
mask  (sidewall  and  top).  The  calculated  element  concentra- 
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Table  I.  Element  concentrations  calculated  using  quantitative  XPS  analysis 
after  the  standard  etching  process.  The  mask  material  was  a  200-nm-thick 
oxide  hard  mask;  the  gate  oxide  thickness  was  6  nm. 


After  standard 

process 

Element  concentration 
(%) 

Si  2p 

Si02 

Cl  2.9 

0  Is 

Blanket  substrate 

3.8 

31.9 

15.3 

49.0 

Blanket  oxide 

mask 

34.0 

16.1 

50.0 

0.35  /xm  L/S 
poly-Si  sidewall 

51.3 

8.1 

32.5 

7.9 

,  0.35  fum  L/S 
oxide  mask 
top  and  sidewall 

35.9 

12.1 

52.0 

0.35  /xm  L/S 
oxide  mask  top 

34.6 

16.5 

48.9 

0.35  L/S 

substrate  bottom 

3.9 

32.4 

13.9 

49.8 

tions  on  a  blanket  substrate  (corresponding  to  the  gate  ox¬ 
ide);  the  unpattemed  oxide  mask,  the  top  of  the  oxide  mask, 
and  the  bottom  of  the  features  are  rather  similar.  Slightly  less 
Si02  is  calculated  on  the  blanket  substrate  and  bottoms  of  the 
substrate.  This  observation  is  due  to  the  underlying  silicon 
substrate,  which  contributes  to  the  XPS  signal,  therefore  re¬ 
ducing  the  total  amount  of  Si02.  A  smaller  amount  of  chlo¬ 
rine  is  adsorbed  on  the  substrate  bottom.  Since  the  chlorine 
coverage  increases  with  ion  energy  bombardment,"^  the  lower 
chlorine  concentration  might  be  due  to  a  loss  of  chlorine  ion 
energy  when  reaching  the  bottom  of  the  feature.  The  element 
concentrations  calculated  from  the  tops  and  sides  of  the  ox¬ 
ide  mask  show  a  lower  total  amount  of  chlorine  and  a  higher 
amount  of  Si02  and  oxygen.  The  chlorine  concentration 
found  is  30%  less  than  on  the  blanket  oxide  mask.  Assuming 
that  the  top  of  the  oxide  mask  contributes  to  the  chlorine 
signal  by  16.1%  and  by  taking  into  account  the  geometry  of 
the  features,  the  true  chlorine  coverage  on  the  sides  of  the 
features  can  be  estimated  to  be  less  than  9%.  This  result  can 
be  explained  by  the  lack  of  ion  bombardment  on  the  side- 
walls  of  the  features,  which  reduces  the  adsorption  of  chlo¬ 
rine  on  the  vertical  surfaces. 

b.  poly-Si  sidewall.  In  addition  to  the  expected  high 
amount  of  silicon  (0.513%),  significant  contributions  of  chlo¬ 
rine  (0.325%),  Si02  (0.081%),  and  oxygen  (0.079%)  are 
found  on  the  sides  of  the  poly-Si.  The  amount  of  chlorine  is 
roughly  double  that  of  the  concentration  found  on  the  oxide 
surfaces,  indicating  that  the  adsorption  of  chlorine  on  silicon 
surfaces  is  higher  than  on  oxide  surfaces. 

3.  Comparison  of  the  chemica!  topography  of  oxide 
and  photoresist  masked  poly-Si  features 

In  our  recent  article,"^  we  studied  the  chemical  topography 
of  photoresist  masked  poly-Si.  We  found  that  a  thin  oxide 
film  is  formed  during  etching  on  the  sidewalls  of  the  fea- 
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tures.  We  concluded  that  the  source  of  the  oxide  film  was  not 
the  redeposition  of  volatile  etching  products  but  the  sputter¬ 
ing  of  the  quartz  tube  located  in  the  plasma  generation  re¬ 
gion.  We  also  observed  a  carbon  rich  film  on  the  sidewalls  of 
the  features,  possibly  acting  as  an  additional  passivation 
layer  on  the  sidewalls  of  the  trenches. 

We  discuss  here  the  chemical  composition  of  the  surfaces 
for  both  types  of  mask  material,  i.e,,  the  sidewalls  of  the 
poly-Si,  the  bottom  of  the  features,  and  unpatterned  areas. 
We  also  observed  a  thin  oxide  film  present  on  the  sidewalls 
of  the  poly-Si  trenches  when  using  the  oxide  hard  mask.  The 
likely  sources  of  this  oxide  are  either  the  quartz  tube  located 
in  the  plasma  generation  region  of  the  helicon  source  or  the 
sputtering  and  redeposition  of  oxide  species  from  the  oxide 
hard  mask.  To  determine  the  source  of  the  oxide  film,  we 
calculated  the  ratio  of  the  Si02  to  Si  2p  peak  areas  from  the 
sidewalls  of  the  poly-Si  trenches.  The  ratio  is  0.16  for  the 
oxide  mask  and  0.17  when  using  a  photoresist  mask,"^  indi¬ 
cating  that  the  thickness  of  the  deposited  oxide  film  is  ap¬ 
proximately  the  same  for  both  types  of  masks.  Since  the 
process  conditions  were  identical  for  both  types  of  masks, 
we  conclude  that  the  same  mechanism  as  that  found  for  the 
photoresist  masked  sample  is  responsible  for  the  oxide  for¬ 
mation:  the  oxide  is  sputtered  from  the  quartz  tube  during 
etching  and  redeposited  on  the  sidewalls  of  the  features.  This 
result  is  rather  surprising,  since  one  could  expect  the  sputter¬ 
ing  of  oxide  species  from  the  oxide  hard  mask  to  also  con¬ 
tribute  to  the  formation  of  the  oxide  passivation  layer.  Since 
this  has  not  been  experimentally  observed,  we  suggest  that 
the  mask  etching  products  are  volatile  and  pumped  away  by 
the  pumping  system.  On  the  other  hand,  the  quartz  tube  in 
the  plasma  generation  region  of  the  helicon  source  is  ex¬ 
posed  to  a  higher  plasma  density  than  the  wafer  in  the  down¬ 
stream  region.  In  addition,  the  temperature  of  the  quartz  tube 
is  much  higher  than  that  of  the  backside  cooled  wafer  and  the 
inner  surface  of  the  quartz  tube  is  calculated  to  be  about 
1340  cm^,  30  times  higher  than  the  oxide  mask  coverage  on 
the  wafer.  Furthermore,  an  increase  in  roughness  of  the  inner 
surface  of  the  quartz  tube  was  observed  as  a  function  of 
source  time  operation.  As  a  consequence,  the  oxide  surface 
of  the  quartz  tube  exposed  to  the  plasma  increases  with  time. 
This  results  in  the  deposition  of  an  oxide  film  on  a  blanket 
silicon  wafer  (without  biasing)  at  a  rate  of  15  nm/min,  as 
measured  by  HeNe  laser  ellipsometry. 

We  found,  on  the  other  hand,  a  substantial  amount  of 
carbon  on  the  sides  of  the  poly-Si  when  using  a  photoresist 
mask,  indicating  that  carbon  from  the  photoresist  is  sputtered 
and  redeposited  on  the  sidewalls.  The  problem  is  to  under¬ 
stand  why  carbon  from  the  photoresist  contributes  to  the  pas¬ 
sivation  film  on  the  sidewalls  of  the  features.  One  plausible 
explanation  is  that  the  etch  rate  of  the  photoresist  mask  is 
approximately  four  to  five  times  higher  than  the  etch  rate  of 
the  oxide  mask.  Consequently,  more  etching  species  from  the 
mask  material  are  present  in  the  gas  phase  of  the  plasma. 
Assuming  that  the  carbon-containing  etching  products  are 
less  volatile  than  the  volatile  oxide  species  from  the  oxide 
mask,  the  photoresist  mask  can  contribute  to  the  formation  of 
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a  sidewall  film.  The  results  obtained  using  an  oxide  hard 
mask  show  perfect  anisotropic  profiles  and  no  carbon  present 
on  the  surfaces.  We  can  therefore  conclude  that,  by  using  a 
nonpolymerizing  chlorine  chemistry,  a  carbon  rich  layer  may 
enhance  anisotropic  etching  but  is  not  necessary  to  obtain 
vertical  sidewalls. 

When  using  a  photoresist  mask,  carbon  is  present  on  all 
surfaces  and  therefore  contributes  to  the  quantification  of  the 
element  concentrations.  In  order  to  compare  the  chlorine 
coverage  on  the  surfaces  with  both  masks,  the  carbon  contri¬ 
bution  was  excluded  and  the  C  I5,  Cl  2.y,  and  Si2p  element 
concentrations  were  calculated  again.  Following  this  proce¬ 
dure,  the  chlorine  coverage  found  on  the  sidewalls  of  the 
photoresist  masked  poly-Si  increases  from  21.4%  (Ref.  4)  to 
30%,  which  is  of  the  same  order  as  the  32.5%  calculated 
with  the  oxide  mask.  The  amount  of  oxygen  increases  from 
4.5%  to  6.5%  for  the  photoresist  mask,  which  is  slightly  less 
than  the  calculated  7.9%  for  the  oxide  mask.  In  blanket  ar¬ 
eas,  similar  calculations  show  that  the  chlorine  coverage  is 
16.7%  for  the  photoresist  mask  and  15.3%  for  the  oxide 
mask.  Since  the  thickness  of  the  gate  oxide  film  was  4.5  nm 
for  the  photoresist  masked  sample  and  6  nm  for  the  oxide 
masked  sample,  no  direct  comparison  of  the  oxygen  contri¬ 
bution  can  be  drawn. 

In  summary,  comparison  of  the  chemical  topography  of 
the  photoresist  and  oxide  masked  samples  shows  that  the 
surface  modifications  after  etching  are  similar  for  both  types 
of  mask  material.  Approximately  the  same  amount  of  silicon 
oxide,  chlorine,  and  oxygen  is  present  on  the  sides  and  bot¬ 
toms  of  the  features.  The  difference  found  is  the  carbon  con¬ 
tamination  on  all  surfaces  when  using  the  photoresist  mask; 
the  effect  of  the  carbon  contamination  on  the  poly-Si  to  ox¬ 
ide  selectivity  will  be  discussed  in  a  subsequent  article. 

IV.  CONCLUSIONS 

We  have  studied  high  density  plasma  etching  processes  of 
0.25  pm  polysilicon  trenches.  Polysilicon  films  on 
Si02-covered  200  mm  silicon  wafers  were  patterned  using  an 
oxide  hard  mask,  and  then  etched  downstream  using  a  high 
density  plasma  helicon  source.  A  two-step  etching  process 
developed  for  a  photoresist  masked  sample  is  equally  effi¬ 
cient  at  etching  the  features  highly  anisotropically  with  a 
high  selectivity  between  the  poly-Si  and  the  gate  oxide. 

After  etching,  the  200  mm  wafers  were  transferred  (under 
high  vacuum)  to  an  ultrahigh  vacuum  analysis  chamber 
equipped  with  an  x-ray  photoelectron  spectrometer  to  deter¬ 
mine  the  chemical  composition  of  the  surfaces.  We  found  a 
thin  oxide  film  on  the  sidewalls  of  the  poly-Si  which  may 
passivate  the  sidewalls  and  therefore  prevent  lateral  etching 
of  the  poly-Si  gate.  The  thickness  of  this  film  found  on  the 
sidewalls  was  of  the  same  order  as  the  thickness  obtained 
using  a  photoresist  mask.  The  source  of  this  film  was  attrib¬ 
uted  to  the  slight  etching  of  the  quartz  tube  in  the  plasma 
generation  of  the  helicon  source.  Thus,  the  oxide  mask  ma¬ 
terial  played  no  role  in  passivating  the  poly-Si  trenches.  The 
controlled  erosion  of  the  quartz  tube  might,  however,  be  of 
significant  importance  for  the  long-term  process  stability.  A 


J.  Vac.  Scl.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2499 


F.  H.  Bell  and  O.  Joubert:  Poly-Si  gate  etching  (II 


2499 


substantial  amount  of  chlorine  was  found  on  the  sidewalls  of 
the  poly-Si,  whereas  about  half  this  amount  was  present  on 
the  oxide  surfaces  of  the  oxide  mask,  gate  oxide  bottom,  and 
blanket  gate  oxide  areas.  A  smaller  amount  of  oxygen  cov¬ 
ered  the  sidewall  of  the  poly-Si.  The  chlorine  and  oxygen 
coverages  were  comparable  to  those  observed  for  the  photo¬ 
resist  masked  samples. 
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Influence  of  patterning  in  silicon  quantum  well  structures 
on  photoluminescence 
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This  article  reports  on  a  study  of  the  photoluminescence  (PL)  characteristics  of  Si02/Si/Si02 
quantum  well  structures  formed  by  the  thermal  oxidation  of  Si  lines  with  rectangular  cross  sections 
and  (111)  side  planes.  The  PL  spectra  have  a  peak  at  a  wavelength  of  760  nm  regardless  of  Si  width, 
indicating  that  the  emission  is  due  to  the  recombination  of  carriers  at  the  Si/Si02  interface.  The  Si 
width  required  to  obtain  the  maximum  emission  is  about  5  nm,  which  is  thicker  than  theoretically 
predicted.  The  reason  for  this  is  clarified  by  measuring  the  difference  in  the  PL  characteristics  for 
patterned  and  unpatterned  Si02/Si/Si02  film  structures.  In  the  film  structures,  the  characteristics  are 
found  to  vary  by  the  patterning  of  the  ultrathin  Si  layer,  followed  by  oxidation;  the  Si  thickness 
yielding  the  maximum  intensity  increases  and  a  wide  range  of  Si  thickness  contributes  to  the 
emission.  This  behavior  is  consistent  with  that  of  the  oxidized  Si  lines  and  is  possibly  due  to  pattern 
deformation  caused  by  oxidation.  The  stress  accompanying  the  deformation  is  probably  the  reason 
for  the  increase  in  the  PL  intensity  in  thicker  Si.  Therefore,  the  PL  characteristics  for  Si  are  strongly 
influenced  by  the  patterning  and  subsequent  oxidation.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  properties  peculiar  to  nanometer-size  silicon  (Si) 
structures  are  expected  to  play  an  essential  role  in  realizing 
new  electronic  and  optoelectronic  devices  based  on  quantum 
effects.  In  particular,  the  optical  properties  of  Si  have  been 
attracting  a  great  deal  of  attention  from  the  viewpoint  of 
optoelectronics  since  strong  visible  photoluminescence  (PL) 
from  porous  Si  was  observed.^  A  lot  of  work  has  so  far,  been 
carried  out  on  either  porous  Si  with  2-50  nm  mean  diameter 
fabricated  by  electrochemical  anodization  or  chemical  vapor 
deposited  (CVD)  films  of  nanocrystalline  Si.  In  both  cases, 
the  PL  emission  was  observed  in  various  colors  between  red 
and  blue  because  the  maximum  PL  signal  position  varied  in 
the  energy  range  of  L4-2.6  eV  according  to  the  difference  in 
sample  fabrication  processes.  On  the  basis  of  these  results, 
various  PL  emission  mechanisms,  such  as  quantum 
confinement^ and  surface  effects  which  include  Si  species 
covered  by  siloxene,^  the  Si-H  complex,^  and  the  Si02 
matrix^" have  been  suggested.  Nonetheless,  what  the  emis¬ 
sion  mechanism  of  PL  from  nanometer-size  Si  structures  is 
remains  unclear. 

From  the  viewpoint  of  applying  this  strong  PL  to  Si  op¬ 
toelectronics,  the  fabrication  of  well-controlled  Si  nanostruc¬ 
tures  is  essential.  For  this  purpose,  nanolithography,  espe¬ 
cially  with  an  electron  beam,  is  effective  because  the  patterns 
with  the  desired  size  can  easily  be  fabricated  with  an  ul- 
trafine  beam.  In  fact,  nanometer-size  resist  patterns  have 
been  formed  by  electron-beam  nanolithography.  By  using 
such  resist  patterns  as  an  etching  mask,  Si  nanostructures 
were  fabricated  and  PL  measurements  were  tried  on  these 
structures.^"^"^^  However,  Si  nanostructures  did  not  always 
show  PL  emission.  For  example,  PL  was  not  observed  from 
10-nm  Si  structures  passivated  with  hydrofluoric  acid.^^  This 
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is  due  to  the  ambiguity  of  the  relationship  between  the  Si 
pattern  size  and  PL,  as  well  as  the  confusion  surrounding  the 
emission  mechanism. 

We  have  studied  PL  spectra  from  two-dimensional  (2D) 
Si  nanolines  formed  using  electron-beam  lithography,  Si 
etching,  and  oxidation.  In  this  article,  we  report  on  the  rela¬ 
tionship  between  the  PL  emission  characteristics  of  the  Si 
lines  and  the  pattern  size.  The  Si  lines  were  fabricated  by 
etching  a  (110)  SIMOX  (separation  by  implanted  oxygen) 
wafer  with  a  KOH  solution.  Owing  to  orientation-dependent 
etching,  patterns  with  a  rectangular  cross  section  were 
formed.  The  Si  patterns  were  separated  electrically  from 
the  substrate  by  the  buried  Si02  layer.  After  the  width  of  the 
lines  decreased  due  to  oxidation,  the  Si  pattern  shapes  and 
sizes  were  evaluated  by  TEM  and  PL  spectra  from  the  oxi¬ 
dized  Si  lines  were  measured.  In  addition,  PL  from  patterned 
and  unpattemed  Si02/Si/Si02  film  structures  was  measured 
to  understand  the  characteristics  of  PL  from  the  oxidized  Si 
lines.  Based  on  a  comparison  of  these  results,  factors  affect¬ 
ing  PL  from  oxidized  Si  lines  are  also  discussed. 

II.  EXPERIMENT 

A  5-10  fl  cm  (100)  p-type  SIMOX  wafer  with  a  100-nm- 
thick  top  Si  layer  was  used  for  nanoline  fabrications.  After 
washing  the  wafer  surface  with  hydrofluoric  acid,  50-nm- 
thick  positype  resist  ZEP-520  (Nippon  Zeon  Co.)  was  spun 
onto  it  and  exposed  to  the  electron  beam.  Resist  lines  about 
15  nm  wide  along  (112)  directions  were  formed  with  a  pe¬ 
riod  of  70  nm  in  a  100  ^mX  100  /nm  area  after  development. 
The  Si  surface  exposed  between  the  lines  was  oxidized  by 
ECR  oxygen  plasma  to  a  thickness  of  2  nm^^  to  form  an 
etching  mask.  After  removal  of  the  resist  film  with  acetone 
and  ultrasonic  agitation,  the  top  Si  layer  of  unoxidized  re¬ 
gions  was  etched  in  a  20%  KOH  aqueous  solution  at  10  °C. 
Consequently,  Si  lines  with  rectangular  cross  sections 
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Table  I.  Si  thickness  after  oxidation  at  800  °C. 


Oxidation  time 

Remaining  Si  thickness  (nm)‘‘ 

Sample 

(min) 

Middle 

Top 

a 

0 

14.6±1.8 

14.6±1.8 

b 

30 

11.6±2.4 

12.7±1.8 

c 

80 

6.5  ±1.4 

10.1±1.5 

d 

94 

4.6±2.0 

9.1±1.5 

e 

125 

1.0±1.4 

7.7±1.3 

f 

170 

0 

7.1±1.4 

^Si  thickness  obtained  from  TEM  micrographs. 


and  (111)  side  planes  were  formed. The  width  of  the  Si 
lines  was  decreased  to  the  desired  thickness  by  thermal  oxi¬ 
dation  in  a  dry  oxygen  ambient  at  800 

Thin-film  structures  were  formed  on  (111)  and  (100) 
SIMOX  wafers  with  a  7-14  nm-thick  top  Si  layer.  After 
resist  patterning  by  lithography,  the  exposed  Si  layer  was 
etched  with  chlorine-based  gas  plasma.  Then  film  structures 
composed  of  15-nm-thick  thermal  Si02,  0-7-nm-thick  Si, 
and  400-nm-thick  buried  Si02  layers  were  obtained  after 
thermal  oxidation  in  a  dry  oxygen  ambient  at  800  '"C. 

PL  measurements  were  carried  out  using  488-nm  excita¬ 
tion  from  an  Ar-ion  laser  and  a  monochrometer  with  a  ther- 
moelectrically  cooled  photomultiplier  coupled  to  a  GaAs 
cathode.  A  laser  beam  with  a  power  of  4  mW  was  focused  on 
a  spot  about  10  /xm  in  diameter.  The  sample  temperature  was 
adjusted  by  controlling  the  amount  of  liquid  He  in  the  cry¬ 
ostat. 

III.  RESULTS  AND  DISCUSSION 

A.  Characterization  of  Si  pattern  shape  after 
oxidation 

Five  samples  about  15  nm  in  width  and  100  nm  in  height 
were  oxidized  under  the  conditions  shown  in  Table  I.  Figure 
1  shows  TEM  micrographs  of  these  samples  before  and  after 
oxidation.  The  Si  width  decreases  with  increasing  oxidation 
time.  In  addition,  the  Si  lines  become  deformed  as  oxidation 
proceeds,  as  shown  in  Fig.  1 .  It  is  known  that  the  oxidation 
of  Si  causes  a  volume  expansion  because  Si02  is  about  two 
times  larger  than  Si  in  volume  per  Si  atom.  The  expansion 
tends  to  be  suppressed  at  the  corners  of  the  pattern  and  there¬ 
fore  the  stress  generated  there  is  greater  than  that  in  the 
middle  region.  The  result  is  the  pattern  deformation  shown  in 
the  figure;  the  oxidation  rate  decreases  with  increasing 
stress.^^  For  instance,  in  Fig.  1(e),  Si  remains  at  the  corners 
after  oxidation,  but  the  Si  in  the  middle  regions  is  completely 
oxidized.  Even  after  a  longer  period  of  oxidation,  the  width 
of  the  Si  at  the  corners  remains  about  the  same  because  of 
the  large  stress,  as  shown  in  Fig.  1(f).  Such  a  large  stress  at 
the  corners  would  affect  the  amount  of  stress  in  the  middle 
region.  It  is  estimated  by  simple  simulation  that  the  stress  in 
the  whole  oxide  on  the  Si  line  ranges  from  10^  to  10^^ 
dyn/cm^,  but  it  is  difficult  to  determine  the  stress  exactly.  The 
Si  thickness  in  the  middle  and  upper  parts  of  the  lines  after 
oxidation  is  summarized  in  Table  I. 


20  nm 

Fig.  1 .  TEM  micrographs  of  the  Si  lines  after  (a)  0,  (b)  30,  (c)  80,  (d)  94, 
(e)  125,  and  (f)  170  min  of  dry  oxidation  at  800  °C. 


B.  PL  spectra  from  SI  nanolines  after  oxidation 

Figure  2  shows  the  PL  spectra  of  the  samples  in  Fig.  1 .  All 
oxidized  Si  lines  clearly  exhibit  PL,  whereas  no  PL  was  ob¬ 
served  from  sample  (a),  which  was  not  oxidized.  The  PL 
intensity  for  samples  (c)  and  (d),  in  which  Si  remains  in  the 
middle  regions,  is  stronger  than  for  samples  (e)  and  (f), 
which  have  no  Si  in  the  middle.  When  the  Si  remaining  is 


Wavelength  (nm) 


Fig.  2.  PL  spectra  of  the  oxidized  Si  lines.  These  spectra  are  obtained  by 
subtracting  the  background  level  from  the  original  spectra.  Sample  tempera¬ 
ture  was  maintained  at  50  K. 
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Si  Width  (nm) 


Unpattemed  Region 


Fig.  3.  The  Si  width  dependence  on  the  PL  intensities  at  760  nm.  Sample 
temperature  was  maintained  at  50  K.  The  Si  widths  were  measured  in  the 
middle  of  the  lines.  The  line  is  extrapolated  under  the  assumption  of  a 
Gaussian  distribution  of  the  Si  thickness. 


Fig.  5.  The  schematic  of  the  Si02/Si/Si02  film  structure  used  to  examine  the 
patterning  effect.  The  patterns  were  fabricated  by  the  etching  of  the  top  Si 
layer  in  (11 1)  SIMOX  substrates  and  subsequent  oxidation.  The  Si  thickness 
remaining  was  from  0  to  3  nm.  The  pattern  size  is  1  /zmXO.75  jam. 


Still  thick,  like  in  sample  (b),  the  intensity  tends  to  decrease. 
On  the  other  hand,  the  spectra  for  the  oxidized  Si  lines  have 
a  peak  at  around  a  760  nm  wavelength  (1.65  eV).  This 
1.65-eV  PL  is  the  same  as  the  signal  observed  from  oxidized 
porous  and  a  Si(lOO)  2D  structure.^^^^^  The  spectra 

are  broad  and  the  peak  position  is  independent  of  the  Si 
width. 

The  dependence  of  the  PL  intensity  at  760  nm  on  the  Si 
width  in  the  middle  of  the  line  is  shown  in  Fig.  3.  The  inten¬ 
sity  is  maximum  at  a  thickness  of  about  5  nm,  which  is 
thicker  than  estimated  by  the  effective-mass 
approximation.^^  In  this  figure,  the  full  width  at  half¬ 
maximum  (FWHM)  is  considerably  wide,  indicating  that  a 
wide  range  of  Si  width  contributes  to  emission.  In  addition, 
the  emission  is  observed  even  when  there  is  no  Si  left  in  the 
middle  regions  of  the  lines.  This  means  the  small  amount  of 
Si  remaining  in  the  corners  is  strongly  related  to  the  emission 
when  the  Si  size  in  the  corners  becomes  as  small  as  the  Si 
which  contributes  to  the  emission. 

Figure  4  shows  the  temperature  dependence  of  the  maxi¬ 
mum  PL  intensity.  It  decreases  as  the  temperature  increases, 
though  the  rate  of  the  decrease  is  not  so  large  from  20  to  80 
K.  The  drop  in  PL  intensity  indicates  that  the  ratio  of  nonra- 
diative  recombination  processes,  such  as  nonradiative  carrier 


Temperature  (K) 

Fig.  4.  PL  maximum  intensity  as  a  function  of  sample  temperature. 


capture  assisted  by  phonons  on  Si  dangling  bonds, to  ra¬ 
diative  recombination  becomes  large  with  increasing  tem¬ 
perature. 

C.  PL  spectra  from  ultrathin  Si  film 

In  order  to  investigate  the  factors  causing  the  interesting 
PL  characteristics  of  the  Si  nanoline,  we  measured  PL  for 
patterned  and  unpattemed  Si02/Si/Si02  film  structures 
formed  on  (111)  SIMOX  substrates.  The  patterned 
Si02/Si/Si02  film  structures  were  prepared  by  Si  etching  and 
subsequent  oxidation  so  as  to  correspond  to  nanolines  which 
were  laid  flat,  as  shown  in  Fig.  5.  For  these  film  structures, 
the  Si-thickness  effect  on  the  PL  intensities  can  be  estimated 
exactly  because  the  thickness  variation  of  the  Si  layer  is  less 
than  0.5  nm.  The  Si  film  in  the  patterned  structures  is  de¬ 
formed  just  like  the  oxidized  Si  lines,  although  the  degree  of 
deformation  is  smaller  than  in  the  lines. 

Figure  6  shows  the  PL  spectra  from  unpattemed 
Si02/Si/Si02  film  structures  with  several  Si  thicknesses  in 
the  pattern  center  region.  The  peak  positions  is  near  770  nm. 
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Fig.  6.  PL  spectra  from  the  unpattemed  regions  of  the  Si02/Si/Si02  film 
structures  as  a  function  of  Si  thickness.  Sample  temperature  was  maintained 
at  50  K.  The  reason  for  the  emission  at  the  Si  thickness  of  0  nm  is  probably 
that  Si  grains  remain  even  when  the  Si  layer  is  expected  to  be  completely 
oxidized  and  they  contribute  to  emission. 
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Si  Thickness  (nm) 


Fig.  7.  The  relationship  between  the  maximum  intensity  and  the  Si  thick¬ 
ness  in  the  patterned  and  unpattemed  Si02/(lll)Si/Si02  film  structures.  The 
Si  thickness  was  obtained  by  decreasing  the  Si  thickness  which  corre¬ 
sponded  to  the  oxidizing  amount  from  an  initial  Si  thickness.  Sample  tem¬ 
perature  was  maintained  at  50  K. 


Temperature  (K) 

Fig.  9.  PL  intensity  as  a  function  of  sample  temperature  in  the  (a)  patterned 
and  (b)  unpattemed  Si02/Si/Si02  film  stmcture.  The  intensity  at  several 
temperatures  is  normalized  by  that  at  20  K. 


In  addition,  strong  PL  emission  is  observed  in  the  ~l-nm 
thick  Si  layer  having  (111)  crystal  plane  as  the  top  surface. 
The  PL  characteristics  for  the  patterned  structures  are  differ¬ 
ent  from  those  for  the  unpatterned  stmctures.  The  peak  in¬ 
tensities  for  each  Si  thickness  at  760-770  nm  are  shown  in 
Fig.  7.  The  reason  for  the  emission  at  the  Si  thickness  of  0 
nm  is  probably  that  Si  grains  remain  even  when  the  Si  layer 
is  expected  to  be  completely  oxidized  and  they  contribute  to 
emission.  This  is  supported  by  TEM  observation  that  a  few 
Si  particles  remain  in  the  sample  indicated  the  negative 
thickness.  As  can  be  seen  in  Fig.  7,  the  maximum  intensity  in 
the  patterned  samples  is  lower  than  in  the  unpatterned  ones. 
The  proportion  of  the  intensity  drop  is  roughly  consistent 
with  that  of  the  Si-area  decrease  caused  by  Si  etching  and 
oxidation.  Other  particular  points  of  interest  in  Fig.  7  are  that 
a  wide  range  of  Si  thickness  in  the  patterned  samples  con¬ 
tributes  to  the  emission  and  the  peak  shifts  to  a  somewhat 
larger  Si  thickness  value  than  that  of  the  unpattemed  sample. 


Si  Thickness  (nm) 


Fig.  8.  The  relationship  between  the  maximum  intensity  and  the  Si  thick¬ 
ness  in  the  patterned  and  unpattemed  Si02/(100)Si/Si02  film  structures. 
Sample  temperature  was  maintained  at  50  K. 


This  behavior  of  the  patterned  sample  is  similar  to  that  of  the 
oxidized  Si  nanolines  shown  in  Fig.  3,  although  the  degree  of 
the  shift  is  different  between  the  patterned  film  and  the  lines. 
Therefore,  it  is  suggested  that  the  strange  behavior  of  PL 
from  the  oxidized  lines  is  caused  by  the  patterning  and  oxi¬ 
dation  of  the  stmctures. 

These  PL  phenomena  due  to  the  patterning  of  the  ultrathin 
Si  film  are  not  special  to  the  film  structure  formed  in  (111) 
Si.  Figure  8  shows  the  relationship  between  the  PL  peak 
intensity,  whose  position  is  at  the  wavelength  of  750  nm,  and 
Si  thickness  in  a  (100)  SIMOX  wafer.  It  is  interesting  that 
the  result  from  the  (100)  Si  film  is  similar  to  the  (111)  Si 
film.  It  is  presumed  from  Figs.  7  and  8  that  there  is  not  much 
difference  in  the  excitation  efficiency  to  the  lowest  excitation 
state  in  (100)  and  (111)  Si,  if  the  emission  efficiency  of  (100) 
Si  is  similar  to  that  of  (111)  Si.  Anyway,  whatever  crystal 
planes  the  Si  structure  may  have,  to  obtain  maximum  inten¬ 
sity  the  Si  has  to  be  quite  thick,  and  a  wide  range  of  Si 
thickness  contributes  to  the  emission  when  the  Si  stmcture  is 
patterned,  though  the  intensity  drops. 

Figure  9  shows  the  temperature  dependence  of  maximum 
PL  intensities  in  the  patterned  and  the  unpattemed 
Si02/Si/Si02  layer  structures.  For  both  structures,  the  inten¬ 
sity  decreases  with  increasing  temperature  in  the  range  of 
20-120  K.  However,  the  proportion  of  the  decrease  for  the 
patterned  structure  is  smaller  than  that  for  the  unpattemed 
structure.  This  is  similar  to  the  situation  for  the  oxidized  Si 
lines  shown  in  Fig.  4.  The  cause  is  probably  related  to  a 
small  increment  in  the  nonradiative  process  with  increasing 
temperature.  In  the  patterned  samples,  the  probability  of  the 
phonon-related  capture  of  carriers  would  not  increase  so 
much  with  increasing  temperature  because  the  lattice  is  un¬ 
der  the  strain  due  to  the  stress  as  discussed  below. 

D.  Origin  of  PL  from  oxidized  Si  nanostructures 

The  fact  that  the  PL  peak  position  is  at  760  nm  regardless 
of  the  Si  thickness,  as  shown  in  Fig.  2,  indicates  that  the 
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emission  does  not  come  directly  from  the  Si  nanolines.  This 
suggests  that  the  mechanism  of  the  PL  emission  from  the 
oxidized  Si  lines  is  similar  to  that  proposed  for  porous  Si.^^ 
In  that  mechanism,  radiative  recombination  occurs  at  the  in¬ 
terface  between  Si  and  Si02 :  carriers  become  excited  in  the 
Si  region  where  the  band  gap  is  widened  due  to  quantum 
confinement  effects  and  their  energy  eventually  exceeds  the 
energy  separation  of  the  radiative  recombination  centers  at 
the  Si/Si02  interface,  i.e.,  1.65  eV.  The  carriers  then  localize 
at  the  interface  and  the  radiative  recombination  occurs  there. 
Consequently,  the  peak  position  is  independent  of  the  Si 
width  since  emission  is  based  on  the  constant  band-gap  en¬ 
ergy  of  the  interface  regions.  While  this  appears  to  be  the 
primary  mechanism  involved  in  oxidized  Si  lines,  it  alone 
cannot  explain  the  PL  behavior  in  Fig.  3. 

In  the  thin-film  samples  in  which  the  Si  layers  were  pat¬ 
terned,  changing  the  number  of  divisions  of  the  Si  layer  as  a 
total  Si  area  is  the  same,  did  not  result  in  any  clear  change  in 
the  PL  intensities.  If  the  neck  region  between  the  comer  and 
the  center  region  of  the  pattern  (see  Fig.  5)  were  mainly 
related  to  the  emission,  the  PL  intensity  would  increase  with 
increasing  numbers  of  the  divisions,  i.e.,  the  quantity  of  the 
neck  regions.  From  these  results,  we  conclude  that  the  Si 
regions  remaining  abundantly  in  the  center  of  the  pattern 
chiefly  contribute  to  the  emission.  This  is  expected  to  also  be 
the  case  for  the  line  pattern. 

The  reason  that  the  Si  yielding  the  maximum  intensity  in 
the  patterned  2D-Si  layers  is  thicker,  as  is  seen  in  Figs.  3,  7, 
and  8,  is  thought  to  be  as  follows.  As  described  in  Sec.  Ill  A, 
the  stress  in  the  Si  lines  is  not  uniform.  Such  stress  is  ex¬ 
pected  to  change  the  band  structure,  but  exactly  how  it 
changes  is  difficult  to  estimate.  For  example,  indirect  band- 
gap  energy  is  known  to  decrease  by  the  simple  application  of 
hydrostatic  pressure.^"^  When  tensile  tangential  stress  is  ap¬ 
plied  to  a  Si  layer  through  oxidation,  as  in  this  experiment, 
the  indirect  band-gap  energy  is  expected  to  increase.^^  On 
the  other  hand,  it  is  expected  that  the  number  of  dangling 
bonds  acting  as  nonradiative  centers  increases  at  the  Si/Si02 
interface  due  to  strain  brought  on  by  the  large  amounts  of 
stress.  Consequently,  the  number  of  carriers  captured  by  the 
nonradiative  centers  increases,  which  could  lower  the  emis¬ 
sion  efficiency  at  the  interface  of  the  patterned  ultrathin  (0-2 
nm  thick)  Si  region.  As  a  result  of  the  combination  of  the 
above  phenomena,  it  is  thought  that  Si  thicker  than  predicted 
produces  maximum  intensity  when  the  Si  layer  is  patterned 
regardless  of  whether  the  silicon  is  in  the  form  of  lines  or 
films.  The  reason  the  emissions  come  from  Si  regions  with  a 
broad  range  of  thickness  is  probably  due  to  the  large  varia¬ 
tion  in  the  stress  generated  in  the  lines  or  films. 

IV.  CONCLUSION 

The  Si  lines  with  (111)  side  plane  and  rectangular  cross 
sections  were  fabricated  using  (110)  SIMOX  substrates  with 
electron-beam  lithography  and  KOH  etching.  After  the  lines 
were  oxidized  at  several  thicknesses,  PL  was  measured  from 
the  oxidized  Si  lines.  Clear  spectra  with  peak  intensity  at  a 
wavelength  of  around  760  nm  were  observed  and  the  emis¬ 


sion  was  found  to  be  due  to  the  recombination  of  carriers  at 
the  Si/Si02  interface,  though  the  detailed  excitation  pro¬ 
cesses  are  unknown.  In  addition,  TEM  images  showing  the 
Si  thickness  in  conjunction  with  the  PL  intensity  from  the 
lines  lead  us  to  conclude  that  Si  lines  with  a  central  thickness 
of  5  nm  produce  the  maximum  intensity.  Furthermore,  a 
wide  range  of  the  Si  thickness  contributes  to  the  emission. 
The  causes  for  this  were  clarified  by  discussing  the  differ¬ 
ence  in  the  PL  characteristics  for  patterned  and  unpattemed 
Si02/Si/Si02  layer  structures.  First,  it  was  found  that  the 
(111)  Si  layer  in  the  unpatterned  Si02/Si/Si02  layer  struc¬ 
tures  has  a  maximum  intensity  at  around  a  1  nm  thickness. 
With  patterning,  however,  the  Si  thickness  required  to  pro¬ 
duce  maximum  intensity  increases  and  a  wide  range  of  the  Si 
thickness  contributes  to  the  emission,  which  is  similar  to  the 
characteristics  of  the  oxidized  Si  lines.  Therefore,  the 
changes  in  PL  behavior  are  probably  caused  by  the  increased 
amount  of  stress  produced  by  the  Si  patterning  and  subse¬ 
quent  oxidation  process. 
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A  computer  simulation  has  been  carried  out  to  study  the  dynamics  of  pore  formation  in  porous 
silicon.  Porous  structures  were  generated  by  a  simple  three-dimensional  Monte  Carlo  calculation, 
based  on  a  modified  percolation  model  for  pore  growth  in  lightly  and  heavily  doped  silicon 
substrates.  The  results  are  consistent  with  recent  observations.  As  an  approximate  guide,  the  type 
formed  morphology  was  shown  to  depend  mainly  on  the  doping  level  of  the  substrate.  A  strong 
dependence  of  the  average  porosity  on  the  doping  level  of  the  substrate  is  also  predicted.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

The  discovery  of  visible  room-temperature  photolumines¬ 
cence  from  electrochemically  etched  highly  porous  silicon 
(PS)^’^  has  generated  much  interest  in  this  material  because 
of  the  possibility  of  an  application  to  Si-based  optoelectronic 
devices.  A  large  number  of  papers  on  optical  and  structural 
properties  of  PS  have  been  published  during  the  last  few 
years,  but  most  basic  questions  concerning  the  origin  of  the 
photoluminescence  (PL)  and  the  morphology  of  PS  film  still 
remain  open.  No  complete  understanding  of  pore  morphol¬ 
ogy  exists,  and  it  is  likely  that  improvements  in  our  under¬ 
standing  will  come  about  through  the  application  of  high- 
resolution  microscopy.  Transmission  electron  microscopy 
and  scanning  tunneling  microscopy  have  already  proved  es¬ 
sential  in  probing  the  porous  structure.  PS  layers  are  inevi¬ 
tably  fragile  and  this  makes  their  evaluation  more  difficult, 
and  necessitates  the  development  of  some  novel  approaches 
to  specimen  preparation.  A  correlation  between  the  surface 
morphology  of  PS  layers  and  their  respective  efficiency  of 
the  visible  photoluminescence  signal  has  been  reported  by 
Enachescu  et  al?  Several  theoretical  models  are  available, 
including  the  quantum  model"^  and  the  diffusion  limited 
model.^  Very  recently,  a  computer  simulation  of  pore  growth 
in  n-type  silicon  has  been  developed  by  Erlebacher  et  al^ 

It  is  well  known  that  the  significantly  strong  luminescence 
from  PS  is  often  observed  on  highly  porous  samples  grown 
on  lightly  doped  (nondegenerate)  p-type  substrates.^  How¬ 
ever,  PS  layers  in  lightly  doped  p-type  silicon  are  more  dif¬ 
ficult  to  characterize  by  computer  simulation  due  to  the 
smaller  length  scales.^  For  lightly  doped  p-type  silicon,  a 
depletion  is  also  set  up  but  the  field  strength  is  relatively 
small  in  comparison  to  the  conditions  for  pore  formation  in 
heavily  doped  (degenerate)  silicon.  Recent  studies  suggest 
that  the  structure  from  the  PS  layer  obtained  from  a  lightly 
doped  p-type  substrate  is  a  network  of  interconnected  nano- 


^^Electronic  mail:  jinglin@fudah.ihep.ac.c 


crystallites  showing  a  disordered  spongelike  morphology 
rather  than  a  series  of  parallel  columns  as  from  heavily 
doped  n-type  substrate. 

It  is  well  known  that  the  percolation  model^  can  be  de¬ 
scribed  as  random  diffusion  and  flow  of  fluid  in  disordered 
porous  materials.  The  approach  is  one  of  the  most  useful 
methods  of  understanding  the  properties  of  some  disordered 
systems.  It  differs  from  the  usual  diffusion  such  as  the  diffu¬ 
sion  limited  model.  The  randomness  in  the  percolation  pro¬ 
cess  is  due  to  the  disordered  structure  possessed  by  materials 
themselves.  Recently,  we  have  shown that  the  Hansdorff 
dimension  of  PS  surface  is  Z)  =  1.88,  which  is  in  good  agree¬ 
ment  with  the  numeral  simulation  for  invasion  percolation 
model.  The  pore  formation  in  PS  can  be  described  by  the 
dynamic  process  of  invasion  percolation  of  the  HE  solution 
in  electrochemical  anodization. 

The  purpose  of  this  study  is  to  perform  a  Monte  Carlo 
calculation,  based  on  a  modified  percolation  model,  to  simu¬ 
late  the  dynamics  of  pore  formation  on  silicon  substrate  in 
electrochemical  anodization,  and  to  obtain  a  better  under¬ 
standing  of  the  problem.  The  simulations  presented  here 
model  the  dissolution  on  a  two-  or  three-dimensional  grid 
representing  a  section  of  silicon.  Disordered  spongelike  mor¬ 
phologies  are  formed  through  a  simulated  dissolution  process 
by  a  modified  percolation  model.  Many  other  groups  have 
used  computer  simulations  to  model  pore  growth  in 
silicon.^’ While  in  our  model,  an  approximation  for  the 
electric  field  within  the  silicon  bulk  and  its  resultant  influ¬ 
ence  on  the  dynamics  of  carrier  transport  in  the  vicinity  of 
the  pore  front  are  considered.  Previous  models  where  the 
charge  carriers  follow  random  walks  to  the  silicon/HF  inter¬ 
face  do  not  predict  the  spongelike  feature  of  PS.  We  empha¬ 
size  that  our  model  helps  to  clarify  many  of  not  only  the 
larger  length  scale  but  also  the  smaller  length  scale  morpho¬ 
logical  characteristics  of  PS,  although  these  preliminary  re¬ 
sults  make  no  predictions  about  atomic  scale  features. 
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II.  MODEL  FOR  HOLE  TRANSPORT  DURING  PORE 
GROWTH 

A  widely  accepted  mechanism  of  pore  formation  in  anod- 
ized  silicon  implies  the  process  of  local  dissolution  of  silicon 
during  the  anodization.^^  The  anodic  etching  of  silicon  leads 
to  the  following  reaction  on  the  surface^^ 

Si+2HF+(2-«)€+  =  SiF2  +  2H+  +  ng",  (1) 

where  «< 2,  and  e~  represent  hole  and  electron.  Here  it  is 
assumed  that  holes  take  part. 

SiF2  is  unstable 

2SiF2  =  Si(^^4-SiF4.  (2) 

A  very  thin  surface  porous  film  (SPF)  is  formed  by  the 
deposition  of  elementary  silicons  Si^^^  produced  by  dispro¬ 
portion  reaction  (2)  on  the  surface  of  the  lightly  doped 
p-type  substrate.  After  this  initial  process,  the  surface  po¬ 
rous  film  can  be  considered  as  a  porous  network  connected 
by  narrow  throats,  similar  to  a  percolation.  Under  a  small 
constant  current  electric  field,  the  flow  rate  of  the  HF  solu¬ 
tion  is  constantly  kept  very  low  so  that  viscous  force  com¬ 
pared  with  capillary  force  can  be  neglected.  The  dynamic 
process  of  the  random  diffusion  process  of  and  invasion 
percolation  of  HF  solution  in  electrochemical  anodization 
can  be  described  by  the  model  suggested  in  Ref.  14. 

Our  model  is  similar  to  the  percolation  model. In  the 
context  of  our  simulation  model,  a  200 X 200 X 100  unit  cubic 
lattice  is  taken  into  consideration.  Each  point  in  the  cubic 
lattice  represents  a  unit  of  silicon  atoms. 

A  point  can  be  one  of  several  values,  0,  1,  4,  or  8.  If  the 
value  of  a  point  is  0,  it  means  that  the  point  is  an  etched  one. 
If  the  value  of  a  point  is  1,  it  represents  that  the  point  is  an 
active  one,  4  means  that  it  is  a  surface  point,  and  8  means  an 
inner  point.  In  the  anodization  process,  the  anodic  etching 
can  only  occur  on  the  interface  of  HF  acid  and  silicon.  It  is 
impossible  for  an  inner  atom  to  be  etched,  and  the  inner 
point  represents  just  these  kinds  of  silicon  atoms.  Also,  the 
etching  process  occurs  only  at  several  places  on  the  interface. 
If  every  place  on  the  interface  has  the  same  probability  to  be 
etched,  the  porous  morphology  will  not  be  received.  The 
places  where  etching  happens  are  active  areas  on  the  inter¬ 
face,  and  their  counterparts  in  our  model  are  active  points. 
Other  atoms  on  the  interface  which  have  little  chance  to  be 
etched  are  represented  by  surface  points  in  our  model.  The 
structure  of  a  general  simulation  is  shown  in  Fig.  1(a). 

Our  simulation  was  carried  out  at  a  200X200X100  cubic 
lattice  with  a  surface  porous  film  as  shown  in  Fig.  1(b).  Ini¬ 
tially,  a  very  thin  (thickness=l)  SPF  with  a  porosity  of  0.1 
was  assumed.  Under  this  initial  condition,  the  surface  could 
be  regarded  as  a  porous  network,  similar  to  percolation.  On 
the  SPF,  a  group  of  surface  points  were  set  to  active  points. 
They  randomly  dispersed  on  the  top  of  a  200X200X100 
cubic  lattice  and  the  top-most  layer  of  the  lattice  is  consid¬ 
ered  to  be  the  silicon  HF  interface.  The  total  number  of  the 
initial  active  points  is  named  the  seeds  number,  which  rep¬ 
resents  the  initial  current  value.  Different  porous  silicon  can 
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(b) 


Fig.  1.  (a)  Simulation  structure  showing  important  features,  (b)  The  200 
X200X100  cubic  lattice  with  a  surface  porous  film,  at  which  our  simulation 
was  carried  out. 


be  received  under  different  current  conditions.  By  changing 
the  seeds  number  in  our  model,  different  morphologies  of 
porous  silicon  can  be  produced. 

In  the  anodization  simulation  process,  the  cubic  lattice 
was  scanned  several  times.  The  scanning  times  represent  the 
period  of  time  of  the  anodization  process.  In  one  scanning, 
each  point  on  the  lattice  was  calculated.  If  the  point  currently 
scanned  is  an  etched  or  inner  point,  no  action  will  be  taken. 
If  it  is  a  surface  point,  it  has  some  probability  to  be  changed 
into  active  point.  This  is  one  way  to  produce  an  active  point. 
If  the  point  currently  scanned  is  an  active  one,  it  will  be 
etched  (its  value  will  be  set  to  zero)  and  adjacent  inner  points 
will  be  made  surface  ones.  Also  there  is  probability  for  one 
of  the  adjacent  points  to  be  changed  into  an  active  point,  this 
is  the  other  way  to  produce  an  active  point.  Because  the 
porous  silicon  is  formed  under  constant  current  condition, 
the  total  number  of  active  points  should  be  kept  equal  to  the 
initial  seeds  number  at  any  time. 

Etching  always  occurs  at  the  place  where  the  static  elec¬ 
tric  field  strength  is  largest.  It  is  known  that  the  field  strength 
is  very  large  on  the  tip  of  a  charged  conductor.  The  HF  acid 
can  be  regarded  as  a  good  conductor  compared  with  the  po¬ 
rous  silicon  whose  resistivity  is  very  high.  On  the  tips  of  the 
pores,  the  field  strength  is  larger  than  that  at  other  places  on 
the  interface,  thus  etching  always  happens  on  tips.  In  our 
model,  points  next  to  an  active  point  have  more  chances  to 
be  changed  into  active  ones  and  etched  at  the  next  scan. 

The  process,  in  which  an  active  point  is  etched  and  an 
adjacent  points  is  changed  into  an  active  point,  can  be  re¬ 
garded  as  that  of  active  point  transfers  from  one  place  to 
another.  With  how  much  probability  will  the  active  point 
move  downward  or  in  other  directions  determines  the  mor¬ 
phology  of  the  porous  silicon.  If  the  field  strength  along  Z 
axis  in  Fig.  1(a)  is  not  strong,  it  is  a  small  length  scale  case.^ 
In  such  a  case,  the  bias  of  the  holes  to  move  toward  the  tips 
is  not  very  strong,  and  its  movement  could  be  regarded  to  be 
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Fig.  2.  The  probability  ball  construction  for  the  active  points  on  a  cubic 
lattice.  Under  little  bias  (a),  the  active  point  moves  in  Z  direction  with  a  little 
larger  probability  than  in  other  directions.  Under  strong  bias  (b),  the  prob¬ 
ability  of  active  point  to  move  in  Z  direction  is  much  greater  than  that  in 
other  directions. 


mainly  random.  Thus  the  holes  will  reach  tips  from  different 
directions  with  almost  equal  probabilities.  It  can  be  regarded 
as  that  of  the  active  points  move  in  different  directions  with 
nearly  equal  probabilities.  If  the  field  strength  along  Z  axis  is 
very  large,  it  is  a  large  length  scales  case.^  The  holes  are 
swept  toward  the  pore  tips  by  the  electric  field  along  Z  axis, 
and  the  influence  of  random  walk  on  holes  is  much  weaker. 
The  holes  always  reach  the  tips  from  below.  It  seems  as  if  the 
active  points  always  move  downward  along  the  Z  axis.  In  a 
lightly  doped  silicon,  the  static  electric  fields  near  the  tips  in 
silicon  are  much  weaker  than  those  in  heavily  doped  silicon. 
In  lightly  doped  silicon,  the  probability  of  the  active  point  to 
move  downward  is  just  a  little  greater  than  in  other  direc¬ 
tions,  as  shown  in  Fig.  2(a).  However  in  heavily  doped  sili¬ 
con  it  is  much  greater  [see  Fig.  2(b)].  This  difference  causes 
the  difference  in  morphologies. 

III.  RESULTS  AND  DISCUSSION 

Here  we  describe  the  simulation  results  and  discuss  the 
effect  of  electric  field  on  the  resultant  morphologies.  Finally, 
we  discuss  the  implications  of  this  model  for  pore  formation 
in  silicon. 

Figures  3-8  are  pseudo-three-dimensional  morphologies 
of  PS.  Because  of  the  limitation  of  our  computer.  Figures 
3-8  only  show  morphologies  at  50X50  areas  in  one  layer  of 
the  cubic  lattice.  The  50X50  area  in  figures  showing  surface 


Fig.  3.  Surface  morphology  (50X50  area)  of  PS  with  high  porosity  grown 
on  a  lightly  doped  silicon. 


Fig.  4.  Cross-sectional  view  (50X50  area)  of  the  same  case  in  Fig.  3. 

morphologies  are  areas  where  “Z=0,  A<=x<  =  <=^A 
+  50,  B<=y<=B  +  50”  in  the  cubic  lattice.  A  and  B  are 
arbitrary  numbers  between  0  and  150.  If  the  figures  show 
cross-sectional  view,  the  50X50  areas  are  areas  where  “x 
=A,  0<=z<  =  50,  5<==y<  =  5  +  50”  or  “y=A,  0<==z< 
=  50,  5<=x<=5  +  50”  in  the  lattice.  A  is  an  arbitrary 
number  between  0  and  200,  and  B  is  an  arbitrary  number 
between  0  and  150. 

As  results  of  simulation,  Fig.  3  shows  a  surface  morphol¬ 
ogy  of  PS  grown  on  a  lightly  doped  silicon  (where  a  deple¬ 
tion  layer  is  also  set  up  but  its  width  is  relatively  large,  so  the 


Fig.  5.  Surface  morphology  (50X50  area)  of  PS  with  low  porosity  grown  on 
a  lightly  doped  silicon. 


Fig.  6.  Cross-sectional  view  (50X50  area)  of  the  same  case  in  Fig.  5. 
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Fig.  7.  Surface  morphology  (50x50  area)  of  PS  with  high  porosity  grown 
on  a  heavily  doped  silicon. 

field  strength  is  relatively  small),  which  was  formed  with  a 
large  anodic  current.  Figure  4  shows  the  cross-sectional  view 
of  micrograph  of  the  same  case.  We  can  find  that  the  PS 
exhibits  a  spongelike  morphology,  the  pores  wind  its  way 
into  silicon,  and  connect  to  each  other.  Also  the  silicon  grains 
connect  to  each  other.  There  is  no  tree  or  channellike  feature. 
The  porosity  in  a  layer  is  about  80%.  There  is  similarity  in 
the  morphologies  of  cross  section  and  surface  of  PS.  These 
results  shown  in  Figs.  3  and  4  are  in  agreement  with  the 
experimental  observations  presented  in  Ref.  16.  It  must  be 
emphasized  that  the  intense  luminescence  is  often  observed 
on  PS  of  high  porosity,  while  the  photoluminescence  from 
PS  of  low  porosity  is  very  weak. 

Figure  5  is  the  surface  morphology  of  PS  on  a  lightly 
doped  substrate  under  a  small  anodic  current.  As  a  compari¬ 
son,  Fig.  6  shows  the  cross  section  of  that  PS.  We  can  see  the 
difference  of  morphologies  between  a  PS  under  a  small  an¬ 
odic  current  and  that  under  a  large  current.  First,  the  porosity 
of  the  surface  is  smaller.  Second,  the  cross-sectional  view  of 
morphology  is  different  with  the  surfaces.  There  is  less  con¬ 
nect  among  pores. 

Figure  7  shows  a  surface  morphology  of  PS  grown  on  a 
heavily  doped  silicon  (where  a  depletion  layer  is  also  set  up 
but  its  width  is  relatively  small,  so  the  field  strength  is  rela¬ 
tively  large),  which  was  formed  with  a  large  anodic  current. 
Figure  8  is  its  cross-sectional  view.  There  is  clearly  differ- 


Fig.  8.  Cross-sectional  view  (50X50  area)  of  the  same  case  in  Fig.  7. 
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ence  between  different  doping  substrates.  The  surface  is 
similar  to  that  of  the  lightly  doped  one,  but  the  feature  of  the 
cross-sectional  view  is  different.  The  latter  shows  a  channel¬ 
like  characterization. 

Our  results  confirm  that  the  effect  of  electric  field  biasing 
hole  motion  within  the  bulk  of  silicon  is  an  important  aspect 
of  pore  growth  in  silicon.  An  important  new  insight  into  PS 
emerges  from  these  studies.  In  lightly  doped  silicon,  either  p 
or  n  type,  the  microstructure  consisted  of  an  apparently 
spongelike  random  array  of  fine  hole  markedly  different 
from  the  heavily  doped  structure,  which  shows  a  channellike 
cross-sectional  feature.  In  the  case  of  pore  propagation  in 
heavily  doped  silicon,  the  local  electric  field  distribution 
plays  an  important  role  in  controlling  the  rate  of  interfacial 
charge  transfer,  which  was  different  from  the  lightly  doped 
silicon.  Our  results  of  dynamical  simulation  are  in  favorable 
agreement  with  those  reported  by  Beale  et  al}^ 

The  difference  in  two  kinds  of  substrates  is  that  the  active 
points  in  heavily  doped  silicon  move  downwards  much  faster 
than  those  do  in  lightly  doped  silicon.  This  results  in  differ¬ 
ent  morphologies  in  the  two  kinds  of  PS.  The  morphology  is 
in  accordance  with  the  experimental  results,  therefore  the 
model  is  usable.  The  simple  model  can  show  the  main  dif¬ 
ference  and  explain  some  phenomena. 

Otherwise,  an  obvious  dependence  of  the  porosity  on  the 
doping  level  of  the  substrate  is  also  predicted.  The  porosity 
increases  with  the  doping  level. 

IV.  CONCLUSION 

In  summary,  the  principal  results  of  this  paper  are  as  fol¬ 
lows: 

(1)  A  simple  three-dimensional  Monte  Carlo  simulation  of 
PS  formation  has  been  developed. 

(2)  Our  results  confirm  that  the  effect  of  electric  field 
biasing  hole  motion  within  the  bulk  of  silicon  is  an  important 
aspect  of  pore  growth  in  silicon.  The  model  presented  does 
not  consider  the  details  of  the  chemistry  of  the  process,  how¬ 
ever,  the  formation  of  a  porous  nanostructure  is  shown  to  be 
determined  by  the  semiconductor  physics  rather  than  by  the 
chemistry. 

Two  types  of  morphologies  of  PS,  spongelike  and  chan¬ 
nellike  structures,  have  been  reported  and  details  of  their 
differing  microstructure  presented.  The  type  formed  was 
shown  to  depend  only  on  the  doping  level  of  the  substrate. 

As  an  approximate  guide,  it  appears  that  lightly  doped 
silicon  produces  a  fine  network  of  pores  whereas  heavily 
doped  material  produces  more  of  a  channel  structure. 

(3)  A  strong  dependence  of  the  average  porosity  on  the 
doping  level  of  the  substrate  is  predicted.  The  porosity  in¬ 
creases  with  the  doping  level. 

(4)  The  result  clearly  shows  the  presence  of  spongelike 
morphology  in  intense  luminescent  PS. 

Our  results  confirm  that  the  effect  of  electric  field  biasing 
hole  motion  within  the  bulk  of  silicon  is  an  important  aspect 
of  pore  growth  in  silicon.  We  believe  that  the  present  results 
provide  a  good  guide  to  better  understanding  the  structure 
and  the  properties  of  PS. 
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Etching  of  polysilicon  features  using  a  helical  resonator  plasma  source  is  evaluated.  Performance 
metrics  consist  of  etching  rate,  etching  rate  uniformity,  and  profiie  control  using  HBr/02--He 
gas-phase  chemistry.  The  effect  of  source  power,  rf-bias  power,  and  reactor  pressure  on  etching  rate 
and  uniformity  is  examined  using  a  response  surface  experiment.  Feature  profile  control  is 
determined  by  examining  nested  and  isolated  lines  and  trenches  using  oxide  mask/polysilicon/oxide 
structures.  Good  uniformity  and  vertical  profiles  are  obtained  at  low  reactor  pressures,  high  source 
power,  and  rf-bias  between  50  and  60  W.  The  operating  point  for  best  uniformity  is  at  3.5  mTorr, 

3000  W  source  power,  and  53  W  rf-bias  power.  At  this  point,  the  etching  rate  is  3700  A/min  and  the 
nonuniformity  is  less  than  1.0%,  over  125-mm-diam  wafers.  Radial  profiles  of  electron  temperature 
and  ion  density  near  the  wafer  surface  are  presented  as  a  function  of  source  power,  rf-bias  power, 
and  reactor  pressure.  The  ion  density  was  found  to  be  in  the  mid-10^^  cm“^  range  and  electron 
temperatures  were  5-7  eV.  An  increase  in  source  power  and  reactor  pressure  results  in  an  increase 
in  ion  density;  however,  the  electron  temperature  shows  a  weaker  dependence.  Finally,  these  results 
are  compared  to  those  using  helicon  and  multipole  electron  cyclotron  resonance  plasma  sources 
evaluated  in  previous  studies.  We  found  that  all  three  plasma  sources  provide  high  ion  density  at  low 
pressures  to  meet  performance  demands  for  polysilicon  etching;  however,  the  helical  resonator 
source  offers  somewhat  higher  etching  rate  and  better  bulk  plasma  uniformity.  ©  1996  American 
Vacuum  Society. 


I.  INTRODUCTION 

The  fabrication  of  ultra-large-scale  integrated  circuits  re¬ 
quires  increasingly  tighter  control  of  feature  profiles  in  order 
to  achieve  high  circuit  densities.^  Low  pressure,  high  plasma 
density  sources  are  becoming  the  accepted  technology  in 
many  development  laboratories  and  will  be  installed  in  sig¬ 
nificant  numbers  in  production  lines  in  the  near  future.  Due 
to  low  pressure  operation,  these  sources  provide  more  aniso¬ 
tropic  etching  without  sacrificing  ion  density  and  thus,  etch¬ 
ing  rates.  An  important  attribute  of  etch  systems  utilizing 
these  sources  is  the  ability  to  control  the  ion  density  and  the 
ion  energy  independently  through  wafer  biasing.^  Many 
techniques  are  available  for  achieving  high  plasma  densities 
at  low  pressures,  including  microwave  excitation,  such  as  in 
electron  cyclotron  resonance  (ECR)  sources,^"^  and  rf  exci¬ 
tation,  such  as  in  helicon  and  helical  resonator  sources.^’^  A 
quantitative  evaluation  and  comparison  of  these  plasma 
sources  is  necessary  in  order  to  make  intelligent  choices  for  a 
desired  application. 

The  first  two  plasma  sources  we  studied^”  were  as  fol¬ 
lows,  (i)  A  multipole  ECR,  manufactured  by  Wavemat  of  Ply¬ 
mouth,  Michigan:  The  ECR  plasma  source  consists  of  a  2.45 
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GHz  microwave  power  supply  and  a  microwave  cavity  sur¬ 
rounding  a  bell  jar.  High  plasma  densities  are  achieved  by 
confining  electrons  in  magnetic  field  cusps  established  by 
equally  spaced  permanent  magnets  surrounding  the  wafer 
plane,  (ii)  An  rf  inductively  coupled  helicon,  manufactured 
by  Lucas  Labs  of  Sunnyvale,  California:  The  helicon  plasma 
source  consists  of  an  upper  antenna  section  and  a  lower  con¬ 
finement  section.  Power  at  13.56  MHz  is  coupled  into  the 
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Fig.  1.  Helical  resonator  plasma  source. 
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Fig.  2.  Electron  temperature  (eV)  in  the  helical  resonator  as  a  function  of 
source  power,  reactor  pressure,  and  rf-bias. 


plasma  using  a  proprietary  antenna  design.  Four  solenoid 
magnets  are  used  to  control  the  magnetic  field.  The  upper 
two  magnets  are  necessary  to  support  the  helicon  wave, 
while  the  bottom  two  magnets  are  used  to  improve  unifor¬ 
mity  and  help  maintain  high  plasma  density  at  the  wafer 
plane. 

In  our  effort  to  compare  commercially  available,  low 
pressure,  high  density  plasma  sources  using  identical  reac¬ 
tion  chambers  to  exclude  platform  dependent  effects,  we 
present  here  results  obtained  with  the  third  source,  a  12-in.- 
diam  helical  resonator  manufactured  by  Prototech  Research 
of  Tempe,  Arizona.  We  perform  a  statistical  evaluation  of 
polysilicon  etching  rates  and  uniformities  as  a  function  of 
source  power,  rf-bias  power,  and  reactor  pressure.  Based  on 
these  results,  operating  conditions  for  optimum  uniformity 
for  the  helical  resonator  source  are  determined.  The  effect  of 
process  parameters  on  polysilicon  profiles  is  then  evaluated 
and  compared.  We  also  present  radial  Langmuir  probe  mea¬ 
surements  of  the  plasma  properties  above  the  wafer.  These 
results  are  then  compared  with  those  obtained  in  our  previ¬ 
ous  studies,  using  helicon  and  ECR  plasma  sources. 

II.  EXPERIMENTAL  DETAILS 

The  setup  of  the  helical  resonator  plasma  source  is  shown 
in  Fig.  1.  The  plasma  source  consists  of  a  coil  wrapped 
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Fig.  3.  Ion  density  (cm”^)  in  the  helical  resonator  as  a  function  of  source 
power,  reactor  pressure,  and  rf-bias. 


around  a  12-in.-diam  quartz  tube.  Radio  frequency  power  up 
to  5000  W,  at  13.56  MHz,  is  inductively  coupled  into  the 
plasma  through  the  coil.  A  Fararady  shield  inserted  between 
the  coil  and  the  quartz  tube  reduces  capacitive  coupling  that 
would  result  in  sputtering  of  the  quartz  wall. 

The  125-mm-diam  wafer  can  be  rf-biased  (—13.56  MHz) 
through  the  wafer  platen  with  a  600  W  power  supply.  Helium 
backside  cooling  is  used  to  remove  heat  from  the  wafer  to 
the  platen.  Liquid  nitrogen  is  circulated  through  the  wafer 
platen  to  remove  the  heat,  and  the  platen  temperature  can  be 
controlled  from  -100  to  +30  °C. 

In  order  to  characterize  etching  rates  and  uniformities, 
timed  etches  on  5000-A-thick  undoped,  blanket  polysilicon 
over  1000  A  of  thermal  oxide  wafers  were  used.  Etching  was 
carried  out  for  15,  60,  and  120  s  and  five-point  thickness 
measurements  (center  point  and  four  edge  points)  were  taken 
before  and  after  etching  using  a  Nanometrics  216S  scanning 
UV  film  thickness  measurement  instrument.  All  etching 
times  were  shorter  than  that  required  to  clear  the  polysilicon 
film  and  reach  the  underlying  oxide  film.  The  initial  polysili¬ 
con  thickness  is  uniform  to  within  ±  1%  of  the  average  value 
obtained  from  the  five-point  measurement.  Uniformity  was 
determined  as: 
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Fig.  4.  Etching  rates  with  the  helical  resonator,  obtained  from  5-point  Na¬ 
nometrics  measurements:  (A)  as  a  function  of  applied  source  power  and 
applied  rf-bias  power  at  3.5  mTorr,  (B)  as  a  function  of  reactor  pressure  and 
applied  rf-bias  power  at  4000  W  applied  source  power. 


Fig.  5.  Etching  rate  uniformity  with  the  helical  resonator,  obtained  from 
5-point  Nanometrics  measurements:  (A)  as  a  function  of  applied  source 
power  and  applied  rf-bias  power  at  3.5  mTorr,  (B)  as  a  function  of  reactor 
pressure  and  applied  rf-bias  power  at  4000  W  applied  source  power. 
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where  ERj^in^  ER^^ax’  ER^v  are,  respectively,  the  mini¬ 
mum,  maximum,  and  average  etching  rates. 

Etching  rates  and  uniformities  were  also  measured  by  full 
wafer  imaging  (FWI)  interferometry  (Model  No.  100-IS, 
made  by  Low  Entropy  Systems,  Brighton,  MA).  The  FWI  (5- 
or  49-point)  technique  takes  advantage  of  the  plasma  emis¬ 
sion  as  the  light  source  for  determining  in  situ  film  thickness 
and  etching  uniformity  over  the  full  area  of  the  wafer. 

The  goal  of  any  etching  process  development  is  to 
achieve  the  desired  profiles  while  minimizing  loss  of  process 
uniformity  and  etching  rate.  In  order  to  evaluate  profile  con¬ 
trol  after  etching,  we  etched  polysilicon  using  hard  masks 
since  this  will  most  likely  be  the  mask  of  choice  for  0.35  /urn 
devices.  The  corresponding  wafers  consist  of  2000  A  of 
boron  phosphorus  doped  tetraethoxysilane  (BPTEOS)  hard 
mask  over  5000  A  of  polysilicon.  The  test  pattern  consists  of 
isolated  and  nested  lines  and  spaces  ranging  from  0.30  to 


0.60  fim.  A  scanning  electron  microscope  (Model  No.  DS- 
130C,  made  by  International  Scientific  Instruments  of  Milpi¬ 
tas,  CA)  is  used  to  evaluate  etched  profiles. 

Bulk  plasma  properties  (electron  temperature  and  ion  den¬ 
sity)  are  determined  by  using  a  Langmuir  probe  (made  by 
Plasma  and  Materials  Technologies  of  Chatsworth,  CA).  The 
measurements  were  performed  using  argon  discharges,  due 
to  the  difficulty  in  maintaining  probe  cleanliness  in  HBr  plas¬ 
mas.  More  details  concerning  the  probe  data  acquisition  and 
analysis  system  are  reported  elsewhere,  where  the  same 
probe  was  used  to  characterize  the  ECR  and  the  helicon 
plasma  sources. 

III.  RESULTS  AND  DISCUSSION 
A.  Bulk  plasma  properties 

The  bulk  plasma  properties  were  spatially  determined  un¬ 
der  the  same  range  of  conditions  used  to  determine  the  etch¬ 
ing  rate  and  profile  control.  Langmuir  probe  measurements 
of  electron  temperature  and  electron  density  were  performed 
as  a  function  of  the  process  parameters:  source  power,  reac- 
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Table  I.  Etching  conditions  for  response  surface  statistical  experiment,  mean  etching  rates,  and  etching  rate  uniformities  obtained  with  the  helical  resonator 
from  three  techniques:  (1)  Nanometrics  5-point,  (2)  FWI  5-point,  and  (3)  FWI  49-point  measurements. 


Run 

no. 

Experimental  conditions 

Etching  rate  (A/min) 

Uniformity  (±%) 

Statist. 

position 

XYZ 

Pressure 

(mTorr) 

X 

Power 

(W) 

Y 

rf-bias 

(W) 

z 

Etching 
time  (s) 

5-point 

Nanometrics 

5-point 

FWI 

49-point 

FWI 

5-point 

Nanometrics 

5-point 

FWI 

49-point 

FWI 

1 

002 

2.0 

3000 

75 

90 

3731 

3239 

3239 

0.3 

0.9 

0.9 

2 

010 

2.0 

4000 

25 

120 

2605 

2302 

2302 

2.3 

1.3 

1.3 

3 

022 

2.0 

5000 

75 

75 

3958 

3392 

3305 

1.9 

2.6 

3.5 

4 

020 

2.0 

5000 

25 

120 

2606 

2278 

2244 

2.0 

3.6 

4.6 

5 

202 

5.0 

3000 

75. 

90 

4208 

3762 

3773 

0.7 

1.1 

1.6 

6 

112 

3.5 

4000 

75 

90 

4083 

3661 

3639 

0.9 

1.1 

1.7 

7 

220 

5.0 

5000 

25 

120 

3169 

2834 

2801 

1.7 

2.2 

3.7 

8 

021 

2.0 

5000 

50 

120 

3100 

2808 

2761 

2.0 

2.9 

3.7 

qa 

022 

2.0 

5000 

75 

75 

3862 

3255 

3183 

2.7 

3.8 

4.5 

10 

101 

3.5 

3000 

50 

120 

b 

3178 

3178 

b 

0.7 

1.3 

11 

000 

2.0 

3000 

25 

120 

2556 

2178 

2187 

0.7 

0.0 

1.0 

12 

111 

3.5 

4000 

50 

90 

3678 

3228 

3203 

1.0 

1.3 

1.9 

13 

211 

5.0 

4000 

50 

90 

4047 

3596 

3608 

0.8 

0.6 

1.2 

14a 

202 

5.0 

3000 

75 

90 

4239 

3843 

3820 

0.7 

1.1 

1.6 

15" 

220 

5.0 

5000 

25 

120 

3118 

2873 

2785 

1.8 

2.1 

3.7 

16 

122 

3.5 

5000 

75 

75 

4154 

3586 

3499 

2.1 

4.0 

4.1 

17. 

222 

5.0 

5000 

75 

75 

4560 

4023 

3916 

2.3 

3.1 

3.7 

18" 

000 

2.0 

3000 

25 

120 

2580 

2228 

2231 

0.5 

0.9 

1.9 

19 

200 

5.0 

3000 

25 

120 

2708 

2808 

2808 

0.4 

0.7 

1.5 

20" 

101 

3.5 

3000 

50 

90 

3686 

3178 

3178 

0.2 

0.7 

0.7 

^Repeats. 

‘’No  data  available  due  to  clearing  of  polysilicon. 


tor  pressure,  and  rf-bias.  Based  on  these  results,  the  plasma 
source  uniformity  was  determined.  Figure  2  shows  the  radial 
dependence  of  the  electron  temperature  (eV)  as  a  function  of 
source  power,  reactor  pressure,  and  rf-bias  power.  The  elec¬ 
tron  temperature  is  approximately  6.3  eV  at  a  distance  of  2.5 
cm  above  the  wafer  surface.  The  electron  temperature  does 
not  vary  significantly  with  applied  rf-bias  power  or  reactor 
pressure.  On  the  other  hand,  the  ion  density  is  strongly  de¬ 
pendent  on  applied  source  power  (Fig.  3).  At  the  center  of 
the  wafer,  the  ion  density  increases  from  2.5 X 10^^  to  6X 10^* 
cm“^  as  the  source  power  is  increased  from  3000  to  5000  W. 
The  ion  density  varies  from  2X10^^  to  4X10^^  cm“^  as  the 
reactor  pressure  is  increased  from  1  to  5  mTorr.  Unlike  the 
applied  source  power  and  reactor  pressure,  rf-bias  power 
only  weakly  affects  ion  density. 


B.  Etching  rates  and  uniformities 

The  statistically  designed  experiment  was  based  on  the 
systematic  variation  of  three  main  variables:  reactor  pressure 
(2-5  mTorr),  plasma  source  power  (3000-5000  W),  and  rf- 
bias  power  (25-75  W).  Figure  4  shows  the  measured  etching 
rates  obtained  from  5 -point  Nanometrics  measurements  (A) 
as  a  function  of  applied  source  power  and  applied  rf-bias 
power  at  3.5  mTorr,  and  (B)  as  a  function  of  reactor  pressure 
and  applied  rf-bias  power  at  4000  W  applied  source  power. 
The  rf-bias  appears  as  the  parameter  having  a  stronger  effect 
on  the  measured  etching  rate  than  the  source  power  and  re¬ 
actor  pressure.  Based  on  the  examined  experimental  condi¬ 


tions,  etching  rates  in  the  range  of  2600-3000  A/min  can  be 
achieved  at  low  reactor  pressures,  moderate  applied  rf-bias 
powers,  and  high  source  power. 

Figure  5  displays  the  etching  rate  uniformities  obtained 
from  5-point  Nanometrics  measurements  (A)  as  a  function  of 
applied  source  power  and  applied  rf-bias  power  at  3.5  mTon*, 
and  (B)  as  a  function  of  reactor  pressure  and  applied  rf-bias 
power  at  4000  W  applied  source  power.  The  source  power 
shows  larger  effects  on  etching  rate  uniformity,  as  compared 
to  the  applied  rf-bias  power  and  reactor  pressure,  in  our  ex¬ 
perimental  conditions.  The  best  uniformity  is  achieved  at 
high  applied  source  powers,  low  reactor  pressures,  and  ap¬ 
plied  rf-bias  powers  between  50  and  75  W.  At  these  condi¬ 
tions  the  uniformity  is  approximately  ±0.2%. 

Measurements  of  the  etching  rate  and  uniformity  are  also 
performed  with  FWI  techniques  (5  and  49  point).  The  con'e- 
sponding  results  are  summarized  in  Table  1.  The  etching  rate 
and  etching  rate  uniformity  values  obtained  from  5 -point  Na¬ 
nometrics  measurements  are  smaller  than  those  obtained 
from  5-point  and  49-point  full  wafer  interferometry  measure¬ 
ments.  This  difference  may  be  caused  by  the  assumed  poly¬ 
silicon  refractive  index  between  the  two  techniques.  In  addi¬ 
tion,  averaging  on  49  points  instead  of  5  points  explains  the 
difference  in  the  values  obtained  within  the  same  (FWI) 
technique. 

C.  Profile  control 

Figure  6  shows  the  SEM  micrographs  of  isolated  lines 
(al),  nested  lines  (bl),  and  isolated  trenches  (cl)  obtained 
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Fig.  6.  SEM  micrographs  of  isolated  lines,  nested  lines,  and  isolated  trenches  obtained  using  pure  HBr  chemistry,  at  2  mTorr,  50  W  rf-bias,  with  the  helical 
resonator  (al,  bl,  cl)  at  3000  W  source  power,  and  with  the  helicon  (a2,  b2,  c2)  at  2500  W  source  power. 


with  the  helical  resonator  at  2  mTorr,  50  W  rf-bias,  and  3000 
W  source  power  using  100  seem  of  pure  HBr.  For  compari¬ 
son,  we  show  in  the  same  figure  the  corresponding  SEM 
micrographs  (a2,  b2,  c2)  obtained  with  the  helicon  source, 
using  the  same  chemistry,  pressure,  and  rf-bias,  but  at  2500 
W  source  power.  For  both  sources,  there  is  a  slight  undercut¬ 
ting  of  the  mask  but  no  microtrenching  (i.e.,  the  relatively 
faster  etching  rates  observed  near  sidewalls).  A  decrease  in 
the  applied  rf-bias  power  while  holding  both  the  reactor  pres¬ 
sure  and  applied  source  power  constant  results  in  an  increase 
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in  mask  undercutting  as  indicated  in  Fig.  7.  As  the  rf-bias 
decreases,  two  factors  may  contribute  to  the  increase  in  un¬ 
dercutting.  First,  the  decrease  in  rf-bias  decreases  the  ion 
energy  normal  to  the  etching  surface  and  therefore  decreases 
the  ratio  of  vertical  to  lateral  etching  rates.  Second,  the  de¬ 
crease  in  ion  energy  normal  to  the  surface  decreases  the 
amount  of  oxide  material  sputtered  from  the  mask.  Oxide 
material  deposited  in  the  feature  may  act  as  a  passivation 
layer.  The  sputtered  oxide  would  be  quickly  removed  from 
the  bottom  of  the  feature  where  significant  ion  bombardment 
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Fig.  7.  SEM  micrographs  of  isolated  lines,  nested  lines,  and  isolated  trenches  obtained  using  pure  HBr  chemistry,  at  2  mTorr,  25  W  rf-bias,  with  the  helical 
resonator  (al,  bl,  cl)  at  3000  W  source  power,  and  with  the  helicon  (a2,  b2,  c2)  at  2500  W  source  power. 


is  present.  However,  oxide  material  deposited  on  the  sidewall 
of  the  feature  would  greatly  reduce  the  lateral  etching  rate. 
At  lower  applied  rf-bias  powers,  the  oxide  material  is  no 
longer  available  to  passivate  the  feature  sidewalls. 

An  increase  in  reactor  pressure  from  2  to  5  mTorr  at  fixed 
rf-bias  and  source  powers  results  in  a  very  small  increase  in 
mask  undercutting,  for  both  the  helicon  and  helical  resonator 
sources  (Fig.  8).  The  increase  in  mask  undercutting  with  an 
increase  in  reactor  pressure  may  be  due  to  an  increase  in  the 


reactive  neutral  concentration,  i.e.,  a  decrease  in  the  ion-to- 
neutral  ratio. 

An  optimized  recipe  for  obtaining  vertical  profiles  with¬ 
out  mask  undercutting  was  developed.  The  importance  of 
acceptably  high  etching  rates  and  high  etching  rate  unifor¬ 
mity  is  maintained  since  a  recipe  resulting  in  vertical  profiles 
without  these  attributes  is  clearly  not  useful.  As  discussed 
before,  mask  undercutting  is  most  likely  a  result  of  the  high 
lateral  etching  rate  by  free-radical  species  produced  in  the 
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Fig.  8.  SEM  micrographs  of  isolated  lines,  nested  lines,  and  isolated  trenches  obtained  using  pure  HBr  chemistry,  at  5  mTorr,  50  W  rf-bias,  with  the  helical 
resonator  (al,  bl,  cl)  at  3000  W  source  power,  and  with  the  helicon  (a2,  b2,  c2)  at  2500  W  source  power. 


plasma.  The  presence  of  an  oxide  on  the  feature  sidewall, 
either  due  to  oxide  mask  sputtering  or  addition  of  oxygen  to 
the  100  seem  HBr  feed  gas,  decreases  the  lateral  etching  rate. 
To  increase  the  oxide  passivation  of  the  sidewall  in  the  opti¬ 
mum  etching  recipe,  a  small  amount  of  oxygen  was  added  to 
the  feed  gas.  The  optimum  oxygen  chemistry  was  25  seem  of 
an  02-He  (20%-80%)  mixture.  Furthermore,  to  decrease 


the  intrinsic  etching  rate  of  free-radical  species,  the  wafer 
temperature  was  lowered  to  0  °C.  Figure  9  shows  the  vertical 
profiles  obtained  for  the  wafers  etched  in  the  helical  resona¬ 
tor  (al,  bl,  cl)  and  the  helicon  (a2,  b2,  c2)  plasma  sources, 
using  the  optimized  recipe  (100  seem  HBr  and  25  seem 
02-He,  2  mTorr,  50  W  rf-bias,  3000  W  source  power  for  the 
helical  resonator  and  2500  W  for  the  helicon). 
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Fig.  9.  SEM  micrographs  of  isolated  lines,  nested  lines,  and  isolated  trenches  obtained  using  the  optimized  recipe,  a  mixture  of  100  seem  HBr  and  25  seem 
02-He,  at  2  mTorr,  50  W  rf-bias,  with  the  helical  resonator  (al,  bl,  cl)  at  3000  W  source  power,  and  with  the  helicon  (a2,  b2,  c2)  at  2500  W  source  power. 


IV.  COMPARISON  WITH  OTHER  SOURCES  AND 
CONCLUSION 

We  have  examined  the  etching  performance  of  0.35  (xm 
poly  silicon  features  in  a  helical  resonator  plasma  source.  We 
made  similar  studies  for  the  helicon  and  electron  cyclotron 
resonance  sources  previously.^“^^  In  this  section,  we  summa¬ 
rize  the  values  of  etching  rates,  uniformities,  and  bulk 
plasma  parameters,  in  order  to  quantify  the  advantages  and 
disadvantages  among  the  three  plasma  sources. 


(1)  The  three  sources  produce  sufficient  ion  and  reactive 
neutral  densities  to  give  etching  rates  in  the  2000-3700 
A/min  range.  Also  they  provide  high  uniformity  (less  than 
±5%  across  the  125-mm-diam  wafers),  as  indicated  in  Table 
IL  The  helical  resonator  offers  a  higher  etching  rate  and  bet¬ 
ter  uniformity,  probably  due  to  higher  ion  density  induced  by 
high  excitation  power  and  large  plasma  generation  volume. 

(2)  The  behavior  of  etched  profiles  is  similar  for  all  three 
sources  as  a  function  of  rf-bias  and  reactor  pressure.  In  all 
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Table  IL  Bulk  plasma  properties,  mean  etching  rate,  and  etching  rate  uniformity  for  the  electron  cyclotron  resonance  (ECR),  helicon  (H),  and  helical  resonator 
(HR)  sources. 


Plasma 

sources 

Optimum  etching  rates  and  uniformities 

Bulk  plasma 
parameters 

Plasma  unif. 

(±%) 

Source 

power 

(W) 

Press. 

(mTorr) 

rf  bias 
(W) 

Etch 

rate 

(A/min) 

Unif. 

(±%) 

n, 

(10"  cm“^) 

Te 

(eV) 

From 

n. 

From 

T 

From 

etching 

ECR 

1370 

2.8 

60 

2300 

1.1 

0.5-3 

5-9 

3.6 

4.1 

4.1 

H 

2500 

2.0 

57 

2400 

2.6 

2-5 

5-7 

2.7 

3.3 

3.3 

HR 

3000 

3.5 

53 

3700 

<1.0 

2-6 

5-7 

<1.0 

<1.0 

<1.0 

sources,  vertical  sidewalls  are  attained  by  adding  oxygen  to 
the  feed  gas  and  lowering  the  wafer  platen  temperature  to 
0  °C.  The  addition  of  oxygen  results  in  an  increase  in  passi¬ 
vation  of  feature  sidewalls  during  the  etching  process, 
greatly  decreasing  mask  undercutting.  The  decrease  in  wafer 
temperature  further  reduces  mask  undercutting  by  decreasing 
the  lateral  etching  rate  due  to  neutral  species.  The  optimized 
profiles  are  the  same  for  almost  the  same  process  recipe. 
Only  the  applied  source  power  is  unique  for  each  source  in 
the  optimized  recipes:  1000  W  for  the  ECR,  2500  W  for  the 
helicon,  and  3000  W  for  the  helical  resonator. 

(3)  Bulk  plasma  properties  are  similar  for  the  polysilicon 
etching  process  window  of  2-5  mTorr  and  25-75  W  rf-bias. 
As  indicated  in  Table  II,  the  ECR  plasma  density  is  slightly 
lower,  consistent  with  observed  lower  etching  rates.  The  best 
bulk  plasma  uniformity  is  obtained  with  the  helical  resonator 
source,  due  to  a  large  plasma  generation  volume.  However, 
the  four  solenoids  in  the  helicon  could  in  principle  provide 
more  flexibility  in  optimizing  uniformity  for  different  oper¬ 
ating  conditions;  however,  for  these  studies,  no  explicit  op¬ 
timization  of  the  magnetic  field  in  the  helicon  was  per¬ 
formed. 

(4)  The  similarity  in  the  performance  of  the  three  exam¬ 
ined  sources  indicates  that  additional  factors  other  than  etch¬ 
ing  rate,  uniformity,  and  profile  control  may  be  key  in  decid¬ 
ing  which  source  is  best.  These  factors  include  plasma- 
induced  damage,  ease  of  source  operation,  and  process 
reliability.  Based  on  carrier  lifetime,  spectroscopic  ellipsom- 
etry,  and  x-ray  photoelectron  spectroscopy  measurements, 
these  plasma  sources  produce  minimal  damage  during  silicon 
etching  under  optimum  conditions. Thus,  manufactur¬ 
ability  and  ease  of  use  will  determine  which  source  succeeds 
commercially.  Ease  of  operation,  plasma  stability,  and  re¬ 
peatable  plasma  ignition  become  relatively  more  important 
when  deciding  which  source  to  use  for  a  particular  manufac¬ 
turing  application.  For  reproducibility  and  multistep  process 
recipes  automatic  matching  is  necessary.  The  helicon  and  the 
helical  resonator  both  have  automatic  matching  and  display 
high  plasma  stability  which  greatly  simplifies  the  etching 
process  optimization.  The  ECR  source  we  used  suffers  from 
the  lack  of  an  automatic  matching  network  and  displays 
plasma  instability — flickering  and  frequent  mode 
changes — at  high  applied  source  power.  This  flickering  is 
accompanied  by  large  changes  in  reflected  power.  The  lack 
of  automatic  matching  for  the  ECR  requires  the  use  of 


dummy  wafers  to  determine  matching  settings  for  each  ex¬ 
perimental  condition,  another  disadvantage.  Of  course,  one 
must  consider  that  these  advanced  sources  are  continually 
being  improved,  and  we  expect  technological  problems  such 
as  tuning  and  mode  stability  in  the  ECR  to  be  solved  even¬ 
tually. 

In  conclusion,  we  have  examined  three  commercially 
available,  low  pressure,  high  density  plasma  sources  using  an 
etching  platform  consisting  of  identical  reactor  chambers  to 
exclude  platform  dependent  effects  such  as  wafer  platen  de¬ 
sign  and  reactor  geometry.  We  studied  the  etching  of  0.35 
jiim  polysilicon  gate  structures  for  complementary  metal- 
oxide- semiconductor  devices.  Etching  rate,  uniformity,  pro¬ 
file  control,  and  bulk  plasma  parameters  were  compared  for 
the  three  sources.  We  found  that  the  three  plasma  sources 
(ECR,  Helicon,  and  Helical  Resonator)  can  all  meet  sub- 
0.35-/xm  etching  demands.  The  importance  of  manufactur¬ 
ability  and  ease  of  use  will  determine  which  source  succeeds 
commercially. 
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Temperature-dependent  surface  morphologies  of  Si(lOO)  resulting  from  Br  etching  at  700  and  800 
K  have  been  studied  using  scanning  tunneling  microscopy.  Quantitative  analysis  of  linear  pit 
distributions  allows  us  to  interpret  the  mechanisms  involved  in  surface  pattern  formation.  We  show 
that  linear  chains  growing  at  800  K  on  450-A-wide  terraces  are  in  quasi-steady-state  conditions  in 
that  their  lengths  follow  the  most  probable  Flory-Schulz  distribution.  Results  obtained  under 
equivalent  conditions  at  700  K,  however,  demonstrate  that  this  distribution  is  not  reached.  The 
difference  reflects  the  rate  of  step  retreat  relative  to  the  rate  of  pit  growth.  At  700  K,  pits  that  develop 
on  terraces  are  erased  before  their  length  distributions  reach  quasi-steady-state  values  even  though 
the  overall  morphology  does  not  change.  The  surface  morphologies  that  are  reached  at  long  times 
can  then  be  understood  by  considering  the  temporal  evolution  of  linear  chains  on  terraces  where 
quasi- steady-state  conditions  are  not  fulfilled.  ©  1996  American  Vacuum  Society, 


I.  INTRODUCTION 

Halogen  etching  is  widely  used  in  the  dry  processing  of 
semiconductors.  It  is  then  important  to  understand  the 
mechanisms  of  etching  so  that  they  can  be  better 
controlled.  Scanning  tunneling  microscopy  (STM)  can  be 
used  to  investigate  adsorption  and  etching  phenomena  with 
atomic  resolution,  yielding  insight  into  surface  structures  and 
the  kinetics  of  etching.  Chander  et  al^'^  have  shown  that  etch 
pit  growth  on  Si(100)-2X1  can  be  quantified,  that  the  size 
distribution  and  shape  anisotropy  of  the  pits  can  be  deduced, 
and  that  differences  in  activation  energies  for  Si  removal 
along  and  perpendicular  to  the  dimer  row  direction  can  be 
determined.  From  analysis  of  STM  images  of  etched  sur¬ 
faces,  it  has  been  demonstrated  that  the  lengths  of  Cl- 
induced  linear  etch  pits  on  Si(100)-2X1  follow  the  most 
probable  Flory- Schultz  size  distribution^  when  quasi-steady- 
state  conditions  are  met^  and  under  conditions  where  the 
probability  of  linear  growth  remains  constant  irrespective  of 
pit  length. 

For  a  very  wide  terrace,  it  is  expected  that  the  longer  the 
time  of  exposure  to  an  etchant,  the  greater  the  damage  on  the 
surface.  However,  it  is  observed  in  our  experiments  that  ter¬ 
races  reach  a  steady-state  etched  morphology  and  that  this 
morphology  is  temperature  dependent.  In  this  article,  we  re¬ 
port  analysis  of  STM  results  for  bromine  etching  of  Si(lOO)- 
2X1  at  700  and  800  K.  Images  were  obtained  under  condi¬ 
tions  of  halogen  flux  and  fluence  such  that  surface 
morphologies  would  reach  steady  state,  i.e.,  they  would  not 
change  with  further  exposure. The  resulting  distributions 
of  terrace  pits  were,  however,  quite  different.  Through  analy¬ 
sis  of  the  pit  size  distributions  at  700  K,  we  demonstrate  the 
consequences  of  having  a  finite  dimension  for  terraces  and 
the  connection  between  step  retreat  and  pit  growth.  Signifi¬ 


cantly,  steps  and  their  separation  play  a  crucial  role  in  sur¬ 
face  layer  etching  and  the  resulting  morphologies. 

II.  EXPERIMENT 

The  experiments  were  performed  in  an  ultrahigh  vacuum 
chamber  equipped  with  a  Park  Scientific  Instruments  STM. 
The  base  pressure  of  the  system  was  5X10”^^  Torr.  Silicon 
wafers  oriented  within  0.2"’-0.5°  of  (100)  and  miscut  toward 
[110]  were  rinsed  in  ethanol  prior  to  introduction  into 
vacuum.  The  wafers  were  degassed  at  600  °C  for  several 
hours  and  then  flashed  to  1200  for  1-2  min.  This  pro¬ 
duces  clean,  well-ordered  Si(100)-2X1.^  Sample  tempera¬ 
tures  were  monitored  with  an  optical  pyrometer.  Electro- 
chemically  etched  tungsten  tips  were  cleaned  using  electron 
bombardment.  Scan  dimensions  were  calibrated  using  the 
Si(100)-2X1  lattice  and  single  height  steps.  All  micrographs 
were  acquired  in  a  constant  current  mode  at  room  tempera¬ 
ture.  An  electrochemical  cell  was  used  to  provide  molecular 
bromine  for  dosing.^  The  samples  were  ~3  cm  from  the 
mesh  electrode  during  exposure.  During  cell  operation,  the 
chamber  pressure  remained  below  1X10“^^  Torr.  Stable  cell 
currents  of  10  /xA  were  used,  releasing  —3.1X10^^  Br2 
mol/s.  Bromine  exposures  were  done  under  conditions  of 
constant  flux  for  a  predetermined  amount  of  time  (160  and 
320  s)  so  that  exposures  are  quoted  in  units  of  mA  s.  The 
samples  were  cooled  rapidly  to  room  temperature  immedi¬ 
ately  upon  terminating  the  flux. 

III.  RESULTS  AND  DISCUSSIONS 

Figure  1  shows  two  occupied-states  images  of  Si(lOO)- 
2X1  obtained  after  exposure  to  1.6  mA  s  Br2  (flux  10  /nA, 
time  160  s).  They  correspond  to  etching  at  700  and  800  K, 
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Fig.  1.  Occupied-state  STM  image  of  Si(100)-2X1  after  1.6  mA  s  exposure 
to  Br2  at  (a)  700  K  and  (b)  800  K,  corresponding  to  10*^  atoms  or  about  16 
monolayer  equivalents  of  Br.  TeiTace  etching  produces  one  layer  deep  etch 
pits  and  Si  regrowth  islands.  The  pits  can  be  characterized  based  on  their 
shapes  as  being  linear  and  one  row  wide,  L,  linear  with  a  dimer  vacancy  on 
an  adjacent  row,  L+B,  and  linear  and  two  rows  wide,  T  +  L. 
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Fig.  2.  Scheme  depicting  the  growth  of  linear  pits  from  the  generation  of 
single  vacancies.  Linear  pits  are  generated  by  linear  growth  of  shorter  linear 
pits  and  are  consumed  by  branching.  Competition  between  different  path¬ 
ways  produces  the  observed  distribution. 


principle,  dimer  vacancies  can  be  mobile,  as  on  the  clean 
surface,  but  this  mobility  is  not  sufficient  on  halogen-rich 
surfaces  to  account  for  the  observed  morphologies,^ 

Figure  1  shows  that  the  etch  pits,  whose  formation  is  the 
main  focus  in  this  article,  have  different  sizes  and  shapes. 
The  smallest  pits  are  the  single  dimer  vacancies.  Larger  pits 
can  be  characterized  as  linear  pits,  L,  that  are  one  row  wide, 
linear  pits  that  have  a  dimer  vacancy  branch  on  an  adjacent 
row,  L+B,  pits  that  are  two  rows  wide,  L+L,  and  so  on. 
(They  could  be  further  characterized  according  to  the  loca¬ 
tion  of  the  branch  site.^^) 


respectively.  The  2X1  nature  of  the  substrate  is  evident  from 
dimer  rows  oriented  along  the  image  diagonals.^  Bromine 
molecules  adsorb  dissociatively  to  produce  Br  atoms  bonded 
to  the  dimer  dangling  bonds.  While  etching  is  minimal  at  low 
temperatures,  Br  can  break  dimer  bonds  at  700  and  800  K  to 
form  volatile  species,^  Etching  involves  Si  removal  from 
steps  and  from  terraces.  Si  atoms  along  steps  are  more  sus¬ 
ceptible  than  their  terrace  counterparts  since  Si  atoms  are 
bound  less  tightly  at  steps  than  on  pristine  terraces  and  since 
steps  act  as  accumulation  sites  for  Br.  Step  etching  then  con¬ 
stitutes  a  “layer  by  layer”  mechanism  for  Si  removal.  At  the 
same  time,  steps  also  play  an  important  role  in  the  final  ter¬ 
race  morphology  since  they  erase  pits  that  are  growing  suf¬ 
ficiently  near  the  step. 

Pits  are  initiated  by  removal  of  a  single  terrace  dimer, 
creating  a  dimer  vacancy.  The  desorption  of  a  SiBr^.  unit  is 
the  limiting  process  since  barriers  for  surface  diffusion  are 
much  smaller  than  energies  for  desorption.^  This  dimer  va¬ 
cancy  destabilizes  adjacent  dimers.  Given  sufficient  time,  the 
pit  can  grow  laterally  to  produce  features  that  are  one  atom 
layer  deep.  As  a  consequence  of  dimer  breaking,  some  Si 
atoms  are  ejected  onto  the  surface  where  they  can  form  re¬ 
growth  islands  that  are  one  layer  high.  Alternatively,  a  freed 
Si  atom  can  be  removed  from  the  surface  as  a  volatile  spe¬ 
cies,  contributing  to  the  overall  rate  for  dimer  removal.  In 


A.  Kinetics  of  linear  growth 

Figure  2  depicts  pit  growth.  A  linear  pit  denoted  VVV,  for 
example,  can  grow  by  the  removal  of  a  dimer  from  either 
end  of  the  pit  (with  a  rate  constant  kj).  Alternatively,  a  linear 
pit  can  branch  at  either  end  (rate  constant  or  can  branch 
at  middle  sites  (rate  constant  Note  that  a  first  neighbor 
dimer  on  a  row  adjacent  to  a  single  vacancy  has  an  extra 
repulsive  interaction  due  to  the  presence  of  an  extra  second 
neighbor  compared  to  a  dimer  similar  for  longer  linear  pits. 
That  is  why  the  branching  rate  constant  denoted  as  is 

different  from  the  general  rate  constant  for  branching  at  the 
end  /ceb  0"  fact,  ^:ebi>^eb)- 

After  taking  into  account  the  statistical  factors  related  to 
each  event,  the  following  equations  can  be  written: 

___^^_2(/^^+ /.^Bi)[yi],  (1) 

d[V:] 

^  =  2kdV^^,]-2(k,+  2k^^^ii-2)ku^)[V,l 

/  =  2,3,...,  (2) 

where  [  VJ  is  the  surface  density  of  linear  vacancies  and 
is  the  generation  rate  of  single  vacancies.  Since  we  are  deal¬ 
ing  with  small  amounts  of  dimer  removal  (a  small  fraction  of 
a  monolayer),  can  be  considered  to  be  independent  of 
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time.  For  simplicity,  we  consider  that  branching  occurs  only 
at  pit  ends.  This  has  been  shown  to  be  the  dominant  branch¬ 
ing  process  for  linear  pits  of  lengths  /<10  (Ref.  10).  These 
simplifications  allow  us  to  write  analytical  solutions  to  Eqs. 
(1)  and  (2),  namely 


[Vi]  =  [V,]ss{l  +  ('-l)exp[(/-‘-l)c^]},  (3) 

[^2]=/^z.['^i]ss{[l-exp(-cf)]  +  (r-l)/ 

X[exp[(/“’-l)cf]-exp(-cr)]},  (4) 


\yi\~PL  H^iJssj 


1  -exp(-c0 


{ctY 


k=\ 


/  =  3,4,..., 


+  (r-l)/  ^(exp[(/  ^-l)ct]-exp(-c0) 


(5) 


where  [ViJss  is  the  density  of  single  vacancies  in  quasi¬ 
steady  state  (term  used  here  to  denote  the  solution  of  the 
above  equations  for  very  long  times).  The  term  pi  represents 
the  probability  for  linear  growth,  r  takes  into  account  the 
initial  density  of  defects,  and  the  two  constants  c  and  /  are 
defined  as 


1 


Pl= 


1  + 


"  [Bliss’ 


(6) 

(7) 


c  —  2ki+ , 


(8) 


/- 


ki-\-  2A:eb 

2^eb“^ebi 


(9) 


Since  the  surface  morphology  does  not  change  with  fur¬ 
ther  exposure,  the  effects  due  to  initial  surface  defects  have 
disappeared  as  step  etching  has  created  fresh  terraces.  Thus, 
given  sufficient  time,  step  edges  have  moved  to  erase  the 
original  exposed  surfaces,  and  the  effects  of  possible  initial 
defects  will  not  appear  in  the  mathematical  expressions.  If  no 
defects  are  assumed  on  the  freshly  created  terraces,  we  can 
consider  the  initial  density  of  defects  to  be  zero.  Figure  3(a) 
shows  the  calculated  evolution  of  linear  vacancy  densities  as 
a  function  of  time,  ct,  for  T^VOO  K  where  /  was  determined 
using  the  interaction  energy  values  from  Ref.  10. 

If  the  time  required  for  the  step  to  retreat  across  the  width 
of  the  terrace  is  greater  than  the  time  needed  to  reach  quasi¬ 
steady  state,  then  Eq.  (5)  will  reduce  to  [V,]ss=[yJ 
which  is  the  most  probable  Flory-Schulz  size  distribution.  In 
this  case,  the  densities  of  linear  pits  reach  their  maximum 
values  and  a  plot  of  size  distribution  establishes  a  straight 
line  with  slope  equal  to  In  .  On  the  other  hand,  if  the  time 
required  for  the  step  to  retreat  across  the  width  of  the  terrace 
is  lower  than  the  time  needed  to  reach  quasi-steady  state, 
then  the  linear  pit  densities  do  not  reach  their  maximum 
values  (according  to  the  Flory-Schulz  distribution).  In  this 
case,  the  slope  in  a  plot  of  size  distributions  will  present  a 
steeper  aspect,  indicating  that  long  pits  are  unlikely.  Figure 
3(b)  shows  the  resulting  size  distribution  as  a  function  of 
(/—I)  for  representative  values  of  ct.  For  large  enough  ct. 


the  distribution  is  a  straight  line.  For  smaller  values  of  ct,  it 
adopts  the  form  of  a  smooth  curve  with  a  slight  downward 
concavity.  Accordingly,  measurements  that  give  size  distribu¬ 
tions  can  be  used  to  assess  the  relative  importance  of  step 
retreat. 

B.  Etching  at  800  K 

Figure  4(a)  shows  a  plot  of  ln([yj/[yi])  vs  z  -  I  for  linear 
pits  on  Si(100)~2X  1  after  etching  at  800  K.  Linear  regression 


Fig.  3.  (a)  Plot  showing  the  temporal  evolution  for  linear  pits  of  represen¬ 
tative  length  i  normalized  to  their  final  values.  The  term  c  is  the  sum  of  the 
rate  constants  for  linear  and  branched  growth,  Eq.  (8),  and  t  is  time.  For 
insufficient  values,  cr—4,  the  linear  pit  distribution  is  far  from  the  quasi¬ 
steady-state  value,  ct=oc.  (b)  Plot  of  size  distribution  ln([yJ/[Vj)  vs  i-l 
for  linear  pits  in  terms  of  cr.  By  cZ  =  4,  only  the  areal  distributions  of  very 
short  pits  are  close  to  their  final  values.  By  ct=l2,  pits  up  to  nine  units  in 
length  are  within  a  few  percent  of  their  final  values.  The  distribution  corre¬ 
sponding  to  ct~^  is  the  most  probable  Flory-Schulz  size  distribution  for 
which  quasi-steady-state  conditions  are  met. 
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Fig.  4.  (a)  Plot  of  areal  distribution  of  linear  pits  of  size  i,  [Vj,  normalized 
to  the  areal  density  of  a  single  vacancy,  [Vj,  for  Si(100)-2X1  after  etching 
at  800  K  with  a  Br2  fluence  of  1.6  mA  s  at  a  flux  of  10  /nA.  The  straight  line 
indicates  the  fit  obtained  using  the  Flory-Schulz  equation  with  the  probabil¬ 
ity  P2.=0*'73±0,05.  Linearity  indicates  that  the  probability  of  linear  growth 
can  be  considered  independent  of  pit  length  for  /<10.  (b)  Plot  of  areal 
distribution  of  linear  pits  of  size  i  after  etching  at  700  K  [same  fluence  and 
flux  as  in  Fig.  4(a)].  The  curve  for  ct='^  shows  the  Flory-Schulz  size 
distribution  while  those  corresponding  to  cr  =  3  and  4  are  distributions  for 
which  quasi-steady-state  conditions  are  not  fulfilled.  The  failure  to  reach 
quasi-steady  state  at  700  K  reflects  erasure  of  terrace  pit  distributions  by 
retreating  steps. 

shows  that  most  points  fit  a  straight  line.  The  slope  under  a 
95%  confidence  interval  gives  p^=0.73±0.05.  Assuming,  as 
in  Refs.  3  and  4,  that  the  rate  constants  can  be  related  to 
activation  energies  via  an  Arrhenius  relationship,  then  ex¬ 
pressions  of  the  form  k— a  Qxp[~-E/kT]  can  be  written.  The 
preexponentials  should  be  equal  since  similar  bond-breaking 
processes  are  involved  in  each  etching  event,  namely  the 
removal  of  a  Si  atom  and  the  ejection  onto  the  surface  of  the 
now-unpaired  Si  atom.  Hence,  k^^/ki  —  cxp[—{E^^ 
-Ei)/kT].  From  the  experimental  value  of  pi  and  Eq.  (6), 
we  find  E^^—Ei~0A2±0.02  eV.  This  analysis  parallels  that 
of  Ref.  4  (etching  of  Si(100)-2X  1  by  Cl  at  850  K)  where  the 
same  difference  in  dimer  removal  energy,  E^^—Ei ,  was  de¬ 
duced.  The  good  fit  obtained  with  a  straight  line  indicates 
that  branching  occurs  preferentially  at  the  ends  of  the  pits, 
implying  that  is  small,  and  that  is  not  greatly  affected 
for  i<  10. 

C.  Etching  at  700  K 

In  Fig.  4(b)  we  plot  ln([yj/[yi])  vs  /  - 1  for  linear  pits 
after  etching  at  700  K.  The  etching  conditions,  halogen  flux. 


and  exposure  time  were  the  same  as  those  for  etching  at  800 
K.  Doubling  the  fluence  to  3.2  mA  s  did  not  alter  the  pit 
distribution.  It  is  apparent  that  etch  pits  and  terrace  chains 
are,  in  general,  much  shorter  than  those  observed  after  etch¬ 
ing  at  800  K. 

Analysis  similar  to  that  carried  out  above  predicts  a  value 
of  Pi  of  about  0.76  for  etching  experiments  at  700  K.  This 
assumes  quasi-steady-state  conditions,  ct—^.  As  seen  in  Fig. 
4(b),  however,  the  experimental  results  indicate  that  the  lin¬ 
ear  pit  distribution  is  far  from  the  quasi- steady-state  condi¬ 
tion.  Indeed,  a  better  fit  is  produced  with  ct  of  about  4.  Thus, 
a  given  terrace  is  exposed  to  etching  conditions  for  insuffi¬ 
cient  ct  for  it  to  reach  quasi-steady  state,  the  linear  pits  do 
not  reach  their  final  lengths,  and  we  find  behavior  corre¬ 
sponding  to  ct<oo. 

We  stress  that  the  surface  etched  at  700  K  is  in  a  steady  or 
unchanging-morphology  state  in  the  sense  that  exposures  are 
sufficiently  long  that  further  etching  does  not  alter  the  pit 
distributions.  Although  a  particular  feature  will  change  its 
appearance,  the  overall  morphologies  are  indistinguishable 
after  reaching  this  steady  state.  However,  the  condition  for 
(mathematical)  quasi-steady  state  for  the  areal  densities  of 
linear  pits  cannot  always  be  applied  because  many  pits  do 
not  reach  their  final  values. 


IV.  CONCLUSIONS 

Step  atoms  are  more  susceptible  to  removal  than  their 
terrace  counterparts.  At  low  enough  temperature,  we  expect 
that  the  ratio  in  etching  rates  for  step  and  terrace  atoms 
would  be  so  high  that  step  etching  would  be  the  only  relevant 
mechanism.  An  increase  in  temperature  has  a  more  profound 
change  for  the  rate  of  the  higher  energy  process  than  for  the 
lower  energy  process  (as  well  as  for  the  creation  of  the  initial 
dimer  vacancies).  Accordingly,  the  ratio  in  their  rates  dimin¬ 
ishes  with  temperature:  the  higher  the  temperature,  the 
greater  the  relative  amount  of  terrace  etching.  The  time  avail¬ 
able  for  terrace  etching  between  step  erasures  is  then  insuf¬ 
ficient  at  700  K,  and  linear  pits  do  not  reach  their  quasi¬ 
steady-state  values.  At  800  K,  the  largest  analyzed  pits  can 
be  considered  to  be  in  the  quasi-steady- state  condition.  At 
900  K,  the  rate  of  pit  formation  and  growth  is  so  high  that 
pits  grow  into  one  another  to  form  large  structures  that  func¬ 
tion  like  steps.  Analysis  of  this  situation  is  more  complex. 

Measurements  at  a  fixed  temperature  as  a  function  of  ter¬ 
race  width  should  show  the  interplay  of  steps  and  terraces 
quite  clearly.  Increasing  the  width  of  a  terrace  etched  at  700 
K,  for  example,  should  give  the  pits  a  greater  ct  between 
erasure  events  and  their  population  should  increase  toward 
quasi-steady  state.  For  very  narrow  terraces,  step  retreat 
should  always  overwhelm  pit  formation.  Indeed,  studies  of 
Si(100)-2Xl  with  a  4°  miscut  show  very  few  terrace  pits, 
although  in  that  case  the  final  state  morphology  is  influenced 
by  step-step  interactions.^^ 
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An  ultrahigh  vacuum  compatible  planar  corrugated  millimeter  mm-waveguide  structure 
(410-/;6m-deep)  possessing  bi-fold  symmetry  and  a  precision  beam  aperture  (800  /jm)  has  been 
fabricated  using  silicon  processing  technology,  modeled  with  numerical  analysis  software, 
geometrically  characterized,  and  compared  to  a  similar  waveguide  fabricated  using  deep  x-ray 
lithography  (DXL)  techniques.  The  waveguide  was  fabricated  to  operate  at  60  GHz  (X=5  mm)  with 
fields  suitable  for  27r/3  phase  advance  operation.  Multichip  alignment  technology  was  used  to 
provide  a  semiclosed  conducting  surface  with  aperture-coupled  periodic  resonator  cavities.  A  pair  of 
Si/Pyrex  composite  metallized  substrates  patterned  with  corrugated  geometries  have  been  vertically 
stacked  with  980-yL6m-diam  Pyrex  capillaries.  Geometrical  analysis  of  the  muffin-tin  waveguide  was 
divided  into  two  classifications:  substrate  feature  error  and  die-to-die  orientation  error.  Both  types  of 
error  were  characterized  with  the  following  results:  feature  accuracy  was  maintained  to  0.1%-1.0% 
tolerances  in  all  directions  (5  jum  or  less  in  most  cases)  and  die-to-die  aperture  distance  agreed  to 
within  —3%  of  theoretical  calculation.  Methods  of  improving  these  geometrical  tolerances  are 
suggested  and  critical  issues  are  addressed.  Electromagnetic  testing  of  the  mm  waveguide  has  been 
investigated  and  a  bead  was  fabricated  for  use  in  a  bead-pull  perturbation  measurement  of 
acceleration  properties.  The  concluding  section  compares  deep-etch  silicon  and  DXL  approaches  for 
the  fabrication  of  the  “micro-linac.”  It  is  concluded  that  through  further  refinement  of  thermal  and 
conductive  properties  that  the  silicon  waveguide  is  a  viable  method  of  constructing  a  micro-linac 
mm  waveguide,  requiring  less  fabrication  complexity,  processing  time,  and  capital  equipment 
investment  than  DXL.  ©  1996  American  Vacuum  Society. 


I,  INTRODUCTION 

Electron  acceleration  at  frequencies  in  the  millimeter 
wavelength  range  requires  conductive  structures  on  the  scale 
of  several  hundred  microns  with  precise  geometries.  Gauss¬ 
ian  random  error  analysis  carried  out  by  researchers  at  Ar- 
gonne  National  Laboratory  (ANL)  indicates  that  approxi¬ 
mately  10  yLtm(~0.28%  error)  or  better  of  transverse 
dimensional  accuracy  is  required  to  limit  normalized  electron 
beam  emittance  growth  to  A  general  dimensional 

tolerance  of  5  fjm  feature  accuracy  is  necessary  to  maintain 
the  structure’s  fabrication  to  within  an  order  of  the  widely 
used  0.1%  dimension  accuracy  rule  applied  to  “macro”  lin¬ 
ear  accelerators. 

Henke  designed  a  planarized  double-sided  muffin-tin  ge¬ 
ometry  for  use  in  mm-wave  particle  accelerator  applications 
typically  implemented  with  capacitive-plate  loaded  cylindri¬ 
cal  waveguides.^  Researchers  at  ANL  have  investigated  50 
MeV  120  and  60  GHz  linear  acceleration  systems  utilizing 
Henke’s  planar  geometry  that  are  implementable  with  micro¬ 
fabrication  techniques.'^  A  60  GHz  structure  has  been  devel¬ 
oped  at  the  University  of  Illinois’  Microfabrication  Applica¬ 
tions  Laboratory  with  adaptations  of  standard  ultraviolet 
(UV)  silicon  fabrication  methods  (Fig.  1,  Table  I). 

Each  waveguide  die  consists  of  anodically  bonded  silicon 
and  Pyrex  wafers.^  The  Si  ((110)  wafer  orientation)  is  aniso- 
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tropically  etched  to  form  the  waveguide  pattern.  Concur¬ 
rently,  capillary  seatings  600  yttm  wide  are  patterned  to  hold 
capillaries  for  stacking.^  These  seatings  will  provide  a  preci¬ 
sion  800  fim  iris  opening  between  die  when  used  in  conjunc¬ 
tion  with  1-mm-diam  cylindrical  spacers.  The  (110)  silicon 
etch  terminates  on  (111)  planes  to  form  —70.54°  parallelo¬ 
grams  when  viewed  normal  to  the  wafer  surface.^  An  array 
of  parallelograms  is  etched  to  form  the  basis  for  cavity  struc¬ 
tures  separated  by  aperture  fiutes  [Fig.  2(a)].  The  nonor- 
thogonal  short  sides  of  the  parallelograms  are  removed  with 
a  wafer  dicing  saw,  leaving  central  aperture  flutes  with  a 
calibrated  transverse  width  [Fig.  2(b)].  A  wafer  is  processed 
in  parallel  to  provide  bulk  material  for  cavity  sidewalls.  Two 
silicon  strips  with  near-optically  smooth  side  edges  are 
placed  adjacent  to  the  aperture  structures  and  anodically 
bonded  in  place  establishing  a  corrugated  pattern  [Fig.  2(c)]. 
A  uniform  aluminum  layer  is  applied  to  a  minimal  thickness 
of  three  rf  skin  depths  (—1.2  /xm  at  60  GHz)  for  improved 
electrical  conduction.  Placing  precision-drawn  (±l-yam- 
diam  accuracy)  outer-diameter  (o.d.)  Pyrex  capillaries  into 
the  lower  die’s  anti-parallel  seatings  and  forming  a  stack 
with  a  mirror  image  version  of  the  die  accurately  controls 
die-to-die  muffin-tin  orientation. 

The  objective  of  this  work  is  to  demonstrate  a  bulk  silicon 
micromachine  based  alternative  to  deep  x-ray  lithography 
(DXL)  useful  for  fabricating  mm-wave  muffin-tin  waveguide 
structures.  The  fabrication  method  is  presented  as  well  as  an 
outline  of  an  alternative  x-ray  fabrication  process.  Geometri¬ 
cal  characterization  data  is  examined  for  the  60  GHz  silicon 
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Fig.  1 .  Muffin-tin  waveguide  with  beam  aperture  and  interleaved  side  open¬ 
ings.  (a)  An  ideal  machined  geometry;  (b)  the  (110)  silicon-fabricated 
equivalent. 


muffin-tin  structures  and  DXL-fabricated  240  GHz  muffin- 
tin  structures  to  determine  each  fabrication  method’s  merits. 
Anisotropic  etching  of  single-crystal  silicon  does  permit  bulk 
micromachining  of  the  exact  muffin-tin  geometry  and  so  the 
electrical  impact  of  using  an  approximate  geometry  in  the 
waveguide  design  is  examined  with  the  aid  of  numerical 
analysis. 

II.  SILICON  WAVEGUIDE  FABRICATION 


Two  identical  but  mirrored  versions  of  the  waveguide  pat¬ 
tern  were  used  for  the  top  and  bottom  die.  A  1.85  (jm  Si02 
film  was  grown  on  double-polished  3-in.-diam  (410-/>6m- 
thick)  (110)  silicon  wafers.  The  design  goal  calls  for  1.3- 
mm-deep  structures  but  410-^6m-deep  patterns  were  fabri- 


Fig.  2.  Fabrication  of  right-angled  muffin-tin  corrugation  pattern  from  (110) 
silicon,  (a)  A  dicing  saw  determines  the  width  of  the  vertically  etched  walls. 
The  vertical  walls  become  the  irises  in  the  muffin  tin  cavity  and  are  1.266 
mm  wide,  0.40  mm  thick,  and  0,41  mm  high,  (b)  Top  view  of  the  iris  walls 
bonded  to  a  Pyrex  wafer.  The  silicon  is  black  and  the  Pyrex  is  gray,  (c)  Side 
strips  have  been  bonded  next  to  the  iris  walls  to  complete  the  muffin-tin 
cavities. 


Table  I.  Geometry  specifications  for  the  muffin-tin  waveguides  (see  also 
Fig.  1). 


Specification 

60  GHz  silicon 
waveguide 

240  GHz  DXL 
waveguide 

Free-space  aperture  gap 

800  fjim 

300  (Jim 

Muffin-tin  cavity  depth 

410  ^m  (fab.  target) 

160  (im  (fab.  target) 

Muffin-tin  cavity  width 

3.6  mm 

1.8  mm 

Muffin-tin  cavity  length 

1.266  mm 

300  (Jim 

Iris  wall  thickness 

400  (Jim 

100  (Jim 

cated  for  proof-of-principle.  The  front-side  Si02  layer  was 
patterned  with  waveguide  and  capillary  groove  features 
while  the  back-side  Si02  was  completely  removed  in  prepa¬ 
ration  for  high-temperature  anodic  bonding  (HTB)  to  Pyrex. 

Figure  3  illustrates  the  setup  for  the  HTB  process.  A 
double-polished  3-in.-diam  18  mil  Pyrex  wafer  was  placed 
on  an  unpolished  Si  wafer.  The  unpolished  Si  surface  pre¬ 
vented  accidental  bonding  between  these  two  wafers  while 
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(a) 


Fig.  3.  High-temperature  anodic  bonding  setup.  A  schematic  of  the  bonding 
circuit  is  shown  in  (a)  with  a  photograph  of  the  setup  in  use  (b). 


providing  a  low  resistance  current  return  path.  The  backside 
of  the  patterned  silicon  wafer  was  pressed  (with  small 
weights)  into  intimate  contact  with  the  Pyrex  wafer  and 
bonded  at  —50  °C  by  the  application  of  a  500  V  potential. 

The  anodic  bond  is  attributed  to  the  mobility  of  sodium 
ions  in  the  glass.  At  an  elevated  temperature  the  ions  in  the 
glass  drift  to  the  Si  ground  plane,  leaving  behind  a  negatively 
charged  depletion  layer  at  the  Si/Pyrex  interface.  The  sur¬ 
faces  are  pulled  into  tight  contact  and  a  permanent  chemical 
bond  takes  place.^  Intimate  interface  contact  and  similar  ther¬ 
mal  expansion  coefficients  of  3.25X10“VC  (0-300  °C)  for 
Pyrex  and  2.6X10“^/°C  for  Si  minimize  shear  stress  during 
cool-down  and  postbond  surface  deformation  that  can  intro¬ 
duce  error  into  the  structure’s  geometry. 

After  cool-down  the  bond  was  tested  for  strength  (—350 
psi)  and  the  presence  of  defects.  Following  a  brief  BOB  etch 
to  remove  unwanted  oxide,  a  40%  by  weight  potassium  hy¬ 
droxide  (KOH)  (balance  made  of  H2O  with  isopropyl  alco¬ 
hol)  85  SDC  silicon  etch  was  performed  with  megasonic  agi¬ 
tation.  The  etch  was  continued  until  the  (111)  plane  edges 
terminated  on  the  Pyrex  plane  and  surface  profilometry  re¬ 
vealed  no  remaining  Si  in  the  recesses. 

As  illustrated  in  Fig.  4,  the  proper  iris  width  is  obtained 
by  guiding  a  dicing  saw  in-line  with  two  etched  fiducial 
markings  separated  approximately  by  the  wafer’s  diameter. 
The  parallelogram  fiducials’  left  and  right  obtuse  angles  de¬ 
fine  the  iris  width  to  photolithographic  accuracy.  A  Micro 


Fig.  4.  AutoCad®  mask  drawing  used  to  generate  one  of  two  waveguide 
pattern  photomasks.  The  silicon  is  etched  in  the  disbonded  region  to  prevent 
it  from  bonding  to  the  Pyrex  wafer  during  the  high  temperature  anodic 
bonding  operation  illustrated  in  Fig.  3(a).  After  bonding  a  dicing  saw  is  used 
to  free  the  unetched  silicon  in  the  disbonded  regions.  The  dicing  saw  also 
removes  the  silicon  to  the  left  and  right  of  the  dicing  markers.  This  step 
determines  the  width  of  the  iris  shown  in  Fig.  2(b). 


Automation®  dicing  saw  with  a  0.9  mil  blade  traversed  the 
structure  1  mil  below  the  Pyrex  plane  in  a  rastering  fashion 
that  slightly  overlapped  neighboring  tracks  until  the  channel 
was  —  1  mm  wide  (66  cuts)  and  then  the  procedure  was  re¬ 
peated  to  define  the  second  iris  edge.  Sidewall  strips  were 
constructed  by  patterning  a  third  (110)  wafer  with  a  simple 
mask  to  define  (111)  plane  edges  prior  to  KOH  etching.  A 
pair  of  sidewall  strips  were  held  in  contact  with  the  central 
waveguide  pattern  as  the  second  HTB  process  was  per¬ 
formed;  the  structure  was  then  one  modular  composite. 

Aluminum  was  used  as  the  conductor  in  the  prototype 
structure.  To  achieve  1.38  /xm  of  A1  (1.38  fxm  is  3  rf  skin 
depths  with  15%  overdeposition  at  60  GHz)  on  the  high- 
profile  structures  while  minimizing  deposition  rate  depen¬ 
dence  on  the  angle  of  incidence,  sputtering  was  used  (300  A 
min).  After  the  sputter  process  a  slight  gap  existed  at  the 
sidewall/iris  interface  that  would  affect  the  current  conduc¬ 
tion  path.  This  issue  is  addressed  in  the  discussion  section. 
One  composite  of  each  image  was  stacked  using  glass  cap¬ 
illaries  (o.d.=980±l  /xm  to  theoretically  yield  a  —775  /xm 
aperture  gap)  cut  to  0.6-0.8  cm  lengths.  Because  of  material 
availability,  980-/xm-diam  capillaries  were  used  in  place  of  1 
mm  capillaries  required  for  the  design  goal.  The  capillaries 
were  seated  into  rectangular  antiparallel  grooves  without 
bonding  for  the  purpose  of  alignment  evaluation  (Fig.  5). 

III.  DXL  WAVEGUIDE  FABRICATION 

Four  waveguide  sections  were  fabricated  out  of  nickel 
(—160  /xm  thick)  using  a  variation  of  LIGA  DXL.^  The 
structures  were  designed  to  be  stacked  using  484  /xm  o.d. 
capillaries.  Cavity  cell  dimensions  were  1.8  mm  in  width  by 
0.3  mm  in  length  and  each  cell  was  separated  by  a  0.1  mm 
iris  wall  (Table  I).  A  silicon  wafer  with  Si02/Si3N4  thin  films 
was  used  as  a  base  substrate.  A  Ti/Cu/Ti  thin  film  at  100 
A/ 100  Ay  100  A  was  deposited  as  an  electroplating  base. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2527 


T.  L.  Wilke  and  A.  D.  Feinerman:  Deep-etch  Si  mm  waveguide  structure 


2527 


Fig.  5.  Two  views  of  a  completed  (110>  silicon  waveguide  stack.  980  fjim 
o.d.  Pyrex  capillaries  were  used  to  stack  the  die. 


Poly(methyl  methacrylate)  (PMMA)  was  cast  to  a  thickness 
of  200  yctm  and  cured.  A  high~contrast  x-ray  projection  mask 
was  used  at  the  1  GeV  Aladdin  accelerator  (Stoughton,  WI) 
to  expose  negative  features  into  the  PMMA.  Nickel  was 
plated  into  the  PMMA  pattern  to  form  one-half  of  the  planar 
waveguide.  The  PMMA  was  removed  to  reveal  the  final 
structure  (Fig.  6). 


15KU  X30  0001  1000. 0U  EMFU 


Fig.  6.  SEM  micrograph  of  a  nickel  DXL  LIGA-fabricated  muffin-tin  struc¬ 
ture.  The  plated  nickel  is  approximately  160  ixm  thick. 


Table  II.  MAFIA^  simulation  results  for  rf  parameters. 


Radio  frequency  parameter 

No  slope 

25:1  slope 

Parameter  change 

Frequency  (GHz) 

61.46 

62.08 

+  1.0% 

Q 

2957 

2799 

-5.3% 

Kq  (MO/m) 

278.9 

267.1 

-4.2% 

kq/Q  (kll/m) 

93.1 

95.4 

+2.5% 

a  (1/m) 

4.879 

5.224 

+7.1% 

IV.  MODELING  AND  CHARACTERIZATION  OF  THE 
WAVEGUIDES 

Since  anisotropic  etches  of  silicon  have  finite  selectivity, 
some  amount  of  cavity  sidewall  slope  was  expected  in  the 
etched  silicon  waveguides.  MAFIA^  was  used  to  test  the 
electromagnetic  effects  of  a  sloped  (111)  iris  wall.  Cavity 
sidewall  slope  was  considered  to  have  less  bearing  on  the 
electrical  properties  than  the  iris  wall  slope. 

KOH  (110):(111)  etch  selectivity  is  typically  between 
100:1  and  200:1,  though  selectivity  as  high  as  600:1  has  been 
reported.^  To  magnify  the  effect  of  finite  iris  wall  slope  (m) 
on  electrical  performance  degradation,  a  worst-case  geomet¬ 
ric  model  with  m=25  was  analyzed.  Important  rf  parameters 
for  the  baseline  structure,  possessing  infinite  m,  and  for  the 
m—25  structure  are  presented  in  Table  II. 

In  the  table,  Tq  is  the  shunt  impedance  and  a  is  the  attenu¬ 
ation  parameter.  The  shunt  impedance  is  defined  by 
ro=  -Elj(dPldz),  where  dPfdz  is  the  rf  power  dissipated 
per  unit  length  and  is  the  amplitude  of  the  axial  electric 
field. A  high  Tq  indicates  efficient  acceleration  properties, 
since  the  E  field  contributes  to  acceleration  while  the  dissi¬ 
pated  power  is  wasted  in  the  form  of  heat.  The  quality  factor 
2  is  a  measure  of  the  ability  of  a  rf  cavity  to  resonate.  It  is 
defined  as  the  ratio  of  energy  stored  to  energy  lost,  per  radian 
of  the  rf  wave.  The  r^lQ  value  is  an  important  measure  of 
the  effectiveness  of  an  accelerating  structure.  Due  to  the 
highly  variable  ordinate  projections  of  linear  mesh  densities 
in  the  MAFIA  model,  the  percentages  of  parameter  changes 
shown  in  Table  II  are  not  considered  significant.  It  can  be 
concluded  from  these  data  that  the  finite  sidewall  slope  in  the 
silicon  structure  will  not  significantly  degrade  the  electrical 
performance  of  the  waveguide.  Variations  in  the  impedance 
and  resonant  frequency  due  to  the  sloped  walls  can  be  com¬ 
pensated  for  by  adjusting  the  mask  geometry.  The  validity  of 
the  modeling  computation  may  be  verified  by  comparing  the 
numerically  computed  r^lQ  value  to  empirical  r^lQ  data 
gathered  from  field  perturbation  experiments.^^  This  com¬ 
parison  was  carried  out  on  a  12  GHz  model  of  the  linear 
accelerator  and  is  reported  elsewhere. 

Due  to  the  high  operating  frequencies  of  this  mm-wave 
structure,  the  measurement  of  the  electromagnetic  fields  us¬ 
ing  conventional  perturbating  objects  was  impossible.  A  mi- 
crofabricated  bead  on  a  supporting  insulator  was  processed 
to  test  the  silicon  waveguide.  Commonly  used  perturbing 
structures  include  conductive  spheres  or  ellipsoidal  needles 
which  are  suspended  in  space  by  a  relatively  thin  insulating 
string.  These  structures  are  typically  used  because  they  can 
be  solved  analytically  and  are  easily  fabricated.  The  micro- 
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(b) 


Fig.  7.  A  microfabricated  bead  conductor  (a)  for  use  in  electromagnetic  field 
perturbation  experiments  and  the  shadow  mask  (b)  used  in  its  fabrication. 
Aluminum  was  sputtered  onto  an  80-/L6m-diam  glass  fiber  through  a  mask 
opening  approximately  550  /xm  in  length  to  create  the  hollow  conductive 
cylinder. 

fabricated  bead  approximation  to  an  ellipsoidal  needle  is  an 
optical  fiber  coated  with  an  aluminum  film  to  form  a  hollow 
cylinder  with  a  length  of  —550  fxm  [Fig.  7(a)].  The  bead  was 
fabricated  on  80-yLtm-diam-silica  optical  fiber  by  sputtering 
an  A1  film  through  a  capillary  shadow  mask  [Fig.  7(b)].  The 
sputter  mask  consists  of  two  small-bore  glass  capillaries 
ground  flat,  mounted  to  a  backing  plate,  and  cut  to  provide 
—550  /xm  precision  gaps.  Bead  fibers  were  threaded  into  the 
large  capillaries  and  metal  was  sputtered  onto  the  regions 
exposed  by  the  saw-defined  slits.  The  mask  allows  two  fibers 
to  be  sputtered  simultaneously.  A  first  generation  bead  was 
used  in  the  electromagnetic  field  measurement  on  a  lOX 
scale  model  of  the  120  GFlz  version  of  the  mm  waveguide. 
The  results  showed  the  expected  shape  of  the  magnetic  field 
but  the  bare  silica  fibers  were  quite  fragile.  Also,  the  form 
factor  for  the  bead  depended  on  the  bead’s  rotational  angle 
due  to  the  nonuniform  thickness  of  the  bead.  The  form  factor 
is  a  scaling  factor  that  is  used  to  determine  how  the  measured 
frequency  perturbation  relates  to  the  field  in  the  cavity  for  a 
particular  bead  geometry  and  material.  Nylon  thread  pro¬ 
vided  a  more  rugged  support  for  the  microfabricated  bead 
and  subsequent  bead  experiments  with  25-125  /xm-diam  ny¬ 
lon  insulators  gave  accurate  form  factors.  These  beads  were 


Fig.  8.  SEM  micrograph  of  the  muffin-tin  iris  sidewall  demonstrating 
boundary  defects  due  to  mask/crystal  plane  misalignment.  The  surface 
roughness  between  misalignment  defects  appears  to  be  <0.2  ^tm. 

able  to  accurately  map  out  the  electric  field  in  the  scale 
model. 

Physical  characterization  using  scanning  electron  micros¬ 
copy  (SEM),  optical  and  profilometry  instrumentation  was 
performed  on  eight  silicon  waveguide  samples.  Surface 
roughness  requirements  for  the  mm  waveguide  can  be  evalu¬ 
ated  by  comparing  the  electrical  skin  depth  {S)  to  the  average 
surface  roughness;  a  roughness  on  the  order  of  or  greater 
than  8  would  indicate  that  the  conduction  length  that  the 
current  sees  is  greater  than  the  physical  length  of  the  particu¬ 
lar  segment.  Since  ^=0.27  /xm  at  60  GHz  for  a  copper  con¬ 
ductor  (the  material  of  choice),  a  degraded  conduction  path 
would  possess  a  visibly  cloudy/dull  surface  finish.  SEM  and 
optical  data  suggest  that  all  internal  exposed  surfaces  possess 
a  near-optical  finish  (^2000  A  average  surface  roughness).  A 
shingle  step  defect  is  present  when  the  optical  mask-to- 
crystal  plane  alignment  is  poor  but  the  use  of  well  placed 
rotational  alignment  marks  can  keep  the  spatial  frequency  of 
these  steps  to  a  minimum.  Figure  8  illustrates  this  defect  on 
the  vertical  iris  wall  of  a  silicon  waveguide.  Note  that  despite 
these  shingle  defects  the  surface  roughness  between  the  mis¬ 
alignment  defects  is  <0.1 -0.2  /xm. 

Cavity  length  accuracy  (at  the  Pyrex/silicon  interface) 
was  measured  with  an  inspection  microscope  and  found  to  be 
—  ±3  /xm  off  the  target  dimension.  Feature  radii  in  the  top 
plane  were  found  to  be  <5  /xm,  though  MAFIA  analysis  of 
the  cavity  structures  has  not  proven  that  feature  radii  greater 
than  5  /xm  have  a  deleterious  effect  on  the  guide’s  accelera¬ 
tion  properties.  The  feature  height  accuracy  depends  only  on 
the  accuracy  with  which  the  (110)  wafers  are  sliced;  high- 
precision  custom  thickness  wafers  are  commercially  avail¬ 
able  that  offer  ±1  /xm  thickness  accuracy.  Feature  height 
runout  >1  /xm  was  not  detected  and  sample  bow  (over  a  2 
cm  length)  was  —6  /xm.  Cavity  width  accuracy  (limited  by 
the  dicing  saw  positioning  and  grinding  accuracy)  was  found 
to  be  marginal,  varying  by  ±10  /xm  with  an  additional  error 
of  up  to  +20  /xm  due  to  sidewall  placement  error. 

SEM  characterization  of  four  DXL-fabricated  samples 
suggested  <0.1  /xm  of  surface  roughness  with  the  exception 
of  the  top  Ni  plated  surface,  which  had  a  1-2  /xm  average 
surface  roughness  confirmed  by  profilometry  data.  Feature 
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Fig.  9.  High-magnification  SEM  micrograph  of  a  muffin-tin  cell  comer  in  a 
LIGA-fabricated  waveguide  structure.  The  waveguide  structure’s  sidewall 
roughness  appears  to  be  —0.1  fim. 


width  was  controlled  to  ~2  yam  accuracy  with  a  *^2  yam 
comer  radius  in  the  top  plane  (Fig.  9).  The  average  feature 
radius  in  the  profile  crosssection  was  <0.25  yam.  All  features 
possessing  widths  ^15  yam  exhibited  edge-bead  plating  ef¬ 
fects.  Surface  profilometry  revealed  an  average  feature 
height  of  ~160  yam  with  a  runout  of  16  yam  (adjusted  to 
account  for  substrate  bow). 

The  second  source  of  geometrical  error,  that  of  top-to- 
bottom  die  orientation,  has  been  evaluated  for  a  silicon 
waveguide  stack  using  optical  microscopy.  Wafer-to-wafer 
aperture  distance  was  measured  to  be  750  ±8  yam  (for  a 
target  aperture  of  775  yam),  representing  a  3.2%  dimensional 
error  attributed  to  capillary  seating  sidewall  slope  (from 
(111)  direction  etching)  and  feature  overetch  of  a  few  mi¬ 
crons  per  die. 

Die-to-die  alignment  was  evaluated  in  two  DXL  wave¬ 
guide  stacks  constructed  with  484  ±1  yam  o.d.  capillaries 
and  an  adhesive  apiezon  wax  (Fig.  10).  SEM  measurements 
indicated  ±5  yam  translational  accuracy  (in  the  beam  aper¬ 
ture  plane),  which  correlates  with  the  calculated  geometrical 
error  due  to  the  unfinished  top  Ni  surface.  Aperture  size  mea¬ 
surements  were  unreliable  due  to  the  large  amount  of  nickel 
thickness  variation  in  the  DXL  samples. 


Fig.  10.  SEM  micrograph  of  a  240  GHz  LIGA  muffin-tin  waveguide  stack. 
484  /xm  o.d.  Pyrex  capillaries  were  used  to  stack  the  two  die. 
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V.  DISCUSSION 

The  silicon  processing  method  does  not  yield  the  high 
precision  that  DXL  offers  but  it  has  the  benefit  of  being 
inexpensive  and  relatively  simple.  The  120  GHz  linear  accel¬ 
erator  proposed  would  require  4,95  m  of  waveguide  sections, 
or  140  three  in.  wafer  substrates  for  70  sections."^  It  can  be 
assumed  that  the  60  GHz  mm  waveguide  described  in  this 
paper  would  require  at  least  the  same  number  of  3  in,  sub¬ 
strates.  DXL  manufacture  of  this  many  devices  would  be  a 
very  costly  and  time-consuming  procedure,  a  tradeoff  that 
probably  will  not  justify  the  accuracy  enhancement.  Numeri¬ 
cal  analysis,  to  be  verified  by  bead-pulling  measurements, 
has  demonstrated  that  iris  wall  slope  does  not  significantly 
degrade  the  electric  field  characteristics.  Other  geometrical 
criteria  described  by  error  analysis  have  been  met  marginally 
for  all  topographical  measurements. 

Die-to-die  iris  separation  (i.e.,  aperture  height)  and  trans¬ 
verse  iris  width  control,  require  improvement  being  unac¬ 
ceptable  and  marginal.  The  dicing  saw  step,  which  uses  pho- 
tolithographically  defined  markers  to  locate  where  the  iris 
extents  should  terminate  (i.e.,  determine  iris  width),  intro¬ 
duces  the  largest  error  in  the  system.  The  20  ±10  yam  iris 
width  variation  (calculated  over  multiple  wafer  batch  runs) 
corresponds  to  a  0.83%  maximum  feature  width  error,  which 
exceeds  the  error  limit  suggested  by  electron  beam  analysis. 

A  second  shortcoming  of  the  iris  design  is  lack  of  electri¬ 
cal  continuity  with  the  etched  sidewall.  A  finite  sidewall 
slope  of  100:1  would  yield  a  gap  of  ~  13  yam  in  the  full-scale 
waveguide;  since  sputtering  would  not  bridge  this  opening  a 
conductive  filler,  such  as  indium  solder,  must  be  considered. 
By  adding  a  filler  the  surfaces  may  conduct  freely  but  the 
comer  geometry  is  not  well  defined.  A  better  solution  may  be 
to  use  sidewalls  defined  by  dicing  saw  cuts  polished  to  a 
'^O.l  yam  finish  that  would  have  an  average  slope  >5000:1. 
This  sidewall  structure  would  meet  the  skin  depth  surface 
smoothness  requirement  and  permit  a  sidewall-to-iris  inter¬ 
face  bridged  by  sputtered  material.  The  gap  would  be  ~1 
yam  and  easily  filled  with  a  conductor  that  would  not  distort 
comer  definition  beyond  ±5  yam. 

Thermal  load  caused  by  rf  power  losses  in  the  waveguide 
is  an  unaddressed  issue  in  the  silicon/Pyrex  composite.  The 
low  1 1  mW/cm  °C  thermal  conductivity  of  Pyrex  will  not 
permit  the  10  W/cm^  °C  of  cooling  required  for  thermal  sta¬ 
bility  during  operation  of  the  accelerator.^^  Complicating 
proper  cooling  is  a  poor  thermal  conduction  path  beneath  the 
sidewalls  adjacent  to  the  iris  (Fig.  11).  The  1  mil  raster  cuts 
which  are  a  fundamental  consequence  of  the  silicon  process 
presented  form  small  free- space  pockets  in  a  ultrahigh 
vacuum  environment  thereby  reducing  thermal  conductivity 
away  from  the  central  iris.  Minimization  of  the  Pyrex  thick¬ 
ness  by  lapping  and  subsequently  bonding  the  thinned  Pyrex 
layer  to  silicon  or  diamond  with  microchannel  cooling  paths 
may  provide  an  effective  film  conductivity  coefficient  of  10 
W/cm^  °C. 

VI.  CONCLUSION 

Fabrication  of  a  high  aspect  muffin-tin  silicon  waveguide 
has  been  developed  as  an  alternative  to  a  LIGA-fabricated 
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Fig.  11.  Cut  away  transverse  view  of  the  muffin-tin  structure.  The  hatched  areas  represent  optically  smooth  interfaces.  Anodic  bonding  is  inhibited  at  the 
dicing  saw  rastered  regions  where  the  Si/Pyrex  contact  interface  is  minimal. 


mm  waveguide  structure.  The  waveguide  tolerances  were  to 
be  held  to  ^5  fxm  for  features  and  die-to-die  alignment,  with 
iris  width  controlled  to  ~10  [xm  accuracy.  The  LIGA  struc¬ 
tures  were  found  to  have  excellent  (better  than  1-2  /zm) 
feature  accuracy  in  the  feature  plane  but  would  require  fur¬ 
ther  mechanical  finishing  in  order  to  meet  total  internal  re¬ 
flection  and  height  tolerances. 

It  is  believed  that  2  /^m  feature  accuracy  (0.5%  dimen¬ 
sional  accuracy)  and  '^5-6  /xm  die-to-die  alignment  (0.7%- 
1.3%  dimensional  accuracy)  is  achievable  in  the  (110)  silicon 
waveguide  through  photomask/etch  compensation  and  pho¬ 
tolithographic  process  refinement.  The  current  results  are  ap¬ 
proximately  a  factor  of  2  worse  than  this,  which  is  consid¬ 
ered  a  success  for  initial  process  development.  Iris  width 
control  could  be  improved  to  ~±10  fjtm  through  controlled 
polishing,  reducing  the  maximal  transverse  error  to  0.28%. 

To  its  benefit,  the  (110)  waveguide  is  more  cost  effective 
in  supplies  and  labor  required  for  fabrication  and  can  be 
produced  with  standard  UV  photolithography  equipment, 
while  the  LIGA  process  requires  time  on  a  relatively  expen¬ 
sive  synchrotron  radiation  storage  ring  equipped  with  a  co¬ 
herent  x-ray  exposure  station.  It  is  estimated  that  the  silicon 
waveguide  requires  ~l/5th  of  the  finances  on  a  per- 
waveguide  section  basis  and  has  a  fabrication  cycle  time  '-H 
4th  that  of  the  LIGA  process. The  deep-etch  silicon  wave¬ 
guide  will  require  more  refinement  but  is  a  viable  alternative 
to  the  LIGA  fabrication  of  a  micro-linear  accelerator  wave¬ 
guide. 
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The  response  times  of  optical  emission  spectroscopy  (OES)  and  mass  spectrometry  (MS)  have  been 
measured  for  plasma  etching  of  III-V  heterostructures.  For  the  Ga  optical  emission  signal  at  417.2 
nm,  a  response  time  as  fast  as  0.2  s  was  obtained.  The  minimum  response  time  of  the  ^"^^AsCl^ 
partial  pressure,  measured  by  MS,  was  found  to  be  0.9  s.  The  saturation  times  of  the  optical  emission 
signal  and  the  partial  pressure  signal  have  been  shown  to  be  related  to  the  residence  time  of  etch 
gases  by  varying  the  total  gas  flow  rate  and  the  chamber  pressure.  Decreasing  residence  time  by 
reducing  the  pressure  from  6  to  2  mTorr  and  maintaining  a  constant  flow  rate  caused  the  saturation 
time  of  the  Ga  emission  signal  at  417.2  nm  to  decrease  from  7  to  3  min.  The  ^"^^AsCl^  partial 
pressure  signal  saturated  before  the  Ga  emission  signal.  Endpoint  detection  for  etching  an  Alin  As 
emitter  and  stopping  on  a  Gain  As  base  of  a  heterojunction  bipolar  transistor  was  studied. 
Algorithms  which  monitor  the  change  in  Ga  emission  intensity  have  been  developed  to 
automatically  stop  the  emitter  etch  with  ~2  nm  of  the  Gain  As  base  layer  removed.  Additionally, 
etching  of  GalnAs  on  an  InP  substrate  was  studied  and  the  signal  from  OES  detected  the  endpoint 
before  the  MS  signal  did  due  to  the  faster  response  time.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

Dry  etching  is  often  used  to  fabricate  structures  of  submi¬ 
crometer  dimension  with  vertical  profiles.  However,  the  se¬ 
lectivity  between  different  materials  is  usually  quite  low.  In 
these  cases,  large  overetch  times  to  clear  a  layer  and  stop 
without  removing  the  next  layer  are  impractical,  and  an  end¬ 
point  detection  scheme  is  necessary  for  reliable  device  fab¬ 
rication.  The  ability  to  distinguish  between  separate  layers 
has  been  shown  using  several  techniques,  including  optical 
emission  spectroscopy  (OES),^  mass  spectrometry  (MS),^’^ 
laser  reflectometry,  and  grazing  angle  optical  emission 
interferometry."^  AlGaAs  layers  as  thin  as  5  nm  have  been 
resolved  in  GaAs/AlGaAs  heterostructures.^  After  detection 
and  identification  of  the  etch  products,  a  scheme  for  endpoint 
detection  is  needed  to  stop  the  etch  once  the  underlying  layer 
has  been  reached.  An  optimized  endpoint  detection  system 
should  include  sensors  with  the  best  response  time  and 
signal-to-noise  ratio.  This  is  essential  for  precisely  stopping 
an  etch  when  the  thicknesses  of  the  device  layers  are  in  the 
nanometer  range.  One  such  device  is  the  Gain As/Alln As- 
based  heterojunction  bipolar  transistor  (HBT)  with  a  GalnAs 
base  layer  as  thin  as  60  nm.^  Using  OES  to  monitor  the  Ga 
etch  products,  we  have  shown  that  a  HBT  emitter  etch  could 
be  stopped  with  <10  nm  of  the  GalnAs  base  layer  removed.^ 
In  this  article,  the  response  time  and  saturation  time  of  these 
in  situ  sensors  are  optimized  to  stop  the  emitter  etch  more 
precisely. 

The  response  time  of  an  in  situ  sensor,  generally  defined 
as  the  delay  between  changes  in  etch  conditions  and  sensor 
signals,  has  been  studied.^  This  parameter  determines  how 
quickly  the  control  scheme  can  respond  to  changes  in  the 
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plasma  and  stabilize  the  plasma  conditions.  It  also  sets  a 
lower  limit  on  how  precisely  an  etch  can  be  stopped  once  a 
layer  has  been  completely  etched.  Another  characteristic  of 
sensor-based  etch  process  monitoring  which  has  been  studied 
is  saturation  time.  Once  the  change  is  detected,  the  saturation 
time  represents  the  delay  before  the  intensity  of  the  moni¬ 
tored  signal  reaches  steady  state.  The  saturation  time  can  be 
influenced  by  the  residence  time  of  gases  in  the  plasma  and 
chamber  wall  effects.  Decreasing  the  saturation  time  in¬ 
creases  the  sensitivity  of  the  in  situ  sensors  to  changes  in 
etch  rates  relative  to  changes  in  residual  etch  product  con¬ 
centrations.  The  response  time  and  saturation  time  have  been 
studied  for  optical  emission  and  mass  spectrometric  signals, 
and  methods  to  reduce  these  are  presented. 

The  effectiveness  of  OES  for  endpoint  detection  has  been 
examined  using  scanning  electron  microscopy  (SEM)  for  re¬ 
active  ion  etching  of  Si/SiGe  heterostructures. ^  The  SEM 
analysis  showed  that  a  significant  amount  of  SiGe  was 
etched  before  the  Ge  emission  intensity  increased.  The  re¬ 
sponse  time  of  the  Ge  emission  intensity  was  approximated 
at  3  s.  In  our  study,  reflectometry  was  used  to  evaluate  the 
effectiveness  of  the  endpoint  detection  scheme.  By  measur¬ 
ing  the  thickness  of  the  GalnAs  base  layer  before  and  after 
etching  with  reflectometry,  the  amount  of  base  layer  removed 
can  be  found.  It  has  been  demonstrated  that  modifications  of 
the  device  material  could  improve  in  situ  monitoring  of  dry 
etching  in  III-V  compound  heterostructures  to  an  etch  depth 
accuracy  of  ±8  nm.^^  However,  if  the  response  time  can  be 
reduced  so  that  OES  can  be  used  to  detect  the  endpoint  with 
<8  nm  of  the  base  removed,  then  modifications  of  the  device 
material  are  not  necessary  to  improve  etch  depth  accuracy 
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and  to  minimize  the  removal  of  the  base  layer. 

II.  EXPERIMENT 

The  etching  system  consisting  of  an  electron  cyclotron 
resonance  (ECR)  source  and  a  rf-powered  electrode  has  been 
described  previously.  Microwave  power  at  2,45  GHz  is 
coupled  to  the  plasma  using  a  copper  input  probe.  Tuning  of 
the  plasma  is  obtained  by  adjusting  the  microwave  power 
input  probe  length  and  the  cavity  dimension  to  minimize  the 
reflected  power.  The  power  coupled  to  the  stage  can  be  con¬ 
trolled  by  fixing  either  the  rf  power  from  0  to  500  W  or  the 
self-induced  dc  bias  voltage  (|Vdcl)-  The  stage  position  can  be 
adjusted  so  that  the  ECR  source  to  sample  distance  can  be 
varied  from  8  to  27  cm.  Gases  are  fed  through  mass  flow 
controllers  and  the  pressure  is  controlled  by  a  throttled  1500 
1/s  turbomolecular  pump.  The  volume  of  the  chamber  is  80  1. 

One  end  of  an  optical  fiber  was  placed  at  the  quartz  dome 
where  the  ECR  condition  was  met  to  collect  optical  emission 
from  the  plasma.  A  Spex  500  scanning  monochromator  was 
used  to  detect  emission  intensity  as  a  function  of  wavelength 
ranging  from  300  to  900  nm.  The  resolution  of  the  mono¬ 
chromator  was  0.02  nm  and  the  focal  length  was  0.5  m.  The 
grating  used  was  blazed  for  maximum  efficiency  at  630  nm. 
The  signal  was  measured  by  a  photomultiplier  tube,  then  it 
was  amplified,  and  sampled  by  a  computer  with  Lab  VIEW 
running  data  acquisition  and  control  software. 

Mass  spectrometric  intensities  were  measured  using  a 
NGS  partial  pressure  transducer  (PPT).  The  mass  spectrom¬ 
eter  was  designed  to  be  portable  which  necessitated  the  po¬ 
sitioning  of  the  PPT  66  cm  downstream  from  the  plasma 
chamber.  For  a  dedicated  system,  the  PPT  could  be  closer  to 
the  chamber,  potentially  decreasing  response  time.  The  elec¬ 
tron  energy  of  the  ionizer  was  70  eV  and  the  base  pressure  of 
the  PPT  was  kept  <5X10“^  Ton*  by  using  a  0.6  mm  orifice 
in  conjunction  with  an  80 1/s  turbomolecular  pump.  The  mass 
spectrometer  provides  the  ability  to  monitor  two  mass  spec¬ 
trometric  signals  simultaneously.  The  two  signals  can  be  up¬ 
dated  as  fast  as  8  Hz,  or  one  signal  can  be  monitored  alone  at 
10  Hz.  All  signals  of  interest  including  microwave  power,  rf 
power,  pressure,  gas  flows,  and  signal  intensities  from  the 
OES  and  MS  systems  can  be  collected  and  monitored  in  real 
time  on  the  Lab  VIEW  platform. 

III.  RESULTS  AND  DISCUSSION 

It  has  previously  been  shown  that  when  etching  III-V 
materials  such  as  GaAs  and  InP,  optical  emission  from  the 
group  III  elements,  Ga  and  In,  is  readily  detected  using  OES, 
while  the  partial  pressures  of  the  group  V  related  etch  prod¬ 
ucts,  ^"^^AsCl^  and  ^^^PCl^ ,  are  detected  by  MS.^’^^  The 
^"^^AsClJ  and  ^^^PCl^  signals  are  proportional  to  the  partial 
pressure  of  the  chlorides  formed  in  the  ionizer  of  the  PPT, 
and  can  be  related  to  changes  in  the  etch  rates  of  GaAs  and 
InP.  Both  OES  and  MS  are  able  to  monitor  signals  related  to 
the  etch  gases.  To  make  use  of  these  signals  for  real-time 
control,  it  is  important  to  measure  the  response  of  the  sensors 
to  changes  in  plasma  conditions.  Figure  1  shows  the  optical 
emission  and  mass  spectrometric  signals  related  to  etch  prod- 
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Fig.  L  Response  of  Ga  optical  emission  at  417.2  nm  ( - )  and  ^"^^AsClJ 

partial  pressure  ( - )  to  ignition  of  plasma,  as  shown  by  increase  in 

microwave  power  ( - )  from  0  to  50  W.  The  plasma  conditions  were  50 

W  microwave  power,  125  W  rf  power,  C^/Ar  flow  at  4/16  seem,  chamber 
pressure  of  2  mTorr,  and  the  ECR  source-to-sample  distance  was  13  cm. 


ucts  of  GaAs  etched  in  a  C^Ar  plasma  when  the  plasma  is 
first  ignited.  The  plasma  conditions  were  50  W  microwave 
power,  125  W  rf  power,  C^Ar  flow  at  4/16  seem,  chamber 
pressure  of  2  mTorr,  and  13  cm  ECR  source-to-sample  dis¬ 
tance.  The  microwave  power  was  increased  from  0  to  50  W 
and  the  plasma  was  ignited.  It  can  be  seen  that  the  Ga  emis¬ 
sion  intensity  at  417.2  nm,  which  is  sampled  at  40  Hz,  begins 
to  increase  as  soon  as  the  microwave  power  is  enabled.  By 
defining  the  response  time  as  the  time  between  when  the 
microwave  power  rises  and  the  time  derivative  of  the  Ga 
emission  intensity  is  maximum,  a  response  time  of  0.2  s  is 
obtained.  There  is  a  delay  of  1  s  before  the  ^^^AsCl^  mass 
spectrometric  signal  increases  from  the  background  value 
shown  prior  to  etch  initiation.  The  cause  of  this  delay  could 
be  due  to  two  effects,  either  the  integration  time  of  the  mass 
spectrometer  or  the  transit  time  from  the  wafer  surface  to  the 
detector.  The  integration  time  represents  the  desired  rate  of 
filtering  and  determines  how  frequently  the  signal  is  updated. 
For  this  study,  the  integration  time  was  varied  between  0.2 
and  1  s.  The  transit  time  of  the  etch  products  is  composed  of 
two  components,  the  residence  time  in  the  chamber  and  the 
time  to  travel  from  the  gate  valve  to  the  ionizer  of  the  mass 
spectrometer.  Under  typical  etch  conditions,  the  transit  time 
was  estimated  to  be  0.6  s. 

Before  sampling  the  Ga  optical  emission  signal,  the  signal 
from  the  photomultiplier  tube  is  amplified.  The  amplifier  has 
a  built-in  low-pass  filter  to  reduce  high  frequency  noise.  Fig¬ 
ure  2  shows  how  the  response  time  and  signal-to-noise  ratio 
vary  with  the  filter  cutoff  frequency.  In  this  case,  the  signal 
referred  to  is  the  time  derivative  of  the  Ga  emission  intensity. 
Increasing  the  cutoff  frequency  improves  the  time  response 
of  the  signal  by  allowing  the  high  frequency  components  to 
contribute  to  the  signal.  Thus,  the  minimum  response  time  of 
0.2  s  occurred  when  the  cutoff  frequency  was  highest  at  10 
Hz.  The  disadvantage  of  increasing  the  cutoff  frequency  is 
that  high  frequency  noise  is  no  longer  damped,  leading  to  a 
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Fig.  2.  Dependence  of  Ga  optical  emission  response  time  on  cutoff  fre¬ 
quency  of  low-pass  filter  for  the  amplifier.  Also  shown  is  the  signal-to-noise 
ratio  of  the  time  derivative  of  the  Ga  signal.  The  etch  conditions  were  the 
same  as  in  Fig.  I . 


Fig.  4.  Effects  of  pressure  on  the  saturation  time  of  etch  product  signals 

monitored  by  OES  ( - )  and  MS  ( - ).  The  etch  conditions  were  50 

W  microwave  power,  100  W  rf  power,  Cl2/Ar  flow  at  3/27  seem,  and  13  cm 
source  distance. 


more  noisy  signal.  This  is  shown  in  Fig.  2  by  a  decrease  in 
the  signal-to-noise  of  the  Ga  time  derivative  with  increasing 
cutoff  frequency.  As  the  cutoff  frequency  decreases,  the  sig¬ 
nal  is  also  damped  and  the  signal-to-noise  decreases.  The 
maximum  value  of  the  signal-to-noise  was  20  and  it  occurred 
for  a  cutoff  filter  frequency  of  0.3  Hz.  The  response  time  of 
the  signal  at  this  frequency  was  only  0.7  s.  Therefore,  for 
endpointing,  a  compromise  is  made  between  response  time 
and  sensitivity.  At  a  filter  frequency  of  1  Hz,  the  signal-to- 
noise  is  16  and  the  response  time  is  0.3  s.  It  is  assumed  that 
the  response  time  will  set  a  lower  limit  on  the  precision  of 
the  endpoint  detection  scheme. 

Figure  3  shows  how  the  response  time  of  the  ^"^^AsCl^ 
signal  depends  on  the  update  frequency.  For  short  integration 
times,  the  mass  spectrometer  responds  quickly  to  changes  in 
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Fig.  3.  Response  time  and  signal-to-noise  for  the  time  derivative  of 
the  ^"^^AsClJ  partial  pressure  signal  as  a  function  of  update  frequency  for 
the  partial  pressure  measurement.  The  etch  conditions  were  the  same  as  in 
Fig.  1. 


the  ^"^^AsCl^  partial  pressure  and  a  minimum  response  time 
of  0.9  s  is  measured  for  an  update  frequency  of  5  Hz.  For 
processes  that  operate  at  higher  pressure,  the  response  time 
can  be  limited  by  transit  times  through  differential  pumping 
stages.  Using  in  situ  MS  sampling  of  poly-Si  deposited  at  5 
Torr,  a  response  time  limited  by  viscous  flow  was  measured 
to  be  ~3  s.^  The  gas  residence  time  is  proportional  to  the  gas 
pressure  and  inversely  proportional  to  the  gas  flow  rate.  The 
residence  time  has  been  shown  as  a  function  of  the  gas  flow 
at  different  gas  pressures.^^  For  our  system,  the  residence 
time  for  a  pressure  of  2  mTorr  and  a  total  flow  of  20  seem  is 
0.6  s.  This  suggests  that  the  minimum  response  time  of  0.9  s 
is  partially  caused  by  the  time  needed  to  travel  from  the 
wafer  surface  to  the  mass  spectrometer.  The  drawback  to 
using  a  short  integration  time  is  the  increased  sensitivity  to 
noise  which  is  not  averaged  out  effectively  during  the  short 
integration  time.  This  can  be  seen  by  the  low  signal-to-noise 
ratio  of  only  6.4  measured  at  an  update  frequency  of  5  Hz. 
For  longer  integration  times,  the  signal-to-noise  is  increased 
significantly  by  lOX  to  64  at  an  update  frequency  of  1  Hz.  At 
a  frequency  of  1  Hz,  the  response  time  of  the  mass  spectrom¬ 
eter  increased  to  1.8  s.  For  endpoint  detection,  a  frequency  of 
2  Hz  is  appropriate,  since  the  response  time  is  still  limited  by 
the  transit  time  of  etch  products  from  the  wafer  surface  to  the 
sensor  and  a  strong  signal-to-noise  of  28  is  obtained. 

It  should  be  noted  from  Fig.  1  that  the  signals  do  not 
saturate  initially.  This  has  been  presented  previously  for  the 
Ga  intensity  during  etching  of  GaAs.^  The  saturation  time 
was  defined  as  the  time  required  before  the  signal  reached 
80%  of  the  steady  state  value.  The  saturation  time  of  the  Ga 
emission  intensity  at  417.2  nm  decreased  as  the  total  gas 
flow  rate  increased.  The  saturation  time  for  the  optical  emis¬ 
sion  and  mass  spectrometric  signals  are  compared  as  shown 
in  Fig.  4.  The  plasma  is  ignited  at  time  0  min  and  the  signals 
begin  to  increase  as  the  GaAs  is  etched.  The  ^^^AsCl^  partial 
pressure  behaves  in  a  similar  manner  to  the  Ga  optical  emis- 
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TIME  (min) 

Fig.  5.  Ga  emission  intensity  at  417.2  nm  as  a  function  of  etch  time.  The 
etches  were  stopped  when  the  signal  reached  25%,  50%,  and  100%  of  the 
maximum  expected  signal.  The  etch  conditions  were  50  W  microwave 
power,  100  W  rf  power,  2  mTorr,  Cl2/Ar  at  3/27  seem,  and  13  cm  source 
distance. 

sion  at  417,2  nm.  The  signal  intensity  at  a  pressure 

of  6  mTorr  is  2.7  X  larger  than  that  at  2  mTorr.  However,  it 
takes  4  min  for  the  ^"^^AsClJ  signal  at  6  mTorr  to  saturate, 
while  it  only  takes  10  s  to  saturate  at  2  mTorr.  Also  shown  on 
Fig.  4  is  the  Ga  emission  intensity  for  the  two  different  pres¬ 
sures.  The  saturation  time  of  the  Ga  emission  is  higher  at  6 
mTorr  than  it  is  at  2  mTorr.  At  2  mTorr,  the  Ga  emission 
saturates  in  3  min  while  at  6  mTorr  it  takes  7  min  to  reach 
80%  of  the  saturated  value.  The  saturation  times  for  the  op¬ 
tical  emission  and  mass  spectrometric  signals  follow  the 
same  trend  but  MS  provides  lower  saturation  time.  This 
could  be  due  to  differences  between  the  two  monitored  spe¬ 
cies,  or  it  could  be  related  to  the  mass  spectrometer  being 
located  downstream  from  the  etching  chamber  while  optical 
emission  is  taken  directly  from  the  plasma. 

The  endpoint  detection  algorithm  for  the  emitter  etch  step 
of  Alin  As/GaIn  As-based  HBTs  samples  the  Ga  optical  emis¬ 
sion  signal  at  417.2  nm,  since  this  signal  will  rise  once  the 
GalnAs  base  layer  begins  etching.^  For  calibration,  an 
AlInAs  on  GalnAs  sample  was  etched  to  determine  the 
maximum  Ga  emission  intensity.  This  test  structure  consisted 
of  200  nm  AlInAs  on  60  nm  GalnAs  over  an  InP  substrate. 
For  the  following  etches,  when  the  Ga  intensity  reached  a 
specified  percentage  of  the  maximum  value,  a  signal  was 
sent  to  the  etch  system  controller  to  disable  the  power.  Figure 
5  shows  three  traces  of  the  Ga  emission  intensity  as  a  func¬ 
tion  of  time.  For  these  three  etches,  the  endpoints  were  de¬ 
fined  as  occurring  when  the  Ga  intensity  reached  25%,  50%, 
and  100%  of  the  maximum  value.  It  can  be  seen  that  once 
the  intensity  reached  the  prescribed  level,  the  plasma  was 
turned  off  and  the  intensity  dropped  to  zero  immediately.  The 
etch  condition  for  all  three  was  50  W  microwave  power,  100 
W  rf  power,  2  mTorr  pressure,  CyAr  at  3/27  seem,  and  13 
cm  source  distance. 

Prior  to  etching,  the  AlInAs  and  GalnAs  layer  thicknesses 
were  measured  using  reflectometry.  The  AlInAs  layer  was 
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PERCENT  OF  MAXIMUM  SIGNAL  WHEN  STOPPED  (%) 

Fig.  6.  Remaining  thickness  of  GalnAs-base  layer  when  stopping  at 
three  different  Ga  signal  intensities.  The  etch  conditions  were  the  same  as  in 
Fig.  5. 


found  to  be  209  nm  and  the  GalnAs  layer  was  57  nm  thick. 
After  etching,  the  samples  were  remeasured  to  determine  the 
remaining  layer  thicknesses.  Figure  6  shows  the  remaining 
thickness  of  the  GalnAs  layer  for  the  three  different  etches. 
The  AlInAs  layer  had  been  completely  removed  in  all  three 
cases.  When  the  etch  was  stopped  after  the  Ga  emission  sig¬ 
nal  reached  25%  of  the  maximum  expected  value,  the  re¬ 
maining  GalnAs  thickness  was  55  nm.  Therefore,  the  etch 
had  been  stopped  after  only  2  nm  of  the  base  had  been  re¬ 
moved.  For  a  60  nm  base  layer,  this  represents  only  a  3% 
change  in  the  base  layer  thickness  due  to  overetching. 

The  rise  in  Ga  emission  intensity  shown  in  Fig.  5  occurs 
at  different  times  for  each  of  the  etches.  The  etch  that  was 
stopped  when  the  intensity  reached  25%  of  the  maximum 
value,  was  the  first  to  increase.  The  etch  time  to  reach  25% 
was  only  4  min  19  s.  This  can  be  compared  to  the  slowest 
etch,  in  which  the  intensity  did  not  reach  25%  of  the  maxi¬ 
mum  until  4  min  38  s  had  elapsed.  The  difference  in  the 
AlInAs  layer  thicknesses  for  the  two  samples  was  measured 
to  be  <  1  nm,  such  that  the  etch  time  difference  is  attributed 
to  etch  rate  variation  as  opposed  to  thickness  variation.  The 
average  etch  rate  for  10  samples  was  47.1  ±1.6  nm/min.  The 
fastest  etch  rate  was  48  nm/min  while  the  slowest  etch  rate 
was  45  nm/min,  a  variation  of  only  6%.  This  is  large  enough 
that  had  the  etching  for  the  run  with  the  fastest  etch  rate  been 
carried  out  for  the  same  time  as  the  run  with  the  slowest  etch 
rate,  an  additional  15  nm  of  the  GalnAs  base  layer  would 
have  been  removed.  Clearly,  endpoint  detection  is  required 
even  when  small  run-to-run  etch  rate  variations  are  antici¬ 
pated. 

In  order  to  eliminate  the  need  for  a  calibration  step  to 
determine  the  maximum  Ga  emission  intensity,  a  more  ad¬ 
vanced  endpoint  algorithm  was  developed.  This  algorithm 
depends  only  on  data  from  the  current  run  and  an  estimated 
etch  time.  1  min  before  the  approximate  endpoint,  Ga  emis¬ 
sion  intensity  data  are  collected  for  30  s.  A  least-squares  fit 
through  the  optical  data  is  used  to  model  the  optical  trend  of 
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Fig.  7.  Endpoint  detection  of  an  AlInAs/GalnAs  structure  using  the  changes 
in  the  Ga  emission  intensity  as  the  GalnAs  layer  begins  to  etch.  The  model 

trend  ( - )  and  the  upper  bound  ( - ),  1.5  standard  deviation  above 

the  trend,  are  shown  with  the  microwave  power,  ( - ).  The  etch  condi¬ 

tions  were  the  same  as  in  Fig.  5. 


Fig.  8.  Comparison  of  the  Ga  optical  emission  from  OES  and  the  ^"^^AsClJ 
partial  pressure  from  MS  when  endpointing  GalnAs  on  InP  substrate.  The 
plasma  conditions  were  50  W  microwave  power,  100  W  rf  power,  1.5  mTorr 
chamber  pressure,  Cl2/Ar  flow  at  3/27  seem,  and  13  cm  source-to-sample 
distance. 


the  current  run,  and  the  standard  deviation  of  the  noise  is 
calculated.  At  a  sample  rate  of  4  Hz,  the  Lab  VIEW  platform 
can  make  these  calculations  without  missing  any  data  collec¬ 
tions.  After  modeling  the  trend,  the  algorithm  looks  for  a  rise 
in  the  Ga  signal.  The  endpoint  is  determined  when  m  con¬ 
secutive  data  points  are  n  standard  deviations  of  noise  above 
the  calculated  trend.  Both  m  and  n  will  be  optimized  to 
minimize  the  etching  of  the  GalnAs  base  layer.  Lab  VIEW 
sends  a  signal  to  the  plasma  system  controller  once  the  end¬ 
point  is  reached  and  the  etch  is  stopped  automatically.  Be¬ 
cause  this  algorithm  does  not  depend  on  data  from  other 
runs,  it  is  robust  to  run-to-run  changes  in  Ga  emission  inten¬ 
sity,  noise  level,  and  etch  rate,  as  long  as  the  etch  rate  does 
not  increase  so  much  that  the  etch  ends  more  than  30  s  before 
the  expected  time.  Figure  7  shows  this  endpoint  algorithm 
tested  on  an  AlInAs/GalnAs/InP  sample  described  previ¬ 
ously.  The  endpoint  was  set  when  4  consecutive  data  points 
of  the  Ga  emission  intensity  were  larger  than  1.5  standard 
deviations  of  the  noise.  The  approximate  etch  time  was  4 
min  and  the  sample  rate  was  4  Hz,  The  endpoint  was  found 
at  4  min  23.5  s  after  etching  only  '--I  nm  into  the  GalnAs 
layer.  This  is  indicated  by  4  consecutive  points  lying  above 
the  upper  bound.  The  trend  and  standard  deviation  of  noise 
are  calculated  using  emission  intensity  data  from  180  to  210 
s.  The  microwave  power,  which  was  shut  off  at  4  min  24  s,  is 
also  shown. 

Figure  8  shows  the  comparison  of  OES  and  MS  signals 
when  endpointing  a  GalnAs  layer  on  an  InP  substrate.  The 
plasma  conditions  were  50  W  microwave  power,  100  W  rf 
power,  1.5  mTorr  chamber  pressure,  Cl2/Ar  flow  at  3/27 
seem,  and  13  cm  sample-to-source  distance.  As  the  GalnAs 
layer  is  removed  and  the  InP  substrate  is  exposed  to  the 
plasma,  both  the  Ga  emission  and  ^"^^AsCl^  signals  are  ex¬ 
pected  to  decrease.  The  Ga  optical  emission  intensity  de¬ 
creased  at  the  same  rate  as  the  ^"^^AsClJ  partial  pressure  did. 
The  time  it  took  for  the  OES  signal  to  drop  25%  of  the 


maximum  is  about  the  same  as  the  MS  signal.  However, 
since  the  Ga  optical  emission  responds  faster  than  the 
^^^AsClJ  partial  pressure  does,  one  would  expect  the  end¬ 
point  predicted  by  OES  to  occur  before  that  predicted  by 
MS.  This  has  been  verified  by  applying  the  endpoint  algo¬ 
rithm  described  earlier.  In  this  case,  the  endpoint  is  reached 
when  4  consecutive  points  lie  below  the  lower  bound,  since 
the  Ga  emission  and  ^“^^AsClJ  partial  pressure  decrease  once 
the  InP  substrate  begins  etching.  Using  the  OES  signal,  the 
algorithm  predicted  that  the  endpoint  occurred  ^2  s  before 
that  predicted  when  using  the  MS  signal.  Therefore,  OES  is  a 
more  effective  method  for  endpointing  this  structure. 

IV.  CONCLUSIONS 

Signals  from  the  etching  system  and  the  in  situ  sensors 
were  collected  simultaneously  so  that  an  accurate  determina¬ 
tion  of  the  sensor  response  times  could  be  made.  The  re¬ 
sponse  time  for  OES  varied  from  0.2  to  1.3  s  when  different 
low-pass  filter  cutoff  frequencies  were  used.  A  low-pass  filter 
with  a  cutoff  frequency  of  1  Hz  on  the  output  of  the  photo¬ 
multiplier  tube  provided  a  strong  signal-to-noise  and  a  short 
response  time  of  0.3  s.  For  short  integration  times,  the  re¬ 
sponse  time  of  MS  was  limited  to  0.9  s,  the  transit  time  of 
the  etch  products  from  the  wafer  surface.  By  increasing  the 
integration  time,  the  signal-to-noise  improved  at  the  expense 
of  response  time.  The  saturation  time  was  reduced  by  de¬ 
creasing  the  pressure  and  increasing  the  total  flow  rate.  The 
^"^^AsClJ  partial  pressure  measured  by  MS  had  a  lower  satu¬ 
ration  time  than  did  the  Ga  optical  emission.  For  endpoint¬ 
ing,  once  the  sensor  detected  a  change  in  the  plasma,  a  signal 
was  sent  from  Lab  VIEW  to  the  etch  system  controller  and 
the  plasma  was  stopped.  Etching  of  an  AlInAs/GalnAs  struc¬ 
ture  was  stopped  once  Ga  optical  emission  was  detected,  and 
reflectometry  measurements  showed  that  only  ~2  nm  of 
GalnAs  was  removed.  OES  is  more  effective  for  endpoint  of 
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a  GalnAs  layer  on  an  InP  substrate  since  the  endpoint  pre¬ 
dicted  by  OES  occurred  '^2  s  before  that  predicted  by  MS. 
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Reactive  ion  etching  of  sloped  sidewaiis  for  surface  emitting  structures 
using  a  shadow  mask  technique 
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A  novel,  very  simple  technique  for  the  direct  adjustment  of  the  slope  of  facets  in  a  reactive  ion 
etching  process  is  presented.  The  etching  process  is  performed  by  screening  the  sample  partially 
with  an  aluminum  shadow  mask,  which  is  located  inside  the  dark  space  of  the  plasma.  The  sidewall 
angle  of  etched  lines,  defined  by  a  pattern  mask  on  the  sample,  depends  strongly  on  the  distance 
between  the  shadow  mask  and  the  sample.  The  investigations  are  carried  out  with  a 
methane-hydrogen  plasma.  The  angle  of  inclination  of  the  sidewalls  can  be  varied  in  a  controllable 
manner  by  more  than  45°.  This  technique  is  well  suited  to  fabricate  90°  reflectors,  which  are 
necessary  for  three-dimensional  optical  interconnects  with  planar  waveguide  structures.  To  analyze 
the  optical  quality  of  the  etched  mirror  plane,  the  beam  transformation  was  investigated  by 
reflectivity  measurements  using  a  single  aluminum-evaporated  mirror  structure,  which  is 
illuminated  by  a  focused  laser  beam  at  a  wavelength  of  633  nm.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

Reactive  ion  etching  (RIE)  is  a  well  established  technique 
for  the  microfabrication  of  semiconductors.  Its  inherent  high 
resolution  and  anisotropy  permit  a  precise  pattern  transfer  of 
features  with  geometrical  sizes  in  the  submicron  region, 
which  are  often  required  in  optoelectronic  devices.  In  con¬ 
trast  to  an  ion  beam  etching  (IBE)  process,  the  main  advan¬ 
tage  of  RIE  is  the  simplicity  of  the  etching  process,  which 
can  easily  be  integrated  in  a  cost  efficient  mass  production 
line  of  semiconductor  devices.  However,  serious  problems 
arise  for  the  application  of  RIE,  if  angled  sidewall  faces  have 
to  be  generated  for  some  special  optical  components  like 
45°  mirrors  for  surface  emitting  lasers  or  optical  diffraction 
gratings.  An  indirect  influence  of  the  sidewall  profile  can  be 
achieved  by  controlled  mask  erosion  or,  with  only  a  small 
variation  range,  by  undercut  etching.  A  strongly  asymmet¬ 
ric  profile  with  different  shapes  of  opposite  sides  of  one 
structure  can  only  be  realized  by  angled  RIE,  where  the  sub¬ 
strate  surface  is  tilted  with  respect  to  the  incident  ion  flux,  in 
a  similar  way  as  with  the  arrangement  in  an  IBE  process. 
Only  a  few  papers  concerning  angled  etching  in  a  RIE  cham¬ 
ber  have  been  published  up  to  now.  All  these  investigations 
were  carried  out  with  special  sample  holders,  where  the  sub¬ 
strate  was  mounted  with  a  tilting  angle  on  the  cathode  of  the 
reactor  chamber.^“^  However  the  position  of  the  sample 
leads  to  a  disturbance  of  the  electric  field  in  the  plasma 
sheath  that  causes  an  inhomogeneity  of  the  ion  flux  across 
the  surface  of  the  tilted  sample.  The  influence  of  such  an 
inhomogeneity  on  the  etching  behavior  has  already  been 
studied  and  different  solutions  have  been  proposed  for  the 
improvement  of  the  homogeneity. 

In  this  study  we  present  another  way  to  control  the  angle 
between  ions  and  substrate  by  deflecting  the  ions  during  their 
path  through  the  plasma  sheath  directly  by  a  defined  field 
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distortion  with  a  conductive  mask.  The  range  of  influence 
depends  on  the  special  design  of  the  mask.  We  have  recently 
constructed  a  spot  size  transformer  with  a  vertically  tapered 
waveguide  by  using  for  the  first  time  such  a  simple  mask 
technique  in  a  reactive  ion  etching  process.^  The  variation  of 
the  etch  rate  along  the  waveguide,  which  leads  to  the  vertical 
tapering  over  a  distance  of  500  /^m,  is  caused  by  spreading 
the  ion  flux  below  the  mask.  Waveguide  structures  which  are 
orientated  perpendicular  to  the  spreading  direction  show  a 
strongly  asymmetric  sidewall  profile,  which  is  a  clear  indi¬ 
cation  for  angled  etching  in  the  vicinity  of  the  mask.  Over  a 
distance  of  several  millimeters  we  observed  only  a  weak  de¬ 
pendence  of  the  sidewall  profile  on  the  lateral  position  of  the 
structure.  Although  in  the  present  work,  only  a  portion  of  the 
wafer  was  etched  with  asymmetric  sidewalls,  it  may  be  pos¬ 
sible  in  future  work  to  design  a  mask  with  a  periodically 
repeated  design  that  allows  processing  of  total  wafers. 

We  present  in  detail  the  first  results  of  angled  etching  of 
InP  in  a  methane  hydrogen-based  plasma  using  a  conductive 
shadow  mask  in  the  RIE  process.  The  methane-hydrogen- 
based  mixture  is  an  efficient  etchant  for  InP  and  related  com¬ 
pounds,  because  it  produces  very  smooth,  mirrorlike  surfaces 
and  allows  the  fabrication  of  features  with  high  aspect  ratio 
due  to  a  good  selectivity  between  etch  mask  and  substrate 
material.^  We  investigated  the  homogeneity  of  the  etching 
behavior  in  the  dependence  on  the  sample  position  relative  to 
the  mask  and  studied  the  effect  of  the  mask  geometry  on  the 
shape  of  the  etched  facets.  Finally,  a  deeply  etched  45°  mir¬ 
ror  with  a  high  aspect  ratio,  suitable  for  producing  surface 
emitting  planar  waveguide  structures,  was  fabricated  and  op¬ 
tically  tested. 

II.  EXPERIMENT 

Figure  1  shows  the  setup  with  the  cross  section  of  the 
applied  mask  for  angled  etching.  The  mask,  which  is  formed 
from  a  170  fim  aluminum  sheet,  is  directly  fixed  on  the 
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Fig.  1.  Setup  of  the  angled  etching  mode  with  a  shadow  mask  and  photo¬ 
graphs  of  the  profile  of  etched  structures,  placed  below  the  mask  in  different 
positions. 


cathode.  During  etching  the  widely  extended  plasma  above 
the  mask  provides  a  homogeneous  ion  density  distribution  at 
the  beginning  of  the  plasma  sheath.  The  principle  of  ion 
deflection  in  the  presence  of  the  mask  is  illustrated  by  the 
different  ion  trajectories  through  the  sheath.  Far  away  from 
the  mask  the  etching  ions  are  accelerated  in  the  plasma 
sheath  directly  to  the  negative  biased  cathode.  Close  to  the 
mask,  the  positive  ions  are  also  attracted  by  the  negatively 
biased  mask,  so  that  the  inclination  angle  of  the  resulting  ion 
flux  on  the  cathode  differs  clearly  from  the  vertical  direction. 
To  enhance  the  distortion  of  electrical  field  lines  in  the  hori¬ 
zontal  direction,  a  vertical  sheet  is  added  on  the  top  of  the 
mask. 

We  started  our  investigations  with  undoped  InP  substrates 
with  a  (100)  surface.  Line  patterns  with  geometrical  widths 
between  1  and  3  /^m  are  defined  on  the  samples  by  photoli¬ 
thography  or  electron  beam  lithography.  We  used  two  differ¬ 
ent  mask  materials:  an  organic  resist  AZ1512,  and  an  evapo¬ 
rated  AI2O3  film,  structured  by  the  lift-off  technique.  Both 
masks  were  300  nm  thick.  The  pattern  transfer  was  per¬ 
formed  by  RIE  etching  with  an  electron  cyclotron  resonance 
(ECR)  source  at  a  process  pressure  of  10“^  mbar.  We  used  a 
commercially  available  etching  system,  type  Z401,  from 
Leybold,  which  is  described  elsewhere  in  more  detail.^  The 
anode  of  a  parallel  plate  reactor  is  replaced  by  an  ECR 
plasma  source  and  the  distance  from  the  cathode  to  the 
plasma  chamber  of  the  ECR  source  is  around  20  cm.  Around 
this  ECR  plasma  chamber  permanent  magnets  are  arranged 
in  a  configuration  that  results  in  a  toroidal  magnetostatic 
field.  The  microwave  power  for  the  ECR  discharge  coupled 
into  the  plasma  amounts  to  200  W.  Due  to  the  additional 
ECR  discharge  the  etch  rate  is  increased  to  a  sufficiently  high 
value  of  10  nm/min.  The  RF  power  was  40  W  resulting  in  a 
bias  voltage  of  300  V.  The  plasma  consists  of  a  mixture  of 
methane,  hydrogen  and  argon  (CH4:H2:Ar=8:50:l).  Under 


these  conditions  the  sheath  thickness  of  the  etch  plasma, 
which  seemed,  as  discussed  later,  to  be  an  important  param¬ 
eter  for  angled  etching,  is  roughly  2  cm. 

III.  RESULTS  AND  DISCUSSION 

First  we  characterized  the  etching  properties — etch  rate, 
sidewall  faces  and  surface  morphology — in  dependence  on 
the  position  of  the  analyzed  structure  relatively  to  the  mask. 
The  lower  part  of  Fig.  1  illustrates  the  characteristic  shape  of 
the  etched  sidewalls  as  observed  at  different  positions  on  the 
sample.  The  line  patterns  are  oriented  perpendicular  to  the 
picture  plane.  In  this  direction  the  mask  extending  over  30 
mm  is  long  enough  to  ensure  homogeneity  of  the  ion  flux. 
The  mask  is  characterized  by  the  height  of  the  sidewall  hi 
(9.4  mm),  the  length  of  the  cover  /  (8.6  mm)  and  its  total 
height  /z  2  (11  mm).  The  micrographs  in  the  lower  part  of  Fig. 
1  show  the  significant  influence  of  the  shadow  mask  on  the 
facetting  of  the  structures  in  three  different  positions.  Even  in 
the  range  outside  the  covered  region  (x<  0),  both  sidewalls 
of  the  structures  are  strongly  sloped  as  discussed  later.  Over 
a  distance  of  8  mm  the  wall  angle  on  the  left  side  opposite  to 
the  ion  beam  direction  turns  by  more  than  20°.  The  curva¬ 
ture  of  the  sidewall  remains  nearly  straight.  So  for  all  posi¬ 
tions  the  realized  left  facets  of  the  structures  are  acceptable 
for  an  application  as  a  mirror  plane.  In  particular,  an  angle  of 
45°  is  shown  in  the  middle  micrograph.  Only  on  the  top  of 
the  profile,  a  slightly  convex  shape  at  extremely  tilted  side- 
walls  may  arise,  which  is  illustrated  on  the  left  photo.  In  this 
case — close  to  the  fixed  end  of  the  mask — the  etch  rate  is 
significantly  reduced.  The  right  side  of  the  line  patterns  di¬ 
rectly  facing  the  ion  beam  are  not  so  well  structured.  There  is 
a  curved  transition  to  the  etched  floor  at  the  bottom  of  the 
facet  and  the  tilting  angle  of  this  side  depends  only  weakly 
on  the  lateral  position. 

Taking  an  appropriate  masking  material  on  the  chip 
(AI2O3 ,  the  stripes  being  defined  by  electron  beam  lithogra¬ 
phy),  the  sidewalls  of  the  etched  structures  themselves  are 
very  smooth  as  shown  in  Fig.  2.  The  roughness  is  well  below 
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Fig.  3.  Characterization  of  the  profile  of  etched  structures  as  a  function  of 
the  lateral  position. 


50  nm  and  thus  the  mirrors  are  even  applicable  as  reflectors 
for  the  visible  light. 

To  quantify  the  etching  behavior  below  the  mask,  etch 
depth  and  tilting  angle  and  4>2  of  both  sidewall  faces  are 
plotted  in  Fig.  3  versus  the  lateral  position  of  the  structure. 
The  definition  of  the  angles  is  given  inside  the  view  graph  of 
Fig.  3.  The  covered  region  begins  in  position  x=^0.  From  this 
point,  the  mask  extends  up  to  the  right  margin  of  the  figure  at 
X  — 8.6  mm.  In  the  lower  part  of  Fig.  3,  the  curve  (open 
squares)  represents  the  etch  depth  over  this  range  of  x.  The 
depth  remains  constant  along  a  distance  of  6  mm.  So  in  the 
first  6  mm  under  the  mask  no  shadowing  effect  of  ions  is 
noticeable,  which  would  lead  to  a  reduction  of  the  etch  rate. 
Only  along  the  last  2  mm  up  to  the  face  of  the  aluminum 
mask,  the  etch  rate  drops.  In  the  upper  part  of  Fig.  3,  the 
changes  of  the  two  wall  angles  $1  and  <I>2  of  the  structure 
are  shown.  (open  circles)  corresponds  to  the  wall,  which 
is  turned  away  from  the  ion  direction.  There  is  only  a  slight 
linear  decay  of  0 1  with  increasing  x  value  around  a  value  of 
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53°.  The  wall  angle  ^2  c>n  the  opposite  side  of  the  profile 
(filled  circles)  also  decreases  linearly;  however,  it  is  shifted 
towards  an  angle  15°  higher.  So  the  sidewalls  are  not  parallel 
to  each  other.  For  patterns  etched  under  the  same  conditions 
at  normal  incidence  without  using  a  shadow  mask,  both  side- 
walls  of  the  resulting  structures  are  symmetrical  to  the  center 
of  the  line  structure;  however,  they  are  also  not  parallel  to 
each  other.  As  defined  in  Fig.  3,  this  corresponds  to  a  differ¬ 
ence  between  the  angles,  ^0-  This  difference  is 

slightly  lower  than  that  by  using  a  shadow  mask,  as  shown  in 
an  earlier  publication.^  Corresponding  to  the  gradient  of  both 
curves  a  homogenous  etching  range  of  5  mm  length  under 
the  mask  is  achievable  with  an  uncertainty  of  the  facet  angle 
of  less  than  2° .  This  is  better  than  the  technologically  useful 
etching  range,  which  results  from  the  published  properties  of 
angled  RIE  with  tilted  sample  holders.^  In  Fig.  3,  the  useful 
homogeneous  range  of  5  mm  is  marked  by  a  horizontal  line. 
The  adjustment  of  the  facet  angle  is  mainly  determined  by 
the  mask  geometry. 

We  want  to  point  out  that  a  sidewall  angle  slightly  differ¬ 
ent  from  45°  is  still  attractive  for  the  fabrication  of  90°  de¬ 
flecting  mirrors  as  small  beam  corrections  can  be  done  ex¬ 
ternally  by  a  lens  array.  In  a  three-dimensional  architecture 
of  optical  interconnects  with  surface  emitting  lasers,  a  lens 
array  between  two  optical  chips  usually  serves  to  focus  the 
single  beam  on  a  small  area  of  the  next  stage.  By  asymmetric 
adjustment  of  the  lens  array,  a  compensation  for  a  deviation 
from  the  ideal  angle  around  10°  can  be  easily  achieved.  Con¬ 
sequently  the  angles  achieved  here  for  ^  are  in  fact  useful 
for  deflecting  mirrors. 

Let  us  now  consider  the  influence  of  the  mask  geometry 
(e.g.,  its  total  height)  on  the  resulting  pattern  profiles.  To 
illustrate  the  limit  of  an  observable  mask  influence  on  the  ion 
flux,  first  only  vertical  walls  serve  as  “shadow  masks.”  The 
total  height  of  the  wall  was  varied  between  3  mm  and  9  mm. 
In  Fig.  4(a)  the  results  of  the  sidewall  face  angles  are  shown 
versus  the  lateral  positions  of  the  structures.  The  straight, 
solid,  horizontal  line  in  the  figure  marks  the  wall  angle  of 
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Fig.  4.  (a)  Influence  of  a  vertical  aluminum  sheet  on  the  facet  angle  in  a  RIE 
process,  (b)  Photographs  of  the  corresponding  etched  structure  at  jr  =  — 1.6 
mm  (height  of  the  shadow  mask:  9  mm). 
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Fig.  5.  Wall  angle  of  etched  structures  as  a  function  of  the  mask  height. 

structures  etched  without  shadow  mask  but  using  identical 
etching  parameters  as  described  above.  For  a  height  /i2=3 
mm  (represented  by  the  circles)  the  asymmetric  etching  be¬ 
havior  due  to  the  field  influence  of  the  conductive  aluminum 
disappears  already  at  a  distance  of  2  mm  away  from  the  wall. 
So  the  effect  of  this  mask  on  the  etching  behavior  is  almost 
negligible.  However,  for  h2=9  mm,  all  structures  have  an 
identical  slope  of  ^74°  over  the  same  distance. 

For  the  definition  of  the  structures,  which  are  discussed 
above  [Fig.  4(a)],  the  organic  resist  AZ1512  was  applied. 
The  slightly  inclined  profile  of  such  an  etched  structure  as 
well  as  the  surface  quality  of  the  sidewalls  are  shown  in  Fig. 
4(b).  As  shown  in  the  upper  photo,  very  smooth,  mirrorlike 
facets  can  be  obtained  also  with  an  organic  resist.  Only  on 
the  top  of  the  sidewall  does  a  weak  roughness  occur.  As  can 
be  seen  in  the  lower  photo,  the  erosion  of  the  resist  is 
strongly  asymmetric.  Especially  the  right  side  of  the  resist  is 
removed  by  the  sloped  incident  ions.  So  the  etch  selectivity 
achieved  by  using  such  a  resist  may  not  be  sufficient  for  the 
required  etch  depth  of  several  jum  for  mirrors. 

Besides  the  total  mask  height  h2,  the  height  of  the  mask 
cover,  defined  by  in  Fig.  1,  mainly  determines  the  slope  of 
the  etched  structures.  To  illustrate  the  dependence  between 
sidewall  slope  and  mask  geometry,  the  wall  angles  d>i  of 
structures,  placed  in  comparable  positions,  are  plotted  versus 
the  cover  height  hi  in  Fig.  5.  The  angle  of  the  considered 
structures  is  related  to  the  position  where  the  etch  rate  is 
reduced  to  80%,  indicating  that  the  ion  flux  starts  to  be  shad¬ 
owed  noticeably  by  the  aluminum  cover.  Up  to  this  point  the 
wall  angle  varies  only  weakly,  as  shown  in  Fig.  3.  The 
viewgraph  shows  a  linear  decay  of  O  i  with  increasing  height 
from  74''  to  43°  for  the  mask  9  mm  in  height.  The  height  of 
the  plasma  sheath  is  roughly  2  cm.  So,  in  principle,  the  wall 
angle  should  be  adjustable  in  a  much  larger  range  as  shown; 
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Fig.  6.  Profile  of  a  deep  etched  structure  with  a  facet  angle  of  45°. 


also  below  40°,  as  long  as  the  dimension  of  the  mask  does 
not  exceed  the  plasma  sheath.  However,  in  our  case  the 
variation  range  of  the  mask  was  limited  by  the  opening  be¬ 
tween  transfer  chamber  and  reactor  of  our  equipment. 

Considering  the  described  height  restriction  to  the  appli¬ 
cable  mask  geometry,  we  finally  want  to  demonstrate  that  the 
presented  technique  enables  us  to  etch  sloped  facets  with  an 
ideal  45°  angle  for  perpendicular  deflection  with  a  sufficient 
tolerance  of  0.5  mm  for  the  lateral  adjustment,  according  to 
the  assumed  uncertainty  of  the  angle  as  described  above. 
Figure  6  shows  the  cross  section  of  a  deeply  etched  90° 
deflecting  mirror  with  high  aspect  ratio.  An  AI2O3  film  was 
used  as  a  pattern  mask  for  this  structure.  The  useful  range  for 
deflecting  extends  about  8  /mm.  To  study  the  deflecting  qual¬ 
ity  of  the  facet,  the  surface  was  evaporated  with  a  50  nm 
aluminum  layer. 

The  optical  quality  of  the  mirror  can  be  characterized  by 
the  distortion  and  the  loss  of  a  light  beam  after  reflection. 
Therefore  we  measured  the  intensity  distribution  of  the  far 
field  of  a  Gaussian  beam,  reflected  at  one  of  the  deep  etched 
micro  mirrors.  The  measurement  setup  is  shown  in  Fig.  7. 
We  operate  with  visible  light  of  a  HeNe  laser  at  633  nm.  The 
incident  beam  is  focused  by  a  microscope  objective  onto  the 
mirror  with  a  numerical  aperture  of  0.1.  The  width  of  the 
beam  waist  of  4  /xm  is  much  smaller  than  the  useful  deflect¬ 
ing  range  on  the  mirror  of  8  /xm,  so  that  the  edges  of  the 
mirror  do  not  effect  the  shape  of  the  reflected  intensity  dis¬ 
tribution.  To  simulate  the  propagation  of  a  beam,  just  leaving 
a  waveguide,  like  in  a  surface  emitting  laser  structure,  the 
focus  is  pulled  back  from  the  mirror  plane  by  a  distance 
^focus  shown  in  the  enlarged  part  of  Fig.  7.  The  incident 
beam  is  tilted  against  the  sample  around  an  angle  of  15°  to 
prevent  a  shading  of  the  beam  by  the  bottom  of  the  sample. 
The  far  field  is  measured  110  mm  away  from  the  mirror  by  a 
scanning  photodiode. 

In  Fig.  8  the  far  field  intensity  in  direction  parallel  (w^) 
and  perpendicular  (w^)  to  the  long  side  of  the  mirror,  is 
plotted  for  a  distance  between  focus  and  mirror  z  f^^us  of  10 
/xm,  corresponding  to  the  dimensions  realized  in  surface 
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seimple  with  45®mirror 


Fig.  7.  Setup  for  the  measurement  of  the  far  field  intensities  of  reflected  a 
Gaussian  beam. 


emitting  structures. In  comparison  to  the  intensity  distri¬ 
bution,  taken  directly  in  the  mirror  plane  by  a  video  camera, 
as  shown  in  the  inset  of  Fig.  8,  there  is  no  significant  distor¬ 
tion.  The  reflection  loss  relative  to  an  undisturbed  reference 
beam  amounts  to  0.6  dB.  Considering  the  aluminum  reflec¬ 
tivity  of  90%  at  X=0.63  yam,  the  additional  loss,  probably 
due  to  surface  roughness,  is  0.14  dB. 

To  characterize  precisely  the  curvature  of  the  mirror 
plane,  causing  a  potential  distortion  of  the  beam,  we  have 
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Fig.  8.  Far  field  intensity  of  the  reflected  beam;  inset:  field  intensity  in  the 
mirror  plane. 
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Fig.  9.  Ratio  of  the  beam  diameter  perpendicular  and  parallel  to  the  long 
side  of  the  micro  mirror  as  a  function  of  z  focus  ■ 

measured  the  l/e  widths  and  of  the  far  field  in  both 
directions  of  the  phase  front  in  dependence  on  z focus-  The 
ratio  wj  is  plotted  in  Fig.  9  versus  Zfocus-  By  fitting  the 
calculated  relationship,  derived  from  Gaussian  beam  optics, 
to  the  measured  data,  the  radius  of  curvature  is  deter¬ 
mined  to  be  400  /xm.  Corresponding  to  this  slight  deviation 
from  an  ideal  straight  plane,  no  serious  distortion  of  the  re¬ 
flected  beam  can  be  observed  up  to  a  distance  Zfocus”^^ 
yam,  when  the  beam  diameter  on  the  mirror  becomes  larger 
than  the  extension  of  the  mirror  plane. 

IV.  CONCLUSIONS 

In  summary,  we  have  presented  investigations  of  angled 
etching  with  a  RIE  process  by  deflecting  the  ions  inside  the 
plasma  sheath  directly  with  a  conductive  mask.  We  have 
shown  that  such  a  simple  addition  in  the  usual  RIE  equip¬ 
ment  allows  strongly  sloped  structures  to  be  etched  with  wall 
angles  as  small  as  40° .  Besides  this  the  methane-based  etch 
chemistry  provides  mirrorlike  sidewall  faces.  Deep  etched 
structures  with  45°  angles  show  nearly  a  straight  profile, 
which  is  suitable  for  application  as  a  defiecting  mirror.  From 
optical  measurements,  performed  with  aluminum-evaporated 
structures,  a  minimum  radius  of  curvature  of  the  deflecting 
plane  of  400  yam  is  determined.  The  study  of  the  optical 
quality  of  such  micro  mirrors  has  shown  that  the  angle  de¬ 
pendent  intensity  distribution  of  the  far  field  of  the  reflected 
beam  is  the  quite  identical  to  that  of  the  incoming  beam.  The 
reflection  loss  amounts  0.6  dB.  In  principle,  the  wall  angle  of 
the  facets  can  be  adjusted  by  the  height  of  the  mask.  The 
extension  of  the  region  with  homogeneous  etching  condi¬ 
tions  depends  also  on  the  total  mask  height.  For  a  mask 
height  of  9  mm  the  slope  of  etched  structures  varies  with  an 
uncertainty  of  2°  around  an  angle  of  53°  over  a  distance  of  5 
mm.  This  is  better  than  the  homogeneity  obtained  by  angled 
RIE  with  a  tilted  sample  holder. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2542 


B.  Jacobs  and  R.  Zengerle:  RIE  of  sloped  sidewalls 


2542 


A.  Bondur  and  R.  G.  Frieser,  in  Plasma  Processing,  edited  by  R.  G. 
Frieser  and  C.  J.  Mogab  (Electrochemical  Society,  Pennington,  NJ,  1981), 
Proc.  VoL  81-1,  p.  180. 

^C.  M.  Horwitz,  Appl  Phys.  Lett.  44,  1041  (1984). 

^L.  A.  Coldren  and  J.  A,  Rentschler,  J.  Vac.  Sci.  Technol.  19,  225  (1981). 

Takamori,  L.  A.  Coldren,  and  J.  L.  Merz,  Appl,  Phys.  Lett,  53,  2549 
(1988). 

^J.  M.  Kim,  W.  N.  Carr,  R.  J.  Zeto,  and  L.  Poli,  J.  Electrochem.  Soc.  139, 
1700  (1992). 

Jacobs  and  R.  Zengerle,  Electron.  Lett.  31,  794  (1995). 


^C,  Constantine,  D.  Johnson,  S.  J.  Pearton,  U.  K.  Chakrbarti,  A.  B.  Emer¬ 
son,  W.  S.  Hobson,  and  A.  P.  Kinsella,  J.  Vac.  Sci.  Technol.  B  8,  596 
(1990). 

^K.  H.  Kretschmer,  K.  Matl,  G.  Lorenz,  I.  Kessler,  and  B.  Dumbacher, 
Solid  State  Technol.  33,  53  (1990). 

^R.  Zengerle,  H.  J.  Bruckner,  B.  Hiibner,  and  W.  Weiershausen,  J.  Vac.  Sci. 
Technol.  B  11,  2641  (1993). 

^^J.  H.  Kim,  R.  J.  Lang,  and  A.  Larsson,  Appl.  Phys.  Lett.  57,  2048  (1990). 
^*G.  A.  Porkolab  and  E.  D.  Wolf,  J.  Vac.  Sci.  Technol.  B  11,  2552  (1993). 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 
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A  comparison  was  made  of  thin  films  of  silicon  dioxide  deposited,  at  floating  temperatures,  using 
electron  cyclotron  resonance  plasma  enhanced  chemical  vapor  deposition  (ECR  PECVD)  and 
distributed  electron  cyclotron  resonance  plasma  enhanced  chemical  vapor  deposition  (DECR 
PECVD).  The  refractive  index,  composition,  and  chemical  bonding  of  the  plasma  oxides  were 
determined  by  null  and  spectroscopic  ellipsometry,  nuclear  reaction  analysis,  and  Fourier  transform 
infrared  spectroscopy  and  were  compared  with  thermal  oxides.  The  damaged  layer  at  the  Si/Si02 
interface  resulting  from  ECR  and  DECR  techniques  was  evaluated  by  spectroscopic  ellipsometry. 

Finally,  high  frequency  and  quasi-static  capacitance  voltage  characteristics  and  ramped  current 
voltage  measurements  were  performed  to  determine  the  electrical  properties  of  the  ECR  and  DECR 
deposited  silicon  oxide.  Device  quality  Si02  thin  films  have  been  prepared  using  both  deposition 
techniques:  low  interface  state  density  [5X10^®  eV“^cm”^  (ECR)  or  2.5X10^^  eV“^  cm“^ 

(DECR)],  and  high  critical  field  [5.2  MV/cm  (ECR)  or  6  MV/cm  (DECR)]  have  been  achieved. 

©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

There  exists  a  need  for  a  low  temperature  process 
(<200  °C)  for  depositing  high  quality  Si02  thin  films  for 
microelectronic  applications^  such  as:  passivation  coatings, 
interlevel  dielectrics,  and  gate  dielectrics  in  metal-oxide- 
semiconductor  field  effect  transistors  (MOSFETs)  and  thin 
film  transistors  (TFTs). 

Many  attempts  to  develop  an  acceptable  low  temperature 
oxide  deposition  process  have  been  made  using  both  radio 
frequency  plasma  enhanced  chemical  vapor  deposition 
(rf-PECVD)^  and  electron  cyclotron  resonance  PECVD 
(ECR-PECVD).^  The  rf-PECVD  technique  incorporates  con¬ 
siderable  hydrogen  (5-30  at.  %)^  and  causes  ion  beam  dam¬ 
age  due  to  the  high  energy  ions  (100-1000  eV).  The  ECR 
technique  uses  a  microwave  frequency  of  2.45  GHz  to  excite 
the  plasma  and  the  resonance  condition  is  established  by 
applying  a  875  G  magnetic  field  in  the  region  of  the  plasma. 
The  ECR-PECVD  conditions  enable  the  achievement  of  a 
high  ion  density  plasma  (10^^  cm“^)  at  lower  pressures 
(0.05-0.5  Pa)  than  conventional  rf-PECVD  techniques. 

This  investigation  considers  the  deposition  of  Si02  films 
by  ECR-PECVD  with  both  a  classical  reactor  configuration 
and  with  the  distributed  system.^“^  The  goal  of  this  research 
is  to  compare  the  physical  and  electrical  properties  of  ECR 
and  DECR  deposited  Si02  films.  In  a  previous  study, ^  we 
compared  ECR  and  DECR  oxidation  mechanisms  of  Si  and 
the  resulting  Si02  properties,  and  we  reported  that  the  ECR 
and  DECR  deposition  techniques  led  to  similar  physical  and 
chemical  properties. 


II.  EXPERIMENTAL  PROCEDURES 

ECR-PECVD  was  performed  in  a  custom  built  ECR 
plasma  system.  The  system  consists  of  an  independent 
vacuum  processing  chamber  equipped  with  an  ECR  plasma 
source  yielding  2.45  GHz  microwaves,  which  are  guided 
through  a  quartz  window  to  the  ECR  plasma  processing 
chamber.  The  vacuum  chamber  was  evacuated  to  10“^  Pa 
prior  to  processing  using  a  turbomolecular  pump.  The  DECR 
reactor,  used  for  this  study,  was  provided  by  Alcatel  and  has 
been  described  previously.^" ^  Briefly,  the  reactor  consists  of 
a  300  mm  diam  chamber,  equipped  with  a  load-lock;  the 
residual  pressure  in  both  chambers  is  about  5X10“^  Pa.  The 
microwave  power  is  coupled  to  the  plasma  through  eight 
antennas  evenly  distributed  at  the  periphery  of  the  deposition 
chamber. 

Si  samples  used  were  commercially  available  [n-type 
(100)  c-Si  (resistivity:  1-2  D  cm)  for  ECR  and  p-type  (100) 
c-Si  (resistivity:  8-10  flcm)  for  DECR].  Si  samples  were 
cleaned  by  a  modified  RCA  procedure^^  with  a  final  HF  dip 
and  de-ionized  water  rinse.  After  the  cleaning  procedures,  the 
Si  samples  were  blown  dry  using  dry  nitrogen  and  then 
loaded  immediately  into  the  load  lock,  and  subsequently 
transferred  to  the  main  processing  chamber.  Ge  and  GaAs 
wafers  were  used  for  the  determination  of  Si  atomic  compo¬ 
sition. 

For  ECR  experiments,  diluted  SiH4  (3%  in  Ar)  was 
used  for  the  Si  source.  In  all  the  work  reported  here  concern¬ 
ing  the  ECR  deposition,  the  microwave  power  was  300  W 
and  the  total  gas  flow  rate  was  30  seem.  For  the  DECR  work, 
the  total  gas  flow  of  undiluted  SiH4  and  O2  (20  seem)  and  the 
microwave  power  (800  W)  were  also  kept  identical  for  all 
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films.  We  studied  the  effect  of  oxygen/silane  flow  ratios  {Rq) 
(between  2  and  9)  on  physical  and  electrical  properties  of  the 
deposited  Si02  films,  using  the  two  microwave  deposition 
techniques. 

The  ionic  current  for  the  oxygen  plasma,  as  shown  in  Fig. 
1(a)  for  ECR  and  Fig.  1(b)  for  DECR,  was  determined  as  a 
function  of  pressure.  To  obtain  these  characteristics,  the  sub¬ 
strate  holder  acted  as  a  large  planar  Langmuir  probe.  Differ¬ 
ent  behavior  was  observed:  in  the  case  of  the  ECR  plasma 
the  ion  flux  decreases  as  a  function  of  the  pressure  whatever 
was  the  value  of  the  microwave  power,  while  for  the  case  of 
the  DECR  plasma  a  maximum  in  the  ion  flux  is  found  for  a 
pressure  in  the  range  0.1 -0.6  Pa.  In  order  to  compare  the 
properties  of  ECR  and  DECR  plasma  deposited  films,  the 
value  of  the  ionic  current  should  be  the  same.  Since  it  is  not 
possible  to  obtain  ionic  current  characteristic  curves  for  the 
02/SiH4  mixtures  (due  to  reaction),  we  use  the  oxygen 
plasma  characteristics  as  an  approximation.  Thus,  we  have 
chosen  a  total  pressure  of  0. 1  Pa  and  a  microwave  power  of 
300  W  for  the  ECR  method  (2.35  mA/cm^),  and  800  W  for 
the  DECR  method  (2.1  mA/cm^). 

Film  thicknesses  and  refractive  indexes  were  measured  at 
near  half  an  ellipsometric  period  (80-100  nm)  using  null 
ellipsometry  at  632.8  nm  light  for  the  ECR  samples  and 
546.1  nm  for  the  DECR  samples.  The  refractive  index  was 
deduced  using  a  single  film  model  and  assuming  no  absorp¬ 
tion  (/:^0).  The  optical  properties  of  the  Si02  films  have 
also  been  studied  using  spectroscopic  ellipsometry  (SE)  op¬ 
erating  between  about  2  and  4  eV  for  ECR  films  and  DECR 
films.  The  measured  dielectric  spectra  in  terms  of  and  £2 
were  modeled  using  the  Bruggeman  effective  medium  ap¬ 
proximation  (BEMA),  which  Aspnes  and  co-workers  and 
Collins  and  co-workers^^  have  shown  to  be  useful  to  charac¬ 
terize  damage  at  the  Si  surface.  To  limit  the  number  of  pa¬ 
rameters,  we  have  assumed  that  the  oxide  film  can  be  de¬ 
scribed  by  no  more  than  two  layers  (interface  layer  and  oxide 
layer)  and  that  each  layer  was  composed  of  no  more  than  two 
of  three  possible  components:  a-Si,  Si02,  and  voids,  as  we 
had  done  previously  for  the  ECR-DECR  oxidation  study.^ 
The  layer  thicknesses  and  the  chemical  compositions  for 
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Fig.  1.  Ionic  current  density  as  a  function  of  the  pressure  for  O2  plasma  for 
(a)  the  ECR  technique  and  (b)  the  DECR  technique. 


each  model  were  fitted  using  a  linear  regression  analysis  to 
provide  the  optimum  agreement  with  experiment.  The  rela¬ 
tive  quality  of  the  fit  for  each  model  is  defined  by  an  unbi¬ 
ased  estimator  S.  The  S  formulas,  for  ECR  samples,  and 
Sj)  for  DECR  samples,  are 


-  /  ^  V 

V  A-P-1 

-4, 


2  [(tan  iAexp-tan  (/fcai)f+(cos  A  -cos  A^ai)^], 


N  N 

( ^Aexp  0^cal) /  (^exp  ^cal); 

i=\  i=  I 


(la) 

(lb) 


where  N  is  the  number  of  data  points  and  P  the  number  of 
unknown  parameters,  and  are  the  weighting  coeffi¬ 
cients  for  A  and  ip,  respectively.  Thus,  the  values  for  S  are 
only  useful  for  comparison  within  a  set  of  samples. 

To  confirm  the  ellipsometry  results,  the  relative  composi¬ 
tion  of  Si  and  O  was  evaluated  by  nuclear  reaction  analysis 


(NRA)  p^Si((i,/7),  and  respectively], while 

the  hydrogen  contamination  was  determined  by  the  elastic 
recoil  detection  analysis  technique  (ERDA).^^ 

Infrared  measurements  were  performed  using  a  Fourier 
transform  infrared  spectrophotometer  operated  with  a  mer¬ 
cury  cadmium  telluride  (MCT)  detector.  Normal  incidence 
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transmission  was  measured  on  the  bare  Si  substrate,  as  back¬ 
ground,  and  on  the  Si02  covered  Si  with  a  resolution  of  2 
cm“^ 

The  etch  rate  of  the  films  was  examined  using  “P-etch” 
solution  [HF(40%):HN03(65%):H20  of  3:2:60]  as  reported 
by  Pliskin.'* 

Al/Si02/Si  MOS  structures  were  prepared  for  I-V  and 
C-V  characterizations.  Following  film  deposition,  alu¬ 
minium  electrodes  typically  200  nm  thick  with  an  area  of  9 
lO”"^  cm^  for  ECR  films,  and  500  nm  thick  with  an  area  of  3 
10“^  cm^  for  DECR  films,  were  evaporated  through  a 
shadow  mask.  For  the  ECR  films,  the  backside  contact  was 
made  by  abrading  the  wafer  and  applying  a  Gain  paste.  For 
the  DECR  samples,  A1  was  evaporated  on  the  back  side  of 
the  substrate.  After  metallization,  the  samples  were  subjected 
to  postmetallization  annealing  in  forming  gas  (10%  H2  in  Ar 
for  DECR  samples  and  10%  H2  in  N2  for  ECR  samples)  at 
450  °C  for  30  min.  The  quasistatic  I-  V  characteristics  were 
obtained  with  a  HP  4  MOB  picoammeter  with  the  device  bi¬ 
ased  in  accumulation  (voltage  ramp  of  50  mV/s).  High  fre¬ 
quency  (1  MHz)  and  quasistatic  (voltage  ramp  of  10  mV/s 
and  5  mV/s  for  ECR  and  DECR  samples,  respectively)  C-V 
curves  were  performed  using  an  impedance  analyzer  and  a 
picoammeter. 

III.  EXPERIMENTAL  RESULTS  AND  DISCUSSION 
A.  Oxide  physical  properties 

The  deposition  rate  (d^),  plotted  as  a  function  of  the 
02/SiH4  flow  ratio,  is  shown  in  Figs.  2(a)  for  ECR  and  2(b) 
for  DECR.  The  same  behavior  is  observed  for  the  two  depo¬ 
sition  methods:  d^.  decreases  when  the  02/SiH4  flow  ratio 
increases  from  2  to  9.  The  depositions  in  the  ECR  and 
DECR-PECVD  reactors  involve  chemical  processes  occur¬ 
ring  in  the  gas  phase  and/or  on  the  substrate  surface.  Under 
our  experimental  conditions  of  low  operating  pressure  (0.1 
Pa),  the  mean  free  path  of  reactive  species  is  of  the  same 
order  of  magnitude  as  the  dimensions  of  the  reactor.  This 
suggests  that  the  chemical  reactions  between  the  silane  and 
oxygen  species  occur  on  the  substrate  surface  rather  than  in 
the  plasma.  The  insets  of  Figs.  2(a)  and  2(b)  show  a  near 
linear  relationship  between  d^  and  the  silane  flow  rate  which 
indicates  that  the  deposition  is  under  a  mass-transport  con¬ 
trol. 

Figure  3  shows  the  refractive  index  versus  02/SiH4  flow 
ratio  for  (a)  ECR-PECVD  samples,  and  (b)  DECR-PECVD 
samples.  The  dotted  lines  are  from  a  thermal  Si02  sample 
grown  in  dry  O2  at  1000  ""C,  which  has  a  refractive  index 
n=  1.465  ±0.002.  For  both  ECR  and  DECR  deposition,  the 
refractive  index  values  are  close  to  the  thermal  Si02.  When 
the  02/SiH4  flow  ratio  is  9,  the  value  of  the  refractive  index 
is  the  closest  to  that  of  thermal  Si02.  As  the  02/SiH4  flow 
ratio  decreases,  the  refractive  index  increases  indicating  ex¬ 
cess  Si  in  the  films. 

For  both  the  ECR  and  DECR  depositions,  we  have  found 
that  the  best  fit  optical  model  is  a  one  layer  model.  Figure  4 
shows  the  best  optical  models  for  ECR  and  DECR-PECVD 
samples  deposited  at  different  02/SiH4  flow  ratios.  For 


Fig.  2.  Deposition  rate  as  a  function  of  the  02/SiH4  flow  ratio  and  SiH4  flow 
(inset)  for  (a)  the  ECR  technique  and  (b)  the  DECR  technique. 


higher  Rq  values,  SE  indicates  nearly  100%  Si02  while  for 
Rq—2  both  DECR  and  ECR  display  measurable  excess  Si 
content.  In  addition  SE  indicates  a  small  amount  of  excess  Si 
in  the  DECR  samples  even  at  high  Rq  values.  This  is  con¬ 
firmed  below  with  the  Si  analyses  using  NRA.  Also,  the 
volume  percentage  for  a -Si  decreases  as  02/SiH4  ratio  in¬ 
creases.  This  is  in  agreement  with  null  ellipsometry  results 
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(b)  (02/81114)  Flow  Ratio 

Fig.  3.  Refractive  index  as  a  function  of  the  02/SiH4  flow  ratio  for  (a)  the 
ECR  technique  (632.8  nm)  and  (b)  the  DECR  technique  (546.1  nm). 

which  show  that  as  the  02/SiH4  ratio  decreases,  the  refractive 
index  increases. 

In  order  to  confirm  the  film  composition  deduced  from  SE 
measurements,  we  have  studied  the  composition  by  NRA  and 
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Fig.  4.  Optical  models  calculated  from  SE  data  for  different  02/SiH4  flow 
ratios  for  the  ECR  and  DECR  techniques. 

J.  Vac.  Scl.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


5 


o 


i  A  ^  A  i 


Q  1 - 1 - 1 - 1 - 1 - 1 - 1 - L - 1 - 1 - 1 - 1 - 1 

1  2  3  4  5  6  7  8  9  10  11  12  13 

(02/SiH4)  Flow  Ratio 

Rg.  5.  Atomic  densities  of  Si,  O,  and  H  in  silicon  oxide  as  a  function  of 
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atomic  densities  of  Si  and  O  in  the  thermal  oxide). 

ERDA.  Films  were  prepared  with  a  wide  range  of  oxidant 
ratios  and  the  results  are  illustrated  in  Fig.  5  where  atomic 
densities  of  Si,  O,  and  H  in  the  films  as  a  function  of  the  gas 
phase  composition  are  reported.  These  densities  in  at.  cm“^ 
are  deduced  from  NRA  (accuracy:  ±5  at.  %)  or  ERDA  (ac¬ 
curacy:  ±5  at.  %)  and  null  ellipsometry  measurements.  NRA 
measurements  are  typically  performed  on  GaAs  or  Ge  sub¬ 
strates,  and  equivalent  O  content  is  found  using  Si  substrates. 
It  can  be  seen  that  O  and  Si  atomic  densities  are  close  to 
those  for  thermal  oxide  (dotted  lines).  NRA  confirms  the  null 
and  spectroscopic  ellipsometry  which  indicates  that  the 
DECR  samples  are  slightly  silicon  rich  (Fig.  6).  In  addition 
Figs.  5  and  6  indicate  that  the  ECR  stoichiometry  is  also  less 
than  2  for  O,  but  this  is  due  to  a  slight  oxygen  deficiency.  SE 
shows  stoichiometry  for  high  Rq  samples.  The  formation  of 
slightly  silicon-rich  DECR  films  could  be  due  to  an  incom¬ 
plete  reaction,  or  the  fact  that  oxygen  is  more  volatile  than 
silicon.  From  the  two  following  experimental  data:  thick¬ 
nesses  of  the  films  measured  by  ellipsometry,  and  silicon, 
oxygen,  and  hydrogen  contents  of  the  films  measured  by 
NRA  or  ERDA,  we  can  deduce  the  mass  density  d  which  is 
shown  in  Fig.  6.  The  ECR  and  DECR  film  density  is  close  to 
that  for  thermal  oxide  and  displays  no  trend  with  composi¬ 
tion. 

Except  for  H,  no  other  impurities  were  detected  in  any  of 
the  oxides.  H  is  a  frequent  contaminant  in  the  low  tempera¬ 
ture  deposited  Si02  films.  The  ERDA  measurements  gave  a 
low  hydrogen  concentration  (2  at.  %)  whatever  the  value  of 
the  02/SiH4  ratio. 
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(02/81114)  Flow  Ratio 


Fig.  6.  Density  and  stoichiometry  of  ECR  and  DECR  films  as  a  function  of 
gas  flow  ratio. 


There  are  three  characteristic  infrared  (IR)  absorption 
bands  for  thermal  Si02  near  450,  810,  and  1075  cm“^  which 
correspond  to  rocking,  bending  and  stretching  vibration 
modes  of  Si-O-Si,  respectively.^^  The  Si-O  stretching  band 
position  changes  with  the  stoichiometry  (;c)  of  silicon  oxide 
(SiO^)  deposited  by  plasma.  When  x  is  1.9,  the  Si-O  band 
frequency  is  about  1060  cm“^  and  it  decreases  as  x  does.^® 
The  position  of  this  band  is  usually  associated  with  the 
change  of  the  oxygen  bridge  angle  {9).  In  force  constant  {k) 
models  for  vibrational  properties,  the  frequency  {v)  of  Si- 
O-Si  stretching  mode  can  be  expressed  as^^’^^ 

v^=  {klm^SAT^i  612),  (2) 

where  k  is  the  nearest  neighbor  effective  force  constant, 
is  the  mass  of  an  oxygen  atom,  and  9  is  the  O-Si-0  bond 
angle  which  is  assumed  to  be  144°  for  thermal  Si02.  When  9 
decreases,  the  associated  vibrational  frequency  also 
decreases.^^ 

The  position  of  the  Si-0  stretching  vibration  band  is 
shown  as  a  function  of  02/SiH4  ratio  in  Fig.  7,  for  both 
deposition  processes.  By  way  of  comparison,  the  frequency 
of  Si-0  band  observed  for  thermal  Si02  is  also  reported.  For 
both  processes,  the  Si-0  band  frequency  does  not  change 
with  02/SiH4  flow  ratio.  However,  this  band  for  DECR 
samples  appears  at  a  lower  frequency  than  that  observed  for 
thermal  and  ECR  Si02  spectra.  This  result  is  in  good  agree¬ 
ment  with  previous  measurements  (SE  and  NRA)  in  that  the 
stoichiometry  of  Si02  deposited  by  DECR  is  1.8  (Fig.  6).  But 
ECR  and  DECR  samples  have  the  same  stoichiometry.  Thus, 
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(02/81114)  Flow  Ratio 

Fig.  7.  Si-O-Si  stretching  frequency  as  a  function  of  the  Rq  gas  flow  ratio 
for  the  ECR  and  DECR  techniques.  The  dotted  line  indicates  the  value  of 
thermal  oxide  Si-O-Si  stretching  frequency. 

the  difference  between  ECR  and  DECR  data  could  be  due  to 
the  difference  in  film  density  as  was  shown  in  Fig.  6. 

Overall,  our  results  show  no  trend  of  the  physico¬ 
chemical  properties  of  the  DECR  and  ECR  films  with  the 
02/SiH4  flow  ratio.  However,  we  do  find  a  trend  in  the  etch 
rate  of  DECR  oxides  in  a  HF  solution  with  the  gas  phase 
ratios  and  with  a  comparatively  smaller  etch  rate  for  ECR 
films  (Fig.  8).  The  etch  rate  of  DECR  oxides  decreases  when 
02/SiH4  flow  ratio  increases  and  it  is  lower  for  films  depos¬ 
ited  for  Rq=9.  The  P  etch  rate  is  often  associated  with  the 
deposition  rate.  Higher  P  etch  rates  are  usually  an  indication 
of  lower  density  for  the  same  material.  For  Ro<4,  the  depo- 


(02/81114)  Flow  ratio 


Fig.  8.  P  etch  rate  dependence  on  Rq  A^w  ratio  for  the  ECR  and  DECR 
techniques.  The  dotted  line  indicates  the  value  of  the  thermal  oxide  P  etch 
rate. 
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Fig.  9.  I-V  characteristics  of  the  ECR  and  DECR  films  deposited  with  a 
02/SiH4  ratio  of  9. 

sition  rate  for  DECR  films  is  much  higher  than  for  ECR 
films.  However  from  Fig.  6,  we  see  that  the  DECR  films 
display  a  larger  density,  but  also  a  larger  P  etch  rate  (for 
Ro<A).  Figure  5  shows  that  the  lower  x  (SiO^)  for  DECR  is 
due  to  higher  Si,  but  for  ECR  samples  to  lower  O.  Hence  the 
point  defect  structure  is  decidedly  different,  and  this  may  be 
implicated  in  the  etch  rate  differences  for  our  plasma  pre¬ 
pared  samples. 

B.  Oxide  electrical  properties 

The  electrical  quality  of  the  Si/Si02  interface  and  the  Si02 
film  in  metal-oxide-semiconductor  (MOS)  structures  are 
traditionally  defined  by  several  measurements  including  the 
critical  field  the  resistivity  (p),  the  interface  state  den¬ 
sity  (Dit)  and  the  fixed  oxide  charges  (2ox)*  P 

deduced  from  quasistatic  I-V  characteristics.^"^  is  defined 
as  the  field  required  to  pass  1  nA/cm^  through  the  A1  gate. 
High  frequency  and  quasistatic  C-V  curves  are  used  to  de¬ 
termine  Dit  and  Qox  froffi  the  Castagne  and  Vapaille 
method.^^ 

Figure  9  shows  typical  I-V  characteristics  for  MOS 
structures  where  the  Si02  was  deposited  with  a  02/SiH4  ratio 
of  9.  It  is  seen  that  for  an  electric  field  E<5  MV/cm  for 
ECR  films  and  £'<6  MV/cm  for  DECR  films,  the  character¬ 
istics  are  linear  indicating  ohmic  behavior.  These  character¬ 
istics  allow  a  determination  of  the  resistivity  (p)  by 

AVA 

'•--Kir  ® 

where  A  is  the  area  of  the  MOS  electrode  and  t  is  the  thick¬ 
ness  of  Si02  film.  Within  the  limit  of  accuracy  in  determin¬ 
ing  the  values  of  p,  this  parameter  which  characterizes  the 
bulk  Si02  film,  could  be  considered  to  be  the  same  for  the 
two  techniques  (1-2  10^^  flcm).  This  result  is  in  good 
agreement  with  physico-chemical  properties,  since  we  have 
also  detected  practically  no  difference  between  ECR  and 
DECR  films  for  high  02/SiH4  gas  ratio  films.  The  current 
injection  from  silicon  substrate  starts  at  an  average  field  of 
5-5.5  MV/cm  for  ECR  films  and  £=6-6.5  MV/cm  for 
DECR  films.  The  critical  field  characterizes  the  beginning  of 


Fig.  10.  High  frequency  (1  MHz)  and  quasistatic  C~V  characteristics  for 
02/SiH4  flow  ratio  of  9  for  (a)  the  ECR  technique  and  (b)  the  DECR  tech¬ 
nique. 

this  injection:  DECR  films  exhibit  a  slightly  higher  critical 
field  (6  MV/cm)  than  the  ECR  films  (5.2  MV/cm). 

High  frequency  (1  MHz)  C-V  characteristics  of  the  MOS 
capacitors  have  been  investigated  to  determine  the  fixed  ox¬ 
ide  charges.  The  flatband  voltage  as  measured  from  these 
curves  shows  a  shift  towards  positive  values.  By  assuming 
that  this  shift  is  caused  mainly  by  the  work  function  differ¬ 
ence  and  negative  oxide  fixed  charges  (2ox)>  we  have  esti¬ 
mated  Qox  about  7X10^^  cm~^  for  ECR  films  and 

-2X10^®  cm“^  for  DECR  films  for  a  gas  flow  ratio  of  9. 

Figure  10  shows  the  high  frequency  (1  MHz)  and  quasi¬ 
static  C-V  curves  for  a  02/SiH4  flow  ratio  of  9.  Further 
analysis^^  of  these  curves  yields  a  midgap  interface  state 
density  (D^J  of  5  and  2.5  10^^  eV”^  cm~^  for  ECR  and 
DECR  films,  respectively.  2-3X10^®  eV~^  cm~^  is  generally 
considered  as  the  value  found  in  production  for  Si/Si02  in¬ 
terfaces  formed  by  the  thermal  oxidation  of  Si.  Samples  of 
Si02  prepared  by  other  plasma  deposition  techniques  includ¬ 
ing  rf  plasma^^  and  remote  rf  plasma  deposition^^  have  also 
yielded  good  interfacial  MOS  electronic  properties.  Our  elec¬ 
tronic  results  for  both  ECR  and  DECR  are  comparable  in 
terms  of  and  . 

In  terms  of  electrical  integrity,  the  previous  results  show 
that  the  ECR  and  DECR  Si02  films  are  similar.  These 
PECVD  films  present  a  critical  field,  a  resistivity  and  an 
interface  state  density  comparable  to  those  of  thermal  oxides 
grown  at  much  higher  temperature  (1000  °C). 

IV.  CONCLUSION 

A  comparison  of  ECR  and  DECR  plasma  CVD  of  Si02 
processes  using  a  variety  of  techniques  shows  that  high  qual- 
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ity  deposited  dielectric  films  are  achievable  with  low  inter¬ 
face  damage  and  suitable  electronic  properties  for  MOS  ap¬ 
plications.  No  detectable  plasma  damage  was  found  at  the 
Si-Si02  interface.  Nearly  stoichiometric  films  were  prepared 
at  the  higher  oxidant  ratios,  although  even  these  films  were 
slightly  off  stoichiometry  with  x<2  for  SiO^ .  It  appears  that 
for  DECR  there  was  a  slight  Si  excess,  but  for  ECR  there 
was  a  slight  O  deficiency  when  compared  to  thermal  Si02 .  A 
high  P  etch  rate  was  found  which  can  rationalized  based  on 
the  defect  nature  of  the  films  rather  than  density.  Overall  the 
DECR-PECVD  Si02  films  were  slightly  electronically  supe¬ 
rior  in  terms  of  critical  field  and  interface  trap  density,  but 
this  may  be  due  to  the  better  processing  facilities  for  the 
DECR  process. 
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Assessments  of  WSi^  reactive-ion  etching  in  terms  of  the  different  CF4  to  O2  flow  rate  ratio  were 
characterized.  Based  upon  the  evaluations  from  etching  rates,  side-wall  profiles,  surface  roughness, 
and  damages,  we  observed  that  the  optimum  etching  condition  was  at  a  ratio  of  10:1.  The  recovery 
of  reactive-ion-etching-treated  GaAs  damaged  layers  through  the  thermal  treatment  was  also 
investigated  as  a  function  of  the  annealing  temperatures  and  duration  times.  These  parameter 
evaluations  were  for  the  purpose  of  achieving  a  high  performance  GaAs  metal-semiconductor 
field-effect  transistor.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Reactive-ion  etching  (RIE)  has  been  extensively  applied 
to  semiconductor  device  fabrication  during  the  last  decade. 
Compared  with  the  other  etching  processes,  a  desired  side- 
wall  profile  together  with  a  reasonable  etching  rate  and  se¬ 
lectivity  can  be  achieved  by  this  technology  after  optimiza¬ 
tion,  which  is  not  always  obtainable  from  either  the  wet 
chemical  or  the  ion  milling.  The  side-wall  profile  control  is 
essential  in  maintaining  a  desired  critical  dimension,  which 
is  especially  important  for  submicron  devices.  On  the  other 
hand,  a  higher  etching  rate  is  necessary  for  a  larger  through¬ 
put,  while  a  better  selectivity  is  able  to  avoid  the  overetch  of 
the  subsequent  layer.  In  this  study,  we  used  a  RIE  process  to 
etch  away  the  WSi^  material,  where  WSi;^  was  the  commonly 
used  material  for  the  gate  terminal  in  self-aligned  ion- 
implanted  GaAs  metal-semiconcdutor  field-effect  transistor 
(MESFET)  fabrication.  WSi^  films  are  highly  resistive  to  the 
conventional  wet-etching  method,  and  the  formation  of  WSi^ 
gates  is  generally  realized  by  the  RIE.^"^ 

Among  the  various  etching  chemistries  for  WSi^^  films  in 
the  RIE,  CF4  plus  O2  is  the  widely  used  etchant  gas  for  this 
material.  In  the  first  part  of  this  study,  we  particularly  fo¬ 
cused  on  the  optimization  of  etching  conditions  for  WSi^ 
with  the  different  CF4  to  O2  flow  ratio.  These  evaluations 
include  the  etching  rate,  side-wall  profile,  plasma  damage, 
and  surface  roughness.  As  to  the  second  part  of  this  work,  the 
removal  of  plasma  damage  was  systematically  investigated 
by  the  subsequent  thermal  treatment.  This  treatment  demon¬ 
strated  an  effective  way  to  remove  the  damage  in  GaAs  ac¬ 
tive  layers.  Finally,  WSi^^.  gate  GaAs  MESFETs  were  fabri¬ 
cated  based  on  the  optimum  etching  conditions  and  the 
following  thermal  treatment. 

II.  OPTIMUM  CONDITION  OF  CF4/O2  FLOW  RATE 
RATIO 

The  WSij,  material  was  cosputtered  on  the  top  of  h-GslAs 
layers  in  an  Ar-ambient  chamber,  and  the  x  value  is  around 
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0.3.  Due  to  a  higher  vapor  pressure  of  WF^  and  SiF4  at  room 
temperature,  fluorine-based  enchants  such  as  CF4  or  SF^  are 
suitable  for  WSi;^.  etching.  However,  the  addition  of  O2  in  the 
CF4  plasma  to  a  certain  amount  is  accompanied  by  an  in¬ 
crease  of  F  radical  density,  resulting  in  an  increase  of  WSi^ 
etching  rate."^  Figure  1  illustrates  the  WSi^^  etching  rate  and 
the  density  of  F  radical  as  a  function  of  different  CF4  to  O2 
flow  rate.  The  flow  rate  of  CF4  was  10  seem,  and  this  rate 
was  fixed  through  the  whole  course  of  this  work.  F  radicals 
were  detected  by  optical  emission  spectroscopy  in  the  RIE 
chamber  at  a  wavelength  of  703.7  nm.  The  etching  rate  was 
first  increased  by  increasing  the  O2  content,  and  reaching  the 
highest  value  of  1600  A/min  at  4  seem  of  O2.  By  further 
increasing  the  O2  amount,  the  density  of  F  radical  is  diluted, 
and  causes  a  decrease  in  the  etching  rate.  F  radicals  at  703.7 
nm  followed  almost  the  identical  tendency  as  the  etching  rate 
curves,  and  can  serve  as  a  good  signal  to  monitor  the  WSi^ 
etching  process.  The  etching  selectivity  between  WSi^^  and 
GaAs  was  28  at  CF4:  O2  flow  rate  ratio  of  10:1.  The  etching 
rate  of  GaAs  is  insensitive  to  the  O2  concentration  under 
such  plasma  condition,  and  the  rate  is  around  20  A/min. 

As  to  the  side-wall  profile  investigation.  Figs.  2(a)  and 
2(b)  show  the  scanning  electron  microscopy  (SEM)  cross 
section  of  WSi^  gates  at  an  O2  flow  rate  of  4  and  7  seem, 
respectively,  under  a  RF  power  of  30  W  and  a  pressure  of  50 
m  Torr.  The  etching  time  of  each  individual  process  was 
determined  by  the  etching  rate  of  WSi;^• .  Therefore,  the  etch¬ 
ing  time  of  O2  flow  rate  of  4  seem  was  the  shortest. 

The  O2  flow  rate  of  1  seem  demonstrated  similar  side-wall 
profiles  as  in  the  case  of  7  seem.  The  top  A1  metal  was  used 
as  an  etching  mask  to  replace  the  photoresist,  which  was 
heavily  attacked  by  the  plasma  bombardment.  In  the  case  of 
4-sccm  O2  flow  rate,  which  also  showed  the  highest  etching 
rate  of  WSi^  in  Fig.  1,  a  large  undercut  profile  with  an  angle 
of  28°  was  obtained.  As  to  the  examples  of  both  1  and  7 
seem,  this  undercut  profile  was  reduced,  and  an  ideal  vertical 
side-wall  profile  can  potentially  be  obtained.  This  change  in 
side-wall  profiles  associated  with  O2  concentrations  is  con¬ 
sistent  with  the  results  presented  in  Fig.  1;  namely  a  higher  F 
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Fig.  1.  WSij.  etching  rate  and  associated  F  atom  in  RIE  chamber  as  a 
function  of  O2  flow  rates. 


radical  density  attacks  WSi^^  side  walls  more  seriously  result¬ 
ing  in  a  larger  undercut  angle.  The  dc  bias  was  insensitive  to 
the  O2  content  in  the  RIE  chamber,  and  the  effect  causing 
from  pure  ion  bombardment  can  be  ruled  out.  Therefore, 
either  in  a  lower  or  a  higher  O2  flow  rate  condition  maintains 
a  better  WSi^  side  wall. 

Plasma  damage  caused  in  a  RIE  chamber  is  a  potential 
problem,  which  will  degrade  the  device  performance.  These 
damages  may  be  correspondent  to  the  structural  deformation, 
stoichiometric  or  impurities.^"^  The  next  characterization  is 
associated  with  the  plasma  damage  in  the  GaAs  surface.  We 
evaluated  the  sheet  resistance  (/?sh)  in  a  GaAs  active  layer  by 
means  of  a  transmission-line  method  (TLM).  The  TLM  pat¬ 
terns  were  defined  by  photolithographic  techniques,  and 
ohmic  contacts  were  realized  by  evaporating  the  Ge-Au-Ni 
metal,  followed  by  a  350  °C  1-min  hot-plate  annealing.  Fig¬ 
ure  3  shows  the  change  in  n-GaAs  layers  as  a  function  of 
O2  content  after  being  exposed  in  the  RIE  chamber  for  5 
min.  Each  data  point  represents  at  least  an  average  of  five 
points,  located  in  different  areas.  The  original  value  of 
was  490  fl/sq  without  the  RIE  treatment.  The  increase  of  R^^ 
indicates  the  enhancement  of  surface  damage,  resulting  from 
the  plasma  bombardment  in  GaAs  active  layers. This  dam¬ 
age  reached  the  highest  value  at  O2  flow  rate  of  4  seem,  and 
either  increasing  or  decreasing  the  O2  flow  rate  caused  a 
decrease  of  surface  damage,  was  628,  756,  and  500  O/sq 
corresponding  to  the  O2  flow  rate  of  1,  4,  and  7  seem,  re¬ 
spectively.  At  4-sccm  O2  flow  rate,  the  WSi^  etching  rate  is 
the  fastest  and  may  induce  a  rougher  surface 
morphology. A  rougher  surface  degrades  the  electron  mo¬ 
bility,  which  makes  a  higher  R^^^.  Using  the  atomic  force 
microscopy  (AFM),  GaAs  surface  etched  under  a  4  seem  O2 
did  show  a  rougher  morphology,  and  this  issue  will  be  fur¬ 
ther  discussed  in  the  next  paragraph. 

Based  on  the  previous  investigation,  the  O2  content  in  a 
RIE  chamber  will  affect  the  R^^^  and  may  be  related  to  the 
GaAs  morphology.  Therefore,  the  final  part  of  this  section  is 


(b) 

Fig.  2.  SEM  cross  section  of  WSi^  gate  profiles  on  GaAs  substrate  after 
RIE.  (a)  CF4:  10  seem,  O2:  4  seem  and  (b)  CF4:  10  seem,  O2:  7  seem. 


the  investigation  of  surface  roughness  associated  with  the  O2 
flow  rate.  Surface  roughness  was  determined  by  an  AFM. 
Figure  4  shows  the  average  surface  roughness  of  GaAs  after 
being  treated  in  a  RIE  chamber  for  90  s  at  a  RF  power  of  30 
W.  The  original  GaAs  surface  roughness  without  the  RIE 
treatment  was  around  10  A.  Surface  roughness  reached  the 
highest  value  of  34  A  at  O2  flow  rate  of  4  seem,  which 
corresponds  to  a  higher  R^Yi>  Either  increasing  or  decreasing 
the  O2  content,  as  shown  in  this  figure,  resulted  in  an  im¬ 
provement  of  GaAs  surface.  Improving  surface  roughness 
after  RTA  may  be  associated  with  the  evaporation  of  non¬ 
volatile  species,  which  were  the  side  products  after  RIE.  The 
other  possibility  is  the  rearrangement  of  surface  atoms  due  to 
a  higher  surface  mobility  during  the  RTA  treatment.  There¬ 
fore,  we  conclude  that  there  exists  an  optimum  condition  for 
the  different  O2  content  in  a  CF4+O2  gas  mixture.  This  win¬ 
dow  could  avoid  the  generation  of  too  many  F  radicals, 
which  causes  a  higher  surface  damage  and  a  larger  undercut 
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Fig.  3.  Sheet  resistance  change  of  GaAs  active  layer  as  a  function  of  O2  flow 
rates  after  RIE  treatment. 

WSi;^  profile.  Either  higher  or  lower  of  the  O2  content  will 
substantially  improve  these  undesired  properties. 

O2  flow  rates  at  both  1  and  7  seem  seem  to  be  the  opti¬ 
mum  condition  of  etching  away  WSi^  materials.  However, 
for  a  practical  consideration,  a  higher  O2  content  is  not  pre¬ 
ferred  since  it  increases  the  etching  rate  of  photoresist,  which 
was  generally  used  as  an  etching  mask.  Figure  5  illustrates 
the  photoresist  etching  rate  as  a  function  of  different  O2  flow 
rates.  The  etching  rate  was  enhanced  monotonously  by  in¬ 
creasing  the  O2  flow  rate.  An  etching  rate  of  3300  A/min  was 
obtained  at  a  flow  rate  of  7  seem.  Therefore,  it  is  clear  that 
under  a  higher  O2  flow  rate  condition,  although  it  still  can 
achieve  reasonable  etching  properties,  photoresist  is  not  suf¬ 
ficient  to  protect  the  underlying  materials.  In  consequence, 
we  chose  the  O2  flow  rate  of  1  seem  as  the  best  etching 
parameter,  and  used  this  condition  for  the  following  damage 
recovery  study. 


Fig.  4.  Surface  roughness  evaluated  by  AFM  of  RIE  treated  GaAs  before 
and  after  the  RTA  heat  treatment. 


Fig.  5.  The  change  of  etching  rate  in  photoresist  vs  O2  flow  rate  in  a 
CF4+O2  RIE  chamber. 

III.  RECOVERY  OF  RIE  PLASMA  DAMAGE 

Since  the  exposed  GaAs  active  layer  in  a  RIE  chamber  is 
unavoidably  attacked  by  the  plasma  bombardment,  in  this 
section  we  focused  on  the  technique  to  restore  these  dam¬ 
aged  layers.  A  post-RIE  thermal  treatment  is  the  typical  tech¬ 
nique  to  remove  these  damages,  and  we  used  the  rapid- 
thermal-annealing  (RTA)  to  heal  surface  crystalline  defects. 
The  RTA  provides  a  rapid,  clean  heat  treatment,  and  the  sur¬ 
face  decomposition  in  GaAs  can  therefore  be  eliminated.  The 
evaluation  of  damage  and  recovery  of  the  GaAs  surface  was 
carried  out  by  measuring  the  sheet  resistance  (R^^)  of  active 
layers,  which  is  determined  by  the  TLM.  Ohmic  contacts 
were  formed  after  the  RTA  recovery  process,  and  the  alloy 
condition  was  350  °C,  1  min.  Therefore,  the  recovery  effect 
resulting  from  the  ohmic  alloy  is  limited. 

A  1000  A  thick  n-GaAs  (n  =  lxl0^^  cm"^)  active  layer 
was  grown  by  a  molecular-beam  epitaxy  (MBE)  system  on 
semi-insulating  GaAs  substrates.  The  original  490 

fl/sq.  These  layers  were  subsequently  placed  in  a  CF4+O2 
(10  sccm:l  seem)  RIE  chamber  to  investigate  the  change  in 
Rsh  on  n-GaAs  layers.  The  increased  significantly  as  a 
function  of  the  exposed  time  and  became  saturated  after  a  2 
min  plasma  treatment.  The  ^ftor  a  10  min  exposure  be¬ 
came  940  n/sq  under  a  dc  bias  of  740  V  at  30  W.  The  higher 
value  of  Rsh  indicates  a  reduction  in  mobility  and  sheet  car¬ 
rier  density,^  which  corresponds  to  a  degradation  of  the  ac¬ 
tive  layers. 

As  to  the  damage  recovery  evaluation,  we  used  the 
sample  treated  in  a  30- W,  CF4+O2  plasma  for  2  min,  and  the 
corresponding  value  was  900  fl/sq  RIE-treated  GaAs 
wafers  were  surface  unpassivated  in  the  RTA  chamber,  and 
the  Ar  is  the  only  gas  used  in  the  thermal  cycle.  shown 
in  Fig.  6(a),  decreased  systematically  as  a  function  of  RTA 
temperatures.  The  became  590  fl/sq  after  a  500  °C,  90-s 
treatment,  which  is  quite  closed  to  the  as-grown  sample  (490 
n/sq).  However,  for  further  increasing  the  RTA  temperatures, 
the  improvement  of  R^^  was  limited.  The  time  dependent 
RTA  evaluation,  shown  in  Fig.  6(b),  demonstrated  the  exist- 
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annealing  time  (sec) 


Fig.  6.  The  recovery  of  sheet  resistance  of  Ai-GaAs  associated  with  (a)  RTA 
temperatures  and  (b)  duration  times. 


ence  of  an  optimum  duration  time  for  RTA.  For  a  longer 
duration  time,  the  surface  decomposition  may  become  se¬ 
vere,  and  cause  an  increase  of  Therefore,  we  concluded 
that  a  500  °C,  90-s  RTA  heat  treatment  is  an  optimum  con¬ 
dition  to  remove  the  damages  in  GaAs  active  layers.  Since 
Ar  was  the  only  gas  used  in  the  RTA  chamber,  conduction 
recovery  resulting  from  the  hydrogen  passivation  can,  there¬ 
fore,  be  ruled  out.  The  effect  of  this  thermal  treatment  can 
also  be  confirmed  by  the  AFM  investigation.  GaAs  surface 
roughness,  shown  in  Fig.  4,  became  smaller  after  the  RTA, 
which  may  be  associated  with  the  improvement  of  crystalline 
defects. 

After  fully  characterizing  the  recovery  of  damaged 
7? -GaAs  layers,  we  applied  this  RTA  process  to  the  MESFET 
fabrication.  The  channel  thickness  of  this  MESFET  was 
1000-A  thick  with  a  carrier  density  of  2X10^^  cm“^.  1.2- 
yum-long  WSi^  gates  were  first  defined  by  the  RIE  with  a 
CF4/O2  flow  ratio  of  10: 1,  and  the  RF  power  was  30  W  at  50 


Gate-source  voltage,  Vgg  (Volt.) 


Fig.  7.  (a)  /js-Vds  output  and  (b)  /ds-^m-^gs  transfer  characteristics  of  RIE 
+RTA  gate  in  GaAs  MESFETs  (gate  length  =1.2  yu,m).  RIE  gate  GaAs 
MESFETs  (dashed  lines)  are  included  in  (b)  for  comparison. 


mTorr.  This  etching  process  was  followed  by  a  500  °C,  90  s 
RTA  heat  treatment  to  recover  the  damaged  layers.  However, 
samples  without  being  through  the  RTA  treatment  were  also 
included  to  verify  the  effect  of  this  recovery  process.  Both 
RIE + RTA  and  RIE  treated  MESFETs  were  finalized  by  a 
mesa  etching  and  an  ohmic  formation  process.  The 
evaluated  from  the  on-chip  TLM  test  patterns  revealed  a 
value  of  880  fl/sq  for  the  RIE- treated  layer.  As  to  the  RIE 
+RTA  treated  layer,  the  R^^  dropped  to  570  fl/sq  which  is 
quite  closed  to  its  original  value  (550  fl/sq). 

Figure  7  demonstrates  the  I~V  characteristics  of  RIE 
+RTA  treated  GaAs  MESFETs  with  a  gate-length  of  1.2  fim. 
The  peak  transconductance  (g,„)  was  120  mS/mm.  As  to  the 
RIE  treated  GaAs  MESFETs,  attached  in  Fig.  7(b),  this  value 
was  110  mS/mm  under  the  same  bias  condition.  The  associ¬ 
ated  threshold  voltages  (V^h)  were  —1.75  V  for  the  RIE 
+RTA,  and  —1.56  V  for  the  RIE  gate,  respectively.  A  lower 
Vth  of  RIE  gate  indicates  a  decrease  of  carrier  density  in  the 
MESFET’s  channel.  Electrons,  under  such  circumstance,  will 
be  trapped  inside  the  defect  and  cause  a  decrease  of 
These  trapped  electrons  will  subsequently  be  released  fol¬ 
lowed  by  a  RTA  treatment,  which  makes  the  move  to  a 
negative  voltage  direction. 
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IV.  CONCLUSIONS 

RIE  WSij^  etching  has  been  optimized  based  on  the  differ¬ 
ent  CF4  and  O2  flow  ratio.  As  far  as  the  side-wall  profiles  and 
surface  damages  are  concerned,  we  conclude  that  with  a  flow 
rate  ratio  of  10:1  is  the  best  etching  condition  for  WSi^^  ma¬ 
terials.  Plasma  damages  caused  in  a  RIE  treatment  can  be 
restored  by  the  subsequent  RTA  treatment.  This  thermal 
treatment  recovers  the  electrical  properties  and  also  improves 
the  surface  roughness.  Based  on  the  optimum  conditions  of 
WSijc  etching  and  thermal  process,  GaAs  MESFETs  were 
fabricated  and  demonstrated  the  importance  of  these  param¬ 
eters. 
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Characterization  of  electrical  damage  induced  by  CH4/H2  reactive  ion 
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In  this  article,  we  report  investigations  on  the  effects  of  methane/hydrogen  (CH4/H2)  reactive  ion 
etching  (RIE)  of  InGaAs/InAlAs/InP  heterostructure  materials  for  high  electron  mobility  transistors, 
and  especially  on  the  electrical  properties  of  the  InAIAs  layer  after  dry  recess  etching  of  the  InGaAs 
cap  layer.  The  ion  etching  induced  damages  in  the  barrier  layer  InAIAs  are  evaluated  by  diode 
current- voltage  and  capacitance -voltage  measurements  and  deep-level  transient  spectroscopy 
(DLTS).  The  l~V  data  indicate  that  RIE  lowers  the  Schottky  barrier  height  and  increases  the 
ideality  factor.  Using  low  pressure  (10  mTorr)  RIE  processes,  with  various  self-bias  voltages,  shows 
that  the  use  of  low  ion  energy  is  necessary  to  get  good  Schottky  contacts.  Rapid  thermal  annealing 
at  400  °C  is  shown  to  induce  a  limited  decrease  of  the  dry  etching  induced  defects.  However,  full 
recovery  of  the  electrical  properties  is  not  achieved.  A  significant  improvement  of  the  Schottky 
diode  electrical  characteristics  and  DLTS  spectra  is  observed  after  wet  etching  a  60  A  thick  InAIAs 
layer  before  metal  deposition,  indicating  that  the  main  damages  are  concentrated  within  a  short 
distance  from  the  surface.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  increasing  importance  of  InP-based  materials  in  the 
fabrication  of  microelectronic  and  optoelectronic  integrated 
circuits  requires  the  development  of  well-controlled  pro¬ 
cesses  for  the  fabrication  of  microstructure  devices. 

Extensive  work  on  selective  wet  and  dry  etching  pro¬ 
cesses  concerning  these  materials  has  been  reported^ to  al¬ 
low  the  fabrication  of  devices  such  as  high  electron  mobility 
transistors  (HEMTs),  heterostructure  field  effect  transistors 
(HFETs)  and  heterojunction  bipolar  transistors  (HBTs).  For 
example,  in  HBT  technology,  etching  of  the  emitter  must  be 
done  very  precisely  and  has  to  stop  at  the  thin  base  region. 
For  MEET,  a  controlled  process,  giving  accurate  reproduc¬ 
ibility  and  etch  selectivity  of  the  InGaAs  contact  layer  versus 
the  InAIAs  barrier  layer,  is  necessary  to  insure  low  threshold 
voltage  dispersion.  Although  conventional  wet  chemical 
etching  does  not  induce  damages  in  the  layers,  it  does  not 
offer  the  same  capabilities  as  dry  etching  in  terms  of  control 
of  fine  patterns,  uniformity,  anisotropy,  and  etching  profiles. 

The  recent  demand  for  extremely  high  performance 
InAlAs/InGaAs  HEMTs^  for  microwave  application  or  opti¬ 
cal  communications  circuits,  in  the  1.3 -1.5  fjm  wavelength 
range,  has  created  a  need  for  selective  dry  etching  tech¬ 
niques.  Reactive  ion  etching  (RIE)  with  CH4/H2  has  been 
developed  and  used  by  several  groups  for  InP-based  materi¬ 
als,  but  relatively  few  reports"^’^  have  been  published  which 
demonstrate  the  application  of  this  technique  in  transistor 
fabrication.  Actually,  RIE  may  induce  surface  damages 
caused  by  ion  bombardment,  stoichiometry  changes  due  to 
preferential  depletion  of  group  V  elements,  redeposition  of 
hydrocarbon  polymer,  and  hydrogen  passivation  of  donors. 
Such  defects  results  in  poor  Schottky  contacts,  which  induce 
excessive  leakage  current  and  low  breakdown  voltage  that 
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limit  HEMT  device  performance.  Such  defects  can  also  in¬ 
crease  the  low  frequency  noise  of  such  devices. 

In  this  article,  we  study  the  influence  of  the  RIE  of  the 
InGaAs  layer  and  exposure  of  InAIAs  to  the  rf  plasma  on  the 
electrical  properties  of  TiAu-InAlAs  Schottky  contacts. 
Conventional  I-V  and  C-V  measurements  were  used  to 
evaluate  the  electrical  characteristics  of  the  Schottky  diode 
(barrier  height,  ideality  factor,  and  leakage  current)  after 
etching  the  InGaAs  cap  layer.  Deep  traps  induced  by  inter¬ 
action  of  the  plasma  with  the  semiconductor  surface  were 
measured  by  deep-level  transient  spectroscopy  (DLTS).  An¬ 
nealing  studies  on  these  diodes  were  carried  out  to  remove 
damages.  Spatial  extension  of  the  damaged  layer  was  evalu¬ 
ated  by  wet  etching  thin  layers  at  the  InAIAs  surface  in  order 
to  recover  good  Schottky  diode  characteristics. 

II.  EXPERIMENT 

The  epitaxial  structures  used  in  this  work  were  grown  by 
molecular  beam  epitaxy  (MBE)  on  n  InP  substrates.  They 
consist  of  a  0.25  fjcm  thick  n  ^  InAIAs  buffer  layer,  a  0.5  fjm 
InAIAs  barrier,  and  a  0.01  /xm  undoped  InGaAs  cap  layer.  Si 
was  used  as  a  dopant,  and  the  carrier  concentration  was  typi¬ 
cally  2X10^^  cm”^  in  the  InAIAs  barrier  layer. 

Diodes  were  processed  on  these  layers  using  a  conven¬ 
tional  technology.  Alloyed  AuGeNiAu  was  used  as  a  back 
side  ohmic  contact.  After  etching  the  InGaAs  cap  layer  with 
CH4/H2  using  a  SiN  mask,  the  TiAu  (50-300  nm)  Schottky 
contact  was  deposited  by  e  beam  evaporation.  Native  oxides 
on  the  InAIAs  surface  were  removed  by  a  diluted  HCl  dip 
prior  to  metal  evaporation.  Schottky  electrodes  with  a 
2X10~^  cm^  area  were  used  for  electrical  characterization. 
Etching  was  performed  in  a  planar  diode  RIE  system 
(ALCATEL  GIR  100)  operating  at  a  frequency  of  13.56 
MHz.  As  a  result  of  a  previous  optimization  of  the  etching 
parameters,  a  gas  mixture  of  methane  and  hydrogen  was  used 
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Current  ((lA) 


Fig.  1.  Log  /  vs  V  characteristics  of  TiAu-InAlAs  Schottky  diodes  at  various  self-bias  voltages. 


at  a  total  pressure  of  10  mTorr,  with  ICH4/6H2  ratio,  and  a 
total  flow  rate  of  30  seem.  Several  etehing  processes  were 
compared  with  rf  power  applied  to  the  cathode  ranging  from 
20,  60,  100,  to  140  W.  The  mean  self-bias  on  the  cathode  was 
200,  450,  600,  and  800  V,  respectively.  Selective  etching  of 
InGaAs  over  InAlAs  was  obtained  when  the  flux  of  the  re¬ 
active  hydrocarbon  species  was  higher  than  the  removal  rate 
of  the  reacted  InAlAs.  The  (ICH4/6H2)  mixture  at  a  pressure 
of  10  mTorr  results  in  low  deposition  rate  of  hydrocarbon 
polymers  on  the  sample  surface.  Under  these  conditions, 
both  InGaAs  and  InAlAs  were  etched  for  all  rf  powers.  For 
example,  the  etch  rate  of  InGaAs  and  InAlAs  at  200  V  is  15 
and  3  nm/min,  respectively,  giving  an  etching  selectivity  of 
InGaAs  over  InAlAs  of  5:1.  Selectivity  between  InGaAs  and 
InAlAs  was  observed  to  increase  with  plasma  rf  power.  We 
used  similar  etch  times  (5  min)  and  gas  residence  times  for 
these  RIE  processes.  On  some  samples,  rapid  thermal  anneal¬ 
ing,  in  an  Ar/H2  atmosphere,  during  20  s  before  Schottky 
metal  deposition,  was  used  to  remove  damages  from  the 
etched  layer.  The  annealing  temperature  (400  °C)  is  typical 
of  ohmic  contact  formation. 

DLTS  measurements  were  performed  using  a  DL  8000 
digital  DLTS  system  from  Bio-Rad.  This  system  digitizes  the 
capacitance  or  current  transient.  It  then  performs  a  numerical 
Fourier  transformation,  in  which  parameters  are  analyzed 
with  the  significant  advantage  that  the  deep-level  character¬ 
istics  can  be  obtained  during  measurements  using  a  single 
temperature  scan.  The  first  sine  coefficient  spectrum  b  1  has 
been  used  in  this  study  to  characterize  the  transient  evolution 
with  temperature.  For  all  measured  samples,  a  reverse  bias  at 


-0.8  V,  a  filling  pulse  at  -0.2  V,  and  a  pulse  width  of  1  ms 
were  used.  Temperature  was  scanned  in  the  100-450  K 
range. 

III.  RESULTS  AND  DISCUSSION 
A.  Characterization  of  the  RIE  samples 
1.  /-  V  characteristics 

Figure  1  shows  a  typical  semilog  plot  of  the  /-  V  charac¬ 
teristics  of  TiAu-InAlAs  diodes  as  a  function  of  self-bias 
voltage.  The  I-  V  plot  of  a  chemically  etched  sample  is  also 
shown  for  comparison. 

These  data  are  analyzed  assuming  a  thermionic  emission 
of  electrons  over  the  potential  barrier.  For  a  Schottky  diode 
dominated  by  thermionic  emission,  the  current  versus  volt¬ 
age  characteristics  are  given  by 

I=Io{txp[q{V-IR,)/nkT]-l},  (1) 

where  Iq  is  the  saturation  current,  n  is  the  ideality  factor,  and 
is  the  series  resistance.  Ideality  factors  are  calculated  un¬ 
der  forward  bias,  from  the  slope  of  the  linear  region  of  log 
I-  V  plots,  and  effective  barrier  heights  from  the  extrapo¬ 
lated  saturation  current,  using  relations  (2)  and  (3),  respec¬ 
tively: 

q  dV 

and 
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Table  I.  Effective  (/>[,  and  real  Schottky  barrier  height  ideality  factor  n, 
series  resistance  ,  and  reverse  current  at  - 1  V  calculated  from  /-  V  data. 


Type  of  processing 

<l>t  (eV) 
(I-V) 

<l>b0  (eV) 
(I-V-T) 

n 

Rs  («) 

/r(A) 

Chemical  etched 

0.60 

0.62 

1.07 

2.5 

3X10-'' 

RIE  200  V 

0.50 

0.48 

1.13 

3.12 

6X10-*^ 

RIE  450  V 

0.44 

0.39 

1.30 

3.5 

2X10"’ 

RIE  600  V 

0.42 

0.39 

1.40 

3.9 

5X10-^ 

RIE  800  V 

0.51 

0.41 

1.60 

10 

7X10-* 

(3) 

where  A*  is  the  effective  Richardson  constant  ('^lO 
A  cm“^  for  InAlAs). 

For  the  reference  diode,  transport  characteristics  domi¬ 
nated  by  thermionic  emission  (n^l.lO)  are  observed;  the 
effective  barrier  height  calculated  from  Iq  is  0.60  eV.  For  the 
reactive  ion  etched  samples,  the  I-V  characteristics  could 
not  be  accurately  fitted  by  a  thermionic  emission  model. 
Comparing  the  /-  V  characteristics,  two  observations  can  be 
made: 

(i)  The  reverse  as  well  as  the  extrapolated  forward  satu¬ 
ration  current  increase  for  reactive  ion  etched  samples 
when  compared  to  the  wet  etched  samples.  This  effect 
may  be  related  to  a  reduction  of  the  effective  Schottky 
barrier  height. 

(ii)  Under  forward  bias,  the  current  increases  in  the  low 


voltage  region,  whereas  the  slope  of  the  linear  region 
of  the  log  I-  V  plots  decreases  in  the  high  bias  region. 

Measured  values  of  barrier  height,  ideality  factor,  series  re¬ 
sistance  and  reverse  current  at  - 1  V  calculated  from  /-  V 
data  are  summarized  in  Table  I. 

The  decrease  in  barrier  height  after  reactive  ion  etching 
may  be  related  to  near  surface  damages  induced  by  ion  bom¬ 
bardment  during  etching.  This  may  be  responsible  for  the 
observed  current  increase  at  low  forward  bias  and  also  for 
the  increase  in  the  series  resistance  of  the  diodes.  From  Table 
I  it  appears  that  does  not  decrease  monotonically  with 
increasing  self-bias:  0^,  for  800  V  is  higher  than  for  450  and 
600  V.  Similar  variations  of  Schottky  contacts  parameters 
have  also  been  reported^  for  IBE  and  ion  sputtering  of  Si  and 
GaAs. 

2. 1-V-T  characteristics 

Analysis  of  the  temperature  dependence  of  the  saturation 
current  density  was  also  conducted  in  order  to  evaluate  the 
real  barrier  height  from  the  Richardson  plot  vs 

T~K 

a.  Forward  bias.  Figure  2  shows  the  Richardson  plot 
ln(7^r"^)  vs  T~^  of  the  Schottky  diodes.  The  corresponding 
barrier  heights  are  reported  in  Table  I.  We  obtain  a  “real” 
barrier  height  of  0.62  eV  for  the  chemically  etched  sample, 
which  compares  well  with  the  “effective”  barrier  height  de¬ 
duced  by  the  thermionic  emission  model  of  the  forward  cur¬ 
rent.  The  values  of  the  real  barrier  height  for  samples  etched 
with  10  mTorr  CH4/H2  for  200,  450,  600,  and  800  V  are 


Ln  [ls/T'^2  (mAxK'^-2)] 


Fig.  2.  Richardson  plot  In  (J^T  vs  7  ^  of  the  Schottky  diodes. 
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Reverse  current  (|iA) 


Voltage  (V) 


Fig.  3.  Reverse  I-V-T  plots  for  diodes  processed  with  chemical  etching 
and  RIE  at  a  self-bias  of  200  V. 


0.48,  0.39,  0.39,  and  0.41  eV,  respectively.  From  these  re¬ 
sults,  it  appears  that  the  effective  barrier  height  is  higher  than 
the  real  barrier  height  for  the  samples  etched  at  450,  600,  and 
800  V.  The  effective  barrier  height,  as  estimated  from  the 
saturation  current  of  room  temperature  I-V  characteristics, 
includes  the  effects  of  the  presence  of  a  thin  insulating  layer 
between  metal  and  semiconductor.  Larger  ideality  factors 
(1.3,  1.4,  and  1.6)  are  also  observed  on  these  Schottky  diodes 
processed  at  higher  ion  energy  (^450  V),  resulting  probably 
from  the  presence  of  this  interfacial  layer  at  the  Schottky 
junction.  Such  interfacial  layers  may  result  from  the  forma¬ 
tion  of  a  heavily  damaged  region  near  the  sample  surfaces 
during  RIE  at  high  ion  energy  (^450  V)  as  will  be  discussed 
later. 

The  effects  of  CH4/H2  reactive  ion  etching  on  the  electron 
conduction  through  the  TiAu-InAlAs  Schottky  junction 
were  analyzed  using  I-V-T  measurements  under  forward 
and  reverse  bias  on  both  chemically  etched  samples  and 
samples  etched  by  RIE  with  a  200  V  self-bias  voltage. 

From  the  Richardson  plot  of  the  saturation  current  density 
at  forward  bias,  we  deduce  the  values  of  the  real  Schottky 
barrier  height  as  given  in  Table  I  (0.6  eV  for  the  reference 
sample  and  0.48  eV  for  the  RIE  sample  at  200  V). 

b.  Reverse  bias.  Figure  3  shows  the  temperature  depen¬ 
dence  of  the  reverse  current  of  these  Schottky  diodes.  For  the 
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reference  sample,  the  reverse  current  shows  a  sharp  increase 
with  increasing  reverse  bias,  especially  at  low  temperature. 
The  plot  of  ln(/^)  vs  HT  shows  two  different  slopes,  giving 
activation  energies  of  the  reverse  current  of  0.61  and  0.29 
eV  for  low  and  high  reverse  bias,  respectively.  The  observed 
reverse  1-  V  characteristics  at  low  bias  can  be  explained  by  a 
thermionic  emission  model  as  reported  by  many  authors,^ 
and  is  similar  to  the  barrier  height  deduced  from  the  Rich¬ 
ardson  plot  of  the  forward  saturation  current.  For  higher  bias 
voltage,  an  activation  energy  around  0.29  eV  is  deduced 
from  the  ln(/;j)  plot.  This  lower  activation  energy  can  be 
attributed  to  a  defect-assisted  tunneling  mechanism  which 
governs  the  current  at  high  reverse  bias.  It  should  be  noted 
that  this  value  is  close  to  the  activation  energy  of  a  defect 
£■1—0.2  eV,  observed  by  DLTS. 

For  the  RIE  sample,  an  excess  current  flow  is  observed, 
and  two  slopes  are  found  on  the  Richardson  plot  of  the  re¬ 
verse  current  as  shown  in  Fig.  4.  At  low  bias,  an  activation 
energy  of  0.5  eV  is  attributed  to  a  generation  current  via 
plasma-generated  defects  (see  Sec.  IV).  At  larger  bias,  a  tun¬ 
neling  current  through  defects  located  at  an  energy  level  of 
0.45  eV  is  probably  the  main  mechanism  observed  in  RIE 
InAlAs.  These  results  are  in  good  agreement  with  the  DLTS 
results  discussed  in  Sec.  Ill  A  4. 


3.  C-  V  characteristics 

Figure  5  displays  the  1  MHz  capacitance- voltage  (C- V) 
characteristics  of  the  devices.  For  the  chemically  etched 
sample,  the  1/C^  vs  V  plot  produces  a  straight  line  which 
slope  can  be  used  to  calculate  the  donor  concentration 
(N^^lX  10^^  cm“^),  and  the  intercept  with  the  voltage  axis, 
the  barrier  height.  We  find  a  barrier  height  of  0.65  eV  which 
is  in  good  agreement  with  that  calculated  from  I-V  and 
I-V-T  measurements.  For  the  RIE  sample,  the  diodes  have 
a  much  reduced  capacitance,  and  1/C^  vs  V  is  no  longer 
linear,  with  dC~^fdV  decreasing  at  low  bias.  Due  to  the 
nonlinearity  of  such  a  plot,  no  useful  information  on  the 
barrier  height  can  be  obtained. 

The  decrease  of  the  measured  capacitance  after  RIE  is 
attributed  to  the  additional  series  capacitance  of  the  interfa¬ 
cial  layer  (polymer  film  and/or  damaged  layer).  These  varia¬ 
tions  of  the  reverse-bias  capacitance  of  Schottky  diodes  after 
dry  etching  have  already  been  reported  in  the  case  of  GaAs.^ 
It  was  concluded  that  ion  bombardment  creates  donorlike 
damages  near  the  metal/semiconductor  interface:  the  pres¬ 
ence  of  this  interfacial  layer  and  damage  defects  in  the  near 
surface  region  of  the  semiconductor  induces  a  nonuniform 
charge  density  in  the  depletion  region.  The  existence  of  such 
defects  perturbs  the  parabolic  conduction  band  and  then  af¬ 
fects  the  intercept  of  1/C^  vs  V  plot. 

4.  DLTS  measurements 

The  DLTS  system  used  in  this  work  is  the  same  as  that 
described  by  Weiss.^  Assuming  that  the  traps  are  fully  occu¬ 
pied  by  the  filling  pulse,  the  time  dependence  of  the  space 
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In  [lr(MA)l 


Fig.  4.  Richardson  plot  of  the  chemically  etched  diode  and  RIE  (200  V)  etched  diode. 


where  is  space  charge  capacitance  at  reverse  bias  voltage, 
Nf  is  trap  concentration,  is  doping  level,  and  is 

emission  time  constant  of  electrons  at  the  actual  temperature 
of  the  diode.  Extrapolation  of  the  measured  transient  values 
to  r  =  0  gives  the  value  of  C{t=0): 
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Fig.  5.  C  vs  V  (reverse  bias)  characteristics  of  TiAu-InAlAs  Schottky  diodes  at  various  self-bias  voltages. 


charge  capacitance  C{t)  (the  capacitance  transient)  is  ap¬ 
proximately  exponential,  for  low  trap  concentrations 


C{t)  —  Cf,  Cf.\^  27V  1  exp 


-)■ 


(4) 


Capacitance  (pF) 
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C{t  =  0)_^  N, 


(5) 


Thus  all  parameters  in  the  above  formula  are  known  and  the 
trap  concentration  can  be  calculated.  When  a  transient  is 
recorded,  we  obtain  one  point  on  the  Arrhenius  plot  from  a 
Fourier  analysis  (at  the  measurement  temperature). 

The  Nf  value  is  obtained  with  bias  values  selected  such 
that  the  depletion  width  is  significantly  larger  than  the  tran¬ 
sition  region.  Then,  to  make  a  more  exact  trap  concentration 
calculation,  an  active  volume  has  to  be  considered  corre¬ 
sponding  to  the  region  extending  from  the  position  (Lr) 
where  the  Fermi  level  (Ef)  crosses  the  trap  level  at  (£'0 
during  the  filling  pulse  to  the  position  (L^)  where  the  Fermi 
level  crosses  the  trap  level  during  the  emission  phase.  Then 
the  capacitance  can  be  written  as 


C^Cr 


N,  l]-lI 

2N, 


(6) 


where  Wf.  is  the  depletion  region  width.  This  equation  takes 
into  account  the  width  of  the  transition  region  under  both 
pulse  and  reverse  bias  conditions. 

For  DLTS  measurements,  the  Schottky  diodes  were  re¬ 
verse  biased  at  -0.8  V,  and  the  filling  pulse  voltage  was 
“0.2  V  to  overcome  the  heavily  damaged  region  caused  by 
exposure  of  the  InAlAs  surface  to  the  plasma.  The  present 
bias  condition  allows  us  to  detect  relevant  traps  located  from 
0.1  yttm  (depletion  width  at  —0.2  V)  to  0.19  /mm  (depletion 
width  at  -0.8  V)  in  depth  from  the  InAlAs  surface. 

Figure  6(a)  shows  the  typical  DLTS  spectra  for  an  as 
grown  InAlAs  MBE  sample.  Three  traps  are  evidenced, 
namely,  Ei,  E2,  and  £3,  as  previously  reported.^’^^  Activa¬ 
tion  energies  are  close  to  0.20,  0.42,  and  0.62  eV  for  levels 
£1,  £2,  and  £3,  respectively.  Trap  concentrations  are 
5.1X10^"^  cm”^  for  Ey  and  7.0X10^^  cm“^  for  £2.  The  de¬ 
fect  concentration  for  £3  is  estimated  to  be  4.9X10^^  cm“^. 
Trap  concentrations  of  our  InAlAs  MBE  samples  are  in  the 

same  order  of  magnitude  as  those  obtained  by  other 
9 10 

groups.  ’ 

Figure  6(b)  shows  the  DLTS  spectra  for  the  same  sample 
after  RIE.  The  trap  activation  energies,  capture  cross  sec¬ 
tions,  and  concentrations  evaluated  from  an  Arrhenius  plot  of 
the  DLTS  maxima  are  listed  in  Table  II.  These  data  show  that 
RIE  samples  can  be  classified  into  two  categories,  depending 
on  the  self-bias.  The  first  one  is  for  samples  etched  at  200 
and  800  V,  whereas  the  second  pertains  to  self-bias  of  450 
and  600  V.  Reactive  ion  etching  at  200  or  800  V  results  in 
devices  having  increased  densities  of  defect  £2  (activation 
energy  0.42-0.50  eV),  while  the  concentration  of  defect  £3 
(0.6  eV)  is  found  to  decrease.  DLTS  spectra  of  samples 
etched  at  450  and  600  V  show  a  single  maxima  correspond¬ 
ing  to  defect  £2  (—0.45  eV).  The  trap  concentration  of  this 
level  is  comparable  to  that  obtained  at  200  and  800  Y.  £1 
(0.20  eV)  disappears  for  all  etched  samples. 

DLTS  spectra  in  Fig.  6(b)  also  show  a  shift  of  the  peaks 
related  to  traps  £2  and  £3 .  This  indicates  a  change  in  acti¬ 
vation  energies  for  levels  £2  and  £3  from  those  of  the  con¬ 


DLTS  Signal  (a.u.) 


Fig.  6.  DLTS  spectra  of  typical  as  grown  (a)  MBE-InAlAs  and  (b)  RIE- 
InAlAs  at  various  self-bias  voltages. 


trol  sample.  In  Fig.  6(b),  it  can  be  observed  that  the  peak 
amplitude  of  defects  £2  and  £3  does  not  increase  with  in¬ 
creasing  self-bias.  Furthermore,  in  all  cases,  the  concentra¬ 
tion  of  the  traps  at  level  £2  is  increased  after  reactive  ion 
etching.  However,  the  height  of  the  DLTS  peak  related  to  the 


Table  IL  Activation  energies  ,  capture  cross  sections  ,  and  trap  den¬ 
sities  Nf  of  the  deep  levels  obtained  from  the  Arrhenius  plot  data. 


Type  of  processing 

E,-E,  (eV) 

(Tj  (cm^) 

N,  (X10'*cm“*) 

0.20 

1.5X10“'“ 

0.515 

Chemical  etched 

0.42 

6X10“'^ 

0.701 

0.62 

2X10“"' 

4.86 

RIE  200  V 

0.45 

5.34X10“'“ 

2.65 

0.69 

6.43X10“'^ 

1.95 

RIE  400  V 

0.49 

1.27X10“"* 

2.0 

RIE  600  V 

0.45 

2.93X10“'“ 

2.0 

RIE  800  V 

0.50 

2.26X10“'^ 

2.06 

0.62 

4.88X10“'* 

1.28 
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Table  III.  Effective  <!>{,  and  real  Schottky  barrier  height  ideality  factor 
n,  and  reverse  current  at  —1  V  for  the  annealed  samples. 


Type  of  processing 

4>i,  (eV)  a-V) 

^60  (eV)  (I-V-T) 

n 

/.(A) 

RIE  200  V+RTAIO  s 

0.54 

0.46 

1.10 

2X10"^ 

RIE  200  V+RTA20  s 

0.57 

0.48 

1.10 

1X10“^ 

RIE  200  V+RTA30  s 

0.54 

0.40 

1.07 

5X10‘® 

RIE  450  V+RTA20  s 

0.46 

0.39 

1.15 

1X10'“ 

RIE  600  V+RTA20  s 

0.49 

0.38 

1.40 

2X10'“ 

RIE  800  V-hRTA20  s 

0.43 

0.30 

2.00 

1X10'^ 

deep  trap  E2  shows  a  saturation  between  2  and  3X10^^ 
cm”^.  This  suggests  that  defects  E2  are  not  simple  point 
defects,  but,  rather,  complex  defects.  This  is  supported  by  the 
widening  of  the  E2  peak,  also  suggesting  that  it  is  related  to 
complex  defects. 

B.  Annealing  effects  after  RIE 
1.  /“  V  and  /“  V-  T  characteristics 

We  have  investigated  the  effects  of  a  thermal  treatment  in 
removing  the  damages  created  by  RIE.  Rapid  thermal  an¬ 
nealing  was  performed  in  Ar/H2  atmosphere  at  400  °C  for 
10,  20,  and  30  s. 

In  Table  III,  we  report  the  values  of  the  real  and  effective 
barrier  height,  ideality  factor,  and  reverse  currents,  at  a  —  1  V 
bias  for  samples  etched  at  200  V  and  annealed  for  different 
times.  An  increase  of  the  effective  barrier  height  and  a  re¬ 
duction  of  the  leakage  current  at  low  reverse  bias  were  ob¬ 
served  after  annealing.  Temperature  dependence  of  the  satu¬ 
ration  current  density  shows  that  the  real  barrier  height  did 
not  change  after  rapid  thermal  annealing  (RTA).  These  typi¬ 
cal  variations  of  the  metal-insulator-semiconductor 
Schottky  junction  were  Analysis  of  the  re¬ 

verse  current  of  the  annealed  diode  at  different  temperatures 
shows  an  increase  of  the  tunneling  current  at  high  reverse 
bias.  This  is  probably  due  to  the  formation  of  a  native  oxide¬ 
like  insulating  layer  at  the  Schottky  junction.  At  low  reverse 
bias,  a  reduction  of  the  leakage  current  is  observed.  These 
variations  can  be  explained  in  terms  of  effective  Schottky 
barrier  enhancement  and  generation  current  reduction  which 
is  probably  associated  to  the  effectiveness  of  annealing  in  the 
reduction  of  the  plasma-induced  defects.  However,  full  re¬ 
covery  was  not  achieved  since  there  is  only  a  limited  de¬ 
crease  of  the  surface  defects. 

Rapid  thermal  annealing  was  also  performed  on  diodes 
etched  at  450,  600,  and  800  V.  The  corresponding  parameters 
are  also  reported  in  Table  III.  The  results  of  the  annealing  on 
these  diodes  were  different  from  those  obtained  with  200  V 
(low  ion  energy).  Samples  processed  at  450  and  600  V  (see 
Table  III)  did  not  show  any  significant  change  in  effective 
and  real  barrier  height.  Moreover,  the  I-V  characteristics  of 
samples  etched  at  800  V  degraded  after  annealing.  An  in¬ 
crease  of  the  reverse  current  was  observed  after  annealing  of 
these  diodes  processed  at  higher  ion  energy  (^450  V). 

However,  the  values  of  the  effective  barrier  height  for  all 
the  etched  and  annealed  samples  were  apparently  higher  by 
about  0.1  eV  than  the  real  barrier  height  determined  by  the 
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Fig.  7.  DLTS  spectra  of  InAlAs  samples  etched  at  200  V  and  annealed  for 
10,  20,  and  30  s  at  400  °C. 


Richardson  plot.  The  enhancement  of  the  effective  Schottky 
barrier  after  annealing  for  the  diodes  etched  at  200  V  (low 
ion  energy)  can  be  attributed  to  the  presence  of  a  native 
interfacial  oxidelike  insulating  layer.  On  the  other  hand,  an¬ 
nealing  of  the  samples  etched  at  higher  ion  energy  (^450  V) 
induces  an  incorporation  of  additional  interface  states  which 
increases  the  reverse  current  by  generation  via  plasma- 
induced  defects,  and  also  by  tunneling  through  interface  de¬ 
fects. 

2.  DLTS  measurements 

Figure  7  shows  a  slight  effect  of  thermal  annealing  on  the 
reduction  of  the  concentration  of  trap  E2  for  the  samples 
etched  at  200  V.  Trap  concentration  of  level  E2  is  estimated 
to  be  (0.9-l)X10^^  cm~^  after  annealing.  Traps  Ei  (0.2  eV) 
and  E3  (0.6  eV)  reappear  after  RTA,  with  lower  concentra¬ 
tions  than  for  the  reference  sample.  In  Fig.  7,  we  can  also 
observe  the  effectiveness  of  annealing  in  suppressing  the 
shift  in  peak  temperature  for  the  deep  trap  E2 . 

Figure  8  gives  the  measured  DLTS  signal  for  a  sample 
etched  at  200  V  and  annealed  for  20  s.  The  activation  energy 
and  capture  cross  sections  obtained  from  the  Arrhenius  plot 
data  for  each  trap  {Ey,  E2,  and  E^)  are  used  as  parameters 
for  a  simulation  of  the  theoretical  DLTS  signal  (exponential 
capacitance  transient).  A  broadening  of  the  experimental 
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Fig.  8.  Measured  and  simulated  DLTS  spectra  of  sample  etched  at  200  V 
and  annealed  for  20  s. 


peaks  at  220  and  300  K  is  observed  in  Fig.  8,  as  compared  to 
the  simulated  signals.  Fourier  analysis  of  the  transient  indi- 
cates  that  it  is  not  a  true  exponential  one,  indicating  an  over¬ 
lap  of  peaks  corresponding  to  different  activation  energies. 
This  effect  was  also  observed  by  Omling  et  al  in  GaAsP.^^ 
They  attributed  this  behavior  to  an  alloy  broadening  effect. 

From  these  results,  it  appears  that  annealing  samples 
etched  at  200  V  result  in  a  small  decrease  of  the  trap  con¬ 
centration  of  level  E2 ,  but  the  value  remains  larger  than  for 
the  reference  sample.  On  the  other  hand,  after  RTA,  re¬ 
appears  at  a  low  level  (small  DLTS  peak  height)  and  E^ 
concentration  remains  constant,  around  1X10^^  cm“^.  How¬ 
ever,  the  main  effect  on  the  DLTS  spectra  for  samples  etched 
at  200  V  and  annealed  at  400  °C  is  a  reduction  of  the  broad¬ 
ening  of  the  DLTS  peaks  at  220  and  300  K,  suggesting  a 
reduction  of  the  energy  distribution  range  of  the  deep  level. 

Figure  9  compares  the  effect  of  thermal  annealing  on 
DLTS  signal  versus  temperature  on  the  samples  etched  at 
various  self-bias.  The  parameters  of  the  observed  traps  are 
displayed  in  Table  IV.  All  traps  reappear  after  thermal  an¬ 
nealing.  However,  as  for  the  sample  etched  at  200  V,  no  well 
defined  DLTS  peaks  related  to  deep  traps  E2  and  E^  are 
observed  after  thermal  annealing.  The  concentration  of  the 
deep-level  E2  after  RTA  is  also  higher  (between  8X10^"^  and 
1X10^^  cm"^). 


DLTS  Signal  (a.u.) 
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Fig.  9.  DLTS  spectra  of  InAIAs  samples  etched  at  various  self-biases  and 
annealed  for  20  s. 

C.  Damage  removal  by  wet  chemical  etching 
1. 1- V  characteristics 

The  damaged  layer  thickness  was  investigated  only  for 
the  samples  etched  at  200  V  (low  self-bias)  by  removing  the 
damaged  layer  using  wet  chemical  etching.  Figure  10  shows 


Table  IV.  Activation  energies  ,  capture  cross  sections  cr, ,  and  trap  den¬ 
sities  Nf  of  the  deep  levels  after  thermal  annealing. 


Type  of  processing 

E,-E,  (eV) 

<7,  (cm^) 

N,  (XlO'^cm"^) 

RIE  200  V+RTA  10  s 

0.26 

3.57X10‘‘^ 

0.225 

0.42 

2.36X10“*^ 

1.07 

0.20 

3.68x10‘'2 

0.148 

RIE  200  V+RTA  20  s 

0.49 

1X10’’* 

0.927 

0.64 

5.87X10“‘5 

0.907 

0.22 

3.78X10^'^ 

0.300 

RIE  200  V+RTA  30  s 

0.47 

1.68X10"‘2 

1.52 

0.61 

2.38X10"*^ 

1.48 

0.24 

1.51  XIO"'"* 

0.356 

RIE  450  V+RTA  20  s 

0.45 

9.51X10"’'' 

1.08 

0.67 

3.61X10"” 

1.38 

0.25 

5.52X10"'^ 

0.227 

RIE  600  V+RTA  20  s 

0.50 

1.97X10"’^ 

0.912 

0.61 

1.37X10"“' 

1.39 

0.24 

1.95X10"'^ 

0.62 

RIE  800  V+RTA  20  s 

0.42 

1.11X10"'^ 

0.812 

0.60 

1.97X10"'^ 

2.17 
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Current  (pA) 


Fig.  10.  Log  I  vs  V  characteristic!!  of  samples  after  RIE  at  200  V  and  wet-chemical  etching. 


the  changes  in  I-V  characteristics  after  CH4/H2  dry  etching 
at  200  V  and  after  the  damaged  surface  is  removed  by  step 
etching.  The  corresponding  Schottky  parameters  are  summa¬ 
rized  in  Table  V,  where  it  can  be  seen  that  a  wet  etch  about 
20  A  thick  induces  an  increase  of  the  barrier  height  from 
0,50  to  0.54  eV,  a  decrease  of  the  ideality  factor  (1.07)  and  a 
small  decrease  of  leakage  current  (2X10“®  A  at  —1  V).  On 
the  other  hand,  a  wet  etch  of  60  A  improves  the  electrical 
characteristics  of  the  diodes  and  essentially  the  leakage  cur¬ 
rent,  which  is  about  6X10“’  A  at  -1  V.  A  wet  etch  of  the 
InAlAs  surface  after  RIE  at  200  V  also  induces  a  decrease  of 
the  series  resistance  of  the  diodes  down  to  2.5  il.  The  current 
transport  through  the  Schottky  barrier,  after  a  shallow  wet 
etch  of  the  InAlAs  layer,  can  anew  be  described  by  a  ther¬ 
mionic  emission  model  (m<1.1).  This  result  supports  the 
presence  of  an  interfacial  layer  near  the  semiconductor  sur¬ 
face  after  dry  etching,  which  is  eliminated  by  wet  chemical 
etching. 


Table  V.  Effective  Schottky  barrier  height  ,  ideality  factor,  and  reverse 
current  at  - 1  V  for  the  samples  wet-chemically  etched  after  the  RIE  treat¬ 
ment  at  200  V. 


Type  of  processing 

<Pi,  (eV)  (I-V) 

n 

Rs  (ft) 

£(A) 

RIE  200  V 

0.50 

1.13 

3.12 

6X10-^ 

RIE  200  V+wet  etch  20  A 

0.54 

1.07 

2.5 

2X10"* 

RIE  200  V+wet  etch  60  A 

0.57 

1.05 

2.5 

exio-’ 

2.  DLTS  measurements 

Figure  11  shows  a  set  of  DLTS  spectra  where  the  samples 
were  chemically  etched  after  RIE  at  200  V.  A  reverse  bias  of 
-0.8  V  and  a  filling  pulse  voltage  up  to  0  V  was  used  in 
order  to  analyze  the  InAlAs  layer.  These  data  show  that  a 
chemical  etching  of  20  A  after  RIE  induces  a  decrease  of  the 
height  of  the  DLTS  peak  related  to  the  deep  trap  E2 ,  whereas 
the  concentration  of  the  trap  £3  becomes  higher  than  that  of 
£2  •  This  DLTS  result  confirms  the  association  of  increased 
density  of  £2  traps  after  RIE  to  plasma-induced  damage. 
Following  a  wet  etching  of  60  A,  we  found  a  DLTS  spectra 
similar  to  that  obtained  on  the  reference  sample.  The  corre¬ 
sponding  I-  V  characteristics  indicate  the  disappearance  of 
residual  damages.  We  can  conclude  that  the  thickness  of  the 
heavily  damaged  region  for  the  RIE  samples  etched  at  200  V, 
is  about  60  A. 

Even  though  DLTS  is  unable  to  detect  defect  levels  close 
to  the  surface,  the  variations  of  the  dominating  deep-level  £2 
after  step-by-step  wet  etching  shows  its  dominance  in  the 
near  surface  region.  This  can  be  put  together  with  observa¬ 
tions  made  on  GaAs  after  ion  implantation,  for  which  a 
strong  increase  of  defect  density  toward  the  surface  is  also 
observed.’’  Thus,  surface  Fermi  level  pinning  by  the  deep 
defect  states  should  be  taken  into  consideration. 

IV.  DISCUSSION 

I-V  measurements  performed  on  InAlAs  Schottky  diodes 
after  RIE  show  a  decrease  in  barrier  height  and  an  increase  in 
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Fig.  11.  DLTS  spectra  of  samples  after  RIE  at  200  V  and  wet-chemical 
etching. 


ideality  factor,  when  compared  with  a  reference  wet  etched 
sample.  This  can  be  attributed  to  the  creation  of  a  damaged 
layer  near  the  surface,  that  leads  to  a  higher  density  of  inter¬ 
face  states  (metal/semiconductor  interface).  As  indicated  by 
temperature  measurements  performed  on  Schottky  diodes, 
the  real  barrier  height  of  the  samples  etched  at  200  V  (low 
ion  energy)  compares  well  with  those  obtained  from  the 
room  temperature  I-V  characteristics  (0.5  eV).  For  samples 
etched  at  a  self-bias  voltage  of  450  V  or  greater,  the  effective 
barrier  height  is  higher  than  the  real  barrier  height.  This  ef¬ 
fect  can  be  attributed  to  the  existence  of  an  interfacial  layer 
at  the  Schottky  junction.  It  should  be  noted  here  that  for  the 
fabrication  of  the  Schottky  diodes,  RIE  was  performed  using 
identical  etch  times  and  gas  residence  times.  Since  hydrocar¬ 
bon  polymer  deposition  and  surface  damage  formation  in¬ 
crease  with  increasing  if  power,  it  can  be  inferred  that  the 


thickness  of  the  heavily  damaged  region  present  at  the 
InAlAs  surface  increases  with  rf  power.  Structural  investiga¬ 
tions  of  the  induced  defects  was  not  presented  in  this  work, 
but  we  found,  in  the  case  of  InP  after  RIE,^'*  a  surface  dam¬ 
age  which  consists  of  a  contamination  (C,  H,  O)  layer  on  top, 
then  an  In203  oxide  layer  and  a  defect  layer.  The  interfacial 
layer  present  at  the  Schottky  junction  is  probably  due  to  this 
heavily  damaged  region.  The  higher  ideality  factor  obtained 
on  samples  etched  at  high  self-bias  voltage  ^450  V  (1.3,  1.4, 
and  1.6)  actually  suggests  the  presence  of  a  thin  insulating 
layer  at  the  interface. 

Taylor  and  Morgan^^  have  examined  the  electrical  prop¬ 
erties  of  irradiated  Ni/n-GaAs  Schottky  barriers  and  have 
shown  that  an  excess  current  flows,  the  extent  of  which  is 
dependent  on  the  concentration  of  radiation  induced  damage 
centers  generated  at  the  metal-semiconductor  interface.  This 
proposal  is  consistent  with  the  observed  behavior  of  the 
current/voltage  characteristics  in  our  Schottky  diodes.  The 
proposed  model  is  based  on  the  existence  of  defect  states 
near  the  interface  whose  density  is  enhanced  by  irradiation 
and  due  to  the  interface  state  field;  the  conduction  and  va¬ 
lence  bands  are  bent  downward  at  the  interface.  In  reverse 
bias,  the  current  transport  mechanism  is  essentially  a  trap 
assisted  tunneling  process.  When,  however,  trapping  levels 
are  present  within  the  forbidden  energy  gap,  then  tunneling 
electrons  may  be  captured  by  these  levels  (trap  assisted  tun¬ 
neling  process)  and,  if  the  temperature  is  high  enough,  they 
will  be  thermally  emitted  to  the  conduction  band.  At  low 
reverse  bias,  a  generation  mechanism  can  develop  through 
band  center  levels,  thus  resulting  in  a  generation  current;  this 
phenomenon  will  be  increased  by  radiation  damage,  produc¬ 
ing  a  greater  number  of  defect  levels. 

Let  us  now  consider  the  effects  of  a  thermal  treatment. 
Samples  etched  at  200  V  show  an  increase  in  the  effective 
barrier  height  and  reverse  I-V-T  current  after  thermal  an¬ 
nealing.  This  effect  can  be  explained  simply  by  considering 
the  incorporation  of  an  oxidelike  insulator  (containing  possi¬ 
bly  AI2O3)  at  the  Schottky  junction.  It  has  been  noted  by 
Wada  et  al}^  that  oxidation  of  InP  increases  the  effective 
barrier  height.  They  proposed  an  expression  of  the  barrier 
height  (f>i,  which  takes  into  account  the  existence  of  an  inter¬ 
facial  oxide  layer.  This  analytical  model  is  based  on  the  ox¬ 
ide  charge  effect  which  depends  on  the  physiochemical  prop¬ 
erties  of  the  oxide  and/or  the  oxide-semiconductor  interface 
and  on  the  surface  state  density  related  to  the  existence  of 
defects  near  the  interface.  Many  authors^^’'^'  have  discussed 
the  nature  of  the  oxide  present  at  the  InAlAs  surface  after 
reactive  ion  etching,  and  they  found  significant  concentra¬ 
tions  of  metal  oxides  such  as  InOj.  and  AlO^^  using  Auger 
electron  spectroscopy  and  x-ray  photoelectron  spectroscopy. 
They  found  that  group  V  elements  also  form  oxides  like 
AS2O3  and  AS2O5.  In  this  case,  a  MIS  Schottky  junction  is 
formed  and  tunneling  transport  through  the  thin  insulator 
(oxide)  occurs,  leading  to  an  increase  of  the  current  at  high 
reverse  bias. 

For  the  samples  etched  at  450  and  600  V,  the  effective 
barrier  height  did  not  change  after  annealing.  This  effect  can 
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be  attributed  to  a  highly  damaged  surface  layer  with  a  large 
density  of  defects  induced  by  higher  plasma  self-bias.  In  this 
case,  the  native  interfacial  oxidelike  insulator  incorporates  a 
high  density  of  traps  which  favor  trap  assisted  tunneling, 
resulting  in  an  increased  leakage  current.  Diodes  processed 
at  800  V  degrade  after  annealing. 

Temperature  measurements  on  Schottky  diodes  fabricated 
on  all  these  samples  show  an  effective  barrier  typically 
higher  by  0.1  eV  than  the  real  barrier  after  thermal  treatment, 
confirming  the  possible  presence  of  an  oxidelike  interfacial 
layer  after  RTA.  Furthermore,  the  etched  samples  show  an 
increase  of  the  reverse  current  after  thermal  annealing,  prob¬ 
ably  by  tunneling  through  the  oxide  at  the  interface  and  also 
by  tunneling  through  the  traps  at  0.45  eV,  as  supported  by 
DLTS  measurements. 

In  order  to  evaluate  the  thickness  of  the  damaged  region 
of  the  samples  for  which  we  obtain  the  smallest  electrical 
degradation  (RIE  at  200  V),  a  set  of  /-  V  measurements  were 
recorded,  after  step-by-step  chemical  etching.  We  found  after 
etching  a  60  A  layer,  an  improvement  in  the  electrical  char¬ 
acteristics  of  the  Schottky  diodes  in  terms  of  barrier  height, 
ideality  factor,  and  leakage  current.  These  data  confirm  our 
assumption  about  the  presence  of  a  heavily  damaged  interfa¬ 
cial  layer  at  the  Schottky  junction  after  RIE. 

The  DLTS  data  after  RIE  showed  an  increase  of  the  con¬ 
centration  of  the  level  E2  (0.42  eV)  which  was  accompanied 
by  a  broadening  of  the  peak  related  to  these  defects.  The 
presence  of  this  wide  peak  shows  that  a  band  of  defects  has 
been  created  in  the  upper  half  of  the  band  gap.  This  obser¬ 
vation  has  been  already  reported  in  the  case  of  GaAs  after 
ion  implantation,  and  it  has  been  associated  with  the  so- 
called  U  band.^^ 

The  reverse  current  measured  on  the  Schottky  diodes  pro¬ 
cessed  on  the  etched  samples  can  be  explained  by  tunneling 
current  mechanism  through  these  defect  levels  as  deduced  by 
the  Richardson  plot,  which  suggests  that  the  trap  £^2  is  asso¬ 
ciated  with  plasma  induced  damage.  In  the  same  way,  the 
concentration  of  trap  E2  was  found  to  increase  in  InAlAs 
after  CHF3/SF6  RIE  of  a  SiN-UVCVD  film  deposited  on 
InGaAs/InAlAs/InP  heterostructures. 

The  extent  of  the  damaged  region  after  RIE  has  been  re¬ 
ported  by  many  authors  for  GaAs  and  InR^^  They  concluded 
that  the  RIE  induced  damages  are  distributed  over  a  region 
deeper  than  the  projected  ion  range.  This  may  be  related  to 
phenomena  such  as  channeling,  vacancy  migration,  and  en¬ 
hanced  diffusion  induced  by  ion  bombardment.^^  On  the 
other  hand,  the  main  sources  of  defects  induced  by  plasma 
processing  at  low  ion  energy  in  InP  based  materials  are  lo¬ 
calized  in  the  surface  region,  probably  as  a  result  of  prefer¬ 
ential  loss  of  group  V  elements  leading  to  nonstoichiometric 
surface.  Therefore,  the  Fermi  level  pinning  at  the  surface  by 
such  defects  should  be  taken  into  consideration.^^  A  unified 
defect  model  has  been  proposed  by  Spicer  et  al}^  which  sug¬ 
gests  possible  correlation  between  the  Fermi  level  pinning 
and  the  dominating  defect  centers  in  GaAs,  especially  As^^ 
and  Ga^g.  Then,  the  change  in  Schottky  barrier  height  is 
governed  by  the  movement  of  the  Fermi  level,  which  is  re¬ 


lated  to  defect  states.  Our  results  support  such  assumptions, 
since  the  real  barrier  height  obtained  on  Schottky  diodes  af¬ 
ter  RIE  corresponds  to  the  energetic  position  of  the  dominat¬ 
ing  deep  levels. 

The  increase  of  the  concentration  in  the  E2  trap  after  RIE 
and  its  association  with  plasma  induced  damages  are  con¬ 
firmed  using  wet  chemical  etching  after  RIE:  a  progressive 
decrease  of  E2  concentration  is  observed  when  etching  the 
surface  down  to  the  value  found  in  the  reference  sample. 

It  should  be  noted  here  that  the  concentration  of  the  level 
E2  after  RIE  does  not  increase  with  increasing  self-bias,  and 
at  the  same  time  a  broadening  of  the  corresponding  DLTS 
spectra  is  observed  after  RIE.  This  suggests  that  the  level  E2 
certainly  corresponds  to  complex  defects,  since  it  is  quite 
different  from  traps  created  by  other  types  of  technological 
processes  such  as  electron  irradiation^^  which  create  simple 
defects.  It  has  been  proposed  by  Luo  et  al?^  that  the  trap  E2 
is  related  to  an  arsenic  related  complex  defect  such  as  di- 
intersticial.  Then,  after  RIE,  an  increase  of  such  complex 
defects  (£^2)  ttiay  result  from  the  association  of  the  impuri¬ 
ties  already  present  in  the  starting  material  and  the  created 
defects.  Such  phenomena  have  been  reported  in  the  case  of 
boron  implantation  in  GaAs,^^  where  complex  defects  result 
from  the  association  of  donors  and/or  impurities  present  in 
GaAs  together  with  the  created  defects. 

The  levels  £3  (0.6  eV)  and  E^  (0.2  eV)  behave  differ¬ 
ently.  A  decrease  of  the  concentration  of  the  level  £3  (0.6 
eV)  is  observed  after  RIE  when  compared  to  the  wet  etched 
sample  (dominant  defect)  while  the  level  E  ^  (0.2  eV)  disap¬ 
pears  after  RIE. 

Annealed  samples  show  a  small  decrease  of  the  concen¬ 
tration  of  the  level  £"2,  while  levels  E^  and  £3  reappear  with 
lower  concentration.  However,  a  reduction  of  the  broadening 
spectra  of  the  level  £2  is  the  main  effect  of  thermal  anneal¬ 
ing,  suggesting  a  reduction  of  its  energy  distribution.  The 
low  decrease  of  the  concentration  of  the  level  £2  after  RTA 
supports  the  suggestion  that  is  related  to  complex  defect, 
since  simple  point  defects  in  general  can  be  easily  annealed 
out. 

However,  variations  of  the  levels  E^  and  £3  should  be 
compared  with  observations  made  by  Pang  et  al  on  GaAs 
after  ion  implantation.^^  They  found  a  dramatic  reduction  of 
the  EL2  level  after  Ar  ion  implantation,  and  they  attributed 
EL2  to  an  antisite  As^^  defect  or  a  complex  involving  the 
antisite  As^^  defect.  Then,  the  lower  EL2  concentration  after 
Ar  implantation,  implies  a  reduction  in  the  density  of  AsQa  • 
After  RTA,  the  EL2  peak  increases  with  temperature  between 
400  and  600  °C,  but  with  lower  concentrations  when  com¬ 
pared  to  their  reference  sample.  They  attributed  this  effect  to 
As  loss  during  RTA.  However,  in  InAlAs,  the  mechanisms  of 
defect  formation  are  still  unknown,  but  we  found  variations 
after  RIE  similar  to  those  for  GaAs.  Then,  further  studies  are 
needed  to  fully  understand  the  variations  of  levels  Ey  and  £3 
after  RIE  and  the  effects  of  a  thermal  treatment  on  the  deep 
electron  traps. 
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V.  CONCLUSION 

CH4/H2  RIE  induced  damages  in  InGaAs/InAlAs/InP  het¬ 
erostructures  have  been  characterized  by  means  of  /-V, 
C~V,  and  DLTS  measurements.  We  have  observed  a  de¬ 
crease  in  Schottky  barrier  height  and  an  increase  in  ideality 
factor  for  the  etched  samples.  The  corresponding  DLTS  spec¬ 
tra  have  shown  an  increase  of  the  defect  at  0.42-0.50  eV,  a 
decrease  of  the  defect  density  at  0.6  eV  while  the  deep  traps 
at  0.2  eV  disappear. 

I-  V  measurements  on  samples  etched  at  200  V  show  a 
decrease  of  the  real  barrier  height  from  0.6  eV  (reference 
sample)  to  0.48  eV  and  a  tunneling  current  through  traps  at 
0.45  eV,  which  is  found  by  DLTS  measurements  to  be  the 
dominant  defect  level.  Annealing  of  these  samples  at  400 
induces  a  small  reduction  of  the  trap  density  at  E^  —  E^^OA 
eV.  However,  the  samples  cannot  fully  recover  their  initial 
characteristics  since  there  is  only  a  small  reduction  of  the 
defect  density  and  possibly  a  modification  of  their  depth  dis¬ 
tribution.  A  wet  etching  of  these  samples  after  RIE  improves 
the  electrical  characteristics  of  the  Schottky  diodes,  and  we 
found  that  the  thickness  of  the  heavily  damaged  region  ex¬ 
tends  to  —60  A  from  the  surface.  The  damages  induced  by 
plasma  processing  at  450,  600,  and  800  V  are  not  removed 
by  a  thermal  treatment. 

Finally,  RIE  with  CH4/H2  at  low  self-bias  can  allow  a 
good  control  of  the  etching  depth  but  at  the  expense  of  a 
significant  induced  surface  damage.  Characterization  of  these 
RIE  induced  damage  suggests  that  the  use  of  CH4/H2  reac¬ 
tive  ion  etching  in  transistor  fabrication  (gate  recessing) 
should  be  done  at  even  lower  self-bias  (<200  V). 
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A  simple  Cl2/Ar  plasma  chemistry  can  provide  smooth,  high-rate  etching  of  InP,  InAs,  InGaAs, 
Alin  As,  and  InGaAsP  at  room  temperature  under  conditions  in  which  there  is  a  balance  between 
formation  and  sputter  desorption  of  the  normally  involatile  InCl3.  When  the  neutral/ion  ratio  is 
either  too  high  or  too  low,  surface  roughening  is  apparent  due  either  to  the  presence  of  InCl3 ,  or  to 
preferential  loss  of  the  group  V  element.  The  etching  has  been  investigated  as  a  function  of 
microwave  power  (600-1000  W),  rf  power  (0-300  W),  process  pressure  (1.5-10  mTorr),  and 
Cl2:Ar  ratio  under  electron  cyclotron  resonance  conditions.  Use  of  N2  or  H2,  rather  than  Ar,  as  gas 
additives  to  the  chlorine,  did  not  produce  smooth,  stoichiometric  etched  surfaces.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

In-based  compound  semiconductors  form  the  basis  for  a 
variety  of  technologically  important  devices,  including 
InGaAsP/InP  diode  lasers  for  optical  communication  systems 
operating  at  1.3  and  1.5  fjm}  InGaAs/AlInAs  heterojunction 
bipolar  transistors  for  high  bit-rate  digital  applications,^  and 
InAs  infrared  detectors.^  One  of  the  future  directions  will  be 
continued  miniaturization  and  higher  device  densities,  which 
requires  development  of  uniform,  high-yield  etching  tech¬ 
niques  for  pattern  transfer.  In  fabricating  the  devices  men¬ 
tioned  above  there  are  a  wide  range  of  etch  depths  needed, 
from  relatively  shallow  mesas  (<1000  A)  (Ref.  4)  to  through 
wafer  via  holes  up  to  100  yum  deep.^“^  Since  CH4/H2-based 
plasma  chemistries  provide  maximum  etch  rates  of  ^1000 
A/min  for  InP  and  related  materials,  they  are  suitable  only 
for  a  limited  number  of  these  applications.^  Traditionally, 
Cl2-based  plasma  chemistries  have  required  concurrent 
sample  heating  in  order  to  aid  desorption  of  the  InCl3  etch 
product.^”  This  heating  adds  additional  processing  com¬ 
plexity  and  needs  addition  of  a  sidewall  polymer  forming  gas 
such  as  CH4  to  prevent  undercutting. 

Recently  it  has  been  reported  that  simple  Cl2/Ar  plasmas 
can  produce  very  high  etch  rates  for  In-based  semiconductors 
at  room  temperature  when  operated  under  electron  cyclotron 
resonance  (ECR)  conditions.  A  plausible  explanation  is 
that  the  high  ion  flux  produces  efficient  sputter  desorption  of 
the  InCl3  before  it  forms  a  thick  selvage  layer.  It  is  necessary 
to  more  completely  investigate  the  parameter  space  for  this 
plasma  chemistry  to  establish  the  usable  process  window.  In 
this  article  we  report  on  ECR  etching  of  InP,  InAs,  InGaAs, 
AlInAs,  and  InGaAsP  in  Cl2/Ar  as  a  function  of  plasma 
composition,  microwave  and  rf  power,  process  pressure,  and 
gas  additive  (Ar,  N2,  or  H2).  Acceptably  smooth  etched  sur¬ 
face  morphologies  are  obtained  for  specific  sets  of  these  pa- 
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rameters.  We  also  confirm  that  the  chlorine  etch  residue  con¬ 
centration  is  much  lower  on  ECR  Cl2/Ar  etched  InP 
compared  to  that  measured  for  samples  exposed  to  reactive 
ion  etch  (RIE)  discharges. 

II.  EXPERIMENT 

Epitaxial  layers  of  Ino53Gao47As,  Alo.48lno.52As,  or  In¬ 
GaAsP  (X=1.3  /xm)  lattice  matched  to  InP  were  grown  on 
semi-insulating  (Fe-doped)  substrates  by  either  metal  organic 
chemical  vapor  deposition^  ^  or  metal  organic  molecular 
beam  epitaxy. Wafers  of  InP  or  InAs  from  undoped  Czo- 
chralski  boules,  together  with  the  epitaxial  materials,  were 
lithographically  patterned  with  AZ5209E  photoresist  to  pro¬ 
duce  features  with  dimensions  between  0.5-100  yum.  Etch 


Fig.  1.  Etch  rates  of  In-based  materials  as  a  function  of  CI2  composition  in 
Cl2/Ar  ECR  discharges  (1.5  mTorr,  1000  W  microwave,  150  W  rf,  and  total 
gas  flow  rate  15  seem). 
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Fig,  2.  SEM  micrographs  of  features  etched  into  InP  using  1.5  mTorr,  1000  W  microwave,  150  W  rf  discharges  of  pure  Ar  (top  left),  2Cl2/13Ar  (top  right), 
10Cl2/5Ar  (bottom  left),  or  pure  CI2  (bottom  right). 


rates  were  measured  by  stylus  profilometry  of  the  largest 
features  after  removal  of  the  resist  in  acetone.  Scanning  elec¬ 
tron  microscopy  (SEM)  and  atomic  force  microscopy  (AFM) 
were  used  to  examine  the  surface  morphology  of  the  etched 
samples,  while  Auger  electron  spectroscopy  (AES)  was  em¬ 
ployed  to  monitor  the  near- surface  stoichiometry  at  three 
spots  across  the  5X5  mm^  samples. 

The  etching  was  performed  in  a  Plasma-Therm  SLR  770 
system.  The  samples  enter  through  a  load  lock  and  are  me¬ 
chanically  clamped  to  a  He  backside-cooled  chuck  held  at 
25  °C  that  is  biased  with  rf  (13.56  MHz)  power  (0-300  W). 
We  believe  from  observation  of  resist  behavior  that  the 
sample  temperature  did  not  rise  above  70-80  °C  during  the 
short  (1  min)  exposures  employed.  Thus,  we  do  not  believe 
sample  heating  plays  a  significant  role  in  our  data.  The 
plasma  is  generated  with  application  of  600-1000  W  of  2.45 
GHz  microwave  power  to  an  Astex  low  profile  ECR  source. 
Electronic  grade  CI2,  Ar,  N2,  or  H2  were  injected  into  this 
source  through  mass  flow  controllers.  The  process  pressure 
was  varied  from  1.5-10  mTorr  by  throttling  the  turbomo- 
lecular  pump  that  evacuates  the  reactor  chamber.  We  held  the 


rf  power  constant,  rather  than  dc-induced  voltage.  The  latter 
varies  from  approximately  -60  V  at  50  W  rf  to  -220  V  at 
200  W  rf,  as  measured  at  the  sample  electrode.  The  biases 
are  higher  under  RIE  conditions  (—80  V  at  50  W,  —290  V  at 
200  W).  The  magnetic  field  was  not  used  when  only  the 
bottom  rf  power  was  applied. 

III.  RESULTS  AND  DISCUSSION 

Figure  1  shows  the  etch  rates  of  the  five  different  materi¬ 
als  as  a  function  of  plasma  composition  in  C^Ar  discharges. 
The  total  gas  flow  rate  was  held  constant  at  15  standard  cubic 
centimeters  per  minute  (seem).  The  general  trend  of  higher 
etch  rates  with  increased  chlorine  content  is  expected  be¬ 
cause  of  the  greater  active  species  concentration  available  for 
reaction  with  the  semiconductors.  Note  at  this  low  pressure 
(1.5  mTorr)  and  high  microwave  power  (1000  W)  condition, 
the  sputter  rates  with  pure  Ar  are  ^1200  A/min  for  all  the 
materials.  This  is  a  result  of  the  high-ion  flux  under  these 
conditions,  but  represents  a  severe  limitation  if  one  is  trying 
to  achieve  selectivity  for  one  material  over  another.  At  CI2 
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Fig.  3.  AFM  scans  from  InP  samples  etched  in  1.5  mTorr,  1000  W  micro- 
wave,  150  W  rf  discharges  of  pure  Ar  (a),  5Cl2/10Ar  (b),  or  pure  CI2  (c). 


percentages  ^67%  the  etching  was  significantly  isotropic, 
while  below  this  value  the  sidewalls  were  basically  vertical 
(sidewall  angles  ^80°). 

The  surface  morphology  of  the  In-based  materials  was  a 
strong  function  of  the  Cl2:Ar  ratio  in  the  discharge.  As  an 
example,  Fig.  2  shows  SEM  micrographs  of  features  etched 
into  InP  with  1.5  mTorr,  1000  W  microwave  and  150  W  rf 
discharges  of  pure  Ar  (top  left),  2Cl2/13Ar  (top  right), 
10Cl2/5Ar  (bottom  left),  or  pure  CI2  (bottom  right).  The  mor¬ 
phology  with  pure  Ar  is  quite  rough  due  to  preferential  sput¬ 


tering  of  P,  as  reported  previously. Similarly,  with  pure  CI2 
the  large  differential  in  volatility  between  the  InCl3  and  PCI3 
etch  products  also  leads  to  a  rough  morphology.  However,  at 
a  10:5  ratio  of  Cl2:Ar  the  surface  is  very  smooth,  which 
confirms  that  it  is  possible  to  balance  the  removal  rates  of  the 
etch  products  without  heating  the  sample  by  optimizing  the 
ion/neutral  flux  ratio.  To  quantify  the  surface  roughness, 
AFM  was  performed  in  the  tapping  mode.  Figure  3  shows 
AFM  scans  from  InP  etched  in  pure  Ar  (a),  5Cl2/10Ar  (b),  or 
pure  CI2  (c).  The  root-mean- square  roughness  measured  over 
lOX  10  fim  areas  was  7.3  nm  for  the  Ar-etched  materials,  and 
14.4  nm  for  the  pure  CI2  sample,  while  the  C^Ar  (5/10) 
sample  had  an  rms  value  of  only  0.64  nm.  The  latter  is  com¬ 
parable  to  an  untreated  substrate.  We  were  able  to  achieve 
this  surface  quality  for  CI2  flow  rates  between  5  and  10  seem 
in  the  total  CI2+ Ar  flow  rate  of  15  seem.  The  near  surface  of 
these  samples  retained  their  stoichiometry,  even  for  etch 
depths  of  several  microns.  Figure  4  shows  an  AES  surface 
scan  (top)  and  depth  profile  (bottom)  from  an  InP  sample 
etched  for  2  min  in  a  10Cl2/5Ar,  1.5  mTorr,  1000  W  micro- 
wave,  150  W  rf  discharge.  There  are  several  noteworthy  as¬ 
pects  to  this  data.  First,  there  are  no  chlorine  residues  re¬ 
maining  on  the  surface,  suggesting  that  sputter-enhanced 
removal  of  the  InCl3  is  very  efficient  under  these  conditions. 
In  addition,  only  adventitious  carbon  and  oxygen  from  the 
native  oxide  are  present  as  contaminants  on  the  surface. 
These  result  from  the  fact  that  the  samples  are  exposed  to 
atmosphere  upon  removing  them  from  the  reactor  to  transfer 
to  the  AES  system. 

Figure  5  shows  the  dependence  of  material  etch  rate  on 
microwave  power  in  2Cl3/13Ar,  1.5  mTorr,  150  W  rf  dis¬ 
charges.  The  rates  increase  at  higher  powers  due  to  a  higher 
atomic  density  (as  evidenced  from  monitoring  the  optical 
emission  line  intensity)  and  a  higher  ion  flux.  The  transition 
from  rough-to-smooth  morphologies  shifts  to  lower  Cl2:Ar 
ratios  at  lower  microwave  powers  because  of  the  need  to 
maintain  the  balance  between  InCl3  formation  and  removal. 

The  vast  difference  in  etch  rates  for  the  five  materials 
between  the  ECR  mode  and  conventional  reactive  ion  etch¬ 
ing  (RIE)  is  shown  in  Fig.  6,  where  the  rf  power  dependence 
of  the  rates  has  been  determined  both  with  and  without  1000 
W  microwave  power.  Again  this  is  a  result  of  the  much 
higher  active  chlorine  densities  and  the  higher  ion  density 
(—5X10^^  cm“^  for  ECR  versus  3X10^  cm  for  RIE,  as  ob¬ 
tained  from  microwave  interferometry  measurements.). In¬ 
deed,  under  RIE  conditions  it  is  necessary  to  heat  the  InP  to 
achieve  rates  >500  A/min.  Temperatures  of  200-250  °C  are 
generally  employed  under  these  conditions. 

The  difference  between  the  two  etch  modes  is  not  simply 
the  rates.  Figure  7  shows  AES  surface  scans  from  two  InP 
samples  etched  in  2Cl2/13Ar,  1.5  mTorr,  300  W  rf  discharges 
with  either  1000  W  microwave  power  (top)  or  0  W,  i.e.,  RIE 
mode  (bottom).  The  In/P  ratio  is  much  higher  in  the  latter 
sample  and  it  has  a  significant  amount  of  residual  chlorine. 
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Fig.  7.  AES  surface  scans  from  InP  samples  etched  in  2Ci2/13Ar,  1.5  mTorr,  and  300  W  rf  discharges  with  either  1000  W  (top)  or  0  W  (bottom)  microwave 
power. 


This  is  consistent  with  the  presence  of  an  InCl3  selvage  layer 
under  RIE  conditions,  which  normally  requires  elevated 
sample  temperatures  to  desorb.  The  higher  ion  flux  under 
ECR  conditions  appears  to  prevent  formation  of  this  “poi¬ 
soning”  layer. 

The  pressure  dependence  of  etch  rate  at  fixed  microwave 
(1000  W)  and  rf  (150  W)  powers  is  shown  in  Fig.  8.  While 
the  etch  anisotropy  degrades  somewhat  above  7.5  mTorr,  the 
etch  rates  are  well  suited  to  via-hole  formation  processes, 
where  rates  of  at  least  1  /xm/min  are  necessary.^"^  Moreover, 
the  rms  surface  roughness  of  InP  and  the  other  materials 
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improves  dramatically  at  higher  pressure,  as  shown  in  Fig.  9. 
We  again  assume  that  for  this  plasma  composition  of 
2Cl2/13Ar  (which  gives  rough  morphologies  at  1.5  mTorr)  if 
one  increases  the  neutral/ion  ratio,  then  it  is  possible  to  bal¬ 
ance  the  formation  and  removal  of  the  InCl3 . 

There  has  been  recent  interest  in  the  use  of  CI2/N2  mix¬ 
tures  for  ECR  etching  of  InP.^^’^^  Smoother  surfaces  were 
obtained  relative  to  either  pure  CI2  or  Cl2/Ar  under  some 
conditions.^^  Figure  10  shows  the  InP  etch  rate  as  a  function 
of  chlorine  percentage  in  Cl2/Ar,  CI2/H2  or  CI2/N2  at  fixed 
pressure  (1.5  mTorr,  rf  (150  W)  and  microwave  (1000  W) 
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Fig.  8.  Etch  rates  of  In-based  materials  as  a  function  of  pressure  in  Fig.  10.  Etch  rates  of  InP  as  a  function  of  CI2  percentage  in  1000  W  micro- 
2Cl2/13Ar,  150  W  rf,  and  1000  W  microwave  discharges.  wave,  150  W  rf,  and  1.5  mTorr  discharges  of  Cl2/Ar,  CI2/N2  or  CI2/H2. 


power.  The  addition  of  either  H2  or  N2  reduces  the  etch  rate 
relative  to  Ar.  This  appears  to  be  due  both  to  the  lighter  mass 
of  these  species  reducing  the  sputtering  yield,  and  to  a  reduc¬ 
tion  in  atomic  chlorine  density.  Figure  11  shows  optical 
emission  spectra  in  raw  intensities  from  both  CI2/N2  (top) 
and  CI2/H2  (bottom)  discharges  under  the  same  conditions. 
The  reduction  in  intensity  of  the  chlorine  neutral  radical  lines 
between  700  and  850  nm  was  also  observed  by  Miyakuni 
et  al?^  Since  the  excitation  cross  section  of  active  species 
depends  on  electron  temperature,  which  will  change  with 
source  power,  pressure,  and  other  parameters,  it  is  necessary 
to  be  cautious  in  inferring  ground  state  [Cl]  densities,  but  in 
each  case  we  observed  a  decrease  in  neutral  radical  line  in¬ 
tensity  when  measured  against  standard  Ar  lines.  We  did  not 
observe  InP  morphologies  as  smooth  for  these  two  chemis- 


pressure  (mTorr) 

Fig.  9.  rms  surface  roughness  of  InP  samples  etched  in  2Cl2/3Ar,  1000  W 
microwave,  and  150  W  rf  discharges,  as  a  function  of  process  pressure. 
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Fig.  11.  Optical  emission  spectra  from  1000  W  microwave,  150  W  rf,  and 
1.5  mTorr  discharges  of  IOCI2/5N2  (top)  or  IOCI2/5H2  (bottom). 
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Fig.  12.  AFM  scans  from  InP  samples  etched  in  1000  W  microwave  150  W 
rf,  and  1.5  mTorr  discharges  of  IOCI2/5N2  (top)  or  IOCI2/5H2  (bottom). 

tries  as  obtained  with  the  simple  Cl2/Ar  mixtures.  AFM  sur¬ 
face  scans  are  shown  in  Fig.  12  from  samples  etched  in  1.5 
mTorr,  1000  W  microwave,  and  150  W  rf  discharges  of 
IOCI2/5N2  (top)  or  IOCI2/5H2  (bottom).  The  rms  roughness 
values  were  31.9  and  34.7  nm,  respectively,  much  worse  than 
that  obtained  for  Cl2/Ar  (0.64  nm).  Clearly  the  addition  of  Ar 
is  the  best  choice  in  terms  of  etch  rates  and  surface  morphol¬ 
ogy.  The  drawback  of  using  H2  or  N2  is  that  they  add  an 
additional  chemical  component  to  the  plasma  chemistry, 
which  complicates  the  control  of  the  near-surface  stoichiom¬ 
etry. 

IV.  CONCLUSIONS 

High  etch  rates  near  room  temperature  with  smooth,  sto¬ 
ichiometric  surfaces  are  possible  for  In-based  III-V  semi¬ 


conductors  under  ECR  conditions  by  controlling  the  forma¬ 
tion  and  desorption  of  the  InCl3  etch  product.  This  is 
achieved  at  particular  combinations  of  Cl2-to-Ar  flow  rates 
for  different  pressures,  microwave,  or  rf  powers.  AES  sur¬ 
face  analysis  shows  that  under  RIE  conditions  a  thick  InCl3 
selvage  layer  forms  that  leads  to  slow  rates  and  rough  mor¬ 
phologies. 
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Metal  stack  etching  using  a  helical  resonator  plasma 
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A  low-pressure  etching  process  for  advanced  aluminum  metallization  stacks  was  developed  using  a 
high-density  helical  resonator  plasma  source  (Prototech  model  ESRF  600)  mounted  on  a  Lucas  Labs 
cluster  tool.  The  metallization  stacks  consisted  of  a  300  A  TiN  antireflection  layer  on  6000  A  of  A1 
(1%  Cu)  with  a  1000  A  TiN  diffusion  barrier  and  a  100  A  Ti  film  to  enhance  adhesion  to  the 
underlying  Si02.  The  features  widths  were  as  small  as  0.45  /xm.  The  films  were  etched  using  gas 
mixtures  of  CI2/BCI3 .  The  BCI3  proved  to  be  an  important  additive  to  reduce  notching  of  the  A1  film 
at  the  interface  between  the  A1  and  the  top  layer  of  TiN.  Best  feature  profiles  were  obtained  using 
80-90  seem  CI2  and  10-20  seem  BCI3  at  the  following  reactor  conditions:  2.0  mTorr,  wafer  platen 
temperature  T=  0  °C,  100  W  rf  bias  power,  and  1500  W  source  power.  More  anisotropic  profiles  are 
obtained  by  either  decreasing  the  wafer  platen  temperature  or  increasing  the  rf-bias  power.  The 
photoresist  is  also  stripped  in  the  same  process  chamber  using  an  oxygen  plasma  at  5  mTorr,  50  W 
rf-bias  power,  and  1500  W  source  power  at  a  chuck  temperature  of  25  °C.  Extensive  application  of 
real-time  process  diagnostics,  including  optical  emission  spectroscopy  and  full  wafer  interferometry, 
aided  process  development  by  identifying  end  points,  etching  rates,  and  etching  rate  uniformities. 
©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

As  the  manufacturing  of  integrated  circuits  moves  toward 
0.25  ^tm  design  rules,  the  requirements  for  metal  etching  are 
increasingly  difficult  to  meet  using  conventional  parallel- 
plate  reactive  ion  etchers  (RIEs).  An  alternative  to  these  etch¬ 
ers  are  low-pressure,  high  ion-density  plasma  sources  such  as 
helical  resonators,  helicons,  transformer/inductively  coupled 
plasma  (TCP/ICP)  sources,  and  electron  cyclotron  resonance 
(ECR)  sources.  These  new  plasma  sources  are  capable  of 
highly  anisotropic  etching  due  to  their  low-pressure  opera¬ 
tion  (2-5  mTorr),  can  maintain  high  throughput  due  to  the 
high  ion  densities  achievable  (10^ ^-10^^  cm~^),  exhibit  high 
etching  rate  uniformity  via  single-wafer  processing,  and  per¬ 
mit  independent  control  of  the  ion  energy  through  wafer 
biasing. For  metal  etching,  this  independent  control  of  ion 
energy  is  essential  to  achieving  vertical  wall  profiles  over 
multilayer  stacks,  thus  overcoming  the  natural  proclivity  of 
different  layers  to  etch  isotropically  at  different  rates. 

State  of  the  art  metallization  layers  are  stacks  of  several 
metal  films  engineered  to  minimize  electrical  resistance  and 
maximize  thermal  stability  and  reliability.^  Aluminum  is  the 
conductor  of  choice  because  of  its  low  resistivity  and  ability 
to  adhere  to  Si02,  but  its  low  melting  temperature,  ability  to 
dissolve  Si,  tendency  to  corrode,  and  low  resistance  to  elec¬ 
tromigration  pose  processing  challenges."^  The  top  layer  is  a 
thin  (typically  100-500  A)  antireflective  coating  (ARC)  of 
Ti-W  or  TiN  to  improve  resist  patterning  on  the  highly  re¬ 
flective  aluminum.  The  ARC  layer  may  be  oxidized  by  the 
preceding  lithography,  with  much  of  the  surface  Ti  converted 
to  oxide.  Beneath  the  ARC  layer  is  the  main  conductor  layer 


^turrent  affiliation:  Department  of  Chemical  Engineering,  Massachusetts 
Institute  of  Technology,  Cambridge,  MA  02139. 
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of  aluminum.  The  A1  often  contains  0.5%-2.0%  Cu  to  in¬ 
crease  resistance  to  electromigration.  Since  Si  is  soluble  in 
Al,  0.5%-2.0%  Si  is  often  also  added  to  the  A1  film  or, 
alternatively,  a  barrier  layer  can  be  added.  The  diffusion  bar¬ 
rier  is  often  TiN  with  an  underlayer  of  Ti  to  promote  adhe¬ 
sion  of  the  barrier  layer  to  the  underlying  oxide  film.  The 
processing  challenge  is  to  etch  the  metallization  stack  to  pro¬ 
duce  vertical  or  slightly  sloping  sidewalls  which  facilitate 
conformal  coating  by  the  subsequent  dielectric  layer.  The 
etching  must  be  uniform  across  the  wafer  and  must  not  sever 
lines,  creating  electrical  opens,  or  leave  metallic  deposits 
across  fields,  creating  shorts. 

Gas  mixtures  of  BCI3  and  CI2  are  typically  used  to  etch 
metallization  stacks  because  fluorinated-aluminum  species 
are  not  volatile  except  at  extremely  high  temperatures.  One 
role  of  BCI3  is  to  scavenge  free  oxygen,  enabling  the  re¬ 
moval  of  oxidized  Al.^  Copper  is  difficult  to  remove  by  dry 
etching,  because  none  of  its  halides  have  an  appreciable  va¬ 
por  pressure.  Most  metal  etching  processes  rely  on  subse¬ 
quent  wet  chemistry  to  remove  residual  copper.^  As  Cl  and 
CI2  react  spontaneously  with  Al,  steps  must  be  taken  to  pre¬ 
vent  lateral  etching  of  the  Al  film.  The  undercutting  of  the 
film  can  be  prevented  by  the  presence  of  a  recombination 
species,^’^  such  as  BCl^  (x  —  1  or  2),  at  or  near  the  sidewalls 
to  react  with  and  neutralize  Cl  and  CI2.  Photoresist  erosion  in 
the  plasma  is  also  believed  to  play  a  role  in  anisotropic  metal 
etching.^ 

Metallization  stacks  have  been  successfully  etched  in 
high-density  plasma  sources  in  both  magnetized  plasmas, 
such  as  helicon^  and  ECR  reactors,^  and  in  inductively 
coupled  reactors  such  as  the  TCP.^^  We  have  used  a  helical 
resonator  plasma  source  from  Prototech  Research^  ^  mounted 
on  a  two-chamber  Lucas  Labs  cluster  tool^^  to  etch  0.45  jmm 
metal  patterns  (0.35  /xm  VLSI  technology)  at  low  pressure. 
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Efforts  were  made  to  achieve  a  vertical  profile  using  the 
simplest  chemistry.  Both  pre-  and  postetch  processing  steps 
were  established  to  prevent  corrosion  after  metal  etching  and 
resist  ashing. 

II.  EXPERIMENT 

A.  Etching  platform  and  helical  resonator  plasma 
source 

The  etcher  consists  of  a  four-port  central  transfer  chamber 
housing  a  robot  arm  for  wafer  handling,  and  a  loadlock  for 
loading  cassettes  of  100-200  mm  diam  wafers.  The  reactor 
consists  of  two  sections:  a  main  processing  section  where  the 
plasma  source  and  the  wafer  platen  are  mounted  and  a  high 
conductance  elbow  leading  to  a  3000  //s  magnetically  levi¬ 
tated  Osaka  turbomolecular  pump.  The  effective  pumping 
speed  is  approximately  1200  //s.  A  throttling  gate  valve, 
installed  directly  above  the  entrance  to  the  turbomolecular 
pump,  controls  the  reactor  pressure  independently  from  the 
mass  flow  rate.  A  temperature-controlled  capacitance  ma¬ 
nometer  with  sensitivity  between  0.5  and  100  mTorr  mea¬ 
sures  the  neutral  pressure  during  processing.  Eight  cartridge 
heaters  imbedded  in  the  anodized  aluminum  walls  of  each 
reactor  allow  the  chamber  walls  to  be  baked  to  100  °C.  Nor¬ 
mal  operating  procedure  is  to  continuously  maintain  a  “soft- 
bake”  temperature  of  65  even  during  processing,  to  mini¬ 
mize  adsorption  of  etch  by-products  on  the  reactor  walls. 

The  wafer  can  be  rf-biased  through  the  wafer  platen  with 
a  600  W  power  supply.  The  wafer  is  mechanically  clamped 
to  the  wafer  chuck,  and  helium  backside  cooling  removes 
heat  from  the  wafer  to  the  platen.  Liquid  nitrogen  circulates 
through  the  wafer  platen  to  remove  the  heat.  The  platen  tem¬ 
perature  can  be  controlled  from  -100  to  25  °C. 

The  helical  resonator.  Prototech  model  ESRF  600,  con¬ 
sists  of  a  quartz  tube  with  an  internal  diameter  of  150  mm 
that  necks  to  25  mm  at  the  top  where  the  feedgas  is  injected 
(Fig.  1).  RF  power  at  27.12  MHz  is  inductively  coupled  into 
the  plasma  via  a  helical  coil  wound  around  the  quartz  tube.  A 
Faraday  shield  is  inserted  between  the  coil  and  the  quartz 
tube  to  minimize  capacitive  coupling  that  would  sputter  the 
quartz.  Cooling  water  is  circulated  inside  the  coil  as  well  as 
through  the  matching  network  and  top  metal  plate.  A  perfluo- 
ropolyether  circulates  around  the  outside  of  the  coil  and  Far¬ 
aday  shield  in  addition  to  cooling  the  quartz  tube  directly. 

B.  Diagnostic  techniques 

1,  Optical  emission  spectroscopy 

A  rotating-grating  optical  emission  spectroscopy  (OES) 
system  from  Monolite  Instruments,^^  model  MIO  6100  with 
a  200-850  nm  photomultiplier  tube  detector,  enables  in  situ 
measurement  of  plasma  induced  emission.  The  entire  spec¬ 
tral  range  can  be  scanned  at  12  Hz,  and  the  resolution  of  the 
monochromator  is  observed  to  be  1-2  nm.  Aluminum  chlo¬ 
ride  (AlCl)  emission  at  261.4  nm  and  nitrogen  (N2)  emission 
at  337.0  nm  are  simultaneously  monitored  during  the  etching 
at  2  Hz.  The  presence  of  the  AlCl  line  corresponds  to  etching 
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Fig,  1.  Schematic  of  the  Prototech  Research  150  mm  diam  helical  resonator 
plasma  source,  model  ESHR-600  (drawn  to  scale). 


of  A1  while  the  N2  line  corresponds  to  etching  of  TiN  in  the 
multilayer  metal  stack  described  in  detail  below. 


2.  Full  wafer  imaging  interferometer 

A  full  wafer  interferometer  (FWI),  model  1000-IS,  from 
Low  Entropy  Systems, monitors  the  etching  in  real  time. 
Plasma  emission  is  the  light  source  for  the  interferometry 
while  a  charge-coupled  device  (CCD)  camera  acts  as  the 
detector  by  imaging  the  entire  wafer,  in  effect  turning  each 
pixel  into  an  independent  interferometer.  An  interference  fil¬ 
ter  (bandwidth  ~10  nm)  in  the  camera  is  centered  at  481  nm. 
Ideally,  one  should  choose  the  center  wavelength  of  the  in¬ 
terference  filter  to  be  at  a  wavelength  of  interest  for  the  ex¬ 
periment,  i.e.,  at  an  emission  line  from  either  a  reactant  or  an 
expected  product  species.  The  filter  used  in  this  experiment 
was  not  initially  intended  for  monitoring  A1  metallization 
etching,  but  it  proved  to  be  the  more  useful  of  the  two  avail¬ 
able  filters  (481  and  750  nm).  The  highest  resolution  of  the 
camera  is  on  the  order  of  100  /xm  per  pixel  depending  on  the 
imaging  optics.  Fast  Fourier  transform  analysis  of  the  inter¬ 
ference  fringes  is  used  to  calculate  the  etching  rates  of  trans¬ 
parent  films.  Etching  rate  uniformities  can  be  calculated  from 
various  preselected  regions  on  the  wafer. 

For  opaque  films,  interference  fringes  will  not  be  ob¬ 
served.  However,  changes  in  the  reflected  light  intensity  as  a 
film  clears  will  still  provide  valuable  end-point  information. 
As  is  discussed  below,  reflectometry  is  the  mode  of  operation 
during  metal  etching,  while  interferometry  is  the  mode  of 
operation  during  resist  stripping. 
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Fig.  2.  Composition  of  metal  stack  prior  to  etching. 


Finally,  the  full  wafer  interferometry  system  also  provides 
optical  emission  capabilities,  albeit  only  at  the  wavelength  of 
the  filter  and  integrated  over  the  filter’s  bandwidth,  by  align¬ 
ing  the  camera  such  that  one  of  the  points  images  a  portion 
of  the  reactor  that  is  not  etching.  In  the  Lucas  Labs  etcher, 
this  is  easily  accomplished  by  choosing  a  point  that  images 
the  wafer  clamp. 

C.  Metal  stack 

Figure  2  is  a  cross-sectional  schematic  of  the  metal  stack. 
The  following  layers  are  deposited  on  Si  wafers:  5400  A 
thermal  Si02,  100  A  Ti,  1000  A  TiN,  6000  A  A1  containing 
1%  Cu,  300  A  of  TiN,  and  1.0  ^tm  of  hard-baked  Novolac 
resist.  The  resist  is  patterned  using  a  trilevel  process  which 
deposits  600  A  of  Si02  and  0.5  /xm  of  chemically  amplified 
(CAMP)  resist  on  top  of  the  1.0  /xm  of  hard-baked  Novolac 
resist.  The  CAMP  is  exposed  using  deep  ultraviolet  (DUV) 
light  and  developed  with  pattern  fidelity  down  to  0.25  /xm. 
The  CAMP  patterns  the  oxide,  which  in  turn  patterns  the 
hard-baked  resist.  The  metal  etching  process  etches  through 
the  metal  stack  minimizing  etching  into  the  underlying  Si02 
layer.  The  open  area  of  the  pattern  is  approximately  50%  of 
the  wafer. 

D.  Pre-  and  postetch  processing 

Table  I  summarizes  the  processing  steps.  Before  etching, 
the  samples  are  placed  in  an  ethylene  glycol  buffered  oxide 


etching  (EG-BOE)  bath,  7:1  aqueous  dilution,  for  3  min  to 
remove  the  CAMP  and  thin  oxide  layers  and  to  expose  the 
layer  of  hard-baked  resist  to  the  plasma. 

After  etching  through  the  metal  stack,  an  O2  plasma  at  50 
W  rf-bias  power,  50  seem  O2,  5.0  mTorr,  r=23  °C  and  1500 
W  rf  source  power  is  used  to  remove  the  remaining  photore¬ 
sist.  Removing  the  photoresist  in  situ  prevents  corrosion, 
which  occurs  when  Cl2-laden  resist  is  exposed  to  air.  As  a 
final  corrosion  prevention  step,  wafers  are  placed  in  an 
ACT-CMI^^  bath  for  10  min  immediately  after  removal  from 
the  etcher  loadlock.  This  bath  removes  any  remaining  trace 
resist  and  passivates  the  surface.  In  all  wet  chemical  treat¬ 
ments  after  etching,  the  wafers  were  run  through  a  seven- 
cycle  rinse  using  de-ionized  H2O  bubbled  with  CO2  and  then 
spin  dried.  The  processing  conditions  and  resulting  etching 
rates  are  listed  in  Table  II. 

III.  RESULTS  AND  DISCUSSION 
A.  Etching  with  pure  CI2 
L  Best  profiles 

The  best  profiles  with  a  pure  CI2  plasma  are  obtained  with 
the  following  main  etching  conditions:  100  seem  CI2,  2.0 
mTorr,  wafer  platen  temperature  7=^0 ‘^C,  100  W  rf-bias 
power,  and  1500  W  source  power.  The  total  etching  time  is 
51s,  including  5  s  of  overetch.  The  amount  of  overetch  was 
intentionally  kept  to  a  minimum  to  study  the  effect  of  the 
main  etching  step  on  profile  control.  Figure  3  is  a  scanning 
electron  microscope  (SEM)  micrograph  of  two  0.45  /xm  lines 
separated  by  0.85  /xm.  The  A1  layer  is  undercut  by  approxi¬ 
mately  150  A  on  each  side  directly  beneath  the  top  TiN  layer. 
All  four  metal  layers  (upper  TiN,  Al,  lower  TiN,  and  Ti)  have 
vertical  profiles.  While  the  isolated  regions  are  completely 
cleared,  regions  between  nested  lines  need  a  longer  overetch 
to  completely  clear  the  lower  TiN  and  Ti  layers.  In  the  open 
areas,  less  than  100  A  of  the  underlying  oxide  is  lost,  indi¬ 
cating  that  a  longer  overetch  will  not  be  a  problem,  as  a  loss 
of  up  to  approximately  500  A  of  oxide  is  acceptable  for  0.25 
/xm  technology  processes. 

The  corresponding  OES  traces  show  clear  Al  and  TiN  end 
points  [Fig.  4(a)].  The  breakthrough  and  end-point  times  are 
marked  by  a  sharp  rise  and  drop,  respectively,  in  the  emis¬ 
sion  intensity.  These  sharp  changes  indicate  uniform  film 
clearing  and  thus  high  etching  rate  uniformity.  Traces  ob¬ 
tained  with  the  FWI  for  one  point  on  and  one  point  off  the 
wafer  are  shown  in  Fig.  4(b).  For  the  point  on  the  wafer,  only 
slight  changes  are  observed  at  the  Al  layer  end  point  and  at 
the  second  TiN  layer  end  point.  In  contrast,  the  point  off  the 
wafer  shows  an  increase  in  intensity  at  TiN  breakthrough  and 


Table  I.  Summary  of  processing  steps. 


1.  3  min  7:1  EG-BOE  bath  to  remove  oxide  layer  of  trilevel  resist  stack. 

2.  Etch  metal  stack  in  helical  resonator. 

3.  Remove  resist  in  helical  resonator  with  O2  plasma  using  the  following  conditions: 
50  seem  O2,  50  W  rf-bias  power,  5.0  mTorr,  7=23  C,  1500  W  source  power. 

4.  10  min  ACT-CMI  bath  to  remove  trace  resist  and  passivate  wafer  surface. 

5.  Seven-cycle  water  rinse  and  spin  dry. 
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Table  II.  Experimental  conditions  for  all  wafers  and  resulting  etching  rates  and  selectivities.  The  rf  source  power  was  1500  W  for  all  etching  experiments. 


Etch  conditions  Resist  Resist  Etch  rates  (A/min)  Selectivity 

-  left  after  etched  -  - 


Run 

Feed  (seem) 

T  (wafer  platen) 
("C) 

Pressure 

(mTorr) 

rf  bias 
(W) 

full  etch 

(A) 

during  Al 
etch  (A) 

TiN  1 

Al 

TiN  2 

AhTiN  2 

Ahresist 

Stack:  resist 

21 

100  CI2 

23 

2 

50 

3380 

1500 

3200 

8  500 

2400 

3.5 

2.7 

1.2 

24 

100  CI2 

21 

2 

75 

2250 

2250 

3500 

9  100 

3300 

2.7 

2.3 

1.0 

26 

100  02 

23 

2 

100 

4400 

6700 

10  600 

3300 

3.2 

1.4 

27 

80  02,  20  BO3 

23 

2 

100 

1900 

3000 

5600 

7  000 

2800 

2.5 

1.7 

0.98 

28 

45  O2,  45  N2 

23 

2 

100 

0 

6000 

1300 

2  300 

1300 

1.7 

0.82 

0.79 

29 

75  O2,  25  N2 

23 

2 

100 

750 

4500 

4800 

5  100 

2500 

2.0 

1.2 

0.85 

30 

100  O2 

-25 

2 

100 

2250 

2250 

6000 

10  100 

4100 

2.4 

2.1 

1.0 

31 

100  O2 

0 

2 

100 

5400 

11  100 

4000 

2.8 

32 

100  O2 

0 

1 

100 

1500 

3700 

4800 

8  100 

3600 

2.3 

1.3 

0.93 

34 

80  O2,  20  BO3 

0 

2 

100 

1500 

3350 

7600 

7  500 

3000 

2.6 

1.7 

0.93 

35 

80  O2,  20  BO3 

0 

1 

100 

2800 

3000 

4400 

6  700 

2000 

3.4 

1.6 

1.1 

36 

75  O2,  25  HBr 

0 

2 

100 

2250 

3000 

4500 

7  300 

2800 

2.6 

1.7 

1.0 

37 

100  O2 

0 

2 

100 

3100 

1800 

5100 

11  000 

3800 

2.9 

2.8 

1.1 

38 

80  02,  20  BO3 

0 

2 

100 

2250 

2600 

4800 

7  800 

3100 

2.5 

1.8 

1.0 

40 

90  O2,  IOBO3 

0 

2 

100 

3500 

1500 

5700 

11  400 

3900 

2.9 

3.4 

1.2 

a  corresponding  decrease  at  TiN  end  point,  for  both  the  upper 
300  A  and  lower  1000  A  TiN  layers.  This  behavior  at  481 
nm  is  similar  to  the  N2  emission  at  337  nm  and  may  be  due 
to  a  relatively  weak  emission  of  N2  at  481.5  nm.^^  However, 
since  the  FWI  integrates  the  optical  emission  intensity  over  a 
fairly  wide  range  (10  nm),  it  is  highly  speculative  to  assign  a 
source  for  the  emission.  Additionally,  only  by  comparing  the 
FWI  signals  to  OES  traces  can  we  accurately  identify  the 
different  breakthroughs  and  end  points. 

2,  Effect  of  rf-bias  power 

Experiments  run  using  different  rf-bias  powers  reveal  that 
high  biases  are  necessary  in  order  to  obtain  vertical  profiles. 
Lower  biases  result  in  significant  undercutting  of  the  Al,  and 
the  Al  layer  can  separate  from  the  lower  TiN  layer,  resulting 
in  lifting  of  the  metal  lines.  The  etching  rates  of  both  Al  and 
TiN  increase  with  increasing  rf-bias  power. 


Fig.  3.  SEM  micrograph  of  metal  lines  etched  using  a  pure  CU  plasma. 
Etching  conditions:  100  seem  CI2,  2.0  mTorr,  T=0°C,  100  W  rf-bias 
power,  and  1500  W  source  power.  Total  etching  time  is  51  s.  0.45  jum  wide 
lines  separated  by  0.85  /.tm  space. 


3.  Effect  of  wafer  platen  temperature 

Lowering  the  wafer  platen  temperature  improves  the  fea¬ 
ture  profile,  and  Fig.  3  is  a  cross-sectional  SEM  of  the  fea- 


Fig.  4.  Etching  diagnostics  for  wafer  etched  with  100  seem  CI2.  (a)  OES 
signals  at  261.4  nm  (AlCl)  and  337.0  nm  (N2);  (b)  FWI  signals  at  481  nm  at 
points  on  and  off  the  wafer. 
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Fig.  5.  SEM  micrograph  of  0.45  /xm  wide  lines  etched  with  100  seem  CI2  at 
the  following  conditions:  2.0  mXorr,  100  W  rf-bias  power,  1500  W  source 
power,  and  T=-25  °C. 

tures  etched  at  0  °C.  However,  it  was  also  found  that  lower¬ 
ing  the  temperature  to  -25  °C  (Fig.  5)  leaves  metal  etch 
residue  on  the  wafer  surface. 

Figure  6  shows  the  dependence  of  etching  rate  and  resist 
selectivity  on  wafer  platen  temperature.  The  A1  etching  rate 
is  fairly  constant  with  platen  temperature,  while  the  TiN 
etching  rate  decreases  slightly  as  the  platen  heats.  At  low 
temperatures,  the  increase  in  TiN  etching  rate  may  contribute 
to  less  A1  undercutting  since  the  overall  etching  time  is  less, 


Fig.  6.  Dependence  of  (a)  A1  and  TiN  etching  rates  and  (b)  resist  selectivity 
on  wafer  platen  temperature.  Etching  conditions:  100  seem  CI2,  2.0  mTorr, 
100  W  rf-bias  power,  and  1500  W  source  power.  Error  bars  indicate  esti¬ 
mates  of  the  measurement  errors. 
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Fig.  7.  SEM  micrograph  of  0.45  /xm  metal  lines  etched  with  a 
BCb-containing  plasma.  Etching  conditions:  90  seem  CI2,  10  seem  BCI3, 
2.0  mTorr,  7=0  °C,  100  W  rf-bias  power,  and  1500  W  source  power.  Total 
etching  time  is  48  s.  Line  spacing  is  0,85  /xm. 

allowing  less  time  for  the  lateral  etching  of  the  A1  layer.  We 
also  observe  a  decrease  in  Ahresist  selectivity  at  lower  tem¬ 
peratures,  leading  to  a  significantly  increased  resist  con¬ 
sumption  rate.  However,  the  increase  in  the  concentration  of 
resist  etch  products  in  the  plasma  chemistry  may  contribute 
to  the  more  vertical  profiles  obtained  at  lower  temperatures. 

4.  Effect  of  plasma  pressure 

Decreasing  the  neutral  pressure  from  2.0  to  1.0  mTorr 
results  in  poor  feature  profile  control,  TiN  and  A1  etching 
rates  decrease  approximately  25%  with  the  drop  in  pressure, 
and  the  Alrresist  selectivity  drops  by  half. 

B.  Etching  with  BCI3 

1.  Best  feature  profile 

When  using  a  mixture  of  CI2  and  BCI3,  the  best  feature 
profiles  were  obtained  at  the  following  main  etching  condi¬ 
tions:  80“90  seem  CI2,  10-20  seem  BCI3,  2.0  mTorr, 
T=0  °C,  100  W  rf-bias  power,  and  1500  W  source  power. 
Aluminum  undercut  is  almost  nonexistent  (Fig.  7)  for  the 
two  0.45  jmm  lines  separated  by  0.85  /mm,  but  additional 
overetching  is  still  needed  to  remove  residual  TiN  and  Ti 
between  the  lines.  Overall,  these  feature  profiles  are  as  good 
as  or  better  than  those  obtained  using  pure  CI2. 

Figures  8(a)  and  8(b)  show  the  corresponding  OES  and 
FWI  traces,  respectively,  for  a  wafer  etched  with  90  seem 
CI2/IO  seem  BCI3.  Again,  A1  and  TiN  breakthroughs  and  end 
points  are  clear  with  both  diagnostics.  However,  notice  that 
in  Fig.  4(b)  the  off  wafer  point  has  a  clearer  end-point  signal 
than  does  the  center  of  wafer  point,  while  in  Fig.  8(b)  the 
opposite  is  true.  Apparently,  the  change  in  gas  chemistry  has 
changed  the  nature  of  the  end-point  signals. 

2.  Effect  of  wafer  platen  temperature 

When  etching  with  a  mixture  of  80  seem  CI2  and  20  seem 
BCI3,  decreasing  the  wafer  platen  temperature  improves  fea- 
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Fig.  8.  Etching  diagnostics  for  wafer  etched  with  90  seem  CI2  and  10  seem 
BCI3,  (a)  OES  signals  at =26 1.4  and  337.0  nm.  (b)  FWI  signals  at  481  nm  at 
points  on  and  off  the  wafer. 


ture  profiles  by  reducing  A1  undercutting.  Etching  rates  de¬ 
crease  marginally  for  both  A1  and  TiN  with  increasing  tem¬ 
perature.  The  Al: resist  selectivity  decreases  slightly,  while 
the  overall  resist  selectivity  increases  slightly  with  increasing 
wafer  platen  temperature.  These  trends  correspond  to  those 
observed  when  using  a  pure  CI2  plasma  (Fig.  6). 

3.  Effect  of  plasma  pressure 

For  an  80  seem  CI2/2O  seem  BCI3  plasma,  as  with  pure 
CI2,  decreasing  the  neutral  pressure  from  2.0  to  1.0  mTorr 
noticeably  degraded  feature  profiles.  Both  Al  and  TiN  etch¬ 
ing  rates  decrease  with  decreasing  plasma  pressure. 


4.  Effect  of  BCI3  concentration  in  plasma 

Feature  profiles  improve  with  the  addition  of  BCI3 ,  as  the 
cross-sectional  SEMs  show  for  wafers  etched  with  mixtures 
of  10  seem  BCI3/9O  seem  CI2  and  20  seem  BCI3/8O  seem  CI2 
[Figs.  9(a)  and  9(b),  respectively].  The  Al  and  TiN  etching 
rates  are  almost  unchanged  between  the  0%  and  10%  BCI3 
plasmas,  while  both  etching  rates  drop  significantly  as  the 


Fig.  9.  SEM  micrographs  of  metal  lines  etched  with  BCl3-containing  plas¬ 
mas.  Etching  conditions:  2.0  mTorr,  T=0  °C,  100  W  rf-bias  power,  1500  W 
source  power,  and  (a)  90  seem  CI2,  10  seem  BCI3,  and  (b)  80  seem  CI2,  20 
seem  BCI3 . 

BCI3  concentration  is  increased  from  10%  to  20%,  [Fig. 
10(a)].  The  same  changes  occur  for  resist  selectivity  [Fig. 
10(b)]. 

Additions  of  BCI3  are  believed  to  suppress  lateral  etching, 
by  the  presence  of  subchlorinated  BCl^^.  species  {x=l  or  2)  at 
or  near  the  sidewalls  tying  up  free  Cl  radicals  to  form 
BCl3.^’^^  As  more  BCI3  is  added  to  the  plasma,  the  Al  etch¬ 
ing  rate  is  suppressed,  and  since  BCI3  etches  Al  much  more 
slowly  than  Cl2,^^  this  leads  to  poor  Ahresist  selectivity. 

Also  note  that  RIE  etching  of  Al  alloys  typically  uses 
BCl3-rich  etchant  mixtures, whereas  Cl2-rich  mixtures 
seem  to  be  appropriate  for  the  helical  resonator.  This  differ¬ 
ence  in  etchant  concentrations  may  be  due  to  the  higher  spe¬ 
cies  dissociation  found  in  high-density  plasma  sources, 
therefore  reducing  the  amount  of  BCI3  needed  to  provide  a 
sufficient  amount  of  BCl^^  radicals.  This  is  also  true  for  the 
TCP  at  low  pressures.^^ 

C.  Etching  with  other  additives 

The  addition  of  other  gases  to  a  CI2  plasma  has  been 
reported  to  have  significant  effects  on  the  feature  profiles. 
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Fig.  10.  Dependence  of  (a)  A1  and  TiN  etching  rates  and  (b)  resist  selectivity 
on  %  BCI3  in  a  BCI3/CI2  plasma.  Etching  conditions:  2.0  mTorr,  7=0  °C, 
100  W  rf-bias  power,  and  1500  W  source  power.  Error  bars  indicate  esti¬ 
mates  of  measurement  error. 


Fig.  11.  Dependence  of  (a)  A1  and  TiN  etch  rates  and  (b)  resist  selectivity  on 
%  N2  in  plasma.  Etching  conditions:  2.0  mTorr,  piatcn=23  °C,  100  W 
rf-bias  power,  and  1500  W  source  power. 


As  discussed  above,  BCI3  is  a  helpful  additive.  Experiments 
were  also  run  using  different  proportions  of  N2  and  HBr, 
with  varying  results. 

1.  Etching  with  N2 

Etching  wafers  with  N2-containing  plasmas  produces  poor 
profile  control  and  decreases  both  the  etching  rates  and  resist 
selectivities.  Etching  times  increase  dramatically  from  51  s 
with  0%  N2  to  210  s  with  50%  N2.  Most  of  the  resist  etches 
away  during  the  main  etch.  Most  features  uplift  or  are  etched 
away,  and  all  remaining  features  are  extremely  tapered. 

Figure  1 1  (a)  shows  the  dependence  of  A1  and  TiN  etching 
rates  on  the  percentage  of  N2  in  the  gas  flow.  Both  etching 
rates  decrease  dramatically  with  increasing  %N2.  These  de¬ 
creasing  etching  rates  account  for  the  tapered  feature  profiles 
since,  as  the  etching  time  increases,  more  isotropic  etching 
can  occur,  and  thus,  the  A1  undercuts  and  tapers. 

Resist  selectivity  decreases  with  increasing  %N2  as  well 
[Fig.  11(b)].  In  the  case  of  50%  N2,  the  resist  selectivities  are 
extremely  low,  accounting  for  the  lack  of  resist  remaining  on 
the  surface  after  the  main  etch.  With  these  N2  chemistries, 
the  plasma  is  actually  more  proficient  at  etching  resist  than  it 
is  at  etching  the  metal  stack. 

2.  Etching  with  HBr 

A  single  trial  was  run  using  25%  HBr  in  the  CI2  plasma 
mix.  The  resulting  feature  profiles  were  relatively  vertical, 
but  significant  undercut  of  the  A1  layer  at  the  interface  of  the 


A1  and  the  upper  TiN  occurs  (Fig.  12).  The  total  etching  time 
increases  to  62  s,  as  opposed  to  51  s  when  using  pure  CI2. 
Both  A1  and  TiN  etching  rates  as  well  as  resist  selectivities 
decrease  with  the  addition  of  HBr  to  the  plasma  mixture. 

IV.  SUMMARY  AND  CONCLUSIONS 

The  best  feature  profiles  are  obtained  at  the  following 
conditions:  80-90  seem  CI2,  10-20  seem  BCI3,  2.0  mTorr, 
^wafer  platen  100  W  rf-bias  powcr,  and  1500  W  source 


metal  etch  #36 
wafer  #1U 


0  5  K  U  4  0 . 0  K  2  5  0  n  0  616  U  8  ••••  9  9  /  9  4 


Fig.  12.  SEM  micrograph  of  wafer  etched  with  HBr-containing  plasma. 
Etching  conditions:  75  seem  CI2,  25  seem  HBr,  2.0  mTorr, 
^wafer  platen “C,  100  W  rf-bias  powcr,  and  1500  W  source  power. 
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power.  Wafers  etched  under  these  conditions  have  features 
with  vertical  profiles,  with  almost  no  A1  undercut  at  the  up¬ 
per  TiN  layer.  These  features  are  slightly  better  than  those 
obtained  when  using  100  seem  CI2  and  the  same  plasma 
chamber  conditions.  The  anisotropic  profile  achievable  with 
a  pure  CI2  plasma  in  the  high-density  helical  resonator  source 
is  better  than  that  obtainable  with  pure  CI2  in  a  conventional 
RIE  plasma,  but  the  remaining  A1  notch  at  the  upper  TiN 
interface  would  deem  it  unacceptable  for  a  manufacturing 
process.  A  small  addition  of  BCI3  eliminates  the  A1  notch, 
and  the  mixture  is  therefore  the  preferred  chemistry.  Inde¬ 
pendent  control  of  the  rf-bias  power,  in  particular,  also 
proved  to  be  important  in  controlling  the  amount  of  resist 
erosion  and  to  obtaining  more  anisotropic  profiles. 

Future  work  is  intended  to  focus  on  a  multistage  process 
where  different  amounts  of  BCI3  are  added  to  the  plasma 
mixture  during  different  parts  of  the  etch.  It  is  hoped  that 
through  this  type  of  processing,  all  A1  undercut  can  be  elimi¬ 
nated,  and  the  residual  TiN  and  Ti  between  pairs  of  lines  can 
be  cleared. 
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The  microstructural  properties  and  interdiffusion  reactions  of  some  common  metallization  schemes 
(Au/Ge/Ni,  Ti/Pt/Au,  WSi^,,  and  AuBe)  on  GaN  and  Ino.5Gao.5N  have  been  examined  using  scanning 
electron  microscopy  and  Auger  electron  spectroscopy.  The  objective  of  this  study  was  to  investigate 
thermally  stable  and  uniform  contacts  to  both  materials.  The  WSi^  contacts  were  found  to  possess 
excellent  thermal  stability  and  retained  good  structural  properties  at  annealing  temperatures  as  high 
as  800  °C  on  GaN.  These  contacts  on  Ino.5Gao.5N  had  a  minimum  specific  contact  resistivity  of 
1.48  X  10~^  fl  cm^  and  an  excellent  surface  morphology  following  annealing  at  700  °C.  The  increase 
in  contact  resistance  observed  at  higher  temperatures  was  attributed  to  intermixing  of  metal  and 
semiconductor.  By  contrast  the  Ti/Pt/Au  and  Au/Ge/Ni  contacts  showed  lower  stabilities,  but  also 
showed  little  Au  penetration  into  the  underlying  GaN.  AuBe  contacts  had  the  poorest  thermal 
stability  of  all  four  schemes  investigated,  with  substantial  reaction  with  GaN  occurring  even  at 
400  °C.  The  WSi^  contact  appears  to  be  an  excellent  choice  for  application  in  high  temperature  GaN 
electronics.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Owing  to  their  large  band  gaps,  mechanical  and  thermal 
stability,  III-V  nitrides  are  very  attractive  for  high  tempera¬ 
ture  electronics  and  blue  and  UV  optoelectronic  device  ap¬ 
plications.  Improved  material  properties  have  recently  led  to 
a  variety  of  devices  being  demonstrated.^  Blue  light-emitting 
devices  (LEDs)^’^  and  metal-semiconductor  field-effect  tran¬ 
sistors  (MESFETs)  have  been  successfully  fabricated."^  How¬ 
ever,  forming  low  resistance,  thermally  stable,  and  uniform 
ohmic  contacts  to  a  wide  band  gap  semiconductor,  such  as 
GaN  with  a  band  gap  about  3.4  eV,  constitutes  a  major  ob¬ 
stacle  to  the  furtherment  of  nitride  based  devices.  These  de¬ 
vices  provide  high  efficiency  and  acceptable  reliability  only 
if  their  contacts  are  stable  and  have  low  resistance  ohmic 
characteristics. 

In  an  earlier  attempt  to  achieve  ohmic  contacts  on  GaN 
epilayers,  Foresi  et  al^  used  A1  and  Au  contacts  with  575  °C 
anneal  cycle.  However,  the  specific  contact  resistivity  of 
these  contacts  was  relatively  poor  (10“^  Cl  cm^).  Khan  et  al^ 
used  Ti/Au  to  contact  n-type  GaN  and  measured  a  contact 
resistance  of  7.8X10”^  fl  cm^  after  annealing  at  250  °C  for 
30  s.  Nakamura  et  al}  have  used  Au  (and  later  Au/Ni  and  Al) 
as  p-  and  n-type  contacts  respectively  in  their  LED  struc¬ 
tures.  While  the  contact  resistances  were  not  reported  in  their 
LED  structures,  an  operating  voltage  of  4  V  and  20  mA 
forward  bias  is  clear  evidence  that  reasonable  contact  resis¬ 
tances  were  obtained.  Recently  Lin  et  al^  have  obtained  ex¬ 
tremely  good  ohmic  contacts  on  «-type  GaN  layers  grown  on 
sapphire  substrates.  Using  Ti/Al  metallization  scheme,  they 
were  able  to  obtain  specific  contact  resistivities  as  low  as 
8X 10“^  fl  cm^  after  annealing  at  900  °C  for  30  s.  Lin  et  al^ 
also  demonstrated  a  novel  ohmic  contact  scheme  to  GaN 
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using  an  InN/GaN  short-period  superlattice  (SPS)  and  an 
InN  cap  layer.  Ohmic  contact  resistivities  as  low  as  6X10“^ 
fl  cm^  were  achieved  even  without  any  postannealing. 

An  ohmic  contact  study  of  four  standard  metallization 
schemes  in  III-V  technology:  Ti/Pt/Au,  Au/Ge/Ni,  WSi^, 
and  AuBe/Au,  has  been  undertaken  in  this  work.  Also,  we 
investigated  a  scheme  which  had  an  InGaN  layer  on  top  of 
GaN.  The  InGaN  has  a  lower  bandgap  than  GaN  and  should 
produce  lower  contact  resistance.  Ultimately,  we  would  like 
to  use  such  a  scheme  for  improved  contact  properties  on 
devices,  much  as  InAs  is  used  on  GaAs.  WSi^^  was  used  for 
metallization  to  the  InGaN  layer.  Electrical  characterization 
of  the  contacts  was  done  using  standard  transmission  line 
measurements  (TLMs)  and  materials  characterization  in¬ 
cluded  scanning  electron  microscope  (SEM)  and  Auger  elec¬ 
tron  spectroscopy  (AES). 


II.  EXPERIMENT 

The  nominally  undoped  GaN  and  Ino  5Gao.5N  was  grown 
on  GaAs  at  800  °C  using  (CH3)3Ga  and  an  electron  cyclotron 
resonance-plasma  generated  N2  flux  in  a  metalorganic  mo¬ 
lecular  beam  epitaxy  system.  The  samples  predominantly 
consisted  of  the  cubic  phase,  with  typical  x-ray  full  width  at 
half-maximum  (FWHM)  of  300-500  arcsec.  The  n-type 
doping  level  in  the  ternary  was  *^10^^  cm“^,  while  the  GaN 
was  typically  ^10^^  cm“^  due  to  the  presence  of  native  shal¬ 
low  donors.  At  this  point,  it  has  been  impossible  to  grow 
p-type  InGaN  at  InN  mole  fractions  above  0.07. 

Four  different  metallization  schemes  common  in  III-V 
technology  were  investigated,  i.e.,  Ti/Pt/Au,  Au/Ge/Ni, 
AuBe,  and  WSi^ .  The  purpose  of  this  was  to  establish  their 
relative  thermal  stabilities  on  GaN.  Once  this  was  estab¬ 
lished,  we  wanted  to  use  the  most  stable  on  InGaN,  which  as 
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mentioned  earlier  is  our  intended  final  structure  for  device 
applications.  The  contact  metals  were  deposited  using  two 
techniques,  namely,  electron  beam  evaporation  and  sputter¬ 
ing.  The  deposition  sequences  and  the  layer  structures  were 
as  follows:  (1)  Ti--250  A/Pt-500  A/Au-IOOO  A;  (2)  Ge-50 
A/Ni-50  A/Au— Ge-268  A— 132  A/Au-1100  A;  (3) 
AuBe-200  A/Au-1000  A,  and  (4)  WSi^-1000  A.  Based  on 
initial  studies  of  the  stability  of  these  systems,  WSi^  contacts 
having  thickness  of  1000  A  were  deposited  on  an  Iuq  5Gao  5N 
sample.  The  Ti/Pt/Au  contacts  have  proven  to  yield  excellent 
contact  resistance  (about  10~^  fl  cm^)  on  extremely  heavily 
doped  III-V  semiconductors.  The  AuBe  based  metallization 
is  a  standard  alloyed  ohmic  contact  and  produces  excellent 
contact  resistance  on  heavily  doped  p-type  materials  (^10~^ 
fl  cm^).  While  we  did  not  investigate  p-type  material  in  this 
work,  it  is  still  worth  establishing  the  stability  of  this  system. 
The  Au/Ge/Ni  contact  is  also  a  standard  alloyed  contact  and 
produces  very  low  contact  resistances  to  heavily  doped 
n-type  III-V  materials  (about  10“^  ficm^).  Typically,  al¬ 
loyed  contacts  exhibit  problems  such  as  poor  thermal  stabil¬ 
ity,  spiking,  and  poor  reproducibility  and  reliability.  The 
above  three  metallizations  were  performed  using  electron 
beam  evaporation  in  a  Temescal  system.  Prior  to  loading  in 
an  UHV  system,  the  wafers  were  rinsed  in  a  solution  of 
H20:NH40H=20:1  for  1  min  to  remove  the  native  oxide 
from  the  GaN  samples  in  order  to  ensure  uniform  contact 
properties  upon  subsequent  deposition  and  alloying.  After  a 
pump  down  to  a  pressure  of  about  2X10“^  Torr  and  in  situ 
Ar"^  ion  beam  sputtering  for  oxide  removal,  the  metallization 
schemes  were  evaporated  with  an  electron  gun. 

The  WSi^  (a:~0.45)  contact  was  deposited  using  Ar'^  as¬ 
sisted  sputtering.  Prior  to  loading  the  samples  in  the  deposi¬ 
tion  chamber,  a  1  min  rinse,  in  a  H20:NH40H  (20:1)  solution 
was  done  after  which  the  contacts  were  deposited  by  sputter¬ 
ing  with  an  Ar  discharge  at  a  bias  voltage  of  90-100  V.  The 
refractory  metal  contacts  have  traditionally  shown  excellent 
thermal  stability  and  are  used  as  a  gate  in  the  processing  of 
self-aligned  MESFET  devices.  The  electrical  measurements 
were  performed  on  WSi^^  contacts  deposited  over  InGaN  lay¬ 
ers  on  GaN.  The  contact  metal  was  deposited  onto  square 
openings  (100X100  fim^)  linearly  spaced  (with  intervals  of 
2-16  jLim)  in  a  photoresist  layer.  Subsequently,  InGaN  mesas 
were  etched  to  give  the  required  one-dimensional  current 
flow.  In  these  samples,  the  GaN  was  semi-insulating  and 
only  patterning  of  the  InGaN  was  necessary.  In  real  device 
structures,  one  would  employ  conducting  GaN  (such  as  for 
channel  layers  in  transistors)  and  then  TLM  measurements 
would  require  patterning  of  the  GaN  as  well.  After  the  met¬ 
allizations  were  performed,  each  sample  was  split  into  pieces 
and  thermally  annealed  at  temperatures  from  400  to  900  °C 
for  30  s  in  a  N2  ambient  using  rapid  thermal  annealing  tech¬ 
nique  (AG  Heatpulse  410  System.) 

III.  RESULTS 

A.  Scanning  electron  microscopy 

The  Au/Ge/Ni  samples  were  annealed  at  temperatures  of 
400,  500,  600,  and  700  °C.  The  as-deposited  sample  and  that 


heat  treated  at  500  °C,  as  seen  from  their  micrographs  in  Fig. 
1,  are  rather  smooth  due  to  the  fact  that  Ni  overplate  is  used 
(the  micrograph  of  the  sample  annealed  at  400  °C  is  not  in¬ 
cluded  because  its  surface  morphology  is  not  very  different 
from  the  as-deposited  sample  and  the  sample  annealed  at 
500  °C).  The  Au/Ge/Ni  sample  heat  treated  at  600  °C 
showed,  signs  of  degrading.  The  surface  was  no  longer 
smooth  and  as  the  sample  annealing  temperature  was  in¬ 
creased  further  to  700  °C  the  SEM  micrograph  at  a  magnifi¬ 
cation  of  10  000  reveals  many  shallow  small  pits.  Comparing 
the  micrographs  of  all  the  samples  shows  that  500  °C  does 
not  really  effect  the  surface  morphology  whereas  heat  treat¬ 
ments  at  600  and  700  do  degrade  the  sample  morphology. 

The  Ti/Pt/Au  samples  were  heat  treated  at  temperatures  of 
500,  700,  800,  and  900  °C  for  30  s.  The  morphology  of  the 
as-deposited  sample  (shown  in  Fig.  2)  is  quite  smooth.  The 
sample  that  was  annealed  at  500  °C  showed  signs  of  degra¬ 
dation  and  at  700  °C  the  surface  became  more  degraded.  The 
formation  of  small  islands  was  more  apparent  and  their  size 
and  density  increased  at  800  °C.  The  maximum  temperature 
for  annealing  these  contacts  would  be  around  450-500  °C, 
which  is  approximately  about  the  temperature  used  for  ob¬ 
taining  low  temperature  Ti/Pt/Au  contacts  on  GaAs 
samples.® 

The  WSi;^  contacts  on  GaN  samples  were  heat  treated  at 
temperatures  of  400,  700,  800,  and  900  °C  for  30  s.  The 
SEM  micrographs  of  the  as-deposited  sample  and  the  sample 
annealed  at  900  °C  are  shown  in  Fig.  3.  The  micrographs  of 
the  other  samples  annealed  in  the  temperature  range  of  400- 
800  °C  were  not  included  because  the  surface  morphology 
was  similar  to  that  of  the  as-deposited  sample.  The  WSi^^ 
contacts  were  also  deposited  on  an  Ino.sGao  5N  film  grown  on 
GaAs  which  was  further  split  into  pieces  and  annealed  in  the 
same  temperature  range  as  the  WSi^/GaN  system.  The  SEM 
studies  performed  on  these  samples  revealed  that  the  as- 
deposited  sample  exhibited  a  very  smooth  surface  and  that 
there  was  no  change  in  the  surface  morphology  of  samples 
annealed  at  temperatures  of  400  and  700  °C,  The  surface 
morphology  of  the  samples  annealed  at  900  showed  only 
a  small  amount  of  surface  roughness.  The  maximum  anneal¬ 
ing  temperature  to  obtain  good  surface  morphology  WSi^,. 
contacts  on  InGaN  samples  would  therefore  be  in  the  range 
700-800  °C. 

The  AuBe  samples  were  annealed  at  temperatures  of  400, 
500,  600,  and  700  for  30  s.  The  as-deposited  AuBe 
sample  was  characterized  by  a  smooth  surface  but  the  sample 
annealed  at  400  °C  already  showed  a  degree  of  roughness.  It 
was  clearly  evident  from  the  sample  annealed  at  700  °C  that 
the  surface  was  badly  degraded.  Thus,  for  the  AuBe  contacts 
on  GaN  the  maximum  annealing  temperature  should  be  less 
than  400  °C  for  it  to  retain  its  surface  morphology. 


B.  Auger  electron  spectroscopy 

The  Auger  survey  scans  of  the  as-deposited  contacts  indi¬ 
cated  the  surface  layer  of  the  metal  had  the  usual  contami- 
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Fig.  1.  Surface  morphology  of  Au/Ge/Ni/GaN  contacts  (magnification  X 10  000);  (top  left)  as-deposited,  (top  right)  annealed  at  500  °C,  (bottom  left)  annealed 
at  600  °C,  and  (bottom  right)  annealed  at  700  °C. 


nation  from  atmospheric  O  and  C.  These  surface  contami¬ 
nants  were  quickly  sputtered  away  during  the  Auger  depth 
profiling. 

The  Auger  depth  profile  of  as-deposited  Au/Ge/Ni  con¬ 
tacts  on  GaN  indicated  a  slight  interdiffusion  between  Au 
and  Ni  and  extensive  interdiffusion  between  Ni  and  Ge. 
There  was  also  moderate  interdiffusion  between  Au  and  Ge. 
There  was  also  some  indiffusion  between  Ni,  Ge,  and  the 
GaN,  with  outdiffusion  of  Ga  into  the  Ni  and  Ge  layers.  The 
penetration  of  Au  into  the  GaN  was  insignificant.  The  sug¬ 
gestion  of  interdiffusion  of  Ni  and  Ge  agrees  with  results  of 
Au/Ge/Ni  contacts  on  GaAs  samples.^ 

The  Auger  depth  profile  of  the  as-deposited  Ti/Pt/Au 
sample  seen  in  Fig.  4(a)  did  not  show  much  interdiffusion 
between  the  metals  compared  to  the  Au/Ge/Ni  as-deposited 
sample.  There  was  a  slight  interdiffusion  between  Au  and  Pt 
and  moderate  interaction  between  Ti  and  Pt,  with  extensive 
interaction  between  Ti  and  Ga.  There  was  essentially  no  in¬ 
teraction  between  Au  and  GaN.  In  comparison  to  the  as- 
deposited  sample,  the  sample  heat  treated  at  600  °C  showed 
extensive  interdiffusion  between  Ti  and  Pt  [Fig.  4(b)].  There 
appeared  to  be  more  Ga  outdiffusion  into  the  metal  layers  in 
the  heat  treated  sample  compared  to  the  as  deposited  sample. 


TiN  is  a  semimetal,^  but  we  do  not  know  whether  TiGaN 
forms  in  these  samples — no  evidence  was  found  for  such  a 
phase  in  double  crystal  x-ray  diffraction  measurements. 

The  depth  profiles  of  the  as-deposited  and  annealed 
samples  of  WSi^  shown  in  Figs.  5(a)  and  5(b)  indicated  that 
the  900  ""C  annealed  sample  showed  more  intermixing  of  the 
metal  and  GaN  at  the  interface  compared  to  the  as-deposited 
sample.  A  similar  interdiffusion  was  reported  in  the 
WSi^./InGaAs  system,  although  it  can  be  seen  that  the  in¬ 
terdiffusion  in  the  WSi^/GaN  structure  is  much  less  com¬ 
pared  to  the  metallization  schemes  described  above. 

Since  the  AuBe  contacts  exhibited  poor  thermal  stability 
from  the  SEM  photographs  we  did  not  perform  AES  studies 
on  them. 


C.  Electrical  measurements 

The  inference  drawn  from  the  analysis  of  the  SEM  and 
AES  results  of  the  different  metallization  schemes  was  that 
the  WSi^  contact  was  more  thermally  stable  compared  to  the 
other  schemes  and  hence  this  metallization  scheme  was  em- 
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Fig.  2.  Surface  morphology  of  Ti/Pt/Au/GaN  contacts  (magnification  X 10  000);  (top  left)  as-deposited,  (top  right)  annealed  at  700  °C,  (bottom  left)  annealed 
at  800  °C,  and  (bottom  left)  annealed  at  900  °C. 


ployed  on  Ioq  sGao  5N  to  study  lower  band  gap  contacts.  The 
samples  were  annealed  in  the  temperature  range  of  400- 
900  °C. 

The  contact  and  sheet  resistance  of  the  contact  can  be 
derived  from  a  plot  of  measured  resistance  versus  gap  spac¬ 
ing.  The  method  of  least  squares  fit  was  used  to  obtain  a 
linear  fit  to  the  experimental  data,  which  typically  yielded 
correlation  coefficient  of  0.9972  or  better. 

The  specific  contact  resistivity  (p^)  is  calculated  from  a 
measurement  of  the  effective  contact  resistance  (Rc),  the 
contact  width  (W),  and  the  transfer  length  (Lj): 

P,^R,WLr. 

The  specific  contact  resistance  dropped  from  5.95X10“^ 
flcm^  on  the  as-deposited  sample  to  3.92X10'“^  after  an¬ 
nealing  at  400  °C,  and  reached  a  minimum  value  of 
1.48X10“^  Hcm^  after  annealing  at  700  °C  (Fig.  6).  The 
trends  are  similar  to  those  of  WSi^^  on  InGaAs.^^ 

IV.  DISCUSSION 

First,  we  discuss  the  result  for  Ti/Pt/Au  metallization, 
since  this  has  been  one  of  the  more  common  schemes  inves¬ 
tigated.  It  has  been  previously  noted  that  Ti/Au  contacts  on 


GaN^  have  large  contact  resistance  even  after  thermal  an¬ 
nealing.  These  contacts  generally  suffer  from  the  problem  of 
spiking  into  the  underlying  semiconductor  upon  annealing.*^ 
To  mitigate  this  problem,  Pt,  which  is  a  very  good  diffusion 
barrier  has  been  used  between  Ti  and  Au  to  prevent  Au  spik¬ 
ing.  In  our  work,  a  similar  kind  of  observation  was  made. 
The  AES  studies  of  the  as-deposited  sample  did  not  show 
intermixing  of  metals  and  the  SEM  studies  confirmed  these 
results  since  the  sample  surfaces  were  smooth.  From  the 
AES  studies  of  the  sample  annealed  at  600  °C,  the  AES  pro¬ 
file  confirms  that  the  Pt  barrier  has  prevented  Au  spiking  into 
the  underlying  GaN.  This  solid  phase  reaction  between  Ti 
and  GaN  to  form  TiN  can  help  the  contact  operate  through  a 
tunneling  mode.  From  the  AES  depth  profile,  it  can  be  seen 
that  for  the  sample  annealed  at  600  °C,  Ga  has  penetrated 
through  most  of  the  Ti  film.  According  to  Stareev  et  aL^  who 
made  similar  observations  of  Ti/Pt/Au  contacts  on  GaAs,  the 
Ga  accumulation  at  the  Ti/Pt  boundary  causes  an  underesti¬ 
mate  of  the  redistribution  of  As  atoms  and,  therefore,  they 
performed  complementary  studies  on  the  species  behavior  at 
Ti/Pt  interface  using  RBS  spectra  of  Ti/Pt  samples  deposited 
on  Si02.  No  experimental  verification  of  Ti-Pt  intermixing 
reaction  even  for  an  annealing  temperature  up  to  600  °C  was 
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Fig.  3.  Surface  morphology  of  WS^/GaN  contacts  (magnification 
X30  000);  (top)  as-deposited,  and  (bottom  )  annealed  at  900  °C. 


observed.  Moreover,  the  accompanying  ternary  intermixing 
reaction  of  T-Pt-Ga  at  the  Ti/Pt  boundary  takes  place  only 
if  the  Ti  film  is  entirely  saturated  with  As.  That  means  the 
changes  at  the  Ti/Pt  boundary  observed  in  the  Pt/Ti/GaAs 
system  after  annealing  at  600  °C  would  be  attributed  to  par¬ 
ticipation  of  As  in  a  complicated  ternary  reaction  with  Ti  and 
Pt.  Thus,  in  our  case  if  we  suppose  N  is  extracted  from  the 
GaN  without  decomposing  GaN,  then  an  accumulation  of  N 
vacancies  would  be  created  in  the  GaN  near  the  metallurgical 
junction.  Since  N  vacancies  most  likely  act  as  donors,  this 
region  would  be  heavily  doped  w-GaN  which  provides  the 
configuration  needed  for  tunneling  contacts.  Thus  annealing 
temperatures  of  about  500  °C  would  be  optional  for  obtain¬ 
ing  thermally  stable  Ti/Pt/ Au  ohmic  contacts  but  tempera¬ 
tures  higher  than  that  may  lead  to  extensive  reaction  at  the 
interface  which  implies  degrading  of  the  sample. 

Turning  to  the  Au/Ge/Ni  and  AuBe  metallization 
schemes,  we  find  their  extensive  reaction  with  GaN  at  rela¬ 
tively  low  temperatures  makes  them  unsuitable  for  applica¬ 
tions  such  as  electronic  devices  operating  in  the  300  °C 
range.  By  contrast,  WSi;^.  provides  fairly  stable  contact  to 


(a) 


Fig.  4.  AES  depth  profiles  of  TiPtAu  contacts  on  GaN  (a)  as-deposited  and 
(b)  after  annealing  at  600  ®C  for  30  s. 


Fig.  5.  AES  depth  profiles  of  WSi^t  contacts  on  GaN  (a)  as-deposited  and  (b) 
after  annealing  at  900  °C  for  30  s. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2587 


Durbha  et  al.:  Stability  of  ohmic  contacts 


2587 


Temperature  (“C) 

Fig.  6.  WSij^  specific  contacts  resistivity  on  InosGao^N  vs  annealing  tem¬ 
perature. 

both  GaN  and  InGaN,  with  decent  resistances  after  annealing 
at  700  °C  on  the  latter  material,  and  no  visible  change  in 
morphology  even  at  900  °C.  Although  we  have  no  long-term 
aging  results,  there  is  generally  a  correlation  between  good 
thermal  stability  and  reliability  since  both  are  dependent  on 
minimizing  metal- semiconductor  reactions.  As  expected 
from  past  data,  on  other  III-V  materials,  it  appears  that  re¬ 
fractory  metals  are  promising  candidates  for  stable  contacts 
on  nitrides. 

V,  CONCLUSIONS 

It  was  evident  from  the  microstructural  and  interdiffusion 
studies  of  the  as-deposited  and  annealed  samples  of  all  these 
metallization  schemes  that  the  WSi^  contacts  exhibited  the 
best  thermal  stability  and  retained  good  structural  properties 
at  annealing  temperatures  as  high  as  800  °C  on  GaN.  Pro¬ 
cessing  of  FET  devices  involve  a  high  temperature  annealing 
step  for  implant  activation,  typically  ^900  °C.  The  stability 
of  the  WSi^/Ino  sGao.sN  contacts  are  high  enough  to  allow 
dopant  activation  in  heterojunction  bipolar  transistor  or  FET 


structure  processing.  The  contact  degradation  at  higher  an¬ 
nealing  temperature  was  related  to  increases  in  sheet  resis¬ 
tance,  which  in  turn  resulted  from  the  degradation  of  the 
metal-semiconductor  interface. 

The  AuBe/GaN  contact  system  showed  poor  thermal  sta¬ 
bility  while  the  Au/Ge/Ni/GaN  contact  system  was  thermally 
stable  up  to  temperatures  of  500  °C.  The  Ti/Pt/Au  contacts 
were  found  to  remain  substantially  stable  at  450-500  °C. 
There  was  not  penetration  of  Au  into  GaN  in  this  scheme, 
owing  to  the  Pt  diffusion  barrier.  These  contacts  exhibited 
better  structural  properties  than  sample  Ti/Au  contacts  de¬ 
posited  on  GaN. 

The  scope  of  future  work  in  the  field  of  WSi^/InGaN  sys¬ 
tem  is  probably  to  experiment  with  the  In  concentration  to 
lower  the  band  gap  of  InGaN.  The  contact  resistance  could 
be  measured  as  a  function  of  annealing  temperature,  and  the 
conduction  mechanisms  and  thermal  stability  established, 
since  the  contact  resistance  should  improve  for  higher  InN 
concentrations. 
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In  this  article,  we  report  our  results  using  the  aluminum/germanium  eutectic  to  create  high  quality 
patterned  bonds  between  two  silicon  dice.  The  bonds  are  formed  using  thin  metal  layers  and  with 
essentially  no  pressure  applied.  We  have  measured  bond  strength  by  fabricating  and  bonding 
patterned  dice.  Pull  tests  were  conducted  and  the  force  required  to  separate  the  bonds  was  measured 
and  found  to  be  about  1.6X10^  Pa.  When  bonds  break,  portions  of  the  substrate  are  removed. 
Testing  of  the  hermiticity  of  the  bond  demonstrated  that  leak  rates  below  the  detection  limit  of  the 
leak  tester  (10“^  sees)  are  possible.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

Many  microelectromechanical  systems  (MEMS)  that  are 
fabricated  using  bulk  micromachining  consist  of  several  dice 
that  are  bonded  together  to  create  a  single  unit.  Examples 
include  pressure  sensors,  accelerometers,^  interferometers, 
and  microvalves.  Over  the  past  several  years,  a  number  of 
unique  bonding  techniques  have  been  developed  including 
electrostatic  bonding,  wafer  bonding,  low  temperature  glass 
bonding,  diffusion  bonding,  and  alloy  bonding.^  Alloy  bond¬ 
ing  has  the  advantage  of  providing  a  means  for  simulta¬ 
neously  creating  an  electrical  interconnection  and  a  mechani¬ 
cal  bond.^ 

The  use  of  alloy  bonding  by  the  semiconductor  industry  is 
not  new.  Thin  foils  of  low  melting  point  gold  alloys  are  used 
to  bond  dice  to  packages.  These  gold  alloys  have  high 
strength  and  produce  a  good  bond  that  provides  both  electri¬ 
cal  and  thermal  conductivity  to  the  package."^  For  MEMS  the 
application  of  foils  has  limited  usefulness  because  the  bond 
layer  cannot  be  lithoghraphically  patterned. 

In  an  alloy  bonding  process,  the  metal  constituents  of  the 
alloy  are  deposited  as  thin  films  with  different  thicknesses. 
The  bonding  temperature  can  be  minimized  by  choosing  the 
eutectic  composition.  Three  bonding  modes  can  be  achieved. 
In  the  first,  the  constituents  of  the  alloy  are  deposited  on  two 
separate  substrates  such  that  one  substrate  contains  one  com¬ 
ponent  of  the  binary  alloy  and  the  other  contains  the  second 
component.  These  are  held  in  close  contact  with  each  other, 
usually  under  pressure.  When  heated,  solid  state  diffusion 
occurs  forming  a  bond  between  the  two  surfaces  at  a  tem¬ 
perature  below  the  melting  point  of  the  eutectic  alloy  (diffu¬ 
sion  bonding).^  In  the  second  approach,  a  binary  eutectic 
alloy  is  chosen.^  The  components  of  the  eutectic  alloy  are 
deposited  on  two  substrates  either  individually  on  separate 
wafers  as  before  or  in  pairs  on  each  substrate.  When  the 
eutectic  melting  point  is  reached,  a  liquid  of  the  eutectic 
composition  is  formed.  Joining  occurs  when  the  liquid  sur¬ 
faces  of  the  parts  combine.  Surface  tension  may  help  to  draw 
the  parts  together.  Upon  solidification,  the  individual  ele¬ 
ments  forming  the  eutectic  separate  into  a  solid  containing 
regions  of  each.  If  the  ratio  of  the  deposited  materials  is 
adjusted  away  from  the  eutectic  then,  upon  exceeding  eutec¬ 


tic  temperature,  melting  at  the  interface  begins.  More  mate¬ 
rial  is  consumed  by  the  melt  as  long  as  the  eutectic  compo¬ 
sition  can  be  maintained.  As  one  of  the  components  becomes 
depleted,  the  liquidus  point  effectively  increases  and  the 
mixture  partially  solidifies.  In  thermal  equilibrium,  the  Lever 
law  can  be  used  to  determine  the  ratio  of  compounds  in  the 
liquid  and  solid  states.  Since  diffusion  in  the  liquid  state  is 
many  times  faster  than  in  the  solid  state,  more  rapid  joining 
with  less  pressure  applied  is  possible  for  eutectic  bonding.  In 
the  third  method,  a  binary  system  consisting  of  a  low  and  a 
high  melting  point  material  that  also  contains  a  high  melting 
point  compound  is  sought.  Peritectics  often  satisfy  these  re¬ 
quirements.  A  layer  of  the  low  melting  point  material  is  de¬ 
posited  on  a  substrate  which  is  formed  from  the  higher  melt¬ 
ing  point  material  or  has  a  thick  layer  on  its  surface.  When 
heated,  liquid-phase  bonding  occurs.  The  liquid  reacts  with 
the  surface,  dissolving  the  high  temperature  material  until  the 
composition  of  the  liquid  matches  the  liquidus  at  the  bond 
temperature  and  the  mixture  solidifies.  Continued  heating 
promotes  the  formation  of  the  high  melting  point  phase 
through  solid  state  diffusion.  This  combination  of  both  types 
of  bonding  is  referred  to  as  solid-liquid  diffusion  bonding 
(SLID).’ 

Little  has  been  written  on  patterned  alloy  bonding  for 
MEMS  applications.  In  this  article,  we  describe  the  results  of 
a  series  of  experiments  conducted  to  optimize  eutectic  alloy 
bonding  for  MEMS.  Our  work  concentrates  on  the 
aluminum/germanium  binary  system.  We  deposit  layers  of 
aluminum  and  germanium  on  two  substrates  such  that  each 
substrate  contains  both  A1  and  Ge  at  the  appropriate  thick¬ 
nesses  to  create  the  eutectic  alloy.  The  eutectic  composition 
is  targeted  in  order  to  obtain  the  lowest  temperature  bond.  An 
underlying  objective  of  this  work  was  to  isolate  the  bond 
layers  from  the  substrate  with  an  insulator  (silicon  dioxide  or 
silicon  nitride)  rather  than  to  consume  the  substrate.^’^  Dice 
with  layers  deposited  on  their  surfaces  are  brought  together 
and  heated  to  form  a  bond.  We  have  varied  the  total  film 
thickness  at  two  levels  and  formed  bonds  at  several  tempera¬ 
tures  and  times.  Both  pull  tests  and  leak  tests  were  per¬ 
formed.  They  indicate  that  the  Ge/Al  eutectic  is  suitable  for 
application  to  MEMS. 
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II.  SAMPLE  PREPARATION 

In  this  work,  careful  attention  was  given  to  the  prepara¬ 
tion,  processing,  and  fabrication  of  the  samples.  In  particular, 
exposure  to  air  can  cause  the  alloy  constituents  to  oxidize.  In 
the  Al/Ge  system,  aluminum  readily  forms  a  tenacious  oxide 
that  is  difficult  to  remove,  whereas  germanium  oxide  can  be 
stripped  off  in  a  dilute  HR  For  this  reason,  germanium  is 
deposited  after  aluminum  in  a  single  deposition. 

In  our  process,  we  deposit  thin  films  of  each  layer  indi¬ 
vidually.  This  permits  precise  control  over  the  ratio  of  the 
elements  forming  the  binary  alloy.  The  layers  are  deposited 
successively  in  a  single  pumpdown,  patterned  photolitho- 
graphically,  and  etched.  After  resist  stripping,  the  wafers  are 
cleaned.  Just  before  bonding,  the  surface  oxide  on  the  ger¬ 
manium  is  removed  in  HF  and  the  wafers  or  die  are  brought 
together  and  clamped.  The  samples  are  then  heated  above  the 
eutectic  temperature  and  held  for  a  short  period  of  time  (sev¬ 
eral  minutes).  Upon  cooling,  the  samples  can  be  tested  to 
determine  the  properties  and  the  quality  of  the  bond. 

Prime  silicon  (100)  3  in.  wafers  were  used  without  regard 
to  their  resistivity.  All  the  wafers  were  subjected  to  a  stan¬ 
dard  pre-oxidation  clean  in  a  modified  process  originally  for¬ 
mulated  by  Kern  and  Puotinen  at  RCA.^^’^^  Thermally  grown 
Si02  was  used  as  a  diffusion  barrier  on  the  Si  surface  and  as 
a  masking  layer  for  Si  etching.  A  wet  oxidation  was  per¬ 
formed  at  1000  °C  for  1.5  hours.  The  thickness  of  the  oxide 
was  measured  with  a  Nanospec  Optical  Film  Thickness  Mea¬ 
surement  System  and  was  typically  about  8000  A. 

To  achieve  an  A1  and  Ge  bond  at  424  °C,  the  wt  %  ratio 
of  each  component  is  selected  to  match  that  of  the  eutectic. 
For  a  given  thickness  of  aluminum,  the  germanium  thickness 
can  be  calculated  as  follows: 


^Ge“  (^Al^  Wt  %Ge^PAl)/(pGe^  Wt 


where,  is  the  thickness  of  the  aluminum,  wt  %Ge  is  the 
weight  percentage  of  the  germanium,  wt  %ai  is  the  weight 
percentage  of  the  aluminum,  p^i  is  the  density  of  the  alumi¬ 
num,  and  poe  is  the  density  of  the  germanium.  Therefore 
with  1  fim  of  aluminum,  0.573  /xm  of  germanium  is  required 
to  obtain  the  eutectic  alloy  (53  wt  %  germanium). 

The  adhesion  of  deposited  thin  films  used  in  the  bonding 
process  must  be  excellent.  In  an  ideal  bond,  the  bond  layer 
should  be  stronger  than  the  materials  being  bonded.  Poor 
adhesion  of  the  bond  layers  to  silicon  dioxide  could  lead  to 
reduced  bond  strength.  To  promote  the  adhesion  of  A1  and 
Ge  thin  films  a  10  minute  clean  in  a  fresh  mixture  of 
DI/H2O2/NH4OH  (5:1:1  by)  at  a  temperature  of  80  ‘"C  fol¬ 
lowed  by  a  5  minute  DI  water  rinse  and  drying  in  a  rinser 
dryer  was  used  prior  to  thin  film  deposition. 

The  thin  films  were  deposited  by  ^-beam  evaporation. 
Prior  to  deposition  the  chamber  was  evacuated  to  less  than 
5X10”^  Torr.  The  substrate  temperature  was  raised  to 
250  °C  to  improve  the  adhesion  of  A1  to  the  wafers.  The 
germanium  layer  was  deposited  on  top  of  the  aluminum  at 
about  100  °C.  This  temperature  was  chosen  to  insure  mini¬ 
mal  interaction  of  the  layers  prior  to  bonding.  A  calibrated 
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Fig.  1.  Two  dice  shown  in  (a)  are  prepared  as  described  in  the  text  and 
diced.  The  dice  are  placed  face  to  face  and  aligned  using  their  edges.  A 
similar  procedure  is  used  for  hermetic  testing  with  the  die  shown  in  (b). 


quartz  crystal  monitor  was  used  to  determine  deposition 
thickness  in  situ. 

After  the  metals  were  deposited,  the  wafers  were  pat¬ 
terned  photolithographically.  The  Ge  layer  was  etched  in  a 
solution  of  50:2  H2O2/NH4OH  and  the  aluminum  in  phos¬ 
phoric,  acetic,  nitric  (PAN)  at  50  ®C.  Two  types  of  samples 
were  prepared.  For  bond  strength  measurements,  square  dice 
with  four  circular  dots  at  the  edges  as  shown  in  Figure  1(a) 
were  patterned.  Experiments  were  conducted  with  1.25  mm 
dots  and  0.5  mm  dots.  A  second  study  of  the  hermiticity  of 
the  bonds  was  conducted  with  square  annular  bond  rings 
prepared  on  two  separate  and  different  dice.  One  die  con¬ 
tained  just  the  bond  ring  and  the  second  die  contained  both  a 
bond  ring  and  an  anisotropically  etched  hole  at  its  center  as 
shown  in  Figure  1(b). 

A  25%  tetramethyl  ammoniumhydroxide  [(TMAH), 
(CH3)4N0H]  solution  is  used  to  anisotropically  etch 
silicon.  The  advantage  of  TMAH  is  its  high  selectivity  to  Si 
over  Si02  (about  4000  at  90  °C^^).  Therefore  8000  A  Si02 
layer  easily  survives  the  TMAH  etch  of  the  silicon  substrate. 
The  solution  of  TMAH  is  heated  to  100  °C  to  obtain  an  etch 
rate  of  about  1.1  /xm/mm.  With  416-/xm-thick  silicon  wa¬ 
fers,  the  time  required  to  completely  etch  through  is  esti¬ 
mated  at  6  hours.  The  TMAH  etching  is  terminated  after  a 
total  etching  time  of  5  hours  and  49  min  to  obtain  the  square 
hole  depth  of  about  385  /xm,  leaving  a  thin  diaphragm  that  is 
breakable  with  tweezers  when  desired. 

III.  EXPERIMENTAL  PROCEDURE 
A.  Sample  clamping  and  heating 

Sample  bonding  is  a  simple  matter  of  aligning  two  dice, 
clamping  them  together,  and  heating  them  to  above  the  eu¬ 
tectic  temperature.  Since  a  dicing  saw  is  used  to  separate  the 
samples,  the  dots  are  aligned  to  the  edges  of  the  die  with  an 
accuracy  of  a  few  microns.  Alignment  can  be  performed  us¬ 
ing  the  die  edge.  Infrared  alignment  is  not  necessary.  Sample 
alignment  was  checked  using  an  infrared  microscope. 

Clamping  is  accomplished  in  two  ways.  The  first  employs 
a  simple  set  of  clips.  Clips  were  prepared  from  some  refrac¬ 
tory  metals  including  titanium  and  molybdenum.  The  clips 
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Fig.  2.  Two  dice  clamped  together  with  refractory  metal  springs. 

are  placed  at  the  edges  of  the  die  as  shown  in  Figure  2.  Full 
wafers  were  bonded  with  just  three  clips.  No  other  means  of 
applying  pressure  to  the  samples  was  used.  In  our  tests  of 
individual  die  bonding,  we  found  that  the  use  of  only  a  single 
clip  can  lead  to  poor  bonds,  especially  when  bonding  is  con¬ 
ducted  at  one  atmosphere.  Therefore  the  results  presented  in 
this  article  for  atmospheric  bonding  were  all  performed  with 
multiple  clips. 

The  second  clamping  method  provides  both  alignment 
and  clamping  in  a  single  fixture.  The  fixture  was  fabricated 
in  graphite.  It  consists  of  two  pieces  as  shown  in  Figure  3. 
The  lower  piece  contains  a  well  in  which  the  die  can  be 
placed  and  aligned  using  the  sides  of  the  graphite  part.  A 
second  graphite  lid  is  placed  on  top  of  the  samples  and  a 
refractory  spring  is  used  to  clamp  the  fixture.  The  thickness 
of  the  lid  is  equivalent  to  the  thickness  of  the  base  unit  to 
ensure  uniform  heating  of  the  sample  from  the  top  and  the 
bottom.  Thorough  testing  was  not  performed  to  determine  if 
uniform  heating  is  indeed  necessary  for  a  good  bond.  Early 
experiments  using  a  rapid  thermal  annealer  as  the  heat  source 
(heating  from  only  one  side),  however,  produced  poor  re¬ 
sults. 

Samples  were  heated  in  two  ambients;  in  vacuum  and  in  a 
reducing  atmosphere.  For  vacuum  bonding,  a  horizontal 
Pyrex  bell  jar  was  fitted  with  a  heating  apparatus  constructed 
from  stainless  steel  sheet.  This  apparatus  contains  two  graph¬ 
ite  meander  strip  heaters  separated  by  about  1.5  in.  (Figure 
4),  The  heaters  are  about  6  in.  square.  Two  quartz  1/8-in.- 
diameter  rods  are  used  to  support  the  sample  in  the  center 
between  the  two  heaters.  A  thermocouple  is  suspended  be¬ 
tween  the  two  heaters  and  in  close  proximity  to  the  sample  to 
be  bonded.  The  correlation  between  the  thermocouple  and 
optical  temperature  readings  of  the  heaters  agrees  to  within 
20  °C.  The  two  meander  heaters  create  a  blackbody  cavity  in 


Base  Lid 


Fig.  3.  A  drawing  of  the  graphite  clamping  fixture. 
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Fig.  4.  Drawing  of  the  vacuum  bonding  system  showing  two  meander  strip 
heaters,  quartz  support  rods,  and  the  location  of  the  thermocouple.  These 
parts  are  surrounded  by  a  stainless  steel  heat  shield  and  installed  in  a 
vacuum  system. 

which  the  sample’s  temperature  rises  and  comes  into  equilib¬ 
rium  with  its  surroundings.  This  vacuum  bonding  method 
should  be  suitable  for  devices  requiring  hermetic  sealing. 

The  second  method  involves  the  use  of  a  standard  tube 
furnace  (Thermco  Minibrute)  in  which  forming  gas  is  intro¬ 
duced.  The  samples  are  placed  at  the  front  of  the  tube  that  is 
subsequently  capped.  Forming  gas  (N2/5%H2)  is  purged  for 
a  time  equivalent  to  twenty  complete  volume  changes.  Care 
is  taken  to  minimize  the  exposure  of  the  sample  to  high 
temperatures  before  the  atmosphere  inside  the  tube  has 
settled.  A  single  hole  in  the  end  cap  allows  the  sample  to  be 
pushed  into  the  tube  via  a  push  rod  and  allows  the  purge  gas 
to  escape. 

Initial  bonding  tests  yielded  poor  results  even  when  con¬ 
ducted  at  temperatures  as  high  as  575  °C.  We  attribute  this  to 
poor  cleanliness  of  the  metal-bond  surface  layer.  The  princi¬ 
pal  source  of  concern  is  the  surface  of  the  Ge  thin  film.  After 
photolithography,  chemical  cleans  are  employed  to  remove 
photoresist  from  the  wafer  surface.  The  wafer  is  then  rinsed 
in  water  and  dried.  We  speculate  that  during  this  procedure  a 
hydrous  oxide  film  forms  on  the  Ge  surface.  This  layer  can 
be  subsequently  removed  in  a  dilute  HF  solution.  Some  de¬ 
gree  of  reoxidation  prior  to  sample  insertion  in  the  furnace  is 
likely  to  occur. 

B.  Bond  nature 

Figures  5  and  6  and  show  the  results  for  bonding  with  and 


Fig.  5.  Optical  micrograph  showing  a  portion  of  the  bond  ring  from  a 
sample  that  was  bonded  with  no  special  surface  preparation.  The  result  is  a 
poor  bond  with  only  scattered  areas  that  are  partially  bonded. 
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Fig.  6.  Photomicrograph  of  a  sample  that  was  carefully  cleaned  prior  to 
bonding.  Although  the  sample  was  not  well  aligned,  large  bonded  regions 
are  apparent.  In  these  regions,  the  silicon  has  either  transferred  from  the 
mating  sample  or  was  pulled  away  from  this  sample  during  bond  separation. 

without  a  pre-bond  clean.  The  bonding  appears  to  be  more 
uniform  after  cleaning  as  expected.  The  big  black  areas 
shown  in  Figure  6  are  missing  pieces  of  the  bulk  Si  substrate 
that  has  been  removed  during  sample  separation. 

Bonding  experiments  were  conducted  on  die  containing 
four  dots  as  shown  in  Figure  1(a).  The  deposited  layers  con¬ 
sisted  of  1  yam  of  aluminum  and  0.57  fxm.  of  germanium. 
Experiments  were  conducted  at  one  atmosphere  in  a  tube 
furnace.  All  dice  were  dipped  in  a  solution  of  50:1  H20:HF 
for  10  seconds,  rinsed  briefly,  and  blown  dry  in  dry  nitrogen 
prior  to  being  clipped  together.  Samples  from  each  experi¬ 
ment  were  broken  apart  by  separating  the  die  with  a  knife 
edge.  Three  bonding  regions  can  be  qualitatively  identified. 
An  incomplete  bond  would  be  one  for  which  the  sample 
either  fell  apart  after  removing  the  spring,  or  required  only  a 
modest  effort  to  break  apart.  Under  the  microscope,  an  in¬ 
complete  bond  showed  little  or  no  evidence  of  the  removal 
of  silicon  from  the  substrate.  In  a  partial  bond,  frequent  oc¬ 
currences  of  silicon  removal  after  die  separation  were  ob¬ 
served.  As  much  as  70%  of  the  bond  area  contained  either  a 
void,  indicating  transfer  of  material  from  the  substrate  to  the 
mating  chip,  or  a  chip  of  silicon,  indicating  silicon  transfer 
from  the  mating  chip.  In  a  complete  bond,  the  original  dot 
has  been  completely  replaced  by  a  void  or  a  chip. 

Bonding  was  performed  at  475,  525,  and  575  °C.  At 
475  °C  bonding  was  incomplete  after  30  min  and  improved 
marginally  at  45  min.  At  525  °C  complete  bonding  could  be 
achieved  in  30  min.  Increasing  the  temperature  to  575  °C  did 
not  improve  the  results  significantly.  These  results  are  re¬ 
ported  in  Table  I. 

A  second  set  of  samples  was  prepared  with  2  /xm  of  A1 
and  1.14  yarn  of  Ge  on  Si/Si02  wafers  (Table  II).  These  were 


Table  II.  Bonding  results  with  2  yttm  A1  and  1.14  yLtm  Ge  (average  of  five 
samples). 


Temperature 

i°C) 

Bond  time 
(min) 

Bond  nature 

474 

10 

Complete 

474 

20 

Complete 

575 

5 

Incomplete 

575 

10 

Complete 

Table  I.  Bonding  results  with  1  fjum  A1  and  0.57  fxm  Ge  (average  of  five 
samples). 


Temperature 

(“’C) 

Bond  time 
(min) 

Bond  nature 

475 

15 

Incomplete 

475 

30 

Incomplete 

475 

45 

Partial 

525 

15 

Incomplete 

525 

30 

Complete 

575 

5 

Incomplete 

575 

15 

Partial 

processed  and  prepared  using  the  procedures  described  for 
the  previous  experiments.  Complete  bonding  could  be 
achieved  at  475  °C  after  only  10  min.  Samples  that  were 
placed  in  the  furnace  at  575  °C  for  5  min  did  not  bond, 
probably  because  they  did  not  reach  thermal  equilibrium. 

C,  Pull  tests 

Pull  tests  were  conducted  in  a  simple  apparatus  that  was 
constructed  for  this  purpose.  Bonded  samples  were  mounted 
to  sample  holders  using  epoxy.  The  sample  holders  consisted 
of  1/4  in.  rods  approximately  1  in.  long  and  that  had  an 
eyelet  centered  at  one  end  and  were  machined  flat  at  the 
other.  The  sample  mounting  took  place  in  a  V  block  that  was 
used  to  ensure  alignment  between  the  two  sample  holders. 
Mounted  samples  were  loaded  into  a  fixture  (Figure  7)  con¬ 
taining  a  piston  at  one  end  in  line  with  a  strain  sensor  at  the 
other.  Steel  cable  was  used  to  connect  the  eyelets  on  either 
side  of  the  sample  to  the  piston  and  strain  gauge.  Air  pres¬ 
sure  applied  to  the  piston  provided  the  force  to  pull  the 
samples  apart.  The  strain  gauge  was  calibrated  using  dead 
weights. 

Tests  were  conducted  on  samples  patterned  with  dots  as 
shown  in  Figure  1(a).  Samples  with  1  mm  dots  would  not 
separate  even  at  a  force  of  25  N.  Five  samples  bonded  with 
0.5  mm  dots  separated  at  an  average  force  of  12.39  N  and  a 
standard  deviation  of  2.35  N.  Therefore,  the  yield  pressure, 
assuming  complete  bonding  over  the  entire  area,  was  about 
1.6X10^  Pa.  This  pressure  is  equivalent  to  one-tenth  of  the 
yield  strength  of  aluminum.  After  the  samples  had  been  sepa¬ 
rated,  they  were  examined  under  the  microscope  (see  Figure 
8).  In  all  cases,  the  substrate  had  been  removed  from  one 
side  of  the  bond.  This  suggests  that  the  silicon  substrate 
yielded  before  the  bond. 


Sample  Mount 


Fig.  7.  Schematic  of  the  pull  tester  used  in  this  work. 
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Fig.  8.  Optical  micrograph  of  a  0.5  mm  dot  after  pull  testing.  The  black 
material  is  silicon  that  has  been  removed  from  the  mating  piece. 


D.  Bond  hermiticity 

For  many  MEMS  applications,  it  is  necessary  to  create 
vacuum  sealed  cavities.  To  test  Al/Ge  bonds  for  hermiticity 
we  created  samples  in  which  one  die  has  an  anisotropically 
etched  port  through  which  the  cavity  between  two  bonded 
dice  can  be  evacuated.  Using  a  helium  leak  tester,  a  series  of 
bonded  samples  was  tested.  The  connection  of  the  sample  to 
the  leak  detector  is  shown  schematically  shown  in  Figure  9. 
The  sample  is  simply  placed  on  an  0-ring  that  seals  the 
sample  to  the  leak  port.  The  system  is  capable  of  detecting 
helium  down  to  a  leak  rate  of  10“^  sees.  An  internal  cali¬ 
brated  leak  was  used  to  check  the  operation  of  the  detector.  It 
should  be  noted  that,  at  this  sensitivity,  a  typical  bulk  micro- 
machined  device  with  a  volume  of  10“^  cm^  would  reach 
one  atmosphere  in  two  weeks. 

Twenty  samples  were  prepared  for  testing.  These  were 
approximately  evenly  split  for  a  2X2  experimental  matrix 
designed  to  test  the  influence  of  time  and  temperature.  Times 
ranged  from  10  to  30  minutes  and  the  temperature  was  set  at 
525  and  575  °C.  For  samples  with  1  yum  A1  and  0.57  yum  Ge 
bonding  at  525  ""C  and  30  minutes  produced  leaks  at  about 
5X10“"^  sees.  Table  III  details  these  results.  Increasing  the 
temperature  to  575  °C  produced  samples  with  leak  rates  be¬ 
low  the  detection  limit.  Sixty  percent  of  the  samples  pre¬ 
pared  with  2  yum  A1  and  1.14  ytxm  Ge  and  tested  at  525 
for  30  minutes  were  below  the  detection  limit  (see  Table  IV). 
All  of  the  thicker  samples  bonded  at  575  °C  had  undetect¬ 
able  leak  rates. 


Bond  ring 


Fig.  9.  Cross  section  of  the  test  die  used  for  leak  testing. 


Table  III.  Leak  test  results  with  1  fxm  Ai  and  0.57  yotm  Ge. 

Temperature 

Time 

Standard  leak  rate 

(°C) 

(min) 

(secs) 

525 

30 

Undetectable 

525 

30 

10-3 

525 

30 

5X10"'' 

525 

30 

7X10“' 

525 

30 

5X10“'' 

525 

30 

Over  the  range 

575 

10 

10"^ 

575 

30 

Undetectable 

575 

30 

Undetectable 

575 

30 

Undetectable 

575 

30 

Over  the  range 

575 

30 

Over  the  range 

IV.  OBSERVATIONS 

Several  observations  can  be  made  about  these  results. 
Very  strong  bonds  can  be  made  using  the  AFGe  eutectic 
alloy  system.  This  was  clearly  demonstrated  by  pull  tests. 
One  may  question  why  the  silicon  yielded  before  the  bond 
broke.  A  possible  weakness  in  the  test  was  the  use  of  mul¬ 
tiple  dots  patterned  in  each  die  that  was  bonded.  Clearly  the 
strain  in  the  silicon  has  a  maximum  at  the  dot  periphery.  A 
second  contributing  factor  is  that  the  epoxy  used  to  mount 
the  samples  provides  enough  elasticity  that  the  sample  defor¬ 
mation  is  greater  than  it  would  be  for  a  more  rigid  mounting 
scheme.  In  this  case,  the  silicon  deformation  added  stress  to 
the  silicon  surrounding  the  bond  causing  it  to  yield. 

A  second  observation  is  that  there  is  a  significant  effect  of 
time  and  temperature  on  bond  quality.  In  general,  longer 
bonding  times  and  higher  temperatures  produced  better 
bonds.  It  is  also  observed  that  thicker  films  bond  more 
quickly  and  at  lower  temperatures  than  thin  bonds.  In  our 
work,  2  yum  was  considered  a  practical  limit  for  a  single 
layer  and  results  in  a  total  bond  thickness  of  6.8  yttm  for  the 
aluminum/germanium  system.  A  final  observation  is  that 
there  appeared  to  be  only  negligible  difference  between  the 
quality  of  bonds  formed  in  vacuum  and  those  in  reducing 
ambient. 

The  time  and  temperature  dependence  can  be  explained  in 
part  by  postulating  a  surface  oxide  layer,  as  has  been  re¬ 
ported  in  other  alloy  systems.^"^’^^  At  the  interface  between 
the  two  bond  metal  layers,  the  two  surface  layers  of  Ge  are  in 


Table  IV.  Leak  test  results  with  2  /rm  A1  and  1.14  /rm  Ge. 


Temperature 

(°C) 

Time 

(min) 

Standard  leak  test 
(sees) 

525 

20 

Undetectable 

525 

20 

3X10"’ 

525 

20 

3X10"’ 

525 

30 

Undetectable 

525 

30 

Undetectable 

525 

30 

1.5X10"’ 

575 

20 

Undetectable 

575 

20 

Undetectable 
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contact.  In  handling  and  in  exposure  to  oxygen  in  the  atmo¬ 
sphere,  a  thin  layer  of  germanium  oxide  grows.  In  addition, 
any  oxide  trapped  during  film  deposition  accumulates  at  the 
surface  of  the  molten  metal.  The  presence  of  the  oxide  inhib¬ 
its  the  merger  of  the  two  molten  bond  layers.  Bonding  does 
occur  at  pinholes  and  cracks  in  this  thin  oxide  layer  that  form 
as  the  bond  is  heated.  For  incomplete  bonding,  a  pattern  of 
spots  where  bonding  occurred  is  surrounded  by  unbonded 
regions.  By  capillary  action  and  with  time,  the  bond  area 
increases  as  the  oxide  is  pushed  aside  or  is  dissolved  into  the 
molten  material.  Evidence  for  this  is  seen  in  the  partially 
bonding  wafers  where  a  larger  percentage  of  the  area  has 
been  bonded.  The  eutectic  bonding  process  could  be  im¬ 
proved  by  finding  a  way  to  avoid  the  growth  of  native  ger¬ 
manium  oxide  or  by  causing  it  to  dissolve  into  or  flow  off  the 
melt  during  bonding.  To  avoid  problems  from  oxides  trapped 
in  the  films  during  deposition,  one  could  pre-melt  the  alloy. 
After  the  initial  melt,  the  surface  is  cleaned  in  preparation  for 
final  bonding. 

A  possible  contributing  factor  to  explain  the  higher  bond¬ 
ing  temperatures  required  to  achieve  a  good  bond  is  a  varia¬ 
tion  in  the  ratio  of  the  thicknesses  of  the  germanium  and  the 
aluminum.  If  the  ratio  is  offset  to  either  side  of  the  eutectic, 
the  liquidus  temperature  will  increase.  We  first  note  that  the 
dual  ^-gun  system  used  for  this  work  had  a  rotating  substrate 
holder.  The  guns  were  placed  at  equal  distances  from  the 
center  of  the  system  and  on  a  line  passing  through  the  center. 
Therefore,  evaporation  from  either  gun  is  essentially  equal. 
Both  depositions  are  performed  sequentially  and  therefore 
the  thickness  variation  across  the  wafer  is  essentially  identi¬ 
cal  for  both  depositions.  Therefore,  the  alloy  composition  is 
constant  across  the  wafer,  and  only  its  total  thickness  varies. 
The  only  other  source  of  variation  is  in  the  ratio  of  the  thick¬ 
nesses.  In  other  words,  while  the  alloy  is  the  same  every¬ 
where  on  the  wafer,  it  may  not  be  the  eutectic  alloy.  We 
observe  that,  around  the  eutectic  composition,  the  liquidus 
can  be  approximated  as  a  straight  line  with  a  slope 
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where  T i  (wt  is  the  liquidus,  wt  is  a  weight  percent 

of  Ge  in  a  near  eutectic  alloy  composition  that  is  slightly  Ge 
rich,  and  Sq^  is  the  slope  of  the  liquidus. 

It  should  be  noted  that  the  quartz  crystal  monitor  mea¬ 
sures  the  deposited  mass,  or  the  product  of  the  density,  thick¬ 
ness,  and  area.  The  ratio  of  measurements  from  two  sequen¬ 
tial  depositions  removes  the  area  and  we  define  TaHoy  as  the 
ratio  Pce'^Ge/PAi*  ^Ai-  Then  it  can  be  shown  that 


Fig.  10.  a  SEM  cross  section  of  a  complete  bond.  To  enhance  the  contrast 
between  the  aluminum  and  the  germanium,  the  germanium  was  etched  in 
HR 

where  is  a  ratio  that’s  slightly  germanium  rich,  and 
Tgu  is  the  eutectic  ratio. 

The  slope  of  the  liquidus  with  increasing  Ge  concentra¬ 
tion  is  about  15  °C/%  for  the  aluminum/germanium  system. 
This  slope  is  about  double  the  slope  of  the  liquidus  for  the 
Ge  poor  alloy.  From  the  above  we  can  estimate  that  even  if 
our  control  of  the  thickness  ratio  is  in  error  by  10%  (5%  for 
each  deposition),  the  increase  in  the  liquidus  will  be  37  °C. 

When  complete  bonding  occurs,  the  dice  cannot  be  bro¬ 
ken  at  the  original  bond  line.  We  have  cross  sectioned  well 
bonded  dice  and  examined  the  interface  using  a  scanning 
electron  microscope  (SEM).  An  micrograph  is  shown  in  Fig¬ 
ure  10.  The  silicon  dioxide  layers  on  both  die  are  clearly 
visible  as  the  dark  thin  films.  The  bond  layer  shows  two 
differently  shaded  regions.  Analysis  of  these  regions  with 
x-ray  dispersion  indicates  that  the  light  region  is  an  alumi¬ 
num  rich  alloy  and  the  dark  layer  is  germanium  rich.  This 
result  is  characteristic  of  the  solidification  of  eutectic 
alloys. 

Aluminum/germanium  bonding  can  be  used  for  electrical 
interconnection.  In  our  early  efforts,  however,  we  observed  a 
large  number  of  devices  that  were  shorted  to  the  substrate.  To 
determine  the  origin  of  the  shorts,  we  fabricated  samples  that 
consisted  of  an  oxidized  silicon  substrate,  a  metal  layer,  a 
LTO  capping  layer,  and  the  bond  layer.  The  bond  layer  was 
patterned  into  dots  for  probing;  these  were  subsequently 
heated  to  emulate  a  bonding  cycle.  We  found  a  large  number 
of  shorts  between  the  dots  and  the  underlying  metal  layer. 
Upon  closer  examination,  it  was  determined  that  the  Al/Ge 
alloy  diffused  through  the  oxide.  After  capping  the  LTO  with 
silicon  nitride,  the  tests  were  conducted  without  any  shorting 
observed. 

V,  CONCLUSIONS 
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Eutectic  bonding  using  A1  and  Ge  was  investigated. 
Samples  were  prepared  with  individual  layers  of  both  metals 
with  thicknesses  adjusted  to  create  the  eutectic  when  fully 
melted.  Several  observations  were  made.  Proper  surface 
preparation  of  the  layers  is  essential  for  good  bonding.  Our 
process  leaves  the  germanium  as  the  upper  layer  that  can  be 
cleaned  prior  to  bonding  with  a  dilute  HF  dip  and  rinse  step. 
Second,  the  bonding  does  not  take  place  when  the  melting 
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point  is  reached.  In  fact,  we  have  found  that  it  is  necessary  to 
exceed  the  eutectic  melting  point  by  about  100  °C  to  ensure 
a  reliable  bond.  Third,  the  bond  does  not  form  immediately 
upon  reaching  the  bond  temperature.  A  time/temperature  de¬ 
pendence  exists  with  the  quality  of  the  bond  improving  as 
both  increase.  Finally,  the  bond  improves  as  the  layer  thick¬ 
nesses  increase.  Bonds  can  be  achieved  with  1  fim  of  A1  and 
0.57  fjbm  of  Ge  but,  by  doubling  the  thickness  of  both,  the 
quality  of  the  bond  is  improved  and  bonding  can  be  con¬ 
ducted  at  a  lower  temperature.  Using  the  techniques  de¬ 
scribed  in  the  article,  we  have  achieved  high  strength  bonds 
that  yield  above  1.6X10^  Pa.  We  also  found  that  hermetic 
sealing  using  eutectic  bonding  is  feasible.  All  the  bonds 
tested  were  patterned  photolithographically  and  formed  with 
essentially  no  external  pressure. 
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In  very  large  scale  integration  metalization  processes,  thin  diffusion  barrier  metal  films  deposited 
over  vias  and  contacts  are  used  to  inhibit  the  diffusion  of  the  top  metal  layer  into  the  underlying 
junctions.  These  thin  films  are  normally  sputter  deposited  and  their  reliability  is  crucial  to  die  yield 
and  device  lifetime.  As  wafer  sizes  increase,  die  at  the  wafer  edge  receive  a  directionally 
asymmetrical  sputter  flux,  leading  to  nonuniform  and  asymmetrical  film  coverage  over  vias  and 
contacts.  Two  three-dimensional  (3D)  extensions  of  the  two-dimensional  (2D)  film  growth  program 
SIMBAD  are  presented.  The  first  extension  (quasi-3D)  assumes  a  radially  symmetrical  adatom  flux 
distribution  and  would  be  appropriate  for  the  simulation  of  film  deposition  on  regions  near  the 
central  axis  of  a  radially  symmetrical  target.  Due  to  the  assumption  of  a  symmetrical  flux 
distribution,  the  resulting  film  growth  is  symmetrical.  The  second  extension  (interpolated-3D)  does 
not  assume  that  the  flux  is  symmetrical  and  is  intended  for  the  simulation  of  asymmetrical  film 
growth  off-axis.  Both  extensions  provide  a  3D  surface  of  the  deposited  film  and  the  microstructure 
and  profile  of  2D  slices  through  the  via  or  contact.  In  order  to  determine  the  accuracy  of  the  two 
models,  simulations  were  compared  to  W  films  sputter  deposited  over  contacts  having  an  aspect 
ratio  of  3:1.  The  models  were  tested  for  a  substrate  placed  directly  under  the  center  of  the  target.  The 
interpolated-3D  model  was  also  tested  for  asymmetrical  fluxes  by  placing  the  substrate  off-axis  by 
2.2,  5.0,  and  8.1  cm.  The  comparisons  with  the  experimental  results  indicate  that  both  the  surface 
profiles  and  the  microstructure  of  the  deposited  films  were  predicted  well  by  the  two  models.  The 
interpolated  3D  model  was  very  successful  at  predicting  deposition  profiles  for  off-axis  deposition 
flux.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  trend  in  the  microelectronics  industry  is  for  a  contin¬ 
ued  reduction  in  device  feature  sizes  in  order  to  increase  the 
level  of  integration  and  the  device  performance.^  This  has  led 
to  the  requirement  of  diffusion  barrier  layers  to  reduce  ma¬ 
terial  interactions  leading  to,  for  example,  shallow  junction 
spiking.  The  deposition  of  barrier  layers  is  usually  done  by 
sputtering,  which  is  a  well  established  process  in  the  micro¬ 
electronics  industry,  for  a  wide  variety  of  materials.^ 

A  number  of  factors  determine  the  effectiveness  of  a  bar¬ 
rier  layer.  It  is  important  that  the  deposited  film  be  as  uni¬ 
formly  deposited  as  possible  over  the  topography  to  be 
coated.  For  sputtering  processes,  the  uniformity  of  the  film  is 
primarily  determined  by  the  shadowing  of  the  adatom  flux  by 
the  initial  topography  and  the  depositing  film.  The  final  film 
profile  is  therefore  strongly  determined  by  the  angular  ada¬ 
tom  flux  distribution.  In  addition  to  the  surface  profile  of  the 
film,  the  internal  microstructure  of  the  film  plays  a  large  role 
in  determining  the  effectiveness  of  the  diffusion  barrier. 
Sputtered  thin  films  exhibit  a  particular  microstructure, ^ 
which  is  determined  to  a  large  extent  by  the  melting  point  of 
the  material  being  deposited,  the  angular  distribution  of  the 
adatom  flux,  and  the  orientation  of  this  flux  to  the  local  sur- 
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face  normal.  This  microstructure  is  important  in  determining 
a  variety  of  film  properties,  including  local  density  and  dif¬ 
fusion  barrier  permeability. 

Traditionally,  the  development  of  sputtering  processes  has 
been  dominated  by  experimental  optimization  of  the  film 
properties.  However,  due  to  increased  complexity  of  the  in¬ 
tegrated  circuit  structures  and  the  tighter  tolerances,  this 
method  is  becoming  prohibitively  expensive  and  time  con¬ 
suming.  It  is  therefore  important  to  develop  process  simula¬ 
tors  to  model  and  help  optimize  these  processes.  Previous 
modeling  can  be  split  into  two  distinct  groups:  string  algo¬ 
rithm  models  that  represent  the  surface  as  a  connected  set  of 
line  segments'^ and  assume  the  internal  film  to  be  homoge¬ 
neous,  and  microstructural  models  which  provide  a  depiction 
of  the  internal  film  structure  in  addition  to  the  film  profile.^’^ 

Predictions  from  the  reported  models  have  been  exten¬ 
sively  compared  to  scanning  electron  micrographs  (SEM)  of 
cross  sections  of  films  deposited  over  trenches.  A  large  num¬ 
ber  of  physical  effects  have  been  incorporated,  including 
nonunity  sticking  coefficients,  ion  bombardment  effects,  and 
high  temperature  effects.^’^’^^  In  addition,  Monte  Carlo  flux 
transport  models  have  been  coupled  with  the  growth  simu¬ 
lator  to  study  the  effects  of  sputter  geometries,  deposition 
conditions,  and  the  use  of  collimators.^ 

The  modeling  of  sputter  deposition  of  refractory  barrier 
layers  has  primarily  been  two  dimensional  (2D)  and  conse- 
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Fig.  1 .  Incident  sputter  flux  distribution  at  various  angles  off  the  central  axis 
of  the  target. 


quently  limited  to  deposition  over  long  features  such  as 
trenches.  In  this  situation  the  three-dimensional  (3D)  flux  can 
be  projected  onto  a  2D  plane  and  the  growth  simulation  can 
be  done  in  2D.  A  number  of  situations  arise  that  are  inher¬ 
ently  3D.  These  include  deposition  into  vias/contacts  and 
off-axis  deposition.  For  these  cases  both  the  film  surface  and 
the  flux  incident  on  this  surface  must  be  represented  in  three 
dimensions.  At  present  there  has  been  some  initial  work  on 
3D  film  growth/^”^^  however,  important  3D  effects  have  of¬ 
ten  been  neglected.  For  example,  the  effect  of  the  asymmetri¬ 
cal  sputter  flux  distribution  present  at  the  wafer  edge  needs 
to  be  addressed  (see  Fig.  1).  This  asymmetrical  flux  distribu¬ 
tion  will  cause  asymmetries  in  the  film  profile  and  also  effect 
the  local  microstructure  of  the  film.  Both  of  these  effects  can 
seriously  affect  the  reliability  of  the  metallization  at  the  edge 
of  the  wafer  and  must,  therefore,  be  minimized.  In  order  to 
simulate  3D  film  growth  of  thin  films  over  very  large  scale 
integration  (VLSI)  topography  it  is  necessary  to  have  a  3D 
angular  flux  distribution  and  3D  model  of  the  flux  shadowing 
due  to  the  topography. 

In  general,  it  might  be  felt  that  a  full  3D  representation  of 
the  film  is  also  needed,  but  this  is  often  very  inefficient  and 
unnecessary.  For  example,  to  represent  a  (2  /nm)^  structure 
with  10  nm  resolution  requires  '^10^  elements  resulting  in 
hundreds  of  megabytes  of  memory  usage  and  a  correspond¬ 
ingly  long  simulation  time.  Furthermore,  in  order  to  quanti¬ 
tatively  interpret  the  film  depiction,  selected  2D  slices  are 
usually  taken  through  the  structure,  thereby  ignoring  most  of 
the  3D  representation.  If  possible,  it  is  far  more  efficient  to 
represent  the  film  just  in  those  selected  2D  slices  provided 
the  3D  effects  discussed  above  can  still  be  accounted  for. 
(For  example,  the  same  scenario  above  would  only  require 
only  4X10"^  elements  per  2D  slice.) 

This  article  describes  two  such  algorithms  for  providing 
important  3D  information  (including  internal  microstructure) 
using  efficient  2D  film  representations.  To  accomplish  this, 
two  programs  are  used.  SIMSPUD  (Simulation  of  sputter  flux 
distribution)  is  a  three-dimensional  Monte  Carlo  simulation 
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of  the  transport  of  the  sputtered  material  to  the  substrate. 
This  program  is  used  to  obtain  the  3D  spatial  and  angular 
distributions  of  the  depositing  sputter  flux.  SIMBAD  (simula¬ 
tion  by  ballistic  deposition)  is  a  2D  thin  film  growth  simula¬ 
tor  used  for  the  simulation  of  the  deposition  of  the  metals  by 
sputtering,  bias  sputtering,  and  chemical  vapor  deposition. 
As  discussed  above,  two  3D  extensions  for  SIMBAD  will  be 
presented.  The  first  extension  (quasi-3D)  exploits  cylindrical 
symmetry  to  reduce  the  system  dimension  to  two.  If  the  flux 
distribution  and  substrate  topography  have  this  symmetry 
(for  example,  a  round  contact  or  via  hole),  then  a  3D  position 
(p,<^,z)  in  cylindrical  coordinates  can  be  reduced  to  a  2D  one 
{(f), z)  suitable  for  use  in  SIMBAD.  However,  when  the  sub¬ 
strate  feature  or  flux  distribution  is  not  cylindrically  sym¬ 
metrical  (for  example,  inside  a  damascene  structure  or  far 
from  the  axis  of  deposition),  the  quasi-3D  approach  is  no 
longer  valid.  Under  these  conditions,  the  more  general 
SIMBAD  extension  (interpolated-3D)  can  be  used. 

The  interpolated-3D  algorithm  uses  a  small  number  of 
vertical  slices  at  different  angles  to  represent  the  key  views 
of  the  3D  features.  Between  these  slices,  a  smooth  interpo¬ 
lation  of  the  film  surface  is  used  to  properly  account  for  3D 
shadowing  effects. 

Both  of  these  SIMBAD  extensions  are  described  in  greater 
detail  below.  In  addition,  predictions  from  both  models  will 
be  compared  to  scanning  electron  micrograph  (SEM)  cross 
sections  of  W  films  deposited  over  circular  contact  cuts. 

II.  MODELING 

A.  Sputter  flux  transport — simspud 

SIMSPUD  is  a  three-dimensional  Monte  Carlo  vapor  trans¬ 
port  simulation.  In  this  model,  individual  sputter  flux  par¬ 
ticles  are  followed  from  the  time  they  are  emitted  from  the 
target,  through  collisions  in  the  sputter  gas,  to  their  final 
deposition  onto  the  substrate.  The  position,  angle,  and  energy 
of  each  particle  is  tracked  in  order  to  collect  the  statistics  on 
the  corresponding  sputter  distribution.  Particles  are  emitted 
from  the  target  at  a  random  position  which  reflects  the  ero¬ 
sion  profile  of  the  sputter  target.  The  emission  angular  dis¬ 
tribution  can  be  input  from  experimental  results  or  alterna¬ 
tively,  a  cosine  distribution  can  be  used.  A  hard  sphere 
scattering  model  gives  scattering  angles,  but  an  energy- 
dependent  collision  cross  section  is  used  to  determine  colli¬ 
sion  events.  Both  2D  and  3D  statistics  are  collected  for  any 
specified  region  of  the  substrate. 

B.  Film  growth — simbad 

1.  Two-dimensional  simulation 

The  basic  SIMBAD  program  is  a  two-dimensional  Monte 
Carlo  simulator  developed  to  model  the  growth  and  micro- 
structure  of  thin  films  deposited  over  topography.  ’  ’  ’  The 
basic  model  accounts  for  the  effects  of  vapor  flux  shadowing 
and  surface  diffusion  by  the  adatoms.  These  are  the  most 
important  factors  in  determining  the  step  coverage  and  the 
columnar  micro  structure  at  low  to  moderate  temperatures.  In 
SIMBAD,  the  film  is  represented  by  an  aggregation  of  lO'^-lO^ 
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Fig.  2.  (a)  Schematic  drawing  of  a  standard  two-dimensional  simbad  simulation  of  a  trench  and  (b)  a  quasi-3D  simbad  simulation  of  a  round  contact  assuming 
rotational  symmetry. 


identical  hard  disks.  Each  of  these  disks  represents  the  sta¬ 
tistically  averaged  behavior  of  a  large  number  of  film  atoms 
(typically  1000  per  disk).  These  disks  are  serially  launched 
from  just  above  the  growing  film  and  follow  straight  line 
trajectories  until  they  strike  the  substrate  or  the  film.  The 
disks  aggregate  into  the  film  after  a  short  surface  diffusion  to 
minimize  a  surface  curvature-dependent  chemical  potential. 
The  combination  of  ballistic  shadowing  and  short  range  sur¬ 
face  diffusion  successfully  accounts  for  the  formation  of  the 
columnar  thin  film  microstructure  characteristic  of  refractory 
metals  at  typical  deposition  temperatures. 

The  angular  distribution  computed  by  SIMSPUD  for  sputter 
flux  arriving  at  the  substrate  is  used  by  SIMBAD  to  generate 
the  initial  disk  trajectories.  The  interaction  between  this  dis¬ 
tribution  and  the  substrate  topography  is  largely  what  deter¬ 
mines  film  coverage.  Without  extensions,  the  basic  2D 
SIMBAD  model  is  suitable  for  simulating  deposition  in  a  long 
feature  such  as  trench  [see  Fig.  2(a)]  This  does  first  require 
the  projection  of  the  3D  flux  distribution  from  SIMSPUD  into 
a  plane  perpendicular  to  the  trench. 

2.  Quasi-three-dimensional  simulations 

The  simulation  of  a  circular  via  or  contact  positioned  un¬ 
der  the  center  of  cylindrically  symmetrical  target  can  be  per¬ 
formed  by  exploiting  cylindrical  symmetry  (quasi-3D  simu¬ 
lation).  As  described  above,  positions  and  trajectories  in  3D 
coordinates  (p,<^,z)  can  be  mapped  back  to  the  2D  represen¬ 
tation  through  a  simple  rotational  projection:  (p,z)^(x,z). 
Consequently,  the  cross  section  of  any  slice  through  the  con¬ 
tact  center  will  be  essentially  identical.  This  is  depicted  in 
Fig.  2(b).  An  additional  consideration  is  mass  conservation 
during  radial  motion.  For  example,  the  effect  of  any  particle 
impinging  at  radius  p  must  be  divided  over  the  area  it  repre¬ 
sents,  pdpdcf).  This  leads  to  a  1/p  weighting  factor,  which, 
for  a  discrete  model  like  SIMBAD,  is  implemented  by  adding 
disks  with  a  probability  proportional  to  1/p.  The  singularity 
at  p=0  is  handled  by  truncating  the  probability  equal  to  1  at 
some  small  p=Po.  This  leaves  a  small  notch  of  width  2po  in 
the  center  of  quasi- 3D  simulations  which  can  be  minimized 
at  the  expense  of  simulation  efficiency. 

Another  limitation  of  the  algorithm  is  that  surface  diffu¬ 
sion  proceeds  in  2D  due  to  the  1/p  weighting  limitations. 


However,  for  the  relatively  small  diffusion  characteristic  of 
refractory  materials,  this  is  not  a  serious  deficiency.  The 
main  limitation  of  quasi-3D  is,  of  course,  that  often  the  cy¬ 
lindrical  symmetry  is  broken  by,  for  example,  deposition 
near  the  wafer  edge  where  the  peak  of  the  flux  distribution 
would  be  shifted  toward  the  wafer  center.  In  this  case,  the 
more  general  interpolated-3D  algorithm  would  be  needed. 

3.  Interpolated-three-dimensionai  simulations 

As  discussed  above,  a  full  three-dimensional  simulation 
of  the  microstructure  of  a  sputter  deposited  film  would  be 
computationally  taxing  because  of  the  need  to  store  a  great 
deal  of  internal  information.  However,  a  model  which  only 
represents  detailed  internal  information  in  selected  critical 
locations  and  just  stores  the  gross  surface  information  every¬ 
where  else  would  require  much  less  computation  and 
memory.  The  SIMBAD  interpolated-3D  model  takes  this  latter 
approach  and  computes  the  gross  3D  surface  features  by 
smoothly  interpolating  between  a  number  of  high  resolution, 
highly  detailed  2D  slices.  Each  of  these  slices  is  vertical  and 
passes  through  the  center  of  the  simulation.  The  basic  simu¬ 
lation  geometry  for  two  slices  is  shown  in  Fig.  3,  although 
any  number  could  be  used  to  increase  the  interpolation  accu¬ 
racy.  The  interpolated  3D  surface  is  then  used  to  determine 
the  film  shadowing  effects  and  the  3D  profile  of  the  depos¬ 
ited  film.  The  2D  slices  provide  highly  detailed  microstruc- 
tural  depictions  through  representative  sections  of  the  film. 

Figure  4  shows  a  simplified  cross  section  of  two  slices 
through  the  center  of  the  contact,  displaced  by  90°  in  angle. 
The  surface  profiles  in  each  slice  are  divided  into  an  equal 
number  of  parts.  As  the  profile  of  the  film  along  these  two 
slices  are  not  symmetrical,  the  length  of  each  of  these  divi¬ 
sions  will  not  be  equal.  The  interpolated  surface  between 
these  two  orthogonally  displaced  slices  are  shown  by  curved 
lines.  Figure  4  shows  the  linear  (in  cylindrical  coordinates) 
interpolation  between  two  corresponding  points,  one  on  each 
of  the  two  slices.  The  aim  is  to  achieve  a  smooth  transition 
from  point  Pi  to  ^2-  This  was  done  by  dividing  the  angle 
between  these  two  points  (which  is  90°  in  this  case)  in  the 
x-y  plane  into  the  number  of  equal  angles  each  denoted  by 
<f},  as  shown  in  the  figure.  The  distance  p  (as  shown  in  the 
figure)  is  then  calculated  by  using  the  relation 
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Fig.  3.  A  schematic  depiction  of  two  2D  slices  used  for  an  interpolated  3D 
simulation. 


<!>  <t> 

p~y\  3^1  9QO  9()o  •  (1) 

The  height  of  the  surface  at  the  position  r  is  then  calculated 
using 

-  _  ^  _L  ^ 

Z  Z\  Z\  Q0O  '  ^2  9QO  * 

Likewise,  all  the  line  segments  between  points  Pj  and  P2  are 
interpolated  and  joined.  The  interpolation  was  repeated  for 
all  the  points  on  the  two  slices  to  give  the  interpolated  profile 
of  the  deposited  film.  Each  of  the  quadrilaterals  formed  was 
then  divided  into  triangles  as  shown.  Similar  interpolations 


Fig.  4.  (a)  Cross  section  of  the  two  orthogonal  slices  through 


Fig.  5.  Disk  trajectories,  shadowing  effect  and  transformation  of  3D  to  2D 
trajectory  for  2D  simbad  deposition. 

were  carried  out  between  all  the  slices  to  give  the  cylindrical 
profile  of  the  contact  as  shown  in  Fig.  8.  The  interpolated 
surface  is  then  used  to  determine  the  film  shadowing  for 
subsequent  deposition. 

The  simulation  of  the  growth  of  the  surface  involves  the 
generation  of  an  initial  3D  surface  from  the  2D  slices  of  the 
initial  topography.  A  number  of  disks  are  then  deposited  onto 
the  3D  surface  using  a  3D  flux  distribution.  The  trajectories 
of  these  disks  are  followed  until  they  collide  with  the  sur¬ 
face.  If  the  trajectory  strikes  the  3D  surface  at  a  2D  slice,  the 
disk  is  deposited  into  that  slice.  Figure  5  is  the  depiction  of 
the  trajectories  of  disks  launched  from  points  above  the  sub¬ 
strate.  7]  and  T2  are  two  such  trajectories.  It  can  be  seen  that 
as  Ti  falls  outside  the  boundary  of  the  slice  (5%  of  the  total 
width),  it  is  therefore  discarded.  On  the  other  hand  T2  falls 
within  the  boundary  of  the  slice  and  is  therefore  accepted. 
The  trajectory  T2  which  is  a  3D  trajectory  is  then  projected 


z 


the  center  of  the  contact  and  (b)  3D  interpolation  between  slices. 


J.  Vac.  Scl.  Technol.  B,  Vol.  14,  No.  4,  Jui/Aug  1996 


2600 


Sheergar  et  al.:  Simulator!  of  3D  metal  step  coverage 


2600 


(c) 

Fig.  9.  SEM  photographs  of  W  films  deposited  over  contacts  off-axis  by  (a)  x=2.2  cm  (^=15°),  (b)  ;i:=5.0  cm  (^—30°),  and  (c)  ;c=8.1  cm  (^=45°). 
Corresponding  simbad  simulations  (d)-(f)  are  shown  to  the  right  of  each  SEM. 


onto  the  2D  slice  in  y-z  plane  as  T20  and  deposited  using  the 
simple  2D  SIMBAD  model.  Using  this  procedure  a  number  of 
disks  are  deposited  into  the  2D  slices,  and  then  a  new,  up- 
dated  3D  surface  is  generated.  In  order  to  maintain  accuracy 


the  3D  surface  must  be  updated  frequently  with  respect  to 
the  film  growth. 

The  accuracy  of  the  simulated  profile  also  depends  on  the 
number  of  slices  considered.  The  higher  the  number  of 
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(a) 


(b) 
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Fig.  iO.  SEM  (a)  and  corresponding  simulation  (b)  of  deposition  on  the  3;  axis  (at  right  angles  to  the  off-axis  flux)  for  the  case  of  a: =5.0  cm  (0=30°). 


slices,  the  smaller  the  region  to  be  interpolated  and  higher 
will  be  the  accuracy  of  the  simulated  film.  The  simulation 
time  is  linearly  proportional  to  the  number  of  slices.  It  will, 
however,  still  be  much  faster  than  a  fully  3D  model.  A  typi¬ 
cal  film  growth  simulation  on  a  Sun  Sparc  20  runs  in  under 
20  min. 

The  model  does  have  a  number  of  limitations.  It  does  not 
incorporate  diffusion  in  the  tangential  direction,  so  it  can  be 
expected  to  only  be  strictly  correct  for  small  surface  diffu¬ 
sion  lengths.  The  sticking  coefficient  is  assumed  to  1.0,  and 
this  will  limit  the  type  of  processes  and  materials  that  can  be 
simulated.  Lastly,  the  simple  interpolation  scheme  for  the  3D 
surface  limits  the  initial  topography  to  cylindrical  or  eliptical 
geometries. 

III.  RESULTS 

To  determine  the  accuracy  of  the  simulations,  calculated 
results  were  compared  to  experimental  films.  High  purity 
tungsten  was  dc  sputtered  from  a  7.5-cm-diam  target  using  a 
planar  magnetron  source  at  100  W  of  constant  power  with 
nominally  -325  V  target  bias.  The  deposition  pressure  was 
11  mTorr  of  argon.  The  base  pressure  was  about  5X10“^ 
Torr.  The  substrate  profile  consisted  of  square  contacts  in 
sizes  from  0.8  to  2.0  /jm  wide  and  with  aspect  ratios  of  up  to 
3  formed  in  thermal  oxide  on  silicon  wafers.  The  substrates 
were  placed  at  0,  2.2,  5.0,  and  8.1  cm  away  from  the  central 
axis  of  the  target.  This  corresponds  to  deposition  at  0°,  15°, 
30°,  and  45°  respectively,  for  a  target  to  substrate  plane  dis¬ 
tance  of  8.1  cm.  After  the  deposition,  the  wafers  were 
cleaved  through  the  center  of  the  contacts  in  two  orthogonal 
directions  for  SEM  cross-section  analyses. 

For  the  simulations,  3D  angular  sputter  flux  distribution 
statistics  were  collected  using  SIMSPUD  for  the  situations  de¬ 


scribed  above.  This  information  was  then  input  to  the  appro¬ 
priate  3D  version  of  simbad  to  simulate  the  deposition  of 
films  over  a  contact.  Figure  6  depicts  two  of  the  3D  angular 
distributions  for  the  W  flux  from  a  magnetron  source  and 
collected  at  0°  and  45°  positions  using  SIMSPUD.  As  ex¬ 
pected,  the  flux  collected  at  the  point  directly  below  the  tar¬ 
get  is  radially  symmetric,  showing  a  ring  of  flux  at  constant 
axial  angle  determined  by  the  position  of  the  race  track  on 
the  target.  The  off-axis  position  displays  a  significantly 
asymmetric  flux  due  the  small  size  of  the  target.  This  target 
was  chosen  as  its  small  size  would  emphasize  the  effect  we 
are  trying  to  model. 

Figure  7(a)  depicts  the  SEM  cross-section  micrograph  of 
a  film  deposited  over  contacts  placed  directly  under  the  cen¬ 
ter  of  the  target.  The  poor  film  coverage  on  the  bottom  and 
side  walls  is  a  result  of  the  high  aspect  ratio.  The  film  cov¬ 
erage  is  symmetrical  (left  and  right)  for  both  the  bottom  and 
the  sidewalls,  as  expected.  Figures  7(b)  and  7(c),  respec¬ 
tively,  present  the  quasi-3D  and  interpolated  3D  profiles  of 
the  film  obtained  from  SIMBAD.  As  the  angular  distribution  of 
material  flux  is  radially  symmetrical  directly  under  the  center 
of  the  target,  the  quasi-3D  and  the  interpolated  3D  simula¬ 
tions  are  more  or  less  identical. 

Figure  8  shows  a  three-dimensional  view  of  the  surface  of 
the  films  deposited  at  an  angle  of  30°  over  contacts  for  two 
different  thicknesses.  These  two  figures  show  the  interpo¬ 
lated  3D  surface  used  in  the  simulation  for  the  flux  shadow¬ 
ing.  The  interpolated  surface  can  be  seen  to  be  smooth  and 
well  behaved. 

Figures  9(a)-9(c)  are  the  SEM  cross-section  micrograph 
of  the  off-axis  films  deposited  on  wafers  placed  at  2.2,  5.0, 
and  8.1  cm  from  the  central  axis  of  the  target,  corresponding 
to  15°,  30°,  and  45°,  respectively.  The  poor  step  coverage 
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observed  in  Fig.  7  is  again  apparent;  however,  the  nonunifor¬ 
mity  of  the  bottom  and  side  wall  coverage  of  the  contacts  is 
also  clearly  visible.  The  farther  the  wafer  is  from  the  central 
axis  of  the  target,  the  higher  the  degradation  in  the  bottom 
and  sidewall  coverage  of  the  contacts.  The  nominal  film 
thickness  also  tapers  off  as  the  substrate  is  moved  farther 
away  from  the  axis  of  the  target.  Figures  9(d) -9(f)  are  the 
corresponding  interpolated  3D  computer  simulations.  The 
film  profiles  predicted  using  this  model  very  closely  re¬ 
semble  the  experimental  ones.  This  is  so  for  the  microstruc¬ 
ture  as  well  as  the  film  profiles.  (The  SEM  image  appears 
more  dense  along  the  sidewalls,  but  this  is  partly  an  artifact 
of  the  finite  depth  of  focus  of  the  SEM.  The  SIMBAD  images 
are  thin  slices.) 

Figure  10  shows  the  SEM  cross-section  micrograph  for 
the  perpendicular  slice  (along  the  y  axis)  of  the  film  depos¬ 
ited  on  wafers  placed  at  5.0  cm  from  the  target  axis  (30°)  and 
the  corresponding  3D  interpolated  simulation.  Here,  the  film 
coverage  at  the  bottom  and  the  side  walls  are  more  uniform, 
indicating  a  symmetrical  flux  distribution  in  this  plane.  Once 
again,  the  3D  interpolated  simulation  closely  predicts  the 
profile  and  microstructure  of  the  experimental  film. 

IV.  CONCLUSIONS 

In  this  article,  we  present  a  simulation  package  for  the 
efficient  prediction  of  3D  sputtered  film  growth  over  VLSI 
topography.  Simulation  of  the  3D  flux  transport  from  the 
target  to  the  substrate  is  performed  using  the  Monte  Carlo 
simulator,  SIMSPUD.  Two  models  of  3D  film  growth  are  in¬ 
vestigated.  The  first  assumes  cylindrical  symmetry  of  the 
adatom  flux  and  substrate  topography  and  is  useful  for  pre¬ 
dicting  film  coverage  and  microstructure  in  the  region  of  the 
wafer  close  to  the  central  axis  of  the  sputter  system.  The 
second  model  is  more  general  and  accommodates  an  asym¬ 
metrical  sputter  flux  in  order  to  predict  asymmetrical  film 
growth.  This  second  model  would  be  particularly  useful  for 
simulation  of  asymmetrical  film  coverage  over  vias  and  con¬ 
tacts  near  wafer  edges. 

Both  film  growth  models  are  based  on  the  2D  simulation 
package,  simbad.  As  SIMBAD  produces  microstructural  infor¬ 
mation  as  well  as  film  profile  evolution,  the  two  models  pro- 
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duce  microstructural  depictions  of  the  simulated  film  through 
specific  2D  slices.  In  order  to  determine  the  usefulness  and 
accuracy  of  the  two  models,  W  films  were  deposited  over 
contact  cuts  in  Si02  and  cross-sectional  SEMs  were  com¬ 
pared  to  simulation  predictions.  The  models  were  found  to 
very  closely  predict  both  the  film  profile  and  the  internal 
microstructure  for  films  directly  under  the  target  and  also  for 
films  located  well  away  from  the  central  axis  of  the  sputter 
system.  These  results  establish  the  utility  of  these  simulation 
techniques  for  designing  and  optimizing  sputtering  systems 
for  future,  more  demanding  integrated  circuit  metalization. 

^P.  Singer,  Semicond.  Int.  17,  57  (1994). 

^R.  Burggraaf,  Semicond.  Int,  28  (1990). 

A.  Thornton,  J.  Vac.  Sci.  Technol.  A  4,  3059  (1986). 

"^A.  R.  Neureuther  and  E.  W.  Scheckler,  IEEE  Trans.  Comput.-Aided  De¬ 
sign  Integ.  Circuits  Sys.  13,  219  (1994). 

^J.  McVittie,  J.  Rey,  L.-Y.  Cheng,  A.  Bariya,  S.  Ravi,  and  K.  Saraswat, 
Extended  Abstracts,  TECHCON  "90  (Semiconductor  Research  Corpora¬ 
tion,  San  Jose,  1990),  p.  16. 

^T.  S.  Cale,  G.  B.  Raupp,  J.  Vac.  Sci.  Technol.  B  8,  1242  (1990). 

^D.  Liu,  S.  K.  Dew,  M.  J.  Brett,  T.  Janacek,  T.  Smy,  and  W.  Tsai,  J.  Appl. 
Phys.  74,  1339  (1993). 

^L.  J.  Friedrich,  S.  K.  Dew,  M.  Brett,  and  T,  Smy,  Thin  Solid  Films  266,  83 
(1995). 

^S.  K.  Dew,  T.  Smy,  R.  N.  Tait,  and  M.  J.  Brett,  J.  Vac.  Sci.  Technol.  A  9, 
519  (1991). 

*^S.  K.  Dew,  T.  Smy,  and  M.  J.  Brett,  IEEE  Trans.  Electron  Devices  39, 
1599  (1992). 

“S.  K.  Dew,  T.  Smy,  and  M.  J.  Brett,  Jpn.  J.  Appl.  Phys.  33,  1140  (1994). 
*^F.  H.  Baumann,  R.  Liu,  C.  B.  Case,  and  W.  Y.-C.  Lai,  in  Proceedings  of 
the  International  Electron  Devices  Meeting  '93  (IEEE,  New  York,  1993), 

p.  861. 

^^F.  H.  Baumann  and  G.  H.  Gilmer,  in  Proceedings  of  the  International 
Electron  Devices  Meeting  '95  (IEEE,  New  York,  1995),  p.  89. 

‘"‘D.  S.  Bang,  J.  P.  McVittie,  K.  C.  Saraswat,  Z.  Krivokapic,  J.  A.  lacoponi, 
and  J.  Gray,  in  Proceedings  of  the  International  Electron  Devices  Meeting 
'95  (IEEE,  New  York,  1995),  p.  97. 

Hsiau,  E.  C.  Kan,  J.  P.  McVittie,  R.  W.  Dutton,  in  Proceedings  of  the 
International  Electron  Devices  Meeting  '95  (IEEE,  New  York,  1995),  p. 
101. 

^^S.  Tazawa,  F.  Leon,  G.  Anderson,  T.  Abe,  K.  Saito,  A.  Yoshii,  and  D. 
Scharfetter,  in  Proceedings  of  the  International  Electron  Devices  Meeting 
'92  (IEEE,  New  York,  1992),  p.  173. 

Yamada,  T.  Shinmura,  Y.  Yamada,  and  T.  Ohta,  in  Proceedings  of  the 
International  Electron  Devices  Meeting  '94  (IEEE,  New  York,  1994),  p. 
553. 

^®R.  N.  Tait,  S.  K.  Dew,  T.  Smy,  and  M.  J.  Brett,  J.  Appl.  Phys.  70,  4295 
(1991). 


J.  Vac.  Sci.  Technol.  6,  Vol.  14,  No.  4,  Jul/Aug  1996 


Liner  conformality  in  ionized  magnetron  sputter  metal 
deposition  processes 

S.  Hamaguchi®^  and  S.  M.  Rossnagel*^^ 

IBM  Thomas  J.  Watson  Research  Center  PO.  Box  218,  Yorktown  Heights,  New  York  10598 
(Received  9  November  1995;  accepted  19  April  1996) 

The  conformality  of  thin  metal  films  (liners)  formed  on  high-aspect-ratio  trench  structures  in  ionized 
magnetron  sputter  deposition  processes  is  studied  numerically  and  experimentally.  The  numerical 
simulator  (shade)  used  to  predict  the  surface  topography  is  based  on  the  shock-tracking  method  for 
surface  evolution.  The  simulation  results  are  in  good  agreement  with  experimentally  observed 
thin-film  topography.  It  is  shown  that  combination  of  direct  deposition  and  trench-bottom 
resputtering  results  in  good  conformality  of  step  coverages  and  the  amount  of  the  resputtering 
needed  for  the  good  conformality  is  almost  independent  of  trench  aspect  ratios.  ©  1996  American 
Vacuum  Society, 


I.  INTRODUCTION 

The  reliable  formation  of  densely  packed  interconnect 
metal  lines  with  low  resistance  is  a  challenging  task  in 
computer-chip  manufacturing.  The  most  advanced  technolo¬ 
gies  currently  under  development  require  filling  of  intercon¬ 
nect  lines  with  the  aspect  ratio  (i.e.,  ratio  of  depth  to  lateral 
dimension)  as  high  as  three.  Conventional  deposition  meth¬ 
ods  such  as  chemical  vapor  deposition  (CVD)*  and  colli¬ 
mated  magnetron  sputter  deposition^""^  may  not  provide  an 
effective  means  to  fill  such  high-aspect-ratio  trenches  and 
vias  in  manufacturing  environments. 

Prior  to  the  metal  filling  process  into  dielectric  trenches 
and  vias,  lining  (i.e.,  deposition  of  a  thin  metal  layer)  is  often 
employed  to  provide  adhesion  layers  or  diffusion  barriers  for 
materials  to  be  deposited  in  the  subsequent  processes.  Such  a 
thin  metal  film  (liner)  must  be  sufficiently  conformal,  i.e., 
uniform  in  thickness  on  sidewalls  and  bottoms  of  trenches 
(or  vias),  so  that  the  adhesion  or  diffusion-barrier  character¬ 
istics  are  also  uniform  on  the  surface. 

Depositing  conformal  liners  into  high-aspect-ratio 
trenches  and  vias  is  also  a  difficult  task  in  low-temperature 
processes.  For  metal  species  to  reach  the  bottom,  the  metal 
flux  incident  upon  the  wafer  surface  must  be  anisotropic.  An 
anisotropic  flux,  however,  does  not  deposit  the  material  onto 
vertical  sidewalls  directly.  Weaker  anisotropy  of  the  incom¬ 
ing  flux,  on  the  other  hand,  may  deposit  the  material  on  the 
sidewalls  but  tends  to  deposit  more  near  the  top  edges  of  the 
trench,  resulting  in  nonconformal  deposition.  This  effect  is 
routinely  observed  with  collimated  sputtering  of  Ti  and  TiN 
diffusion  barriers."^ 

The  ionized  magnetron  sputter  deposition  process^’^  can 
circumvent  this  dilemma  by  using  its  ability  to  resputter  the 
deposited  metal  from  the  bottom  of  the  trench.  The  ionized 
magnetron  sputter  deposition  system  is  schematically  de¬ 
picted  in  Fig.  1.  This  is  a  combination  of  the  conventional 
magnetron  sputter  system  and  an  inductively  coupled  plasma 
generator.  The  metal  atoms  are  sputtered  from  the  magnetron 
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cathode  and  then  ionized  near  the  center  of  the  chamber  by 
the  high-density  Ar  plasma  generated  by  the  induction  coil. 
The  bias  voltage  on  the  wafer  then  controls  the  directionality 
and  bombardment  energies  of  the  metal  and  Ar  ion  fluxes 
incident  upon  the  wafer  surface. 

If  the  ion  bombardment  energy  is  sufficiently  high,  the 
deposited  metal  on  the  bottom  of  the  trench  may  be  resput¬ 
tered  and  then  redeposited  on  the  sidewalls.  By  controlling 
the  bias  voltage  on  the  wafer,  one  can  adjust  the  amount  of 
resputtering  to  form  a  conformal  metal  liner  over  the  trench 
structure. 

II.  EXPERIMENTAL  SETUP  AND  SIMULATION 
MODEL 

As  shown  in  Fig.  1,  the  experimental  system  essentially 
consists  of  the  magnetron  and  multiple-turn  radio  frequency 
induction  (RFI)  coil.  The  magnetron  power  may  be  typically 
varied  up  to  10  kW  and  30  kW  on  200  mm  and  300  mm 
target  cathodes.  The  RFI  coil  can  be  powered  up  to  3-5  kW 
at  13.56  MHz.  The  dc-bias  voltage  applied  to  the  wafer 
holder  may  be  varied  up  to  —200  V  with  substrate  currents  in 
the  several  ampere  range.  In  typical  experimental  conditions, 
the  Ar  pressure  is  varied  from  1  to  35  mTorr.  When  a  large 
metal  ion  flux  is  desirable,  the  high-pressure  (e.g.,  35  mTorr) 
operation  is  employed.  When  the  pressure  is  high,  the 
sputtered-atom  mean  free  paths  decrease  and  thus  the  relative 
ionization  level  increases.  Since  the  collision  frequency  in 
the  sheath  region  (about  0.1  mm  thick)  above  the  wafer 
holder  is  still  small,  the  increased  pressure  does  not  affect  the 
directionality  of  ion  fluxes. 

The  trench  structures  used  for  the  experiments  are  typi¬ 
cally  less  than  1  jum  deep  and  have  various  aspect  ratios.  The 
trenches  are  typically  made  of  etched  Si02  on  Si.  Metals  that 
can  be  used  for  lining  in  ionized  magnetron  sputter  metal 
deposition  processes  include  Ti,  TiN,  Ta,  TaN,  Cu,  Cr,  AlCu, 
AIN,  W,  and  WN. 

The  numerical  simulator  used  to  predict  film  topography 
is  called  SHADE  (shock-tracking  algorithm  for  deposition  and 
etching).  It  uses  the  shock-tracking  method^  for  its  surface 
evolution  algorithm  and  has  been  applied  to  various  etch  and 
deposition  processes. For  the  two-dimensional  simula- 
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Fig.  1.  Schematic  diagram  of  the  experimental  apparatus. 


Fig.  2.  Schematic  diagram  of  particle  transport. 


tion,  the  surface  is  represented  by  a  sequence  of  nodes 
(string)  and  the  fluxes  of  Ar  ions,  metal  ions,  and  neutral 
atoms  are  calculated  at  each  node.  Since  the  nodes  are  con¬ 
nected  by  straight  line  segments,  there  are  at  least  two  sur¬ 
face  normals  at  each  node:  surface  normals  associated  with 
left-side  and  right-side  line  segments.  The  shock-tracking 
method  allows  multiple  surface  normals  at  each  nodal  point 
in  addition  to  these  two  surface  normals.  The  multiple  sur¬ 
face  normals  may  be  regarded  as  an  approximation  of  the 
comer  point  (node)  by  an  infinitesimal  polygonal  surface. 
Since  the  etch  rate  (or  sputtering  yield)  of  the  material  sur¬ 
face  depends  on  the  angle  formed  by  the  the  incident  ion  flux 
and  surface  normal,  the  multiple  surface-normal  represen¬ 
tation  provides  an  accurate  description  of  surface  evolution. 
Details  of  the  shock-tracking  method  may  be  found  in  Ref.  7. 

Figure  2  schematically  shows  the  transport  of  neutral  and 
ion  species  near  the  surface.  Since  the  length  scales  of  the 
trench  are  sufficiently  smaller  than  the  mean  free  paths,  col¬ 
lisionless  transport  is  a  valid  assumption  for  all  the  gas-phase 
species.  As  indicated  in  Fig.  2,  the  metal  and  Ar  ion  fluxes 
are  assumed  to  be  unidirectional  and  vertical  to  the  substrate 
top  surface.  Since  the  value  of  the  ion  temperature  measured 
in  electron  volts  is  sufficiently  smaller  than  the  bias  voltage, 
the  angular  distribution  of  the  ions  is  insignificant.  On  the 
other  hand,  the  neutral  metal  flux  has  a  large  angular  distri¬ 
bution.  For  the  sake  of  simplicity,  we  assume  that  the  inci¬ 
dent  neutral  metal  has  an  uniform  angular  distribution  with 
cutoff  angle  0.  For  a  low-pressure  operation,  where  ions 
sputtered  from  the  target  cathode  in  the  magnetron  region 
undergo  virtually  no  collisions,  this  cutoff  angle  is  deter¬ 
mined  by  the  tool  geometry,  i.e.,  the  ratio  of  the  distance 
between  the  target  and  wafer  to  the  radius  of  the  target. 


Here  we  chose  0=52°  based  on  our  tool  geometry^®  for  the 
simulation  presented  in  this  article. 

When  an  energetic  ion  hits  the  surface,  it  sputters  the 
surface  material  with  sputtering  yield  Y.  The  sputtering 
yield,  which  is  defined  as  the  number  of  ions  sputtered  from 
the  surface  per  incident  ion,  depends  on  the  angle  0  formed 
by  the  incident  flux  and  surface  normal.  Figure  3  shows  the 
angle  dependence  of  the  etch  rate  [which  is  proportional  to 
y(^)cos  6]  that  is  used  for  the  numerical  simulation.  The 
rate  is  normalized  to  one  at  ^=0.  The  film  topographies  ob- 


angle  of  incidence  (degrees) 

Fig.  3.  The  dependence  of  the  normalized  etch  rate  as  a  function  of  the  slope 
angle  (i.e.,  angle  of  ion  beam  incidence)  used  for  the  simulation. 
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tained  from  the  numerical  simulation  with  this  etch  rate  func¬ 
tion  are  in  good  agreement  with  experimental  observations, 
as  will  be  shown  later. 

Metal  atoms  sputtered  by  the  incident  ions  are  assumed  to 
have  sufficiently  low  energies  so  that  they  do  not  resputter 
the  surface  material  when  then  reach  other  surface  sites.  In¬ 
stead,  such  atoms  and  the  neutral  metal  atom  flux  directly 
coming  from  the  plasma  source  region  are  adsorbed  on  the 
surface.  The  probability  that  an  adsorbed  atom  stays  on  the 
surface  indefinitely  is  called  the  sticking  coefficient  3^,  For 
Cu,  ^=1  is  known  to  be  a  good  approximation.  Those  at¬ 
oms  that  are  reemitted  (i.e.,  desorbed  or  resputtered)  from  the 
surface  are  assumed  to  have  the  cosine  angular  distribution 
about  the  surface  normal: 

/^=A(|vl,x)cos'^+^  <9,  (1) 

with  1^=0,  where  is  the  velocity  distribution  function  of 
the  reemitted  atoms  at  position  x  on  the  boundary  curve,  and 
B  is  the  angle  between  the  velocity  vector  v  and  the  surface 
normal  n  [Fig.  4(a)].  If  v>0  or  -l<v<0,  the  function  above 
represents  over-cosine  or  under-cosine  distributions,  respec¬ 
tively. 

The  normal  component  of  the  incoming  flux  at  position  X 
on  the  surface  is  related  to  the  velocity  distribution  function 
/(v,X)  as 

^(X)=  L  N-v/(v,X)^/v, 

Jn-v<0 

where  N  is  the  unit  vector  normal  to  the  surface  at  X,^and  the 
integration  is  taken  over  the  half  velocity  space  (N*v<0). 
The  outgoing  flux  is  defined  in  a  similar  manner  with  the 
integration  over  the  other  half  space  N*v>0. 

The  normal  component  of  the  outgoing  flux  from  the  sur¬ 
face,  i.e.,  the  flux  of  reemitted  metal  atoms,  is  the  sum  of  the 
desorption  flux  and  resputtering  flux: 

i^^'(X)  =  ( 1  -.5^.^”(X)  Fi^'^^X),  (2) 

where  i^”(X)  is  the  incoming  flux  of  (low-energy)  metal 
atoms  and  therefore  the  first  term  represents  the 

total  desorption  flux.  The  second  term  represents  the  resput¬ 
tering  flux  due  to  the  ion  bombardment  (etching).  The  in¬ 
coming  neutral  metal  flux  may  be  split  into  the  two 

components 

.^"(X)  =  i^(X)  +i^(X) ,  (3) 

where  the  first  term  represents  the  incoming  neutral 
metal  flux  that  is  originated  (reemitted)  from  other  sites  on 
the  surface  and  the  second  term  represents  the  neutral 
metal  flux  arriving  directly  from  the  plasma  source.  Using 
the  reemission  distribution  function  Eq.  (1),  we  may  relate 
the  desorption  flux  J^(X)  to  the  outgoing  metal  atom  flux 
at  every  position  x  on  the  surface  as 

f  (4) 

where 


Fig.  4.  (a)  The  coordinate  system,  (b)  The  flux  ratio  (which  is  pro¬ 

portional  to  the  film  thickness  deposited  by  resputtering)  as  a  function  of  the 
normalized  height  hfw  of  the  trench  wall. 


K,iX,x)  =  giX,x) 


(R-N)|R-n|‘'+' 

R 


(5) 


is  the  integration  kernel,  the  coefficient  A  ^+4  is  given  by 


V 

/2T  -  +  2 
2 


and  ^  denotes  the  surface  boundary.  Note  that  for  the  cosine 
distribution  (n=0),  77/4.  In  Eq.  (5),  the  function  g  is  the 

visibility  factor  (i.e.,  1  if  the  points  X  and  x  are  on  the 

line  of  sight,  and  g  =  0  otherwise),  R=x-X,  =  R=R/ 
R,  and  n  is  the  unit  vector  normal  to  the  surface  at  x. 

Substituting  Eqs.  (2)  and  (3)  into  Eq.  (4)  results  in  an 
integral  equation  for  .^(X)  for  given  source  terms 
and  .^(X).  More  details  of  the  gas-phase  transport,  includ¬ 
ing  the  derivation  of  Eq.  (4)  and  actual  expressions  for  the 
source  terms  and  .^(X),  may  be  found  in  Ref.  10. 
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hb 


(a)  Y  =  0.0 


(b)  Y  =  0.4 


Fig.  5.  Profiles  of  the  deposited  films  obtained  from  numerical  simulations. 
The  aspect  ratio  of  each  trench  is  2.5.  The  ratio  of  the  ion  to  neutral  flux  on 
the  top  surface  is  1:1.  The  sputtering  yields  at  zero  incident  angle  7(0)  are 
(a)  0,  (b)  0.4,  and  (c)  1.0. 


III.  NUMERICAL  AND  EXPERIMENTAL  RESULTS 


Before  proceeding  to  numerical  simulations,  we  briefly 
discuss  resputtering  on  trench  bottoms.  To  estimate  the 
amount  of  resputtering  from  a  trench  bottom,  we  first  ignore 
sidewall  metal  deposition  due  to  direct  neutral  fluxes  from 
the  plasma  source.  In  this  case,  as  shown  in  Fig.  4(a),  the 
flux  received  at  height  h  on  the  sidewall  is  only  related  to  the 
resputtered  outgoing  flux  from  the  bottom  through  Eq. 
(4).  The  resputtering  flux  assumed  to  be  uniform  over 
the  entire  trench  bottom.  Assuming  that  the  reemission  has 
the  cosine  distribution  [i.e.,  z^=0  in  Eq.  (1)],  we  obtain  from 
Eq,  (4), 


^  ^0  r-  ,  (R-N)|R-n| 

^=-Jo 

“  2  Jo 


(6) 


where  we  take  the  x  axis  along  the  bottom  surface,  and  w 
represents  the  width  of  the  trench  bottom.  Figure  4(b)  shows 
the  resputtered  flux  received  on  a  vertical  sidewall  as  a  func¬ 
tion  of  the  ratio  h/w.  Since  the  deposition  rate  is  proportional 
to  the  flux  Fig.  4(b)  also  represents  the  relative  thickness 
of  the  deposited  thin  film  on  the  vertical  sidewall. 

For  an  infinitely  wide  trench  (i.e.,  w  =  <x>,  aspect  ratio  is 
0),  the  magnitude  of  the  resputtered  flux  becomes  indepen¬ 
dent  of  the  height  h  and  is  given  by  This  is  not 

surprising  since  the  limit  of  with  fixed  h  is  equivalent 
of  the  limit  of  h^O  with  fixed  w.  (Note  that  depends  only 
on  the  parameter  h/w.)  As  is  evident  in  Fig.  4(b),  the  flux 
variation  is  small  for  small  h/w.  This  result  indicates  that  the 
smaller  the  aspect  ratio  is,  the  more  uniformly  the  films  are 
deposited  on  the  sidewalls. 

In  numerical  simulations  that  we  shall  present  momen¬ 
tarily,  the  neutral  metal  flux  from  the  source  is  also  taken  into 
account.  Since  metal  atoms  sputtered  from  the  target  are  not 
generally  100%  ionized  by  the  RFI  coil,  the  effect  of  the 
neutral  flux  cannot  be  ignored.  In  typical  operations,  the  ratio 


Fig.  6.  The  ratio  of  film  thickness  at  each  position  [a,  b,  c,  and  d  indicated 
in  Fig.  5(a)]  to  the  top  film  thickness  w  as  a  function  of  the  sputtering  yield 
7(0)  under  the  conditions  used  in  Fig.  5. 


of  the  metal  neutral  flux  to  the  metal  ion  flux  is  about  1:1.^^ 
As  mentioned  earlier,  the  metal  neutral  flux  is  assumed  to 
have  an  isotropic  distribution  with  a  cutoff  angle  52°.  The 
ion  flux  is  a  delta-function  beam,  perpendicularly  incident 
upon  the  horizontal  substrate  surface. 

Figure  5  shows  profiles  of  deposited  thin  metal  films  (lin¬ 
ers)  under  three  different  sputtering  yields  7=7(0)  at  angle 
0  (horizontal  surface);  (a)  7=0,  (b)  0.4,  and  (c)  1.0.  The 
aspect  ratio  of  the  initial  trench  is  2.5  and  the  sticking  coef¬ 
ficient  is  assumed  to  be  one.  The  value  7  is  actually  the 
effective  sputtering  yield,  i.e.,  the  number  of  sputtered  atoms 
per  incident  metal  ion,  regardless  of  whether  the  sputtering  is 
due  to  metal  ions  or  Ar  ions.  If  sputtering  by  the  Ar  ion  flux 
is  negligible,  then  7  is  the  Cu-on-Cu  sputtering  yield. 

Figure  6  shows  the  step  coverage  (i.e.,  ratio  of  the  film 
thickness  at  a  particular  position  to  the  flat-surface  film 
thickness  w)  at  four  different  sites  indicated  in  Fig.  5(a)  as 
functions  of  the  effective  sputtering  yield  7.  The  positions  a, 
b,  c  are  at  5,  and  |  of  the  trench  depth,  and  d  is  at  the  center 
of  the  trench  bottom.  Since  the  ion  to  neutral  metal  flux  ratio 
is  1:1,  the  film  is  deposited  on  the  top  surface  only  when 
7<2,0. 

As  may  be  seen  in  Fig.  5(c),  the  sidewall  film  thickness  is 
not  necessarily  a  monotonic  function  of  the  height,  unlike  the 
case  of  Fig.  4(b).  At  position  a,  a  significant  portion  of  the 
incoming  metal  flux  arrives  directly  from  the  plasma  source, 
rather  than  the  resputtering  from  the  bottom,  and  thus  the 
film  thickness  is  slightly  larger  than  that  at  position  b.  When 
the  sputtering  yield  7  is  high  [e.g.,  Fig.  5(c)],  a  significant 
portion  of  the  incoming  metal  flux  at  position  c  is  due  to  the 
resputtering  from  the  trench  bottom,  and  thus  the  film  thick¬ 
ness  is  again  slightly  larger  than  that  at  position  b.  Therefore, 
the  film  topography  on  the  sidewall  tends  to  be  slightly 
bowed  (thicker  near  the  top  and  bottom  and  thinner  in  the 
middle)  when  7  is  high.  However,  for  sufficiently  thin  films, 
such  as  those  used  in  liner  technology,  this  variation  of  the 
sidewall-film  thickness  is  not  significant. 

The  critical  sputtering  yield  7^ ,  which  we  define  as  the  7 
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(a)  (b)  (c) 


Fig.  7.  SEM  pictures  of  Cu  deposition  under  the  conditions  similar  to  Fig.  5.  The  applied  bias  voltages  are  (a)  12,  (b)  50,  and  (c)  100  V. 


value  at  which  the  step  coverages  at  b  (middle  of  the  side- 
wall)  and  d  (middle  of  the  bottom)  are  equal,  gives  a  good 
guidance  for  the  film  conformality.  Under  the  conditions 
used  in  Fig.  5,  1.0. 

Experiments  were  done  with  Ti,  TiN,  AlCu,  and  Cu  over 
the  range  of  0.5-4  aspect  ratios  and  good  conformal  step 
coverages  were  obtained  in  all  these  cases.  Figure  7  shows 
typical  results  of  such  systems  under  conditions  similar  to 
those  used  in  Fig.  5.  The  bias  voltages  used  in  these  experi¬ 
ments  were  12,  50,  and  100  V  for  Figs.  7(a),  7(b),  and  7(c), 
respectively.  Cu  films  are  shown  in  Fig.  7  since  the  higher 
brightness  of  Cu  in  scanning  electron  microscopy  (SEM) 
photographs  most  clearly  delineates  the  film  topography. 

Figure  8  shows  simulation  results  of  liner  deposition  on 
trenches  of  aspect  ratio  1.  The  conditions  used  here  are  the 
same  as  those  used  in  Fig.  5.  Figure  9  shows  the  step  cover¬ 
age  (i.e.,  normalized  film  thickness)  at  four  different  posi¬ 
tions  indicated  in  Fig.  8(a)  as  functions  of  the  sputtering 
yield  Y.  The  positions  a,  b,  c  are  at  1/4,  1/2,  and  3/4,  respec¬ 
tively,  of  the  trench  depth,  and  d  is  at  the  center  of  the  trench 
bottom,  as  before.  It  is  seen  in  Fig.  9  that  the  critical  sput- 


Fig.  8.  Profiles  of  the  deposited  films  obtained  from  numerical  simulations. 
The  aspect  ratio  of  each  trench  is  1 .  Other  conditions  are  the  same  as  those 
used  in  Fig.  5.  The  sputtering  yields  at  zero  incident  angle  T(0)  are  (a)  0, 
(b)  0.4,  and  (c)  1.0. 


tering  yield  for  conformality  is  again  Y^—l.  Evidently,  Y^ 
depends  weakly  on  the  aspect  ratio  A  for  the  range 
1  =^A^2.5  studied  here.  Thus  the  ionized  magnetron  sputter 
metal  deposition  system  with  an  appropriate  sputtering  yield 
Y  can  form  conformal  metal  step  coverages  over  trenches 
with  a  wide  range  of  aspect  ratios  in  a  single  process. 

IV.  CONCLUDING  REMARKS 

Numerical  simulations  together  with  experimental  results 
show  that  conformal  deposition  of  metal  liners  can  be 
achieved  by  simultaneous  deposition  and  resputtering  in  ion¬ 
ized  magnetron  sputter  metal  deposition  processes.  The  ion¬ 
ized  magnetron  sputter  deposition  process  has  two  significant 
advantages  over  the  conventional  magnetron  sputter  deposi¬ 
tion  process.  One  is  the  directionality  (or  anisotropy)  of  the 


Fig.  9.  The  ratio  of  film  thickness  at  each  position  [a,  b,  c,  and  d  indicated 
in  Fig.  8(a)]  to  the  top  film  thickness  w  as  a  function  of  the  sputtering  yield 
7(0)  under  the  conditions  used  in  Fig.  8. 


(c)  Y  =  1.0 


(a)  Y  =  0.0 


(b)  Y  =  0.4 
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metal  deposition  flux  due  to  the  presence  of  highly  ionized 
gas-phase  metal  and  the  applied  bias  voltage  on  the  substrate 
surface.  With  high  ionization  of  metal  in  the  plasma  region,  a 
large  metal  ion  flux  with  high  directionality  may  be  extracted 
from  the  source  to  the  substrate  without  damaging  the  sub¬ 
strate.  The  other  advantage  is  its  ability  to  control  the  ion 
flux  energy  incident  upon  the  substrate  surface.  The  amount 
of  resputtering  from  the  substrate  surface  can  be  easily  con¬ 
trolled  by  the  bias  voltage. 

It  is  also  shown  that,  by  selecting  an  appropriate  bias 
voltage  to  control  surface  resputtering,  we  are  able  to  form 
conformal  step  coverages  over  various  trenches  with  differ¬ 
ent  aspect  ratios  in  a  single  process.  In  this  method,  metal 
atoms  resputtered  from  the  bottom  of  the  trench  are  effec¬ 
tively  redeposited  on  trench  sidewalls.  As  the  bias  voltage 
(i.e.,  sputtering  yield  on  the  sample  surface)  is  increased,  the 
film  thickness  on  the  sidewalls  increases,  whereas  the  film 
thickness  on  the  trench  bottom  decreases.  The  appropriate 
bias  voltage  (i.e.,  critical  sputtering  yield)  is  the  one  that 
gives  similar  film  thicknesses  on  sidewalls  and  trench  bot¬ 


tom.  Since  the  critical  sputtering  yield  is  almost  indepen¬ 
dent  of  trench  aspect  ratios,  one  can  conformally  line 
trenches  with  a  wide  variety  of  aspect  ratios  in  a  single  pro¬ 
cess. 
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Electron  beam  deposition  (EBD)  provides  an  inexpensive  way  to  fabricate  nanostructures  of  various 
materials  in  a  scanning  electron  microscope  (SEM).  However,  the  purity  of  metals  deposited  from 
an  organometallic  precursor  gas  is  impaired  by  simultaneously  deposited  carbon  coming  both  from 
the  organometallic  molecule  and  the  residual  contamination  gas  in  the  SEM  chamber.  We  discuss 
carbon-contamination  EBD  in  a  standard  high-vacuum  SEM  and  compare  it  to  EBD  of  Au  in  an 
environmental  SEM  (ESEM).  The  ESEM  allowed  us  to  perform  ‘‘environmental”  EBD  (E-EBD), 
i.e.,  EBD  in  the  presence  of  an  environmental  gas  (1-10  Torr)  in  addition  to  the  organometallic 
precursor  gas.  We  built  a  simple  device  that  contains  a  reservoir  for  the  organometallic  precursor 
and  goes  on  the  sample  stage  of  the  ESEM.  With  this  device  we  were  able  to  highlight  the 
advantages  of  E-EBD  over  conventional,  high-vacuum  EBD.  We  discuss  the  basic  chemical 
reactions  underlying  the  E-EBD  process.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Conventional  deposition  techniques  currently  used  in  the 
fabrication  of  micron  or  submicron  structures  (e.g.,  micro¬ 
electronics  devices)  do  not  allow  the  deposition  of  materials 
on  a  very  small  area  of  the  substrate.  Instead,  materials  are 
deposited  over  the  whole  substrate  and  the  pattern  is  defined 
afterwards  by  removing  material  from  the  adjacent  areas. 
The  creation  of  a  given  pattern  involves  several  tedious  steps 
(such  as  masks,  resists,  photo-  or  electron-beam  lithography, 
lift-off,  and/or  selective  etching)  which  result  in  costly  pro¬ 
cesses  and  generally  require  planar  substrates.  The  cost  is 
usually  justified  by  the  large  number  of  devices  produced  in 
parallel.  However,  alternative  techniques  are  sought  for 
many  applications  where  a  few  devices  suffice.  Also,  it  is 
particularly  important  for  microsensor  engineering  to  de¬ 
velop  patterning  techniques  that  are  compatible  with  nonpla- 
nar  substrates. 

Electron  beam  deposition  (EBD)  is  an  inexpensive  tech¬ 
nique  suitable  for  the  fabrication  of  structures  of  different 
sizes,  shapes,  and  materials  in  the  submicron  or  nanometer 
scale.  In  the  presence  of  a  surrounding  gas,  a  finely  focused 
electron  (e~)  beam  is  observed  to  cause  the  deposition  of 
material  from  the  gas  only  in  the  area  irradiated  by  the  beam. 
In  EBD,  materials  are  patterned  and  deposited  simulta¬ 
neously.  This  one-step  process  is  comparatively  inexpensive. 
Since  it  is  maskless  and  resistless,  it  has  the  advantage  that  it 
allows  patterning  on  nonplanar  substrates.  On  the  other  hand, 
since  it  is  induced  locally,  it  is  time-consuming.  Another  dis¬ 
advantage  concerns  the  purity  of  the  deposits,  as  will  be 
discussed  later. 

The  observation  of  EBD  is  not  new.  The  formation  of 
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carbonaceous  films  on  surfaces  upon  electron  bombardment 
in  vacuum  haunted  researchers  for  more  than  30  years.  ^ 
Ennos^  identified  specimen  contamination  in  the  electron 
mixcroscope  produced  by  the  interaction  between  the  bom¬ 
barding  e~  and  the  organic  molecules  adsorbed  on  the  speci¬ 
men  surface.  Broers  et  al.^  were  the  first  to  deposit  nanom- 
eterscale  carbon-contamination  structures  in  a  scanning 
transmission  electron  microscope  (STEM).  We  describe  our 
studies  on  carbon-contamination  EBD  in  Sec.  II.  By  allow¬ 
ing  a  small  pressure  of  an  organometallic  vapor  in  the  cham¬ 
ber  of  a  scanning  electron  microscope  (SEM),  Matsui  and 
Mori^  deposited  0.15-/^m-wide  metallic  lines.  The  principle 
of  EBD  is  sketched  in  Fig.  1 .  The  dissociation  of  some  or  all 
of  the  gas  molecules  that  happen  to  adsorb  on  the  irradiated 
spot  results  in  the  formation  of  a  deposit  consisting  of  the 
nonvolatile  remains  from  that  dissociation.^"^  The  deposition 
rate  depends  on  the  total  ^“-beam  current  density  at  the 
beam  spot,  on  the  effective  adsorption  rate  of  the  precursor 
gas  molecules  onto  the  substrate,  and  on  the  probability  for 
an  electron  to  cause  EBD  of  a  molecule.  In  addition,  the 
beam  current  density  depends  not  only  on  the  beam  spot  size 
but  also  on  the  rate  of  emission  of  secondary  and  backscat- 
tered  electrons,  which  also  contribute  to  the  process.  Re¬ 
cently,  EBD  has  been  demonstrated  with  a  scanning  tunnel¬ 
ing  microscope  (STM)  operated  in  field-emission  mode.^  For 
our  work  we  used  SEMs  because  of  their  flexibility  and 
availability.  We  developed  an  instrumentally  simple  device 
to  perform  EBD  without  needing  to  modify  the  SEM  cham¬ 
ber  for  introduction  of  the  precursor  gas.  We  call  this  device 
the  “EBD  cell”  (Sec.  III).  We  used  it  to  demonstrate  “envi¬ 
ronmental”  EBD  (E-EBD).  This  approach  has  several  advan¬ 
tages  over  conventional  EBD  (Sec.  IV).  The  growth  process 
is  analyzed  in  Sec.  V. 
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Fig.  1 .  Principle  of  EBD.  A  focused  e  beam  induces  a  chemical  change  on 
the  precursor  gas  molecules  that  are  adsorbed  on  the  irradiated  spot. 

II.  CARBON-CONTAMINATION  EBD 

One  may  straightforwardly  carry  out  an  EBD  experiment 
by  means  of  a  standard  SEM  operated  at  low  beam  currents 
(—10-50  pA)  and  high  beam  energies  (—15-30  keV). 
These  parameters  are  generally  not  critical  A  flat  sample 
with  a  thin  film  of  sputtered  gold  to  avoid  charging  will 
generally  suffice.  A  rough  sample  will  make  the  observation 
of  the  submicron- sized  deposits  difficult.  A  requirement  is 
that  the  e  ~  -beam  be  well  focused.  Stopping  the  e  ~  beam  on  a 
given  spot  for  a  few  minutes  will  typically  cause  the  growth 
of  a  submicron-diameter  column  or  tip.  The  growth  rate  var¬ 
ies  greatly  from  one  SEM  to  another,  probably  due  to  differ¬ 
ent  residual  gas  pressures  and/or  composition.  If  the  growth 
rate  is  very  slow,  beam  instabilities  and/or  drifts  will  prevent 
the  formation  of  a  fine  column. 

We  deposited  columns  and  lines  on  a  thin  film  of  Au 
(  —  1000  A)  on  Si  with  an  SEM  operating  at  a  base  pressure 
of  5  X  10  ^  Ton*.  A  microfabricated  trench  in  the  Si  sub¬ 
strate  served  as  a  landmark  for  easy  location  of  the  deposits. 
A  set  of  tips  is  shown  in  Fig.  2(a).  A  nearby  dust  speck 
helped  to  finely  focus  the  e~  beam.  The  tips  were  grown  for 
periods  of  time  varying  from  30  s  to  10  min,  and  with  beam 
currents  and  acceleration  voltages  ranging  10-30  pA  and 
15-40  kV,  respectively.  The  apparent  conical  shape  can  be 
attributed  to  instabilities  of  the  e~  beam  that  effectively 
spread  the  e~  beam  spot,  such  as  noise  in  the  deflection 
electrodes,  mechanical  vibrations,  and  thermal  drifts.  The  ca¬ 
pability  to  write  lines  is  demonstrated  in  Fig.  2(b). 

An  interesting  observation  is  that  these  contaminants  can 
be  present  even  at  lower  pressures,  as  in  a  scanning  Auger 
microanalysis  (SAM)  system  operating  at  a  base  pressure  of 
10“^- 10”^®  Ton*.  On  an  indium-tin-oxide  (ITO)  sample  pre¬ 
viously  cleaned  by  Ar"^  sputtering  (0.5  min,  20  juAJcm^,  2 
keV)  we  exposed  a  75  yarn  X  50  jmm  area  by  raster  scanning 
the  e~  beam  of  the  SAM  for  16  hours.  After  this  period  of 
time,  the  exposed  area  appeared  blackened  in  an  electron 
micrograph  and  its  Auger  analysis  revealed  the  presence  of 
carbon  only.  The  black  appearance  can  be  explained  by  the 
particularly  low  secondary  electron  emission  rate  of  carbon. 
The  thickness  of  the  deposit  has  to  be  at  least  several  nanom¬ 
eters  in  order  to  screen  the  Auger  emission  of  the  indium,  tin, 


Fig.  2.  Illustrative  examples  of  carbon-contamination  EBD.  The  white  scale 
bar  represents  1  /Am.  To  deposit  a  tip,  the  beam  is  focused  on  the  desired 
spot  for  a  certain  amount  of  time,  (a)  A  set  of  tips  is  grown  near  a  dust 
speck.  The  series  of  5  tips  in  a  row  indicated  by  arrows  was  grown  for 
periods  of  time  of  (from  left  to  right)  10,  5,  2.5,  1,  and  5  min.  Beam  current 
and  energy  were  30  pA  and  40  keV,  respectively,  (b)  The  four  l-/Am-long 
lines  forming  the  square  were  deposited  in  4  min  each  with  a  single  line 
scan  (4  min//Am)  at  8  pA  and  15  keV.  The  two  broader,  taller  lines  were 
deposited  under  the  same  beam  conditions  but  6  times  slower  (24  min/ 
/Am). 

and  oxygen  in  the  underlying  ITO  substrate.  Therefore,  the 
amount  of  carbon  present  on  any  given  sample  is  overesti¬ 
mated  because  the  analyzing  e  ~  beam  simultaneously  causes 
deposition  of  carbon.  This  problem  poses  a  challenge  to  the 
characterization  of  very  small  deposits.  Furthermore,  the 
composition  of  a  spot  or  column  deposited  with  a  stationary 
e~  beam  cannot  be  analyzed  with  an  beam  of  similar 
resolution,  since  the  analysis  is  comparatively  slow  and  is 
severely  affected  by  drifts.  Usually,  since  the  efficiency  of 
Auger  electron  excitation  is  very  low,  high  beam  currents 
(— 1  nA)  are  used  in  Auger  analysis  as  compared  to  EBD 
(—10  pA),  which  results  in  bigger  beam  spot  sizes.  A  partial 
solution  to  this  problem  is  to  analyze  large  deposits  made  by 
rastering  the  e~  beam  across  a  large  area  (—100  /xm^). 

III.  ORGANOMETALLIC  EBD:  THE  “EBD  CELL” 

The  nature  of  the  precursor  gas  determines  the  composi¬ 
tion  of  the  deposits.  Metals,  for  instance,  can  be  deposited 
from  organometallic  precursor  gases."^”^  Unfortunately,  the 
purity  of  the  deposits  is  affected  by  undesired  deposition  of 
carbon  coming  not  only  from  the  residual  contamination 
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Fig.  3.  Schematic  of  (a)  the  usual  subchamber  setup  for  EBD,  and  (b)  our 
“EBD  cell”  setup.  The  environmental  gas,  not  depicted  for  clarity,  fills  up 
the  whole  ESEM  chamber, 

gases,  as  discussed  in  Sec.  II,  but  also  from  the  organome- 
tallic  precursor  compound.  Unfortunately  also,  since  high 
vacuum  is  required  for  operation  of  the  SEM,  the  precursor 
gas  must  be  allowed  inside  the  SEM  chamber  in  small  quan¬ 
tities.  This  results  in  low  growth  rates.  Generally,  the  gas  is 
introduced  from  an  external  source  into  a  subchamber  (con¬ 
taining  the  sample)  within  the  SEM  chamber  [see  Fig. 
SCa)]."^"^  Some  authors  used  a  gas  nozzle  directed  onto  the 
sample.^  The  SEM  chamber  must  be  modified  to  accommo¬ 
date  a  gas  feedthrough,  which  may  represent  a  serious  incon¬ 
venience. 

It  is  important  to  realize  that  high  precursor  gas  pressures 
are  desirable  not  only  to  obtain  high  growth  rates  but  also  to 
minimize  carbon-contamination  incorporation.  At  very  low 
precursor  gas  pressures  carbon  deposition  dominates  and  re¬ 
sults  in  grossly  impure  deposits.  Therefore,  there  is  an  essen¬ 
tial  incompatibility  between  the  high  vacuum  requirements 
of  conventional  SEMs  and  the  desired  purity  of  the  deposits. 

To  avoid  hardware  modifications  to  the  SEM,  we  made  a 
small  device  that  acts  as  an  internal  vapor  source  and  goes 
on  the  SEM  sample  stage.  We  call  this  device  an  “EBD  cell” 
[see  Fig.  3(b)].  The  EBD  cell  is  a  3.5-cm-diameter  two- 
chamber  assembly:  one  subchamber  for  the  sample  and  a 
reservoir  for  the  precursor  material.  Three  screws  hold  the 
two  chambers  together  and  a  sealing  Viton  0-ring  in  be¬ 


tween.  The  vapors  of  the  precursor  material  leave  the  reser¬ 
voir  through  a  fixed  small  aperture  (which  we  call  the  “res¬ 
ervoir  aperture”),  enter  the  sample  subchamber  and  escape 
through  another  fixed  aperture  (the  “sample  aperture”)  that 
exposes  the  sample  to  the  e~  beam.  Thus  some  gas  mol¬ 
ecules  adsorb  on  the  sample  spot  that  is  being  irradiated  by 
the  beam.  The  apertures  were  TEM  diaphragms  (with  hole 
diameters  ranging  from  5  to  500  jim)  glued  with  silver  paint 
on  1-mm-diameter  holes  machined  on  top  of  each  chamber. 
In  practice,  the  choice  of  aperture  sizes  is  limited  by  (a)  the 
desired  growth  rate,  (b)  the  sample  area  accessible  by  the 
beam,  (c)  the  amount  of  precursor  material  given  its  vapor¬ 
ization  rate,  and  (d)  the  vacuum  requirements  of  the  SEM. 
Since  the  apertures  are  removable,  this  is  straightforwardly 
determined  by  trial  and  error.  Variable  apertures,  pressure 
gauges  and  an  electrode  gauge  could  be  implemented  for  a 
precise  control  and  in  situ  variation  of  the  growth  rates. 

To  eliminate  the  vacuum  constraints  we  used  an  environ¬ 
mental  SEM  (ESEM).^  With  the  ESEM  it  is  possible  to  im¬ 
age  in  the  presence  of  up  to  20  Torr  of  certain  gases.  The 
detection  of  secondary  electrons  for  imaging  is  possible  due 
to  the  highly  ionized  state  of  the  environmental  gas  above 
the  beam  spot.  We  used  Ar,  H2O  20,  and  a  mixture  of  80% 
Ar  and  20%  O2  (Ar/02)  as  environmental  gases. 

The  performance  of  the  cell  was  first  optimized  by  experi¬ 
menting  with  EBD  of  Fe  from  iron  pentacarbonyl, 
Fe(CO)5,  a  liquid  with  a  vapor  pressure  of  3  Torr  at  20 
Then  we  deposited  Au  at  room  temperature  from 
Au(CH3)2(hexafluoroacetylacetonate),  a  liquid  with  a  vapor 
pressure  of  700  mTorr  at  20  (hereafter  referred  to  as 
DMGHFAC).  We  emphasize  that  both  the  organometallic 
precursor,  emanating  from  the  reservoir,  and  the  environ¬ 
mental  gas  (Ar,H20,  or  Ar/02),  filling  up  the  whole  ESEM 
chamber,  were  present  during  deposition. 

In  all  the  experiments  reported  here,  the  EBD  cell  was 
used  in  combination  with  the  ESEM,  but  it  has  been  used 
successfully  in  high  vacuum  in  a  standard  SEM  as  well.^^  A 
few  droplets  of  DMGHFAC  last  30  min  in  the  reservoir  at 
environmental  pressures  of  ~  1-3  Torr  if  the  reservoir  aper¬ 
ture  and  the  sample  aperture  are  50  and  1  mm  in  diam¬ 
eter,  respectively.  This  yields  growth  rates  of  around  40 
nm/s.^  A  key  feature  of  the  EBD  cell  performance  is  that  at 
our  environmental  gas  pressures  ( ~  1  Torr)  the  precursor  gas 
molecules  leave  the  subchamber  by  diffusion,  rather  than 
baliistically  as  in  previous  subchamber  setups  in  vacuum. 
This  yields  low  evaporation  rates  of  the  precursor  compound 
while  allowing  a  large  sample  area  exposed  to  the  e  ~  beam. 

IV.  ENVIRONMENTAL  EBD  OF  HIGH-ASPECT- 
RATIO  NANOSTRUCTURES 

The  concept  of  environmental  EBD  (E-EBD),  i.e.,  EBD  in 
the  presence  of  an  environmental  gas  in  addition  to  the  pre¬ 
cursor  gas,  can  be  advantageous  for  several  applications. 
This  process  was  recently  applied  to  reduce  carbon  incorpo¬ 
ration  in  EBD  of  Au.^  The  environmental  gas  was  either  Ar, 
H2O,  or  Ar/O  2 .  The  presence  of  Ar/02  or  H2O  reduced  the  C 
content  of  the  deposits  (as  measured  with  Auger  electron 
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Fig.  4.  AFM  image  of  EBD  columns  deposited  on  Si  from  DMGHFAC 
vapor  at  30  keV.  The  columns  appear  convoluted  with  the  AFM  tip,  as 
manifested  by  identical  sidewall  features  on  each  column. 


spectroscopy)  whereas  Ar  did  not.  Moreover,  the  Au  content 
increased  with  increasing  H2O  or  Ar/02  pressure,  whereas  it 
was  significantly  affected  by  changes  in  Ar  pressure.  The 
highest  Au  contents  (50%)  were  achieved  under  10  Ton*  of 
Ar/02.  With  H2O  vapor,  the  overall  contents  were  smaller 
(up  to  20%  under  3  Torr).  Very  low  Au  contents  (^2-3  %) 
were  systematically  observed  in  the  absence  of  environmen¬ 
tal  gas.  This  indicated  a  large  amount  of  hydrocarbon  re¬ 
sidual  gases  in  the  ESEM  chamber  and  highlighted  the  ac¬ 
complishment  of  50%  Au  composition  by  means  of  E-EBD. 

The  concept  of  E-EBD  can  also  be  applied  to  create  high- 
aspect-ratio  nanostructures.  Since  the  ESEM  is  able  to  oper¬ 
ate  at  high  environmental  gas  pressures  (0-20  Torr),  it  is 
inherently  compatible  with  higher  precursor  gas  pressures 
than  standard  high-vacuum  SEMs.  This  can  yield  very  high 
growth  rates  and  aspect  ratios.  We  achieved  line  widths  of 
~100  nm  despite  the  high  gas  pressures.  An  example  is 
shown  in  Fig.  4.  A  pattern  of  columns  reading  “UB”  was 
deposited  in  5  s/column  from  DMGHFAC  on  Si  under  2  Torr 
of  Ar/02  ^  keV  beam.  The  columns  have  an 

average  height  of  180  nm,  which  yields  a  growth  rate  of  36 
nm/s.  The  widths  (~100  nm)  were  measured  with  an  SEM 
since  in  the  atomic  force  microscope  (AFM)  image  the  col¬ 
umns  appear  wider  than  they  really  are  due  to  convolution 
with  the  AFM  tip.  Since  every  column  has  the  same  rough 
features  on  its  right  sidewall,  one  can  conclude  that  it  is  a 
feature  present  on  the  AFM  tip  itself.  This  suggests  that  ar¬ 
rays  of  EBD  columns  could  be  used  as  calibration  tests  to 
know  the  exact  shape  of  an  AFM  tip. 

The  same  procedures  needed  to  create  column  structures 
such  as  those  in  Fig.  4  are  applicable  when  the  substrate  is 
not  flat.  We  were  able  to  deposit  >4-yL6m-high,  0.1-/im-thick 
columns  from  DMGHFAC  on  top  of  commercial  microfab- 


Fig.  5.  EBD  columns  deposited  on  top  of  pyramidal  AFM  tips.  The  white 
scale  bar  represents  (a)  5  fim  and  (b)  1  /nm. 

ricated  AFM  tips.  Indeed,  the  unique  capability  of  EBD  for 
patterning  nonplanar  substrates  caught  the  attention  of  STM 
and  AFM  researchers  very  early  because  sharp,  high-aspect- 
ratio  tips  are  critical  in  several  applications.^^  Two  examples 
are  shown  in  Fig.  5.  The  deposition  parameters  were  the 
same  as  in  Fig.  4.  The  tips  are  not  perfectly  straight  because 
the  deposition  of  each  tip  was  done  with  several  short  expo¬ 
sures  (~5  s)  to  allow  re-focusing  of  the  beam  between  ex¬ 
posures.  This  was  done  to  avoid  broadening  of  the  column. 
Tip  heights  are  (a)  4  jmm  and  (b)  1.5  /xm. 

Last  but  not  least,  with  E-EBD  it  is  possible  to  fabricate 
nanostructures  on  bulk  insulators,  such  as  the  silicon  nitride 
pyramid  in  Fig.  5(b),  because  in  the  ESEM  the  sample  is 
continuously  discharged  through  the  ionized  environmental 
gas. 

V.  DISCUSSION 

It  is  important  to  note  that  the  ESEM  itself  is  not  essential 
to  the  concept  of  EEBD.  Similarly,  we  believe  that  the  addi¬ 
tion  of  a  reactive  gas  to  reduce  carbon  incorporation  during 
EBD  may  as  well  be  implemented  in  high  vacuum.  One 
could,  for  example,  direct  two  nozzles  to  a  sample  in  a  high- 
vacuum  SEM,  one  transporting  the  organometallic  com¬ 
pound  and  another  one  transporting  pure  O2 .  The  composi¬ 
tion  of  the  deposits  is  probably  determined  by  the  ratio  of  the 
flows  or  partial  pressures  of  the  two  gases.  Furthermore,  we 
expected  electron  scattering  at  our  high  environmental  gas 
pressures  to  broaden  the  EBD  features.  However,  we  ob¬ 
served  no  dependence  between  environmental  gas  pressures 
<5  Torr  and  the  width  of  the  EBD  features,  which  was  rou¬ 
tinely  100  nm  or  less  (see  Figs.  4  and  5).  This  is  in  accor¬ 
dance  with  the  ESEM’s  surprisingly  high  resolution. 
Danilatos^^  has  suggested  that  the  beam  is  self-focused  by  a 
concentric  plasma  sheath,  and  its  associated  electrostatic 
field,  around  the  beam.  As  noted  above,  in  any  case,  lower 
pressures  of  similar  mixtures  of  gases  could  be  used  in  a 
high-vacuum  EBD  setup  to  deposit  features  at  a  slower  rate 
and  higher  resolution.  We  stress  that  the  concept  of  E-EBD  is 
applicable  to  STM-EBD  as  well. 

The  composition  of  the  deposits  is  virtually  independent 
of  the  e  ”  beam  energy  for  the  values  typically  used  in  SEM 
imaging  ('^10-40  keV).  This  observation  rules  out  the  pos- 
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sibility  that  substrate  heating  by  the  impinging  electrons 
plays  a  significant  role  in  the  EBD  mechanism,  as  the  heating 
effect  increases  with  increasing  energy.  Essentially,  EBD  of  a 
gaseous  molecule  is  the  rupture  of  one  or  more  of  the  chemi¬ 
cal  bonds  within  the  molecule  by  one  or  more  high-energy 
electrons  (>  1  keV),  resulting  in  a  solid  deposit.  Typical 
bonds  in  an  organic  molecule  such  as  C-C,  C-H,  or  C-0  have 
energies  of  3.25,  3.43,  and  4.31  eV,  respectively.  Hence  a 
high-energy  electron  cannot  be  used  to  selectively  rupture 
the  chemical  bonds  present  in  an  organic  molecule.  The  dy¬ 
namic  role  of  the  continuously  changing  surface  has  not  been 
investigated  yet  to  our  knowledge. 

We  explain  our  results  on  E-EBD  by  noting  that  highly 
reactive  species  are  generated  by  the  dissociation  of  the  pre¬ 
cursor  gas  molecules  adsorbed  on  the  substrate  as  well  as  by 
the  ionization  of  environmental  gas  molecules  above  the  sur¬ 
face.  Thus,  simultaneously  with  deposition,  C  in  the  deposits 
can  be  desorbed  as  CO  or  CO2  molecules  if  an  environmen¬ 
tal  gas  supplies  the  oxygen  ions  or  molecules  necessary  for 
that  reaction  to  occur.  Ar/02is  more  effective  than  H2O  at 
doing  so  due  to  its  richer  presence  of  O  at  the  compared 
pressures.  That  is,  a  total  pressure  of  10  Torr  of  Ar/02  (80%/ 
20%)  represents  a  partial  pressure  of  2  Torr  of  pure  O2 .  Thus 
the  density  of  O  atoms  in  10  Torr  of  Ar/02  is  twice  as  high  as 
that  in  2  Torr  of  H  2O,  and  accordingly  it  yields  superior  Au 
contents.  Pure  O2,  which  could  not  be  used  for  safety 
reasons, should  yield  higher  Au  contents  at  similar  environ¬ 
mental  gas  pressures.  Finally,  in  our  opinion  the  inert  nature 
of  Ar  satisfactorily  explains  that  Ar  does  not  create  a  reactive 
environment. 

A  key  observation  is  that  in  our  experiments  on  organo- 
metallic  EBD  with  DMGHFAC  there  is  no  trace  of  F  or  O  in 
the  Auger  analysis  of  the  deposits  for  standard  e"  beam  cur¬ 
rent  and  acceleration  voltage  deposition  parameters  (see  Ref. 
8).  Since  high-energy  electrons  break  all  bonds  unpreferen- 
tially,  the  above  observation  suggests  that  the  organometallic 
molecule  is  fully  dissociated  into  its  atomic  constituents  by 
the  e~  beam.  Rather  than  two  or  more  inert  molecules,  the 
outcome  of  the  dissociation  is  a  set  of  highly  reactive  ions, 
probably  monatomic,  the  recombination  of  which  produces 
the  observable  deposit  along  with  volatile  compounds.  The 
creation  of  F2  and  O2  molecules,  for  example,  satisfactorily 
explains  the  absence  of  F  and  O  in  the  Auger  analysis  of  the 
deposits.  In  general,  any  reducing  species  present  in  the  en¬ 
vironmental  plasma  (such  as  from  H2O)  will  also  con¬ 
tribute  to  their  desorption.  In  the  case  of  Ar,  it  is  probably  a 
small  partial  pressure  of  other  residual  gases  (such  as  H2O) 
that  provides  the  necessary  reducing  agents.  The  Au:C:0:F:H 
atomic  ratios  in  the  DMGHFAC  molecule  are  1:7:2:6:8.  Al¬ 
though  H  cannot  be  detected  with  Auger  spectroscopy,  the 
above  arguments  should  also  hold  true  for  H.  For  this  reason, 
we  expect  the  deposit  to  be  composed  of  only  Au  and  C  in  a 
Au:C  ratio  of  1:7  (12.5%  Au)  at  least.  Therefore,  in  EBD  of 
Au  with  DMGHFAC,  Au  contents  smaller  than  12.5%  must 
be  attributed  to  simultaneous  carbon-contamination  deposi¬ 
tion  from  residual  hydrocarbon  gases  in  the  SEM  chamber. 

Probably,  in  high  vacuum  the  organometallic  compound 


does  not  fully  dissociate  under  certain  low-electron-density 
conditions.  Indeed,  assuming  full  dissociation  the  most  likely 
recombination  favors  the  creation  of  two  desorbing  CO  mol¬ 
ecules,  other  recombinations  being  far  less  likely.  Accord¬ 
ingly,  it  should  contain  up  to  16.7%  Au  (Au:C  =  l:5)  at  most. 
However,  higher  Au  contents  have  been  reported  by 
others.^’^  This  suggests  that  in  high  vacuum  the  molecule  is 
not  fully  dissociated  and  that  bigger  species  are  desorbed. 
We  could  not  investigate  ranges  of  pressure  <0.1  Torr  with 
the  ESEM  because  of  its  inaccuracy  in  pressure  control.  We 
explain  the  higher  degree  of  dissociation  observed  in  E-EBD 
as  compared  to  conventional  high-vacuum  EBD  by  noting 
that  in  E-EBD  the  precursor  gas  molecules  adsorbed  on  the 
e~  beam  spot  are  bombarded  by  electrons  as  well  as  by 
reactive  ions. 

VI.  CONCLUSIONS 

We  have  described  an  innovative  setup  for  EBD  that  re¬ 
quires  no  modifications  to  the  SEM  chamber.  It  consists  of  a 
small  subchamber  with  a  built-in  reservoir  for  the  precursor 
compound,  thus  eliminating  the  need  for  a  feedthrough  for 
introduction  of  the  precursor  gas  from  outside  the  SEM 
chamber.  With  this  device  we  had  previously  introduced  the 
concept  of  environmental  EBD  (E-EBD).  The  composition 
of  Au  deposits  can  be  substantially  improved  by  a  judicious 
choice  of  the  environmental  gas  and  the  growth  rates  can  be 
much  higher  than  in  high-vacuum  EBD,  which  yields  high- 
aspect-ratio  nanostructures.  Here  we  discussed  its  advantages 
and  analyzed  the  beam-induced  reaction  mechanisms  of 
E-EBD  as  compared  to  high-vacuum  EBD. 
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We  use  EVFLOW,  a  thin  film  thermal  flow  process  simulator,  to  simulate  the  thermal  flow  of  metal 
thin  films  in  axisymmetric  contacts/vias  on  patterned  wafers,  where  two  dimensional  surface 
evolution  is  appropriate.  The  flow  processes  considered  are  (1)  A1  and  Au  thin  film  laser  melting 
processes,  for  which  process  temperatures  are  higher  than  the  metal  melting  points,  and  (2) 
conventional  A1  thermal  anneal  processes  (400-550  °C).  Surface  tension  induced  mass  transport  in 
the  flowing  films  is  calculated  by  solving  the  Navier-Stokes  and  continuity  equations  for 
incompressible  fluids.  Our  simulations  of  Au  film  profiles  during  laser  melting  process  predict  the 
experimentally  observed  trends  in  void  formation  and  collapse  with  feature  geometry  and  deposited 
film  thickness.  In  laser  melting  processes,  the  void  shrinks  and  vanishes,  resulting  in  fully  filled 
contacts  and  planarized  metal  surfaces.  A1  film  evolution  in  the  thermal  anneal  process  is  modeled 
using  a  surface  layer  melting  model.  The  thickness  of  the  liquidlike  flowing  layer  is  assumed  to  be 
0.02  imv  in  this  work.  The  trends  in  the  formation  of  voids  with  feature  geometry  and  deposited  film 
thickness  predicted  by  EVFLOW  reflect  experimental  observations.  Voids  may  form  during  the 
thermal  anneal  processes.  The  surface  layer  melting  model  as  used  in  this  work  will  not  predict  the 
collapse  of  voids.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

Fabrication  of  multilevel  interconnect  structures  used  in 
modem  integrated  circuits  is  becoming  increasingly  chal¬ 
lenging  as  the  device  density  and  the  number  of  interconnect 
layers  increase  with  decreasing  device  dimensions.  Tungsten 
chemical  vapor  deposition  (CVD)  plug  processes  remove  de¬ 
posited  film  conformality  problems  in  contacts  and  vias 
which  are  present  in  conventional  sputter  deposited  (PVD) 
aluminum,  and  are  widely  used  in  production.  However, 
tungsten  CVD  processes  are  more  complex,  more  costly,  and 
tend  to  have  more  particulate  issues  than  PVD  aluminum 
processes.^’"^  In  addition,  it  is  preferable  to  form  intercon¬ 
nects  and  plugs  simultaneously. 

Metal  reflow  technologies,  such  as  A1  thermal  flow  after 
low  temperature  sputter  deposition,  has  been  investigated  as 
promising  methods  for  forming  plugs  in  submicron  contacts 
and  achieving  planarized  surfaces. Heating  the  entire  wa¬ 
fer  above  the  melting  point  of  A1  (its  alloys,  in  practice) 
(—660  °C),  either  in  a  conventional  furnace  or  by  rapid  ther¬ 
mal  annealing,  is  undesirable  because  of  potential  interac¬ 
tions  of  A1  with  other  materials.  Among  the  methods  being 
studied  in  order  to  minimize  the  thermal  budget  for  the  re¬ 
flow  are  (1)  heating  with  pulsed  lasers  (—10“^  s)  and  (2) 
annealing  at  temperatures  between  400  and  550  °C.  Pulsed 
lasers  allow  flowing  metal  films  locally  and  have  been  ex¬ 
tensively  investigated. So  far,  excimer  lasers  such  as 
XeCl  excimer  lasers  have  been  used  in  commercial 
systems.  Laser  reflow  of  a  variety  of  A1  alloys  (Al-1%  Si, 
Al-1%  Si-0.5  %  Cu,  and  Al-1%  Si-0.1%  Ti)  using  an  ex¬ 
cimer  laser  showed  excellent  planarization  for  all  topologies, 
including  submicron  contacts.  Mukai,  Sasaki,  and 
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Nakamo^^  detected  the  presence  of  undesired  voids  in  con¬ 
tacts  with  aspect  ratios  greater  than  1,  beneath  fully  pla¬ 
narized  A1  surfaces  following  excimer  laser  irradiation.  Bern¬ 
hardt  et  al}^  and  Marella,  Tuckerman,  and  Pease^®  also 
found  voids  in  Au  coated  trench  structures  following  excimer 
laser  irradiation.  Those  voids  could  be  eliminated  using 
higher  fluence  laser  irradiation.  Park  et  al?  reported  that  sub¬ 
micron  contacts  with  high  aspect  ratio  (>1)  were  completely 
filled  using  conventional  thermal  annealing  under  high 
vacuum  conditions.  Electrical  tests  with  fully  processed 
complementary  metal  oxide  semiconductor  (CMOS)  devices 
showed  no  degradation  of  contact  resistance  or  junction  leak¬ 
age  current  and  improved  electromigration  resistance.  For 
higher  aspect  ratio  contacts,  the  final  flowed  films  showed 
planarized  metal  surfaces  with  voids  inside.^ 

This  article  reports  on  our  computer  simulations  of  laser 
melting  and  conventional  annealing  of  metal  films  in  axi¬ 
symmetric  contacts.  The  initial  deposited  metal  film  and  sub¬ 
strate  profiles  either  represent  experimental  SEMs  or  from 
deposition  simulations  using  EVOLVE, which  has  shown 
quantitative  agreement  with  experimental  data  in  previously 
reported  work  [for  example  see  Refs.  20  and  21].  Metal  films 
are  first  cold  sputter  deposited  (either  experimentally  or  vir¬ 
tually)  in  contacts  and  then  the  simulator  evflow^^  is  used 
to  simulate  the  flow  process.  We  first  discuss  our  simulation 
results  for  A1  film  evolution  during  laser  melting,  focusing 
on  void  deformation;  the  simulated  as-deposited  A1  film  has 
a  large  void  inside.  The  effect  of  contact  aspect  ratio  on 
metal  film  evolution  during  laser  melting  processes  is  dem¬ 
onstrated  by  simulating  Au  film  evolution  in  two  contacts, 
with  aspect  ratios  of  about  1  and  about  0.5.  Digitized  as- 
deposited  Au  film  and  substrate  profiles  are  input  to  EVFLOW, 
in  addition  to  the  required  fluid  properties.  We  then  discuss 
our  simulation  results  for  A1  thermal  annealing,  which  we 
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model  using  a  “surface  layer  melting  model”  which  is  simi¬ 
lar  to  the  one  presented  by  Hirose,  Kikuta,  and  Yoshida.^^ 
The  influences  of  initial  feature  geometry  and  as-deposited 
film  thickness  on  void  formation  during  thermal  annealing  of 
uncollimated  and  collimated  deposited  A1  films  are  then  dis¬ 
cussed.  A  limitation  of  the  surface  layer  melting  model  is 
highlighted. 

II.  SURFACE  DIFFUSION  AND  SURFACE  LAYER 
MELTING  MODELS 

The  key  to  understanding  thermal  annealing  under  the 
melting  point  of  a  metal  is  that  not  all  atoms  are  equivalent 
in  nature;  those  present  at  the  surface  are  energy  rich  due  to 
having  fewer  neighbors  than  those  in  the  bulk.  Annealing  of 
metals  has  been  described  using  two  kinds  of  models:  curva¬ 
ture  driven  surface  diffusion  models  and  surface  melting 
models.  In  the  surface  diffusion  models,  surface  atoms  dif¬ 
fuse  along  the  surface  due  to  gradients  in  surface  curvature. 
Cale  et  al?^  developed  a  method  to  estimate  surface  diffu- 
sivity  as  a  function  of  temperature  using  film  profiles  as  de¬ 
posited  in  commercial  sputter  deposition  equipment.  The 
comparisons  between  EVOLVE  predictions  are  in  good  agree¬ 
ment  with  experimental  profiles. 


Fig.  1 .  The  principle  of  the  surface  melting  model  for  the  A1  thermal  anneal 
process  simulations. 

Surface  melting  models  for  A1  anneal  processes  are  based 
on  experimental  and  simulation  studies  of  surface  disorder¬ 
ing  of  crystal  A1  surfaces.  Stoltze,  Norskov,  and  Landman^"^ 
used  molecular  dynamics  to  simulate  the  behavior  of  Al(llO) 
and  Al(lll)  surfaces  as  they  approach  the  melting  point. 
They  found  a  definite  disordering  of  the  outermost  layers  of 
Al(llO)  up  to  200  °C  below  the  bulk  melting  point,  and  the 
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Fig.  2.  (a)  2  jjm  as-deposited  A1  film  in  a  1X1  /zm  contact,  (b)-(d)  A1  film  profiles  at  selected  times  during  a  laser  melting  process. 
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Fig.  3,  (a)  Sputter  deposited  Au  film  in  a  contact  with  aspect  ratio  about  0.5.  (b)  Au  film  profile  after  irradiation  with  a  3.5  J/cm^,  600  ns  laser  pulse, 
(c)  Simulated  Au  topography  during  the  laser  melting  process. 


close-packed  (111)  surfaces  showed  much  weaker  disorder¬ 
ing  than  the  (110)  surfaces.  Denier  et  al?^  experimentally 
studied  the  thermal  disordering  of  Al(llO)  and  Al(lll)  sur¬ 
faces.  Medium  energy  ion  scattering  (MEIS)  measurements 
indicated  a  melting  induced  disordering  of  the  Al(llO)  sur¬ 
faces,  but  not  on  the  (111)  surfaces  at  temperature  about 
423  °C.  The  number  of  disordered  monolayers  grows  as  the 
melting  point  is  approached.  Molecular  dynamics  simula¬ 
tions,  scattering  experiments,  and  transport  measurements 
have  shown  that  the  diffusion  coefficients  of  these  mobile 
layers  are  very  large  and  are  comparable  to  the  mobility  of  a 
bulk  liquid  at  its  melting  point.^^’^^  Hirose,  Kikuta,  and 
Yoshida^^  proposed  that  the  mass  transport  of  A1  occurs  only 
in  a  thin  surface  layer  during  conventional  thermal  anneal 
processes.  Mass  transport  inside  the  melting  layer  was  calcu¬ 
lated  using  a  fluid  dynamics  model.  They  estimated  the 
thicknesses  of  the  flow  layers  by  matching  the  anneal  time  at 
each  temperature;  the  thickness  diverges  as  the  melting  point 
is  approached.  Figure  1  illustrates  the  principle  of  the  surface 
melting  models.  The  surface  melting  model  is  used  in  this 
work  to  simulate  A1  film  evolution  in  conventional  thermal 
anneal  processes. 

Hi.  EVFLOW 

EVFLOW  is  described  in  detail  by  Liao  and  Cale;^^  it  con¬ 
sists  of  EVOLVE,  a  physically  based  low  pressure  deposition/ 


etch  process  simulator  and  FLOW,  a  thin  film  thermal  flow 
process  simulation  program.  The  conservation  law  based  sur¬ 
face  movement  routines  in  EVOLVE  are  used  to  update  the 
location  of  the  free  surface  and  void  profiles  according  to  the 
local  velocity  vectors.  The  flow  process  is  modeled  as  an 
incompressible,  surface  tension  driven,  free-boundary  prob¬ 
lem,  and  is  an  extension  of  the  model  presented  by  Bornside 
et  EVFLOW  is  capable  of  simulating  thin  film  thermal 
flow  processes  in  infinitely  long  trenches  with  arbitrary  ver¬ 
tical  cross  sections  and  cylindrical  contacts/vias;  i.e.,  situa¬ 
tions  for  which  feature  surfaces  can  be  represented  in  two 
dimensions.  The  full  dimensionality  of  the  transport  is  taken 
into  account. 

In  EVFLOW,  the  flow  process  is  divided  into  a  number  of 
time  steps,  and  the  location  of  the  free  surface  and  void  is 
determined  as  a  function  of  time;  i.e.,  the  pseudo-steady-state 
approach  is  used.  Within  each  simulation  time  step,  EVFLOW 
generates  a  mesh  in  the  flowing  film,  and  solves  the  steady 
state  version  of  the  dimensionless  Navier-Stokes  equations 
for  incompressible  fluids  using  a  penalty  function  finite  ele¬ 
ment  method.^®  The  penalty  function  method  is  usually  con¬ 
siderably  more  economical  in  terms  of  computer  memory 
and  execution  speed  than  the  general  mixed  order  interpola¬ 
tion  formulation.^^  A  difficulty  with  the  use  of  this  approach 
when  applied  to  the  surface  layer  melting  model  is  discussed 
in  Sec.  V. 
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Fig.  4.  (a)  Sputter  deposited  Au  film  in  a  contact  with  aspect  ratio  about  1.  (b)  Au  film  profile  after  irradiation  with  a  3.8  J/cm^,  600  ns  laser  pulse,  (c)  Digitized 
substrate  and  as-deposited  Au  film  profiles,  (d)  Simulated  Au  topography  during  the  laser  melting  process. 


IV.  EXPERIMENTS  AND  SIMULATIONS  OF  LASER 
MELTING 

In  order  to  illustrate  film  evolution  during  laser  melting 
processes,  simulations  begin  with  a  simulated  sputter  depos¬ 
ited  2  fim  A1  film  in  a  1 X 1  [jm  idealized  contact.  Figure  2(a) 
shows  the  A1  film  predicted  by  EVOLVE  assuming  that  the  A1 
sticking  factor  is  1  during  the  sputter  deposition.  There  is  a 
large  void  inside  the  film  as  would  be  expected.  The  values 
of  surface  tension  and  viscosity  of  A1  used  in  the  flow  simu¬ 
lation  are  914  dyne/cm  and  0.013  P,  respectively.^^  The  film 
evolution  calculated  by  EVFLOW  is  illustrated  in  Fig.  2(b), 
2(c),  and  2(d).  The  void  shrinks  and  finally  vanishes  during 
the  laser  melting  process,  which  results  in  totally  filled  con¬ 
tact  and  planarized  topography.  In  other  words,  the  laser 
melting  process  can  lead  to  100%  filled  contact  and  pla¬ 
narized  topography  if  enough  laser  fluence  is  provided.  The 
experimental  results  of  Bernhardt  et  al}^  and  Marella,  Tuck- 
erman,  and  Pease^^  support  this  conclusion;  i.e.,  no  voids 
were  found  in  high  aspect  ratio  samples  if  high  enough  laser 
fluences  (and  hence  temperature)  were  used  to  reflow  the 
metal.  The  metal  viscosity  decreases  exponentially  with  in¬ 
creasing  temperature  and  the  surface  tension  decreases 
linearly,^  ^  which  results  in  shorter  characteristic  film  leveling 
times  at  high  temperatures.^^  It  should  be  noted  that  in  this 
simulation  and  those  discussed  in  the  following,  the  simula¬ 
tion  domain  contains  a  large  volume  of  metal  above  the  vias. 


As  the  via  fills,  the  decrease  in  metal  height  above  the  flat 
surface  is  small,  which  is  supported  by  experimental  results. 

Marella,  Tuckerman,  and  Pease^^  studied  via/contact  hole 
filling  with  Au,  Cu,  and  Al-1%  Cu  films  using  a  flashlamp- 
pumped  dye  laser  (600  ns  pulse  duration  and  X.=490  nm) 
and  an  excimer  laser  (35  ns  and  X=308  nm).  Test  structures 
containing  either  metal-to-metal  vias  or  metal-to-silicon  con¬ 
tacts  were  prepared,  starting  with  silicon  wafers  as  sub¬ 
strates.  All  metal  films  were  formed  by  sputter  deposition, 
while  all  Si02  layers  were  deposited  by  a  low  pressure  CVD 
process.  Vias  and  contacts  were  exposed  and  dry  etched  into 
the  Si02.  Thin  (100-300  A)  TiojWoj  films  were  sputter  de¬ 
posited  to  provide  adhesion  between  the  Si02  and  the  metal 
films.  The  thickness  of  the  metal  and  insulator  layers  and  the 
diameter  of  the  vias/contacts  were  varied  in  order  to  produce 
a  number  of  different  as-deposited  metal  profiles.  Metal 
plugs  before  and  after  laser  irradiation  were  inspected  by 
SEM. 

The  laser  melting  cases  simulated  were  for  Au  films,  for 
which  Marella,  Tuckerman,  and  Pease  provided  the  most 
experimental  information;  e.g.,  film  profiles  before  and  after 
melting.  Experimentally,  the  smaller  the  aspect  ratio  of  the 
contact,  the  easier  it  is  to  fill.  Figures  3(a)  and  3(b)  show 
experimental  Au  film  profiles  in  a  contact  with  an  aspect 
ratio  of  about  0.5,  before  and  after  irradiation  with  a  3.5 
J/cm^,  600  ns  laser  pulse.  The  profile  evolution  predicted  by 
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Fig.  5.  (a)  evolve  simulated  0.6  jjim  sputter  deposited  A1  film  in  a  0.5X1 
/u.m  contact  with  15°  tapered  sidewall,  (b)  evflow  prediction  of  the  A1 
profiles  during  the  anneal  process. 


EVFLOW,  using  substrate  and  as-deposited  Au  profiles  as 
shown  in  Fig.  3(c)  as  input,  are  shown  Fig.  3(d).  Because 
EVVLOW  is  limited  to  axisymmetric  features,  the  average  of 
the  left  and  right  hand  sides  of  the  Au  film  in  the  SEM  is 
used  to  represent  the  deposited  film  profile.  The  viscosity  and 
surface  tension  of  Au  at  the  melting  point  are  used  in  the 
simulations  and  are  0.05  P  and  1140  dyne/cm.^^  The  process¬ 
ing  time  in  EVFLOW  is  set  to  600  ns;  i.e.,  the  time- 
temperature  trajectory  is  assumed  to  be  an  ideal  step  up  to 
the  melting  point  and  a  step  down  600  ns  later.  The  contact  is 
filled  and  the  surface  is  planarized. 

Figure  4(a)  shows  an  as-deposited  Au  film  in  a  tapered 
contact  with  an  aspect  ratio  about  1.  The  final  film  profiles 
after  irradiation  with  a  600  ns,  3.8  J/cm^  laser  pulse  is  shown 
in  Fig.  4(b).  The  film  and  substrate  profiles  shown  in  Fig. 
4(c)  are  input  to  EVFLOW.  The  values  of  Au  viscosity  and 
surface  tension  used  are  the  same  as  in  the  above  example. 
Figure  4(d)  shows  the  evflow  calculated  time  evolution  of 
the  Au  film  topography,  which  illustrates  that  the  void  is 
formed  at  the  beginning  of  the  process,  and  collapses  after¬ 
ward.  The  surface  is  becoming  more  planarized  with  time. 


Fig.  6.  (a)  evolve  simulated  0.7  fxm.  sputter  deposited  A1  film  in  a  0.5X1 
fim  contact  with  15°  tapered  sidewall,  (b)  evflow  prediction  of  the  A1 
profiles  during  the  anneal  process. 


The  exposure  time  or  intensity  of  the  laser  pulse  is  not  large 
enough  to  eliminate  the  void.  By  increasing  the  flow  time 
somewhat,  the  void  will  disappear.  The  predicted  void  is 
smaller  than  that  of  the  experiment  and  does  not  have  the 
same  shape.  Nevertheless,  we  feel  that  the  simulation  results 
support  the  model  used.  In  order  to  predict  film  profile  evo¬ 
lution  more  accurately,  better  estimates  of  the  time- 
temperature  trajectories  and  fluid  properties  are  required.  At 
this  time,  it  is  satisfying  to  predict  trends  correctly. 

The  mechanism  of  void  formation  during  the  laser  melt¬ 
ing  process  can  be  explained  using  the  concept  of  surface 
tension  driven  mass  transport,  which  is  proportional  to  the 
local  curvature.  For  vias,  consider  the  curvatures  in  the  plane 
of  the  paper  (as  seen  in  cross  sectional  views)  and  the  cur¬ 
vatures  in  the  “horizontal”  plane,  which  are  given  by  the 
local  hole  radii  of  the  opening  (which  is  also  seen  in  cross 
sectional  views).  For  wider  contacts  (smaller  aspect  ratios), 
the  three-dimensional  (3D)  curvatures  at  the  top  edges  are 
smaller  than  those  for  narrower  contacts,  for  a  given  film 
thickness.  At  the  via  mouth,  the  curvature  in  the  plane  of  the 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2620 


H.  Liao  and  T.  S.  Cale:  Metal  thin  film  thermal  flow  processes 


2620 


Fig.  7.  (a)  evolve  simulated  0.8  /^m  sputter  deposited  A1  film  in  a  1 X 1  fim 
contact  with  15°  tapered  sidewall,  (b)  evflow  prediction  of  the  A1  profiles 
during  the  anneal  process. 


paper  drives  the  film  into  the  via.  The  curvature  in  the  hori¬ 
zontal  plane  drives  the  film  to  pinch  off  and  form  a  void. 
Thus,  the  films  are  more  likely  to  pinch  off  before  filling  the 
feature  in  higher  aspect  ratio  features,  because  the  curvature 
in  the  horizontal  plane  is  larger  (the  films  are  closer  to  clo¬ 
sure).  Also,  the  amount  of  movement  needed  to  close  the 
mouth  is  lower.  Once  formed,  the  void  collapses  for  the  du¬ 
ration  of  the  melt  process. 

V.  SIMULATIONS  OF  THERMAL  ANNEALING 

EVFLOW  is  used  to  predict  the  evolution  of  A1  film  profiles 
during  thermal  anneal  processes,  using  cold  sputter  deposited 
A1  film  profiles  predicted  by  EVOLVE  as  input.  The  thickness 
of  the  melted  surface  layers  is  assumed  to  be  0.02  ^cm.  The 
actual  thickness  of  the  melting  surface  layer  may  be  much 
thinner  than  0.02  /xm  (Refs.  23-25);  however,  we  chose  0,02 
fjm  because  of  limitations  of  computer  memory  and  execu¬ 
tion  speed.  The  values  for  surface  tension  and  viscosity  used 
in  the  simulations  are  1077  dyn/cm  and  420  P,  as  used  by 
Hirose,  Kikuta,  and  Yoshida.^^ 

Figure  5(a)  shows  a  simulated  sputter  deposited  0.6  yum 
A1  film  in  a  0.5X1  fim  contact  with  a  15°  tapered  sidewall. 
The  contact  is  filled  without  a  void  after  a  simulated  thermal 


(a) 


(b) 


Fig.  8.  (a)  evolve  simulated  0.8  /im  collimated  sputter  deposited  A1  film  in 
a  0.5X1  fxm  contact  with  15°  tapered  sidewall,  (b)  evflow  prediction  of  the 
A1  profiles  during  the  anneal  process. 


anneal  process  [Fig.  5(b)].  If  the  deposited  film  thickness  is 
increased  to  0.7  yam,  however,  the  contact  is  found  to  have  a 
void  left  in  it,  as  shown  in  Figs.  6(a)  and  6(b).  This  can  be 
explained,  as  for  the  case  of  laser  melting,  by  (1)  the  higher 
surface  curvature  and  (2)  the  film  is  closer  to  closure,  for  the 
0.7  yam  case  relative  to  the  0,6  yam  film.  Once  the  void  is 
formed,  it  will  tend  to  become  spherical.  Because  the  surface 
melting  model  does  not  include  transport  in  the  bulk  of  the 
films,  the  void  volumes  will  stay  the  same  volume  while 
becoming  spherical.  In  order  to  predict  void  shrinkage,  the 
model  would  have  to  include  transport  through  the  bulk  of 
the  film  (see  Ref.  32  for  one  approach). 

Based  on  the  above  analysis,  the  key  to  preventing  void 
formation  is  to  prevent  closure  of  the  feature  mouth  by  keep¬ 
ing  a  large  enough  opening.  This  implies  that  contacts  with 
smaller  aspect  ratios  and  thinner  as-deposited  films  are  easier 
to  fill  than  those  with  larger  aspect  ratios  and  thicker  films. 
As  shown  in  Figs.  7(a)  and  7(b)  a  simulated  0.8  yam  A1  film 
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can  completely  fill  a  1X1  /xm  contact  after  reflow.  One  way 
to  decrease  the  likelihood  of  void  formation  for  a  given  de¬ 
posited  film  thickness  and  feature  size  is  to  use  collimated 
sputter  deposition.  Figures  8(a)  and  8(b)  show  that  a  simu¬ 
lated  0.8  /xm  collimated  sputter  deposited  A1  film  fills  a 
0.5  X 1  /xm  contact  after  thermal  annealing.  The  opening  at 
the  top  of  the  feature  mouth  is  larger  than  that  of  conven¬ 
tional  sputter  depositions  at  a  given  film  thickness.  Thus,  a 
larger  film  thickness  can  be  deposited  without  void  formation 
during  reflow. 

These  trends  were  obtained  by  Hirose,  Kikuta,  and 
Yoshida^^  in  analyzing  film  topography  during  A1  thermal 
anneal  processes;  however,  the  processing  times  to  closure 
are  significantly  different.  By  adjusting  the  melted  layer 
thickness  to  match  the  experimental  simulation  times  of  30- 
180  s,^  Hirose,  Kikuta,  and  Yoshida^^  estimated  it  to  be  about 
0.005  /xm.  However,  the  processing  time  in  the  simulation 
results  presented  above  are  of  the  order  of  10“^  s.  This  is  due 
to  the  larger  melted  layer  thickness  used  in  this  work. 

A  limitation  of  the  penalty  function  formulation  became 
obvious  during  the  simulations  of  thermal  anneal  processes; 
the  penalty  factor  cannot  be  0.  The  continuity  equation  modi¬ 
fied  for  the  penalty  function  formulation  in  axisymmetric 
contacts  is 

lid  dv\ 

V  +  [-J-^(ru)+-j=0.  (1) 

in  which  p  is  pressure,  €  is  the  penalty  factor  (typically 
and  u  and  v  are  velocities  in  the  r  and  z 
directions,  respectively.  Consider  a  void  inside  a  contact  with 
a  thin  melted  surface  layer.  The  steady  state  of  the  void  is  a 
sphere,  and  the  volume  inside  the  sphere  should  be  the  same 
as  the  initial  void  volume  (mass  balance).  At  the  steady  state, 
velocities  (u,v)  should  be  zero  with  the  surface  tension  in¬ 
duced  forces  balanced  by  pressure.  However,  the  modified 
continuity  equation  in  the  penalty  formulation  leads  to  zero 
pressure  if  u  and  v  are  identically  zero,  since  the  penalty 
factor  cannot  be  0.  Using  the  penalty  function  formulation, 
the  void  will  continue  to  shrink  in  size.  Because  of  this  limi¬ 
tation,  EVFLOW  is  only  used  to  simulate  the  evolution  of  the 
free  surfaces  during  the  A1  thermal  anneal  processes  after 
void  formation.  This  situation  does  not  exist  in  the  laser 
melting  processes,  in  which  the  films  totally  melt  and  the 
steady  state  is  a  fully  planarized  surface  and  a  completely 
filled  via. 

VI.  CONCLUSIONS 

Trends  in  reflow  behavior  and  void  formation  during  laser 
melting  and  thermal  anneal  processes  calculated  by  EVFLOW 
agree  with  experimental  trends.  Decreasing  the  curvatures  of 
the  deposited  films  at  the  feature  mouths  reduces  the  possi¬ 
bility  of  void  formation  during  both  the  laser  melting  and 
thermal  anneal  processes.  The  methods  of  reducing  curvature 
include  reducing  deposited  film  thickness  and  using  colli¬ 
mated  sputter  deposition  techniques.  The  voids  formed  in 
laser  melting  processes  can  finally  disappear,  if  enough  laser 
intensity  or  pulse  duration  is  provided.  Due  to  the  limitations 


of  the  surface  melting  model  used  to  simulate  thermal  an¬ 
nealing,  voids  will  not  disappear.  The  model  needs  to  be 
extended  to  include  a  route  to  void  closure  in  order  to  match 
experimentally  observed  film  and  void  profile  evolution. 
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Ti/Si  and  TiSi2/Si  diodes  have  been  prepared  by  direct  current  (dc)  sputtering  and  rapid  thermal 
processing.  Their  1-  V  characteristics  from  77  K  up  to  300  K  have  been  analyzed  using  a  nonlinear 
least  squares  method  to  fit  the  experimental  data.  All  the  experimental  data  have  been  fitted  using 
two  exponentially  dependent  currents  and  a  series  resistance.  Two  parameters  are  fitted  for  each 
current:  a  saturation  current  and  an  exponential  parameter.  As  is  discussed  in  the  article,  one  current 
can  be  assigned  to  the  thermoionic  current,  since  the  exponential  parameter  fits  well  to  the 
theoretical  value  qIkT  and  therefore  a  temperature  independent  Schottky  barrier  height  can  be 
calculated.  The  second  current,  which  yields  an  exponential  parameter  independent  of  temperature, 
cannot  be  assigned  to  a  mechanism  of  direct  tunneling  through  the  barrier.  As  a  result,  we  have 
proposed  a  trap-enhanced  tunneling  mechanism  to  explain  this  current.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

The  rectifying  behavior  of  metal- semiconductor  contacts 
has  been  known  for  over  a  century,  and  the  first  theories  to 
explain  this  effect  date  back  to  1930s. ^  This  classical  analysis 
led  to  the  well  known  expression  for  solid  state  diodes, 
I=I^Yl^xp(qV/nkT)~  1]  where  1^  is  the  thermionic  satura¬ 
tion  current,  V  is  the  applied  voltage,  n  is  the  ideality  coef¬ 
ficient,  and  the  other  constants  have  their  habitual  meaning. 
This  expression  has  been  widely  used  in  the  analysis  of 
Schottky  diodes  mainly  because  a  plot  ln(/)  vs  V  can  be 
fitted  using  the  linear  least  squares  method  obtaining  7^^ 
and  n. 

Several  attempts  have  been  made  to  give  n  a  clear  physi¬ 
cal  meaning.  For  example,  it  was  soon  realized  that  the  pres¬ 
ence  of  a  series  resistance  (R^)  with  the  Schottky  diode  in¬ 
creases  the  ideality  coefficient.  This  led  to  a  series  of  related 
methods^  to  calculate  .  Considering  the  effect  of  ,  a 
modified  voltage  given  by  t)e  used  in  the 

diode  equation,  leading  to  values  of  n  closer  to  1. 

Although  the  linear  analysis  including  the  ideality  coeffi¬ 
cient  is  still  used,^’^  the  possibility  of  using  a  nonlinear  least 
squares  method  provides  a  new  way  of  performing  the  fitting 
of  the  7-  V  characteristics.^  In  this  article  we  apply  such  a 
method  to  Ti/Si  and  TiSi2/Si  Schottky  diodes  prepared  by 
direct  current  (dc)  sputtering  and  rapid  thermal  processing. 


II.  NONLINEAR  FITTING  OF  I-  V 
CHARACTERISTICS 

The  aim  of  the  nonlinear  analysis  is  to  resolve  each  cur¬ 
rent  present  in  the  Schottky  diode.  Since  each  current  has  to 
be  related  to  a  transport  mechanism  of  electrons  or  holes 
through  the  barrier,  this  approach  allows  a  deeper  physical 
insight  into  the  problem.  The  nonlinear  fitting  of  7-  V  char¬ 
acteristics  of  Schottky  diodes  by  Donoval  et  al.^  supposes 
four  mechanisms  which  contribute  to  the  7-  V  characteristics 


at  low  temperatures:  thermionic  emission,  generation- 
recombination,  tunneling,  and  leakage  current. 

In  this  work  we  adopt  a  nonlinear  least  squares  method  to 
fit  the  experimental  7-  V  curves  of  Ti/Si  and  TiSi2/Si  diodes 
measured  at  temperatures  from  77  to  300  K.  In  our  approach 
we  make  no  assumption  about  the  possible  mechanisms  for 
current  transport,  except  for  the  presence  of  a  thermionic 
emission  current.  The  experimental  data  are  fitted  to  the 
function: 

N 

^=2  (1) 

1=1 

where  7  and  V  are  the  measured  intensity  and  voltage  and  R^ 
(series  resistance),  7^-  (saturation  current  /),  and  (exponen¬ 
tial  factor  i)  are  fitting  parameters.  The  nonlinear  least 
squares  fitting  was  made  by  means  of  a  Levenberg- 
Marquardt  algorithm^  using  the  minimum  number  of  expo¬ 
nential  functions. 

III.  EXPERIMENT 

Titanium  thin  films  ('^1000  A  thickness)  were  deposited 
on  p-type  boron-doped  silicon  (100)  using  a  dc  magnetron 
sputtering  system.  The  concentration  of  dopants  in  the  sili¬ 
con  was  estimated  from  C~V  measurements^  to  be 
A^~5X10^^  cm“^.  Prior  to  the  deposition  the  samples  were 
dipped  in  HF  (5%)  to  eliminate  the  native  oxide.  The  geo¬ 
metrical  definition  of  the  diodes  was  made  using  a  lift-off 
technique.  The  area  of  the  diodes  was  about  1  mm^. 

TiSi2/Si  diodes  were  prepared  by  annealing  the  Ti/Si  con¬ 
tacts  in  an  N2  atmosphere  at  700  °C  for  20  s  in  a  rapid  ther¬ 
mal  processing  system.  Aluminium  backcontacts  were  de¬ 
posited  on  both  the  Ti/Si  as  deposited  samples  and  the 
TiSi2/Si  samples.  To  improve  the  electrical  characteristics  of 
the  contact  the  samples  were  annealed  at  400  °C  for  5  min. 

The  samples  were  subsequently  introduced  in  a  cryostat 
and  forward  7-  V  characteristics  were  measured  from  77  K 
up  to  300  K  in  steps  of  25  K,  using  a  HP4140B  pA  meter. 
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Fig.  1.  Experimental  and  calculated  1-V  characteristics  of  TiSi2/Si  and  Ti/Si 
diodes  at  77  and  300  K. 

IV.  RESULTS  AND  DISCUSSION 

The  experimental  data  of  Ti/Si  and  TiSi2/Si  diodes  could 
be  fitted  at  all  temperatures  (77-300  K)  using  only  two  ex¬ 
ponential  functions,  one  of  them  corresponding  to  the  ther¬ 
mionic  emission  current.  The  physical  meaning  of  the  other 
exponential  term  will  be  discussed  below.  This  means  that 
five  parameters  had  to  be  fitted:  two  saturation  currents  (/^ 
and  72),  two  exponential  factors  (a^  and  0:2)  and  the  series 
resistance.  Figure  1  compares  the  experimental  and  theoreti¬ 
cal  characteristics  of  the  TiSi2/Si  diode  at  77  and  300  K. 

The  quality  of  the  fitting  can  be  measured  using  the  pa¬ 
rameter  which  is  equivalent  to  the  parameter  of  the 
linear  least  squares  fitting.  In  this  case  however,  is  zero  for 
an  ideal  fitting.  The  fittings  of  the  TiS^Si  diode  had  an  r^<  1 
for  all  temperatures.  The  fittings  of  the  Ti/Si  diode  had  r^<2 
except  for  the  measurements  at  77  and  100  K,  which  yielded 
r^==16.9  and  r^  =  13,  respectively.  No  evident  improvement 
was  obtained  by  increasing  the  number  of  exponential  func¬ 
tions  of  the  fitting  or  even  by  including  a  linear  function 
(simulating  a  leakage  current);  so  the  data  concerning  these 
temperatures  should  be  taken  cautiously. 

The  only  previous  physical  assumption  in  this  study  is  the 
presence  of  a  thermionic  emission  current.  This  assumption 
allows  an  autoconsistency  test  for  the  method  since  the  ex¬ 
ponential  factor  cTj ,  corresponding  to  the  thermoionic  emis¬ 
sion  current,  can  be  compared  with  its  theoretical  value 
q/kT} 

Figure  2  compares  the  variation  of  the  parameter  as  a 
function  of  temperature  with  the  theoretical  values  of  qlkT. 
The  excellent  agreement  found  between  both  values  indi¬ 
cates  that  this  fitting  is  equivalent  to  taking  n—\.  Thus,  the 
saturation  current  /j  can  be  assigned  to  the  thermoionic 
emission  current.  From  the  value  of  the  saturation  current, 
the  Schottky  barrier  height  cp  can  be  calculated  as 
(P=-kTlq'\n{IJA'^ST^),  where  S  is  the  area  of  the  diode 
and  A  *  is  the  Richardson  constant,  which  is  taken  as  A  *  =  30 
A/cm^  K.^  The  calculated  Schottky  barrier  heights  are  shown 
in  Fig.  3. 
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Fig.  2.  Comparison  of  the  fitted  parameter  with  its  theoretical  value 
qlkT. 


In  a  previous  work,  the  Schottky  barrier  height  was  cal¬ 
culated  at  room  temperature.^  A  clear  difference  in  the  barrier 
height  of  the  as  deposited  Ti/Si  (^'^0.55  V)  and  the  treated 
TiS^Si  (0'-O.52  V)  diodes  was  reported.  The  values  now 
reported  at  300  K  in  Fig.  3  coincide  with  those  obtained  in 
the  previous  work.  At  low  temperatures  the  Schottky  barrier 
height  remains  approximately  constant,  showing  a  slight  in¬ 
crease  in  its  value  with  decreasing  temperature.^^ 

The  evolution  of  the  series  resistance  with  temperature  is 
shown  in  Fig.  4.  The  series  resistance  decreases  from  300  K 
down  to  150  K  where  it  shows  a  minimum,  and  then  in¬ 
creases  its  value  from  this  temperature  down  to  77  K.  The 
behavior  down  to  150  K  can  be  explained  by  the  decrease  of 
the  scattering  mechanisms  dependent  on  temperature  (i.e., 
phonons).  Below  150  K  the  intrinsic  conduction  becomes 
steadily  more  important.  The  concentration  of  dopants  has 
been  calculated  from  the  1/C^  versus  V  plots.  The  values 
obtained  were  =  6.9X10^^  cm“^  for  the  Ti/Si  diode  and 
=4.7X10^^  cm“^  for  the  TiSi2/Si  diode.  These  values  are 
in  good  agreement  with  the  difference  of  the  series  resistance 
shown  between  both  diodes. 


T(K) 


Fig.  3.  Evolution  of  the  calculated  Schottky  barrier  height  with  temperature. 
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Fig.  4.  Evolution  of  the  calculated  series  resistance  with  temperature. 


The  exponential  parameter  and  the  saturation  current  of 
the  second  exponential  term  in  Eq.  (1)  are  shown  in  Figs.  5 
and  6,  respectively.  The  exponential  parameter  remains  ap¬ 
proximately  constant  with  temperature;  therefore  this  current 
cannot  be  assigned  to  the  generation-recombination  mecha¬ 
nism,  since  its  exponential  parameter  depends  on  tempera¬ 
ture  as  q/2kT.  The  obvious  candidate  seems  to  be  a  tunneling 
current.  Two  tunneling  currents  through  a  Schottky  barrier 
have  been  proposed:  the  thermionic  field  and  the  field  cur¬ 
rent,  which  have  been  discussed  in  the  classical  papers  by 
Padovani  and  Stratton^  ^  and  Crowell  and  Rideout. Accord¬ 
ing  to  these  works  the  thermoionic-field  current  is  typical  of 
Schottky  contacts  on  moderately  doped  (A<10^^  cm“^) 
semiconductors.  This  current  presents  an  exponential  behav¬ 
ior  with  voltage,  with  an  exponential  parameter  given  by 


60  80  100  120  140  160  180  200  220  240 
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Fig,  5.  Evolution  of  the  calculated  exponential  parameter  0^2  with  tempera¬ 
ture. 
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Fig.  6.  Evolution  of  the  calculated  saturation  current  1 2  with  temperature. 


In  this  frame  Eq.  (2)  was  used  to  calculate  the  exponential 
parameter  for  the  Ti/Si,  diode  taking  m*,  the  mean  effective 
mass  for  holes  in  silicon,  as  =  Values 

of  56.9  and  150  were  obtained  for  ^2  at  200  and  77  K, 
respectively.  Although  these  values  are  in  the  same  order  of 
magnitude  as  the  values  obtained  from  the  fitting,  no  signifi¬ 
cant  variation  with  temperature  is  observed  in  the  values  ob¬ 
tained  from  the  fitting.  Therefore  the  second  experimental 
current  cannot  be  assigned  to  the  thermionic-field  current. 

On  the  other  hand,  the  field  current  has  been  described  for 
degenerate  semiconductors  with  its  Fermi  level  within  the 
valence  band  (for  a  p-dopQd  semiconductor). Although 
this  is  not  the  case,  the  field  current  shows  in  the  case  of 
degenerate  semiconductors  an  exponential  behavior  with 
voltage  with  an  exponential  parameter  which  is  independent 
of  temperature. 

It  is  known  that  there  is  a  different  behavior  of  the  expo¬ 
nential  parameter  of  both  tunneling  currents  with  tempera¬ 
ture.  This  is  due  to  the  fact  that,  whereas  in  the  field  current 
the  maximum  number  of  electrons  tunneling  through  the  bar¬ 
rier  has  an  energy  independent  of  temperature,  the  energy  at 
which  the  maximum  number  of  electrons  tunnel  in  the 
thermionic-field  current  depends  on  a  compromise  between 
the  statistical  distribution  of  electrons  with  energy  (increas¬ 
ing  with  decreasing  energy)  and  the  transmission  coefficient 
through  the  barrier  (decreasing  with  decreasing  energy). 
Therefore,  the  temperature  independent  exponential  param¬ 
eter  can  be  assigned  to  a  tunneling  process  at  a  fixed  energy 
above  the  Fermi  level. 

A  possible  mechanism  to  explain  the  behavior  observed  in 
our  samples  is  the  trap  enhanced  tunneling  through  the 
Schottky  barrier. According  to  this  mechanism  the  elec¬ 
trons  do  not  tunnel  directly  into  the  metal.  The  tunneling 
consists  of  a  two-step  process:  first,  the  electrons  tunnel  from 
the  conduction  band  into  traps  in  the  depletion  layer;  then 
from  these  traps  the  electrons  finally  tunnel  into  the  metal. 
Since  the  traps  are  situated  at  fixed  energies  above  the  Fermi 
level,  the  electrons  preferentially  tunneling  into  the  metal 
will  have  this  energy  independent  of  the  temperature.  The 
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enhanced  tunneling  mechanism  can  be  formally  described 
using  an  exponential  function  similar  to  the  thermionic-field 
emission,  but  the  numerical  value  of  the  exponential  param¬ 
eter  is  smaller  in  the  case  of  the  enhanced  tunneling  mecha¬ 
nism.  This  is  in  agreement  with  the  experimental  data,  since 
the  values  of  ^2  are  smaller  than  the  theoretical  value  56.9 
eV*  at  200  K. 

Figure  6  shows  the  variation  of  the  saturation  current  I2 
with  temperature.  At  relative  high  temperatures  log  (72)  ver¬ 
sus  l/T  can  be  fitted  to  a  straight  line.  This  is  in  agreement 
with  the  enhanced  tunneling  mechanism,  since  the  amount  of 
current  will  be  dependent  on  the  number  of  electrons  which 
have  the  appropriate  energy.  The  statistical  distribution  in 
energy  of  these  electrons  corresponds  to  a  Boltzmann  distri¬ 
bution  for  moderately  doped  semiconductors.  It  can  also  be 
observed  in  Fig.  6  that  the  saturation  current  for  the  TiSi2/Si 
diodes  is  greater  than  for  the  Ti/Si  ones,  in  agreement  with 
their  higher  density  of  traps. 

At  low  temperatures  (100-77  K)  shows  a  different  be¬ 
havior,  which  can  be  due  to  the  presence  of  a  leakage  current 
through  the  diode.  Except  for  the  Ti/Si  diode  no  attempt  has 
been  made  to  resolve  this  additional  current.  This  is  caused 
by  the  low  value  of  the  parameter  obtained  from  the  origi¬ 
nal  analysis,  which  does  not  allow  to  discriminate  clearly 
whether  an  additional  current  is  physically  significant  or  not. 

V.  SUMMARY  AND  CONCLUSIONS 

The  forward  1-  V  characteristics  of  two  different  Schottky 
diodes:  Ti/Si  and  TiSi2/Si  have  been  analyzed  by  a  nonlinear 
method.  We  have  shown  in  this  work  that  the  7-  V  charac¬ 
teristics  can  be  fitted  with  two  exponentials,  if  a  series  resis¬ 
tance  is  considered.  The  first  exponential  is  attributed  to  the 
thermionic  emission  current.  This  hypothesis  is  supported  by 
the  good  agreement  between  the  exponential  parameter  and 
its  theoretical  value  {qIkT),  The  fact  of  fitting  the  exponen¬ 
tial  parameter  is  equivalent  to  using  an  ideality  coefficient 
equal  to  one.  In  this  case,  however,  the  ideality  coefficient 
can  be  considered  an  option  of  the  fitting,  instead  of  an  in¬ 
tegral  part  of  it,  since  the  fitting  can  be  improved  by  includ- 
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ing  more  functions  and  so,  the  use  of  the  ideality  coefficient 
is  considered  more  an  autoconsistency  test  of  the  method 
than  part  of  the  fitting  itself. 

The  value  of  the  barrier  height  remains  approximately 
constant  at  different  temperatures.  This  implies  that  the  dif¬ 
ference  between  the  barrier  height  of  the  Ti/Si  and  TiSi2/Si 
diodes  is  a  characteristic  of  the  materials  and  is  not  due  to  the 
appearance  of  new  conduction  mechanisms. 

The  assignment  of  the  second  current  is  more  controver¬ 
sial.  It  cannot  be  assigned  to  the  generation-recombination 
current  nor  to  the  direct  tunneling  current  through  the  diode. 
The  characteristics  of  this  current  are  similar  to  the  enhanced 
tunneling  current,  and  therefore  we  have  assigned  the  current 
to  the  enhanced  tunneling  mechanism. 

The  possibility  of  performing  nonlinear  fitting  to  7-V 
characteristics  allows  a  more  direct  insight  into  the  physics 
of  the  problem. 
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A  Ti/TiN  bilayer  deposited  under  an  TZ/AlSiCu/TiN  interconnect  is  generally  used  at  the  contact 
level  to  stabilize  the  contact  resistance  of  small  diameter  contacts  as  well  as  to  prevent  junction 
spiking  during  postdeposition  anneals.  An  air  break  followed  by  a  furnace  or  a  RTF  anneal  is 
generally  required  after  the  deposition  of  the  Ti/TiN  bilayer  to  improve  the  barrier  stability.  This 
three  step  interconnect  scheme  results  in  a  low  net  effective  throughput  process  if  the  same 
metallization  system  is  used  twice  or,  alternatively,  in  a  high  cost-of-ownership  process  if  two 
metallization  systems  are  used,  a  first  for  the  Ti/TiN  barrier  and  a  second  for  the  T/ZAlSiCu/TiN 
interconnect.  In  this  article,  it  is  shown  that  collimation  permits  the  deposition  of  very  thick  TiN 
barriers  at  the  bottom  of  high  aspect  ratio  contacts  which  allows  contact  metallization  integration  in 
a  single  metallization  system.  In  addition,  it  is  found  that  the  use  of  sl  Ti  glue  layer  between  the 
barrier  and  the  AlSiCu  contact  interconnect  to  promote  the  filling  of  these  high  aspect  ratio  contacts 
with  A1  alloys  results  in  a  yield  loss  of  n  ^  and  p  contact  chains  and  therefore  should  be  avoided. 

Finally,  it  is  demonstrated  that  an  integrated  Ti/TiN/AlSiCu/TiN  contact  metallization  with  excellent 
electrical  properties  is  possible  with  collimated  TiN  barriers.  ©  1996  American  Vacuum  Society. 


1.  INTRODUCTION 

Reactively  sputtered  TiN  is  the  most  popular  barrier  for 
the  contact  metallization  with  aluminum  alloys  since  it  pro¬ 
vides  a  chemically  stable  barrier  with  low  defect  density  and 
controlled  properties.  This  process  has  nevertheless  a  major 
drawback:  the  bottom  coverage  of  reactively  sputtered  TiN 
drops  to  less  than  10%  when  the  aspect  ratio  of  straight  wall 
contact  holes  exceeds  1.2.  This  means  that  a  100  nm  thick 
TiN  only  provides  10  nm  of  TiN  at  the  bottom  of  0.8  pm 
diam  straight  wall  contacts  etched  through  a  1.0  pm  thick 
oxide. 

Collimation  of  TiN  was  introduced  few  years  ago  to  push 
the  limits  of  the  contact  metallization  with  aluminum  alloys 
as  well  as  to  permit  the  contact  metallization  with  chemical 
vapor  deposited  tungsten  (CVD  W)  in  the  subhalf  micron 
processes.^  Varian’s  patented  1.5:1  cds®  Ti/TiN  collimation 
process^ permits  a  100  nm  thick  TiN  to  provide  60  nm  of 
TiN  at  the  bottom  of  the  same  0.8  pm  diam  straight  wall 
contacts  etched  in  1.0  pm  thick  oxide.  This  means  a  500% 
improvement  over  reactively  sputtered  TiN  without  collima¬ 
tion. 

The  1.5:1  cds®  TiN  films  can  be  obtained  with  collima¬ 
tion  in  a  nitrided  target  mode  of  operation,  thus  providing  so 
called  TiN(B)  films,  or  alternatively  in  a  non-nitrided  target 
mode  of  operation,  thus  providing  so-called  TiN(C)  films  of 
equally  high  quality:  a  bulk  resistivity  of  40  pTi  cm,  a  bulk 
density  of  more  than  95%,  a  high  compressive  stress,  a  deep 
gold  appearance  and  a  [N]/[Ti]  stoichiometric  ratio  of  1. 


The  improved  bottom  coverage  and  film  properties  of 
these  two  types  of  processes  certainly  increase  the  perfor¬ 
mance  of  the  TiN  barrier  and  could  permit  an  integrated 
contact  metallization  with  aluminum  alloys  at  small  geom¬ 
etry  without  the  use  of  an  air  break  and  of  an  ex  situ  furnace 
or  RTF  anneal.  This  article  looks  at  such  an  integrated  met¬ 
allization  process  with  AlSiCu  alloy. 

II.  EXPERIMENT 
A.  Contacts  shape 

A  1.5  pm  mixed  signal  CMOS  process  involving  two 
levels  of  polysilicon  and  two  levels  of  aluminum  alloy  inter¬ 
connects  is  used  to  verify  the  performance  of  a  Ti/TiN/Al- 1 .0 
wt  %Si-0.5  wt  %Cu/TiN  metallization  process  in  the  M2000 
cluster  tool.  An  inorganic  2.0  wt  %F  SOG  is  used  to  pla¬ 
narize  the  upper  capacitor  polysilicon.  This  results  in  a  local 
accumulation  of  1.4  pm  of  oxide  in  the  contact  area  of  the 
RTF  activated  0.2  pm  deep  I p~  and  p'^ In"  junctions 
where  straight  wall  1.0,  1.2,  1.5,  and  1.8  pm  diam  contacts 
are  etched.  The  junctions  were  prepared  by  implantation  of 
As  and  B  (using  BF2  molecular  ions)  as  N  and  F  dopants, 
respectively.  The  implanted  energies  and  doses  were  80  keV 
and  5.8  X 10^^  at/cm^  for  As  and  50  keV  and  2.1  X 10^^  at/cm^ 
for  BF2  (the  corresponding  energy  of  the  B  atoms  being  ~11 
keV).  After  the  implantation,  the  dopants  were  activated  at  a 
temperature  of  1065  °C  for  10  s. 


2627  J.  Vac.  Sci.  Technol.  B  14(4),  Jul/Aug  1996  0734-21 1X/96/14(4)/2627/9/$1 0.00  ©1996  American  Vacuum  Society  2627 


2628 


Ouellet  et  a/.:  Ti/TiN/Ti/AISICu/TiN  contact  metallization 


AlSiCu 

Plug  @  475*C 


Fig.  1.  Layout  of  the  different  modules  composing  Varian  M2000  cluster 
tool.  The  numbers  drawn  on  each  module  corresponds  to  the  sequence  fol¬ 
lowed  during  the  metallization:  (1)  degas  module,  (2)  collimated  TiN  mod¬ 
ule,  (3)  etch  module  used  as  a  cooling  module,  (4)  AlSiCu  module,  (5) 
collimated  TiN  module,  and  (6)  cooling  module. 

B.  Wafer  process  sequence 

After  a  60  s  etch  in  a  mixture  of  NH4F:HF  (50:1)  buffer 
oxide  etch  solution,  the  wafers  are  loaded  in  a  Varian  M2000 
cluster  tool  configured  as  illustrated  in  Fig.  1.  The  wafer 
movement  in  the  system  is  described  according  to  the  en¬ 
circled  number  in  this  figure.  The  wafer  is  first  degassed  for 
75  s  at  about  510  °C  in  the  degas  station  (1).  While  hot,  the 
wafer  is  then  transferred  into  the  1.5:1  cds®  Ti/TiN  module 
(2)  onto  a  vacuum  isolated  heater  operated  at  a  setpoint  of 
25  °C.  The  wafer  remained  at  a  temperature  of  about  400 
during  the  150  s  necessary  to  make  the  Ti/TiN  or  Ti/TiN/Ti 
deposition.  The  wafer  is  moved  after  the  deposition  to  an 
etch  module  (3)  where  it  is  cooled  for  a  duration  of  about 
150  s  by  a  chuck  operating  at  a  setpoint  of  25  °C.  The  cool¬ 
ing  is  possible  using  a  flow  of  backside  argon  between  the 
wafer  and  the  heater.  The  wafer  is  subsequently  transferred 
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in  one  of  the  two  AlSiCu  modules  (4)  operating  at  a  heater 
setpoint  of  475  °C  where  a  400  nm  cold/400  nm  hot  two 
steps  AlSiCu  planarization  (TSP)  film  is  deposited  within 
about  300  s.  The  wafer  is  finally  transferred  after  the  A1 
deposition  to  a  cool  station  (6)  where  it  is  cooled  within 
about  75  s.  Each  individual  module  that  are  part  of  this 
M2000  cluster  tool  is  isolated  and  cryopumped.  After  the 
patterning  of  the  interconnects,  the  remaining  standard  back¬ 
end  process  steps  up  to  wafer  level  testing  are  carried  out. 

C.  Optimization  of  the  various  fiims  thicknesses 

The  timing  in  each  module  is  such  that  an  integrated  met¬ 
allization  is  possible  with  a  productivity  of  about  18  wafers 
per  hour  (WPH).  This  timing  forces  the  collimated  Ti/TiN  or 
Ti/TiN/r/  barrier  to  be  deposited  within  150  s,  even  at  the 
end  of  life  of  the  collimator, and  limits  the  thickness  of 
the  bottom  Ti  contact  layer,  of  the  inner  TiN  barrier  layer, 
and  of  the  upper  Ti  glue  layer  if  used. 

Since  a  minimum  thickness  of  the  Ti  contact  layer  is  re¬ 
quired  to  ensure  good  contact  resistance  to  to  to  gate 
poly  silicon  and  to  capacitor  poly  silicon  and  since  a  Ti  glue 
layer  is  used  in  some  of  the  investigated  cases,  an  upper  limit 
exists  for  the  thickness  of  low  deposition  rate  TiN(B)  and  for 
the  high  deposition  rate  TiN(C). 

Therefore,  the  thickness  of  the  Ti  contact  layer  under  an 
arbitrary  selected  60  nm  thick  TiN(B)  barrier  was  varied  in 
order  to  determine  this  minimum  Ti  contact  layer  thickness. 
No  Ti  glue  layer  was  deposited  and  the  barrier  was  covered 
by  800  nm  of  AlSiCu  followed  by  40  nm  of  TiN  antireflec- 
tive  coating.  The  thickness  of  the  Ti  contact  layer  was  fixed 
using  the  results  of  contact  resistance  measurements.  A  sec¬ 
ond  experiment  was  then  performed  using  the  optimum 
thickness  for  the  Ti  contact  layer  to  investigate  the  effect  of 
the  Ti  glue  layer  on  a  certain  number  of  splits  which  all 
maintain  a  productivity  of  18  WPH  at  the  end  of  life  of  the 
collimator.  Two  reference  wafers  without  barriers  were  also 
included  in  this  study.  Table  I  gives  the  set  of  splits  prepared 
for  this  experiment. 

D.  Interfacial  reaction  measurements 

Three  types  of  blank  substrates  were  also  used  to  investi¬ 
gate  the  robustness  of  the  tested  barriers.  These  substrates 


Table  I.  Metallic  multilayers  used  to  study  the  effect  of  the  Ti  glue  layer  on  the  electrical  properties  of  devices. 
The  left  to  right  ordering  of  the  columns  corresponds  to  the  sequential  order  of  the  deposition  of  the  various 
layers.  The  last  two  rows  of  this  table  corresponds  to  the  two  reference  wafers  included  in  the  study. 


Ti  contact 
layer 
thickness 
(nm) 

TiN  barrier 
type 

TiN  barrier 

thickness 

(nm) 

Ti  glue  layer 
thickness 
(nm) 

AlSiCu 

thickness 

(nm) 

TiN  ARC 
thickness 
(nm) 

10 

[C] 

95 

0 

800 

40 

10 

[C] 

80 

10 

800 

40 

10 

[B] 

30 

10 

800 

40 

10 

[B] 

35 

0 

800 

40 

0 

0 

0 

800 

40 

0 

0 

0 

800 

40 
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a)  N+  contacts 


Ti  (nm) 
b)  P+ contacts 


1 - 

■MeanDStandard  Deviation 

-^50 


Ti  (nm) 


c)  Gate  polysilicon  contacts 


1 - - - 

■MeanDStandard  deviation 

*^200 


Ti  (nm) 

d)  Capacitor  polysilicon  contacts 


Fig.  2.  Contact  resistance  results  of  n’*'  (a),  (b),  gate  polysilicon  (c)  and  capacitor  polysilicon  (d)  contact  chains  as  a  funtion  of  the  Ti  contact  layer 

thickness.  The  black  bar  gives  the  average  and  the  white  bar  the  standard  deviation. 


were  a  thermal  oxide,  an  n-type  single-crystal  silicon  free 
from  native  oxide  and  a  N-doped  and  RTP  activated  poly¬ 
crystalline  silicon.  The  latter  substrate  is  known  to  easily 
dissolve  in  aluminum. 

The  tested  barriers  were  deposited  over  each  type  of  sub¬ 
strate  and  under  a  100  nm  AlSiCu  layer  and  a  40  nm  TiN 
film  combination.  Sheet  resistance  maps  of  49  points  per 
wafer  of  the  resulting  wafers  were  measured  before  and  after 
a  60  min  450  °C  anneal  in  order  to  detect  any  change  in  the 
sheet  resistance  which  would  result  from  an  interfacial  reac¬ 
tion  with  any  of  the  three  types  of  substrate  or  with  the 
AlSiCu  alloy.  In  this  part  of  the  study,  two  wafers  were  mea¬ 
sured  for  each  tested  barrier/substrate  combination. 

E.  Junction  leakage  measurements 

The  tested  junctions  have  a  surface  of  160  000  yum^  and  a 
perimeter  of  1600  /nm.  There  are  27  tested  junctions  per  size 
of  contact  per  wafer.  Two  types  of  contacts  were  used  to 
connect  the  junctions:  (1)  6400  contacts  of  2.0  yumX2.0  yarn 
and  (2)  1600  contacts  of  4.0  yamX4.0  yum. 

The  junctions  were  tested  at  a  reverse  bias  of  5  V  in  total 
darkness  and  at  room  temperature.  In  order  to  be  used  in  low 
power  CMOS  devices,  a  leakage  current  density  of  about  5 
nA/cm^  of  junction  which  corresponds  to  a  total  leakage  cur¬ 
rent  of  about  10  pA  for  this  size  of  a  junction  is  desired. 
According  to  our  specifications,  this  leakage  is  independent 
of  the  number  of  connecting  contacts,  but  if  one  contact 
leaks  the  complete  junction  leaks. 

F.  Contact  chain  measurements 

n^{p^)  diffusion  contact  chains  having  contact  sizes  of 
1.0,  1.2,  1.5,  and  1.8  jum,  respectively,  were  measured.  Each 


chain,  located  in  a  large  p-well  (n-well),  has  200  contacts, 
has  100  diffusion  links  and  has  99  AlSiCu  links. 

Similar  gate  poly  silicon  and  capacitor  poly  silicon  contact 
chains  (no  1.2  pm  capacitor  polysilicon  contact  chains)  were 
also  measured.  Located  in  a  large  p  well,  each  chain  has  200 
contacts,  100  polysilicon  links,  and  99  AlSiCu  links.  As  for 
the  junction  leakage  measurements,  there  is  27  chains  per 
type  of  contact  per  wafer. 

III.  RESULTS 

A.  Optimum  TI  contact  layer  thickness 

Figure  2  shows  the  average  contact  chain  resistance  and 
the  standard  deviation  of  the  tested  1.5  pm  ,  p^,  gate 
polysilicon  and  capacitor  polysilicon  contact  chains.  High 
contact  resistance  with  large  standard  deviation  is  obtained 
for  the  3  nm  thick  Ti  contact  layer  for  all  types  of  contact 
structures.  However,  lower  contact  resistance  with  small 
standard  deviation  is  measured  for  the  6  and  10  nm  thick  Ti 
contact  layer.  Furthermore,  the  yields  vary  between  60%  and 
70%  for  all  types  of  contact  chains  containing  the  3  nm  Ti 
contact  layer  while  it  is  100%  for  the  other  two  thicknesses. 

B.  Effect  of  a  Ti  glue  layer 

Figure  3  shows  the  percentage  yield  of  the  tested  1.0  and 
1.8  p^,  gate  poly  silicon,  and  capacitor  poly  silicon  con¬ 

tact  chains  before  and  after  annealing  at  450  °C  as  a  function 
of  the  deposited  barrier  structure.  This  is  a  measure  of  the 
number  of  good  contacts  on  a  wafer.  The  graphs  in  Fig.  3 
clearly  show  that  the  yield  is  much  less  than  100%  when  a 
10  nm  thick  Ti  glue  layer  is  present  on  the  and  p^  con¬ 
tact  chains.  In  fact,  it  is  zero  for  the  1.0  pm  contact  chains  in 
both  cases  before  and  after  annealing.  For  the  gate  polysili- 
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■  1 .0  urn,  OC  ^  1 .0  um,  4S0C  □  1 .8  um,  OC  g  1 .8  urn,  450C 


a)  N+  contact  chains 


b)  P+  contact  chains 


c)  Gate  polysilicon  contact  chains 


d)  Capacitor  polysilicon  contact  chains 


Fig.  3.  Graphs  of  the  percentage  yield  for  (a),  p'*'  (b),  gate  polysilicon  (c) 
and  capacitor  polysilicon  (d)  contact  chains  as  a  function  of  the  barrier  type 
deposited  on  the  wafer  for  two  different  contact  sizes  before  and  after  an¬ 
nealing  at  450  °C. 


con,  a  slight  decrease  in  the  yield  is  measured  for  the  1.0  /mm 
contact  chains  having  a  TiN[C]/r i  glue  layer  before  and  after 
annealing.  No  important  yield  loss  is  observed  for  the  ca¬ 
pacitor  poly  silicon  chains.  It  is  very  clear  that  the  use  of  a  Ti 
glue  layer  in  an  integrated  Ti/TiN/r//AlSiCu/TiN  process  se¬ 
quence  on  the  n'^  and  of  the  contact  chains  should  be 
prevented.  A  reaction  of  the  Ti  glue  layer  with  the  Al  during 
the  deposition  of  the  alloy  may  be  responsible  for  such  a 
total  yield  loss. 

C.  Interfacial  reaction  measurements 

Figure  4  shows  the  average  sheet  resistance  and  the  stan¬ 
dard  deviation  results  of  the  four  tested  barriers  deposited 
over  the  three  types  of  substrates.  Table  II  gives  the  sheet 
resistance  for  each  of  the  compounds  used  in  this  part  of  the 
study.  If  the  metallic  multilayers  keep  their  integrity,  it  is 
possible  to  estimate  the  expected  sheet  resistance  value  from 
the  known  thickness  and  resistivity  of  each  individual  film 
and  substrate  given  in  Table  II.  Since  all  the  individual  films 
are  connected  in  parallel,  the  net  sheet  resistance  of  each 
wafer  should  be  very  close  to  the  AlSiCu  sheet  resistance  of 
300  mil/D. 

Before  annealing,  Fig.  4(a)  clearly  shows  that  both 
TiN[C]  and  TiN[B]  barrier  layer  without  a  Ti  glue  layer 
maintains  an  average  sheet  resistance  around  300  mfl/D. 


As  deposited 


TiN(C)  TiN(C)/Ti  TiN(B)nri  TiN(B) 


After  60  min  @  450''C 
c) 

1  1,000 

1  800 
q> 

c  600 
<0 
ca 

w  400 

2 

H  200 
£ 

CO  0 


TiN(C)  TiN(C)m  TlN(B)/ri  TiN(B) 


d) 


10 


miB)m  TiN(B) 


Fig.  4.  Average  sheet  resistances  [(a)  and  (c)]  and  standard  deviations  [(b) 
and  (d)]  before  and  after  annealing  at  450  °C  for  barrier  with  and  without  a 
Ti  glue  layer  deposited  on  three  different  substrates:  silicon  (hatched  bar), 
oxide  (black  bar),  and  polysilicon  (white  bar). 


Table  II.  Thickness,  resistivity,  and  sheet  resistance  of  all  the  layers  and 
substrates  used  in  the  electrical  measurements. 


Layer/substrate 

Thickness 

(nm) 

Resistivity 
(yuXl  cm) 

Sheet  resistance 

(n/D) 

TiN  ARC 

40 

40 

10.0 

AlSiCu 

100 

3.0 

0.30 

TiN[C]  barrier 

95 

40 

4.2 

TiN[C]  barrier 

80 

40 

5.0 

TiN[B]  barrier 

35 

65 

18 

TiN[B]  barrier 

30 

65 

21 

Ti  contact  layer 

10 

80 

80 

Ti  glue  layer 

10 

80 

80 

Si  substrate 

1.5 

PolySi 

30 

Si02 

very  high 
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As  deposited 


After  60  min  @  450''C 


Fig.  5,  p  +  ln—  junction  leakage  currents  [(a)  and  (c)]  and  standard  devia¬ 
tions  [(b)  and  (d)]  before  and  after  annealing  at  450  °C  for  barrier  with  and 
without  a  Ti  glue  layer  and  reference  wafers  measured  on  two  different 
types  of  contact:  2.0X2.0  (black  bar)  and  4.0X4.0  (white  bar). 


However,  only  the  TiN[C]  barrier  shows  small  standard  de¬ 
viation  of  the  sheet  resistance  while  for  the  TiN[B],  one  mea¬ 
sures  the  largest  standard  deviation  on  most  substrates.  When 
a  Ti  glue  layer  is  present,  the  sheet  resistance  vary  between 
400  and  750  mfl/D  on  all  substrate  types.  After  the  hour 
long  450  °C  annealing,  the  sheet  resistance  worsens  for  all 
barrier  type  as  can  be  seen  in  Fig.  4(c).  However,  the  in¬ 
crease  in  the  sheet  resistance  is  small  for  TiN[C]  and  TiN[B] 
barrier  without  a  glue  layer  and  large  for  the  barriers  with  the 
Ti  glue  layer.  The  standard  deviation  remains  quite  similar 
before  and  after  the  annealing. 

D.  Junction  leakage  measurements 

Figure  5  shows  the  average  fn~  junction  leakage  and 
the  standard  deviation  results  of  the  four  tested  barriers  and 
of  the  two  reference  wafers  without  barrier.  Before  anneal¬ 
ing,  Fig.  4(a)  shows  that  the  barriers  without  a  Ti  glue  layer 
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have  a  leakage  current  similar  to  the  leakage  of  the  first 
reference  wafer.  Furthermore,  the  standard  deviations  are 
small.  For  the  barriers  covered  with  a  glue  layer,  the  leakage 
is  many  times  higher  than  the  desired  value  of  10  pA  with  a 
standard  deviation  of  the  same  order  of  magnitude  than  the 
average  value.  The  junction  leakage  of  the  wafers  having  a 
Ti  glue  layer  increases  by  more  than  one  decade  after  a  60 
min  thermal  stress  at  450  °C  while  a  low  average  value  is 
obtained  for  the  barriers  without  a  Ti  glue  layer.  However, 
the  standard  deviations  are  approximately  the  same  for  all 
barrier  types,  typically  a  few  pA  higher  than  the  measured 
standard  deviation  of  the  reference  wafers. 


E.  Contact  resistance  measurements 

Figure  6  shows  the  average  and  the  standard  deviation  of 
the  contact  resistance  of  (a)  1.0  chain,  (b)  1.2 

chain,  (c)  1.0  pm chain,  and  (d)  1.2  pm p^  chain  before 
and  after  a  450  °C-60  min  thermal  stress.  Only  the  barriers 
without  Ti  glue  layer  are  analyzed  along  with  the  two  refer¬ 
ence  wafers  since  all  previous  measurements  had  shown  that 
a  Ti  glue  layer  worsens  the  electrical  properties  of  a  device. 

For  the  chains,  it  is  seen  that  before  annealing  the 
average  resistance  of  about  10  kfl  is  the  same  for  the  barrier 
and  reference  wafers  with  a  standard  deviation  around  1  kfl. 
After  annealing,  the  barrier  wafers  have  a  slightly  lower  re¬ 
sistance  than  the  reference  wafers  for  both  contact  sizes.  The 
standard  deviations  follow  the  same  trend.  In  the  case  of  p^ 
chains,  the  average  resistance  for  the  barrier  wafers  is  al¬ 
ready  lower  than  the  average  for  the  reference  wafers  before 
annealing.  This  difference  is  even  more  striking  for  the  stan¬ 
dard  deviations.  After  annealing,  the  lower  resistance  of  the 
barrier  wafers  compared  with  the  reference  wafers  is  still 
observed  but  not  so  strongly  for  the  1.0  pm  contact  size. 
However,  the  standard  deviations  are  very  different  between 
wafers  with  and  without  barrier.  The  above  results  are  also 
observed  for  the  1.5  and  1.8  pm  contact  chains  but  the  dif¬ 
ferences  are  less  apparent. 

In  Fig.  7,  the  average  and  the  standard  deviation  of  the 
contact  resistance  of  (a)  1.0  pm  gate  polysilicon  chain,  (b) 
1.5  pm  gate  polysilicon  chain,  (c)  1.0  pm  capacitor  polysili¬ 
con  chain,  and  (d)  1.5  pm  capacitor  polysilicon  chain  before 
and  after  a  60  min  thermal  stress  at  450  °C  are  presented. 

As  is  the  case  for  the  and  p'^  chains,  the  gate  and 
capacitor  polysilicon  chains  have  a  better  average  resistance 
when  a  barrier  is  present  on  the  wafers,  especially  after  the 
annealing.  However,  the  difference  between  a  barrier  and  a 
reference  wafer  is  less  apparent  for  the  gate  polysilicon  chain 
for  both  sizes.  It  can  be  seen  that  in  the  case  of  the  capacitor 
polysilicon  chains,  the  difference  in  the  average  resistance 
between  the  barrier  and  the  reference  wafers  is  smaller  for 
the  1.5  pm  contact  size.  In  all  cases,  there  is  no  significant 
difference  between  the  two  analyzed  barriers  after  the 
450  °C-60  min  thermal  stress;  the  thin  35  nm  TiN[B]  barrier 
gives  as  good  results  as  the  thicker  95  nm  TiN[C]  barrier. 
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a)  1.0  N+  contacts 


I  ■Asdep.D450*C.6Qmin.  | 


£15-  II 

^mm 

TiN(C)  TiN(B)  Ref1  Ref  2 


b)  1.2  N+  contacts 


c)  1.0  |im  P+  contacts 


d)  1.2  P+  contacts 


Fig.  6.  Average  contact  chain  resistances  and  standard  deviations  of  1.0  jum  (a),  1.2  fim  n'^  (b),  1.0  /jm (c),  and  1.2  pm p'^  (d)  contact  chains  before 
(black  bar)  and  after  (white  bar)  a  450  °C  annealing. 


IV.  DISCUSSION 

A.  Standard  deviation  vs  average  value 

The  reader  must  have  noticed  that  practically  all  figures 
presented  in  this  article  show  results  in  the  form  of  average 
and  standard  deviation  values.  There  are  target  average  val¬ 
ues  which  must  be  obtained  for  all  the  electrical  characteris¬ 
tics  measured  in  this  work.  In  the  industry,  this  target  value 
represents  the  normal  operating  characteristic  of  the  device. 
On  the  other  hand,  the  standard  deviation  of  the  measure¬ 
ments  represents  a  measure  of  the  reproducibility  and  unifor¬ 
mity  of  the  deposition  process  of  the  metallization  multilay¬ 
ers.  One  has  to  remember  that  the  average  and  the  standard 


deviation  are  calculated  from  27  different  measurements  on 
one  wafer  in  the  case  of  leakage  and  contact  resistance  while 
for  the  sheet  resistance,  they  are  determined  from  49  mea¬ 
sured  sites  on  each  of  the  two  wafers. 

For  industry  purposes,  it  is  as  important  to  attain  the  de¬ 
sired  target  value  as  well  as  to  have  tightly  distributed  and 
small  standard  deviations.  Tightly  distributed  deviations  in¬ 
dicate  the  excellent  uniformity  of  a  given  layer  throughout 
the  wafer  as  well  the  good  reproducibility  of  the  deposition 
process  of  the  metallization.  These  brief  considerations  will 
surely  help  to  understand  the  conclusions  that  are  reached  in 
the  following  discussion. 
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c)  1.0  pm  capacitor  polysilicon  contacts 


d)  1.5  pm  capacitor  polysilicon  contacts 

Fig.  7.  Average  contact  chain  resistances  and  standard  deviations  of  1.0  jum  gate  polysilicon  (a),  1.5  pm  gate  polysilicon  (b),  1.0  pm  capacitor  polysilicon 
(c),  and  1.5  pm  capacitor  polysilicon  (d)  contact  chains  before  (black  bar)  and  after  (white  bar)  a  450  °C  annealing. 


B.  Ti  contact  layer 

The  results  presented  in  Fig.  2  may  be  understood  if  a 
reaction  between  the  Ti  contact  layer  and  the  underlying  Si 
substrate  is  occurring.  The  reaction  product  is  either  a  TiSi  or 
a  metastable  C49-TiSi2  phases.^  This  hypothesis  is  sup¬ 
ported  by  the  facts  that  the  deposition  of  the  Ti  contact  layer 
is  done  on  a  substrate  which  is  at  a  temperature  of  400  °C 
and  that  the  deposition  of  the  Al  alloy  layer  is  carried  out  at 
475  °C.  Assuming  that  1  nm  of  Ti  reacts  with  1  nm  of  Si  to 
form  1  nm  of  TiSi,  this  latter  phase  will  have  a  thickness  of 
3,  6,  and  10  nm  for  the  3,  6,  and  10  nm  thick  Ti  contact 
layers.  In  the  case  of  the  C49-TiSi2  phase,  the  Murarka  and 
Fraser^  rule  predicts  a  thickness  of  7.5,  15,  and  25  nm  for  the 


3,  6,  and  10  nm  Ti  contact  layers.  The  decrease  with  the 
thickness  of  the  contact  resistance  (Fig.  2)  may  then  be  re¬ 
lated  to  the  silicide  thickness  that  is  being  formed. 

A  thickness  of  3  nm  for  a  silicide  corresponds  to  only  a 
few  atomic  layers  of  a  compound.  In  this  case,  it  is  very 
likely  that  the  reacted  layer  between  the  barrier  and  the  sub¬ 
strate  is  not  a  very  uniform  silicide  which  obviously  results 
in  a  nonuniform  contact,  thereby  increasing  the  contact  re¬ 
sistance.  On  the  other  hand,  at  the  maximal  silicide  thick¬ 
ness,  i.e.,  25  nm,  the  layer  is  certainly  uniform  and,  at  the 
same  time,  is  sufficiently  thin  to  avoid  detrimental  effect  on 
the  contact  resistance  due  to  dopant  loss  from  the  contacted 
substrates.  This  is  why  it  was  decided  to  use  a  10  nm  Ti 
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contact  layer  throughout  the  rest  of  the  study. 

C.  Ti  glue  layer 

Figures  3-5  clearly  indicate  that  the  use  of  a  H  glue  layer 
in  an  integrated  Ti/TiN/r//AlSiCu/TiN  process  sequence  has 
a  negative  effect  on p^ln~  junction  leakage,  and con¬ 
tact  chains  resistance,  as  well  as  on  sheet  resistance.  The 
results  shown  in  Fig.  4  may  provide  a  possible  explanation  of 
the  detrimental  effect  of  the  Ti  glue  layer.  It  may  be  recalled 
that  an  average  sheet  resistance  of  300  mfi/D  is  expected  for 
all  contacts  if  no  reaction  is  taking  place  between  any  of  the 
deposited  layers.  Only  the  barriers  having  the  Ti  glue  layer 
show  resistance  higher  than  this  expected  value  before  the 
450  °C  annealing.  A  chemical  reaction  between  the  Ti  glue 
layer  and  the  AlSiCu  layer  for  as-fabricated  wafers  is  surely 
occurring  during  the  deposition  of  the  AlSiCu  metal  layer  at 
475  °C.  In  fact,  Al  and  Ti  readily  react  to  form  the  TiAl3 
compound  at  temperatures  as  low  as  400-450 

Table  II  indicates  that  only  10  nm  of  Ti  was  deposited  as 
a  glue  layer.  When  the  Ti  film  thickness  is  smaller  than  what 
is  necessary  to  form  a  stoichiometric  TiAl3  phase  with  the 
diffusing  Al,  the  excess  Al  could  interact  with  the  underlying 
layers. The  average  sheet  resistance  of  400-500  mfl/D  be¬ 
fore  annealing  indicates  that  about  half  of  the  Al  alloy  thick¬ 
ness  has  been  consumed  through  reaction.  This  indicates  that 
the  TiN  barrier  layer  as  well  as  the  Ti  contact  layer  beneath 
the  Ti  glue  layer  may  have  also  reacted  with  the  Al  during 
the  deposition  of  the  AlSiCu  layer.  In  the  case  of  the 
TiN[B](30  nm)/Ti(10  nm)  barrier,  the  measured  sheet  resis¬ 
tance  indicates  an  almost  complete  consumption  of  the  Al¬ 
SiCu  layer.  Since  the  TiAl3  byproduct  has  a  bulk  resistivity 
of  about  50  juifl  cm,  a  100  nm  film  would  have  a  sheet  resis¬ 
tance  of  500  mfl/D,  value  actually  obtained  before  anneal¬ 
ing. 

As  Fig.  4  shows,  the  sheet  resistance  is  worse  after  a 
450  °C  annealing  in  the  case  of  barrier  covered  with  the  Ti 
glue  layer.  Furthermore,  Fig.  5  clearly  shows  that  such  a 
multilayer  also  has  very  high  leakage  before  and  after  this 
annealing.  Since  the  Al  layer  is  almost  fully  converted  into 
TiAl3 ,  this  means  that  Al  must  have  consumed  the  complete 
Ti/TiN/r/  multilayer  and  has  probably  penetrated  the  doped 
substrate.  This,  in  turn,  results  in  Si  (and  dopants)  diffusion 
inside  the  TiAl3  compound  which  leads  to  spiking  problems, 
especially  for  the  1.0  pm  and  p^  contacts  chains  which 
do  not  show  satisfactory  contact  resistance  [see  Figs.  3(a) 
and  3(b)], 

The  above  paragraphs  indicate  that  the  most  plausible 
cause  of  the  detrimental  effect  of  the  Ti  glue  layer  on  the 
electrical  performance  of  diodes  and  contact  chains  is  the 
reaction  of  the  full  Ti  glue  layer/TiN  barrier/Ti  contact  layer 
with  Al  during  the  deposition  of  the  AlSiCu  alloy  at  475  °C. 

D.  TiN  barrier 

When  the  metallization  multilayer  has  no  Ti  glue  layer, 
the  results  of  Figs.  2-7  show  the  excellent  electrical  proper¬ 
ties  of  the  TiN  barrier/Ti  contact  layer  when  compared  with 
reference  wafers.  This  means  that  the  TiN/Ti  bilayer  does  not 
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react  much  with  Al  during  the  475  °C  deposition  of  the  Al¬ 
SiCu  alloy.  This  is  understandable  since  it  is  known  that  TiN 
is  thermally  stable  up  to  800  °C  in  contact  with  Si^^’^^  and 
does  not  react  much  with  Al  up  to  500 

As  Figs.  5-7  show,  the  presence  of  the  TiN  barrier/Ti 
contact  layer  results  in  a  leakage  current  slightly  higher  than 
the  reference  wafers  leakage  but  in  a  much  lower  contact 
chain  resistance  than  the  reference  wafers  (especially  after 
the  450  °C  annealing).  These  results  point  out  the  necessity 
to  use  a  system  of  barrier  and  contact  layer  in  order  to  sta¬ 
bilize  the  contact  resistance  to  diffusion,  to  p^  diffusion, 
to  gate  polysilicon  (N-doped)  and  to  capacitor  polysilicon 
(N-doped)  during  the  thermal  stress  at  450  It  is  generally 
known  and  predicted  by  the  Al~Si  phase  diagram^^  that  the 
aluminum  doped  silicon  (p-type)  precipitates  at  the  bottom 
of  these  contacts.  Their  associated  rectifying  effects  explain 
the  increase  of  the  contact  resistance  after  thermal  stresses. 

The  above  paragraphs  justify  why  the  Ti  glue  layer  must 
be  avoided  in  the  metallization  multilayers  and  why  the  TiN 
barrier/Ti  contact  layers  are  considered  to  be  necessary  in  the 
same  metallization  and  have  such  good  electrical  properties. 

V.  CONCLUSION 

The  integration  of  a  Ti/TiN/7//AlSiCu/TiN  metallization 
sequence  within  a  single  metallization  equipment  giving  a 
net  productivity  of  18  wafers  per  hour  (WPH)  is  desirable  for 
cost-of- ownership  reasons.  This  sequence  integration  is  pos¬ 
sible  with  the  high  quality  TiN  barriers  obtained  with  colli- 
mation.  The  two  analyzed  barriers,  95  nm  TiN(C)  and  35  nm 
TiN(B)  give  good  results  and  the  same  productivity  of  18 
WPH.  The  better  stability  of  the  95  nm  TiN(C)  barrier 
(smaller  standard  deviation)  in  the  sheet  resistance  tests  in¬ 
dicates  a  much  better  reaction  uniformity. 

The  optimization  of  the  thickness  of  the  Ti  contact  layer 
shows  that  a  10  nm  thick  Ti  contact  layer  is  enough  to  ensure 
good  contact  resistance  to  the  analyzed  n  ^  diffusion,  p  dif¬ 
fusion,  gate  polysilicon,  and  capacitor  polysilicon.  The 
analysis  of  the  junction  leakage  results,  of  the  contact  chain 
resistance  results  and  of  the  sheet  resistance  results  indicates 
that  the  Ti  glue  layer  of  an  integrated  Ti/TiN/r//AlSiCu/TiN 
metallization  sequence  causes  an  important  degradation  of 
the  electrical  properties  of  the  devices  before  and  after  a 
450  ®C-60  min  thermal  stress.  It  was  discussed  that  this  deg¬ 
radation  is  due  to  a  reaction  between  the  Ti  glue  layer  and 
the  AlSiCu  layer  during  the  deposition  of  the  alloy  since  the 
wafer  is  heated  during  part  of  the  deposition.  In  fact,  it  is 
argued  that  the  Al  is  in  fact  reacting  with  the  underlying 
r//TiN/Ti/substrate  during  this  processing  step,  resulting  in 
the  formation  of  the  TiAl3  compound  as  well  as  spiking  in 
the  n  and  p  diffusion  areas. 
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The  stress  and  yield  behavior  of  Al  alloys,  Al(Cu)  and  Al(SiCu),  have  been  investigated  using 
bending  beam  and  x-ray  diffraction  techniques,  and  the  effect  of  passivation  on  the 
stress-temperature  behavior  of  alloys  is  examined.  A  stress  analysis  of  trilayer  (Si02/Al  alloy 
film/Si)  structure  was  formulated  in  order  to  deduce  the  stress  of  the  confined  metal  films  from 
bending  beam  measurements.  The  stress-temperature  behavior  of  Al  alloy  films  obtained  from 
bending  beam  techniques  was  in  agreement  with  that  determined  directly  by  x-ray  diffraction 
techniques.  The  grain  structure  and  precipitate  morphology  of  these  alloy  films  were  studied  using 
transmission  electron  microscopes.  Presence  of  passivation  leads  to  a  change  in  microstructure  of  Al 
alloys,  resulting  in  hardening  for  the  Al(SiCu)  films  and  softening  for  the  Al(Cu)  films  when 
compared  with  the  unpassivated  films  at  room  temperature.  Our  results  reveal  that  oxide  thickness, 
Al  thickness,  and  microstructure  play  an  important  role  in  determining  the  plastic  deformation 
behavior  of  the  metal  films,  and  can  satisfactorily  explain  the  contrasting  behavior  observed  for 
Al(SiCu)  and  Al(Cu)  films.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Metal  films  deposited  on  rigid  substrates  are  considerably 
stronger  than  their  bulk  forms  and  show  higher  yield  stresses 
during  thermal  cycling. Proposed  strengthening  mecha¬ 
nisms  have  focused  on  dislocation  pinning  by  obstacles  as¬ 
sociated  with  precipitates  and  film/substrate  interface.  Oxide 
films  are  commonly  used  to  cover  metal  films  and  lines  in 
multilevel  wiring  for  on-chip  interconnects.  The  overlayer 
passivation  is  expected  to  further  strengthen  the  metal  films 
and  lines  due  to  additional  constraint  imposing  on  disloca¬ 
tion  motion.  Various  models  have  been  developed  to  account 
for  the  yield  behavior  of  passivated  films.  One  specific 
model  proposed  for  Pb  films^  considered  the  energetics  of 
dislocation  motion,  taking  into  account  the  elastic  strain  en¬ 
ergy  of  the  film  and  the  dislocation  line  energy  at  interfaces 
and  grain  boundaries,  where  film  thickness,  grain  size,  and 
overlayer  thickness  were  parameters.  This  model  was  subse¬ 
quently  extended  to  consider  the  dislocation  behavior  and 
density  observed  by  transmission  electron  microscope 
(TEM)."^  Recently,  similar  approaches  have  been  applied  to 
model  the  yielding  behavior  of  Al  alloy  films  used  in  inte¬ 
grated  circuits.^"^ 

Passivation  films  are  usually  deposited  at  temperatures 
about  100-200  °C  higher  than  those  of  the  metal  films.  The 
elevated  temperature  often  changes  the  microstructure  of  the 


‘‘^resent  address:  Hyundai  Electronics,  San  136-1,  Ami-ri,  Budal-eub, 
Ichon-Si,  Kyoungki-do,  467-701,  Korea;  Electronic  mail: 
isy  eo  @  ccwf .  cc  .utexas  .edu 
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metal  film,  for  example,  by  grain  growth  or  precipitate  redis¬ 
tribution.  Different  microstructure  can  lead  to  change  of  the 
stress  behavior  of  the  film,  particularly  its  yield  characteris¬ 
tics.  The  passivation  film  and  its  effect  on  microstructure 
have  not  yet  been  considered  in  the  previous  studies. 

In  this  article,  we  first  report  the  results  of  stress  measure¬ 
ments  using  bending  beam  and  x-ray  diffraction  techniques 
of  Al(Cu)  films  with  and  without  passivation.  These  two 
techniques  are  expected  to  give  the  same  results,  and  the 
experiments  have  been  designed  to  include  the  stress  of  both 
metal  film  and  passivation.  In  applying  the  bending  beam 
technique  to  a  bilayer  film  structure,  the  stress  of  the  indi¬ 
vidual  films  has  to  be  extracted  from  the  combined  deforma¬ 
tion  of  the  metal  film  and  the  passivation  using  an  elastic, 
perfect  plastic  stress  model.  Such  a  stress  analysis  is  not 
required  for  x-ray  techniques  since  it  directly  measures  the 
strain  of  the  metal  film.  Therefore,  the  x-ray  results  are  use¬ 
ful  for  checking  the  bending  beam  results  and  for  validating 
the  stress  analysis.  After  validation,  one  can  deduce  the  stress 
of  the  passivated  metal  film  from  bending  beam  measure¬ 
ments. 

We  investigate  the  effects  of  passivation  on  the  stress  and 
yield  behavior  of  two  different  alloy  films,  Al(Cu)  and  Al- 
(SiCu).  These  two  systems  are  chosen  for  our  study  because 
they  are  commonly  used  for  on-chip  interconnects.  The  re¬ 
sults  show  that  the  passivation  increases  Al(SiCu)  film 
stresses  but  decreases  the  Al(Cu)  film  stresses  at  room  tem¬ 
perature.  The  contrasting  behavior  of  the  two  Al  alloy  films 
can  not  be  attributed  solely  to  the  constraint  effect  due  to  the 
passivation  overlayer;  the  role  of  the  microstructure  must  be 
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taken  into  account.  Yield  behavior  of  the  two  films  is  ana¬ 
lyzed  by  using  a  film  strength  model  formulated  by 
Chaudhari,^  which  quantifies  the  effect  of  grain  size  and  pas¬ 
sivation  thickness  on  the  yield  strain  of  the  film.  In  order  to 
provide  input  for  this  analysis,  TEM  is  used  to  determine 
changes  in  grain  size  and  precipitate  formation  of  the  alloy 
films  before  and  after  deposition  of  the  passivation.  This 
model  enables  us  to  understand  the  behavior  of  the  two  Al 
alloy  films  by  assessing  the  relative  importance  of  micro- 
structure  versus  passivation  thickness  in  determining  their 
yield  strength. 
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X-ray  measurements  were  carried  out  using  a  diffracto¬ 
meter  equipped  with  a  high-intensity  rotating  anode  source, 
which  was  designed  for  fast  measurements  of  triaxial  stresses 
in  thin,  patterned  films  as  a  function  of  temperature.  High 
intensity  low  index  diffraction  planes  were  used  to  reduce 
acquisition  time.  The  unpassivated  and  passivated  Al(Cu) 
films  used  for  x-ray  measurements  were  prepared  under  con¬ 
ditions  identical  to  those  used  for  the  bending  beam  samples. 
Details  of  the  x-ray  measurements  have  been  published 
earlier. 


11.  EXPERIMENTAL  DETAILS 


III.  RESULTS  AND  DISCUSSION 


Silicon  wafers  (625  jmm  thick)  were  oxidized  (200 
nm  thick)  to  prevent  reaction  between  metals  and  the  Si 
substrate.  For  passivated  Al(Cu)  samples,  a  0.6  jum  thick 
Al(l  wt  %  Cu)  film  was  sputter  deposited  at  about  175 
onto  a  Si  substrate.  The  metal  film  was  then  encapsulated 
with  a  0.4  /mm  thick  phosphorus-doped  silicate  glass  (PSG) 
followed  by  0.7  /mm  plasma  enhanced  CVD  (PECVD)  SiN^ 
deposited  at  400  °C.  An  Al(Cu)  and  a  SiN^^/PSG  film  were 
also  deposited  on  different  wafers  to  supplement  the  study  of 
the  bilayer  films.  For  passivated  Al(SiCu)  samples, 
Al(l  wt%Si-0.5%Cu)  film  was  deposited  by  sputtering  at 
250  to  a  thickness  of  0.4  /xm.  The  tetraethyl-ortho¬ 
silicate  (TEOS)-based  PECVD  Si02  film  (2.0  /mm  thick)  was 
subsequently  deposited  at  400  °C.  Single-layer  films  of 
Al(SiCu)  and  TEOS  Si02  were  deposited  onto  different  wa¬ 
fers  for  comparisons.  All  samples  were  annealed  above 
400  °C  to  homogenize  the  composition  of  Al  alloy  and  to 
provide  a  common  point  from  which  comparisons  could  be 
made.  The  back  side  of  wafers  was  thinned  to  a  thickness  of 
”^525  /mm  in  order  to  improve  the  sensitivity  of  the  measure¬ 
ments  and  polished  to  provide  a  mirror  surface  for  curvature 
measurements.  This  process  step  left  the  protective  oxide 
only  on  the  front  side  of  the  samples.  Afterward,  these 
samples  were  cut  into  3X45  mm  strips  using  a  wafer  dicing 
machine. 

Thin  film  stresses  were  measured  as  a  function  of  tem¬ 
perature  by  a  bending  beam  technique,  using  a  modified  ver¬ 
sion  of  a  system  described  in  detail  elsewhere.  After  a 
stress -temperature  measurement  was  complete,  the  film(s) 
was  etched  off  the  Si  substrate  to  determine  the  curvature 
corresponding  to  zero  stress.  The  effect  of  the  protective  ox¬ 
ide  on  the  stress  measurement  was  eliminated  by  measuring 
the  radii  of  curvature  of  the  etched  sample  with  the  oxide 
through  the  whole  thermal  cycle  and  subtracting  the  mea¬ 
sured  radii  from  those  measured  with  passivation/ Al  films. 
The  stress  variation  versus  temperature  observed  during  the 
first  thermal  cycle  between  25  and  400  °C  usually  reflected 
microstructure  changes  occurring  in  the  film;  after  the  first 
cycle,  results  became  reproducible  for  subsequent  thermal 
cycles.  The  heating  and  cooling  rate  employed  for  thermal 
cycling  was  5  °C/min.  The  stress  vs  temperature  data  shown 
in  this  article  have  been  measured  up  to  three  times,  and 
show  comparable  stress  values. 


A.  AI(Cu)  films 

Bending  beam  measurements  were  first  carried  out  on  the 
individual  SiN^/PSG  and  Al(Cu)  films  deposited  on  Si  sub¬ 
strates.  For  a  single-layer  film,  its  stress  is  related  to  the 
measured  radius  of  curvature  of  a  beam  by  Stoney’s 
equation 

£,  t] 

6(1  —  Vi) 


where  £‘j/(l  -  z^i)  is  the  biaxial  modulus  of  substrate,  and 
t2  are  substrate  and  film  thicknesses,  and  R  is  the  radius  of 
curvature  of  the  substrate  measured  from  film  side.  Stresses 
in  the  SiN^/PSG  (treated  as  a  single-layer  film)  and  Al(Cu) 
film  are  evaluated  using  Eq.  (1),  and  results  are  shown  in 
Figs.  1(a)  and  1(b),  respectively.  The  stress -temperature  data 
of  SiN;^/PSG  in  Fig.  1(a)  shows  a  linear  dependence  with 
small  stress  hysteresis,  and  thus  reveals  an  elastic  behavior  of 
the  film  in  the  temperature  range  studied.  In  Fig.  1(b),  the 
stress  in  the  unpassivated  Al(Cu)  film  shows  an  elastic  be¬ 
havior  upon  heating  from  room  temperature  to  about  200  °C. 
Above  200  °C,  stress  reduction  via  plastic  flow  dominates 
until  the  end  of  the  heating  cycle  at  400  °C.  Upon  cooling 
from  400  °C,  the  film  shows  a  brief  elastic  region  before 
undergoing  plastic  yield  until  about  150  °C.  Below  150  °C, 
the  stress-temperature  curve  exhibits  a  distinct  change  in 
slope  due  to  film  hardening.  Similar  results  have  been  re¬ 
ported  by  other  investigators.^’ 

Bending  beam  measurements  were  carried  out  on  the 
(SiN;^/PSG)/Al(Cu)  bilayer  on  Si  and  the  observed  stress  ver¬ 
sus  temperature  behavior  is  shown  in  Fig.  1(c).  The  stress 
values  shown  here  are  the  average  stress  (cr)  for  the  compos¬ 
ite  layer  determined  according  to  the  following  extended 
Stoney’s  equation: 


a2t2+  o-3h={a-){t2  +  t3)= 


(2) 


where  0-2  and  0-3  are  stresses  in  the  overlayer  films  of  thick¬ 
ness  ?2  G’  is  the  biaxial  modulus  of  sub¬ 

strate,  R  is  the  substrate  radius  measured  from  film  side. 
Thickness  of  the  substrate  is  much  larger  than  those  of  the 
films.  Bending  beam  techniques  measure  the  stress  of  com¬ 
posite  layer,  hence  a  detailed  stress  analysis  for  the  compos- 
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Fig.  1 .  Stress  vs  temperature  curves  for  (a)  a  0.7  jum/OA  /jm  SiN^/PSG  film, 
(b)  a  0.6  juvn  Al(Cu)  film,  and  (c)  a  0.7  /j.m/0A  jum  (SiN^/PSG)/0.6  /jm 
Al(Cu)  bilayer  structure. 


ite  layer  is  required  to  deduce  individual  layer  stresses,  and 
the  analysis  will  be  presented  in  a  later  section. 

The  stress  vs  temperature  behavior  of  the  unpassivated 
Al(Cu)  film  was  also  measured  by  x-ray  diffraction  and  the 
results  are  shown  in  Fig.  2(a)  where  stress  data  obtained 
using  (111)  and  (222)  reflections  are  compared.  Good  agree¬ 
ment  between  the  corrected  (111)  data  and  the  (222)  data 
validates  the  alignment  correction  procedures  developed  for 
the  x-ray  measurements  because  (222)  reflection  gives  higher 
reflection  angle  and  more  accurate  stress  value  at  the  expense 
of  measurement  time.^^  Figures  1(b)  and  2(a),  which  are 
compared  in  Fig.  2(b),  show  a  quantitative  agreement  be¬ 
tween  the  stresses  determined  by  bending  beam  and  x-ray 
techniques,  and  thus  validates  the  equivalence  of  these  two 
methods  used  for  stress  measurements.  Texture  of  the  Al 
films  shows  strong  (111)  orientation.^^ 

Stress  analyses  for  bilayers  of  elastic  materials  have  been 
developed  by  several  authors,  and  extended  to  multilayer 
films. Some  of  these  studies  have  concluded  that  stresses 
in  multilayer  films  can  be  adequately  described  by  a 
weighted  average  of  stresses  in  individual  films. Since 
these  analyses  were  limited  to  elastic  materials,  and  may  not 
be  applicable  to  passivated  metal  films  where  metal  film  de¬ 
forms  plastically,  we  have  developed  a  stress  model  for  the 
trilayer  (Si02/Al/Si  substrate)  structure,  in  which  the  inter¬ 
layer  film  is  assumed  to  deform  in  an  elastic,  perfect  plastic 


Fig.  2.  (a)  Stress  vs  temperature  curve  for  an  unpassivated  Al(Cu)  film 
obtained  by  x-ray  diffraction  techniques,  stress  data  obtained  using  (111) 
and  (222)  reflections  are  compared  and  good  agreement  is  observed  (b) 
comparison  of  the  stress  versus  temperature  curve  for  the  unpassivated 
Al(Cu)  film  obtained  by  x-ray  diffraction  techniques  and  bending  beam 
techniques.  Qualitative  agreement  as  shown  validates  the  bending  beam 
method  used  for  stress  measurements  in  this  study. 


manner.^®  Calculations  based  on  this  model  have  been  per¬ 
formed  for  a  2  /am  SiO2/0.4  /am  Al/500  /am  Si  trilayer  struc¬ 
ture,  and  the  resulting  stresses  for  the  individual  layers  and 
the  composite  bilayer  stresses  are  shown  in  Fig.  3.  The  pa¬ 
rameters  used  in  the  calculation  are  given  in  Table  I.  The 
critical  temperature  for  the  onset  of  a  plastic  deformation  is 
assumed  to  be  200  °C.  The  intrinsic  stress  values  are  given  at 
400  °C  for  computational  simplicity,  and  correspond  to  the 
number  necessary  to  shift  the  Al  and  oxide  curves  up  or 
down  to  resemble  the  stresses  in  the  elastic  regime.  As  de¬ 
signed,  the  stress  vs  temperature  behavior  of  the  Al  exhibits 
linearity  between  room  temperature  and  200  °C,  whereupon 
further  increases  in  temperature  induce  almost  no  change  in 
the  Al  stress.  (Note  that  the  stress -temperature  slope  of  Al 
film  above  200  °C  is  not  zero,  since  the  presence  of  the  oxide 
does  influence  stress  in  the  Al  film  slightly.) 

Significantly,  the  stress  of  the  oxide  is  nearly  unaffected 
by  the  presence  of  the  plastically  deforming  Al  layer.  (How¬ 
ever,  it  should  be  noted  that  plastic  deformation  of  the  Al 
layer  is  affected  by  the  presence  of  the  overlayer  oxide.)  The 
stress  calculated  for  a  2  ^m  oxide  deposited  directly  on  Si  is 
nearly  coincident  with  the  stress  in  the  oxide  deposited  on 
Al,  as  shown  in  Fig.  3.  Indeed,  separate  calculations  for  vari- 
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TEMPERATURE  fC) 

Fig.  3,  Calculated  stress -temperature  curves  for  a  Si02/Al  bilayer  structure 
on  Si  substrate.  Al  film  is  assumed  to  deform  elastic,  perfect  plastic  manner 
(no  strain  hardening)  upon  200  °C. 


ous  combinations  of  oxide  and  metal  bilayer  reveal  that  more 
than  an  order  of  magnitude  difference  in  the  thickness  ratio  is 
required  before  the  stress  in  the  oxide  on  the  deforming  Al  is 
significantly  different  from  that  calculated  for  an  oxide  de¬ 
posited  directly  on  Si,  Thus,  stresses  in  multilayer  films  with 
comparable  respective  thicknesses  can  be  still  described  by  a 
weighted  average  of  stresses  in  individual  films  even  when 
one  of  the  films  experiences  plastic  deformation.  This  stress 
superposition  principle  can  be  applicable  to  many  stress- 
related  problems  in  microelectronics,  and  the  stress  of  passi¬ 
vated  metal  films  can  be  deduced  from  bending  beam  mea¬ 
surements  by  subtracting  the  stresses  in  passivation  layer. 

The  deduced  stress  behavior  of  the  passivated  Al(Cu)  film 
in  the  bilayer  is  shown  in  Fig.  4(a).  X-ray  diffraction  mea¬ 
surements  were  performed  to  directly  measure  the  stress  of 
the  passivated  Al(Cu)  film  and  the  results  are  compared  with 
the  bending  beam  result  shown  in  Fig.  4(a).  These  data  show 
a  good  agreement  between  the  results  from  both  measure¬ 
ments.  Since  x-ray  diffraction  directly  measures  the  strain 
(which  can  be  converted  to  the  stress)  of  the  passivated  metal 
layer,  the  observed  agreement  validates  the  stress  analysis  for 
the  passivated  metal  film. 

In  Fig.  4(b),  stresses  of  the  unpassivated  film  are  shown 
along  with  those  of  the  passivated  Al(Cu)  film.  During  heat- 


Table  I.  Parameters  used  in  the  trilayer  calculations  (Ref.  25). 


Material 

Silicon 

Aluminum 

Silicon  dioxide 

£  (GPa) 

130 

69 

71.7 

V 

0.28 

0.33 

0.16 

Thickness  (/tm) 

530 

0.4 

2.0 

a  (X10“®) 

2.14 

23.4 

0.94 

o'"'  (MPa) 

0.0 

-550 

-10 
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TEMPERATURE  (X) 


Fig.  4.  (a)  Comparison  of  passivated  Al(Cu)  stresses  obtained  by  bending 
beam  techniques  and  x-ray  techniques,  (b)  comparison  of  the  stress- 
temperature  behavior  for  the  passivated  Al(Cu)  film  and  for  the  unpassivated 
Al(Cu)  film.  The  differences  can  be  explained  by  a  model  which  calculates 
the  relative  contributions  of  oxide  thickness  and  grain  size  to  the  film  yield 
stress. 


ing,  both  films  exhibit  an  elastic  regime,  although  the  passi¬ 
vated  film  starts  exhibiting  plastic  flow  at  a  lower  tempera¬ 
ture  (140  °C)  than  the  unpassivated  film  (200  ®C).  Upon 
cooling  from  400  °C,  the  passivated  film  is  elastic  until  about 
350  ‘"C,  and  it  deforms  plastically  until  room  temperature. 
Unlike  the  unpassivated  sample,  however,  it  does  not  show 
any  significant  change  in  slope  due  to  hardening  upon  cool¬ 
ing  below  150  °C.  These  results  are  discussed  below  in  the 
light  of  microstructural  observations. 


B.  Al(SiCu)  film 

The  stress-temperature  behavior  of  the  individual  2  /mm 
Si02  and  0.4  /xm  Al(SiCu)  films  deposited  on  Si  are  shown 
in  Figs.  5(a)  and  5(b).  The  Si02  film  exhibits  a  smooth, 
nearly  linear  stress  response  as  a  function  of  temperature. 
With  almost  no  hysteresis  during  thermal  cycling  up  to 
400  °C,  our  results  differ  from  an  earlier  report  for  TEOS- 
based  Si02  with  a  strong  stress  hysteresis.  The  Al(SiCu) 
film  behaves  elastically  upon  heating  between  25  and  220  °C 
as  shown  in  Fig.  5(b).  Above  220  °C,  the  film  starts  yielding 
in  compression.  Upon  cooling  from  400  ‘^C,  the  film  behaves 
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100  200  300  400  500 
TEMPERATURE  (°C) 


Fig.  5,  Stress  vs  temperature  curves  for  (a)  a  2.0  TEOS  Si02  film,  (b)  a 
0.4  /jm  Ai(SiCu)  film,  and  (c)  a  2.0  jum  SiO2/0.4  juum  Al(SiCu)  bilayer 
structure. 


elastically  until  it  reaches  a  stress  of  ~  100  MPa  at  330  °C, 
and  then  it  begins  to  yield  in  tension.  As  the  film  is  cooled 
below  180  °C,  film  strengthening  occurs,  and  the  Al(SiCu) 
film  behaves  almost  elastically.  Similar  results  for  Al(SiCu) 
have  been  reported  elsewhere  by  other  groups.^’^^  In  Fig. 
5(c),  we  present  a  stress -temperature  curve  for  a 
Si02/Al(SiCu)  bilayer  film.  Note  the  stress-temperature 
curve  shift  toward  compression  because  of  the  compressive 
thick  oxide  film. 

We  compare  the  stresses  of  two  Al(SiCu)  films,  one  pas¬ 
sivated  by  a  2  fim  oxide  and  the  other  unpassivated,  in  Fig. 
6.  The  results  for  the  unpassivated  film  are  reproduced  from 
Fig.  5(b).  Those  for  the  passivated  Al(SiCu)  have  been  de¬ 
termined  by  the  linear  subtraction  procedure  given  in  the 
Appendix.  These  two  sets  of  data  exhibit  similar  slopes  be¬ 
tween  25  and  '^230  °C,  where  both  oxide  and  metal  films 
behave  elastically.  However,  above  '^230  °C,  the  two  sets  of 
data  are  quite  different.  Upon  heating,  the  stress  in  the  pas¬ 
sivated  film  flattens  out  above  —300  °C  while  the  compres¬ 
sive  stress  in  the  unpassivated  film  begins  to  decrease  at 
—300  °C  continuing  to  400  °C,  This  suggests  that  the  pres¬ 
ence  of  the  passivation  layer  reduces  the  rate  of  plastic  flow 
under  compression  in  Al(SiCu)  film,  and  leads  to  a  higher 
yielding  temperature  for  the  confined  film  (—280  versus 
—230  °C).  When  samples  are  cooled  from  400  °C,  both  sets 
of  data  exhibit  a  linear  range  to  —300  °C,  a  characteristic  of 
elastic  behavior.  However,  between  —300  and  180  °C,  the 


TEMPERATURE  (°C) 


Fig.  6.  Comparison  of  the  stress-temperature  behavior  for  a  passivated  0.4 
fim  Al(SiCu)  film  and  for  a  unpassivated  0.4  jjim  Al(SiCu)  film.  Stresses 
determined  for  passivated  and  unpassivated  Al(Cu)  films  exhibit  a  trend 
opposite  from  those  shown  in  Fig.  4. 


two  sets  of  data  exhibit  slopes  distinct  from  one  another 
where  Al(SiCu)  is  yielding  in  tension.  Below  —180  °C,  both 
the  unpassivated  and  the  passivated  Al(SiCu)  films  begin  to 
harden.  Our  data  indicate  that  the  passivated  Al(SiCu)  films 
exhibit  higher  compressive  stresses  upon  heating  and  higher 
tensile  stresses  upon  cooling  than  the  unpassivated  Al(SiCu) 
films. 


C.  TEM  examination  of  microstructure 

Comparing  with  the  stress  measurements  of  their  respec¬ 
tive  unpassivated  films,  the  passivated  Al(SiCu)  films  show  a 
higher  stress,  but  the  passivated  Al(Cu)  films  show  a  lower 
stress  at  room  temperature.  The  contrasting  behavior  may  not 


Fig.  7.  Microstructure  of  (a)  the  unpassivated  Al(Cu)  film  and  (b)  the  pas¬ 
sivated  Al(Cu)  film  in  plan  view  TEM.  In  (a),  a  dark  arrow  points  out  slip 
traces,  and  a  white  arrow  indicates  an  AI2CU  {6  phase)  precipitate.  In  (b), 
dark  arrow  indicates  a  $  phase  precipitate.  Interestingly,  no  slip  traces  are 
observed  despite  larger  average  grains. 
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Fig.  8.  Microstructure  of  (a)  the  unpassivated  Al(SiCu)  film  and  (b)  the 
passivated  Al(SiCu)  film  in  plan  view  TEM.  In  (a),  a  dark  arrow  indicates 
AI2CU  (0  phase)  precipitate,  and  an  unfilled  arrow  indicates  a  Si  precipitate. 
In  (b),  slip  traces  are  marked  by  dark  arrows,  and  precipitates  show  similar 
morphology  to  those  in  the  unpassivated  film. 


be  attributed  only  to  the  effect  of  overlayer  constraint.  Data 
suggest  that  microstructural  parameters  are  also  important. 

The  microstructure  of  Al(SiCu)  films  and  Al(Cu)  films 
was  investigated  by  plan  view  TEM  after  annealing  at 
400  °C  for  30  min.  For  passivated  samples,  passivation  films 
were  removed  by  reactive  ion  etching  using  CF4  gas. 
Samples  for  TEM  were  prepared  by  thinning  the  wafer  down 
to  150  fim,  cutting  3  mm  diam  disks,  and  dimpling  to  about 
50  fim.  The  rest  of  the  Si  substrate  in  a  dimpled  region  of  the 
sample  was  chemically  etched  off  using  a  solution  of  85% 
HNO3  and  15%  HR  The  front  side  Al  film  was  protected 
with  an  etch-resistant  lacquer  for  the  disk  edge  and  with  the 
protective  oxide  which  also  works  as  an  etch  stopper  for  the 
center  region.  The  sample  disk  was  finally  ion  milled  until  an 
electron- transparent  region  was  reached.  A  JEOL  1200  EX 
STEM  was  used  for  the  microscopy  work. 

Typical  microstructure  of  Al(Cu)  and  Al(SiCu)  films  is 
shown  in  Figs.  7  and  8,  respectively.  The  unpassivated 
Al(Cu)  film  [Fig.  7(a)]  exhibits  slip  traces  in  several  grains 
which  suggest  plastic  deformation.  Irregular  lines  inside 
some  grains  are  dislocations  generated  during  ion  milling. 
Figure  7(b)  shows  the  corresponding  microstructure  for  the 
passivated  Al(Cu)  film.  It  is  remarkable  that,  unlike  the  un¬ 
passivated  film,  no  slip  traces  were  visible  for  this  film  al¬ 
though  it  shows  much  larger  grains.  The  reason  for  this  is  not 
clear.  The  small  holes  seen  in  this  film  are  artifacts  of  sample 
preparation  procedure  involving  etching  and  ion  milling. 
Figures  8(a)  and  8(b)  show  the  microstructure  of  the  unpas¬ 
sivated  and  passivated  Al(SiCu)  film,  respectively.  The  mi¬ 
crostructures  of  the  films  are  quite  similar  to  each  other.  The 
observations  on  the  microstructure  of  these  films  are  summa¬ 
rized  in  Table  II. 

The  grain  size  of  the  unpassivated  and  passivated  films 
was  determined  from  the  TEM  micrographs  by  taking  the 
mean  of  the  smallest  and  the  largest  dimension  of  each  grain. 
Figures  9(a)  and  9(b)  show  the  grain  size  distribution  of  the 
Al(Cu)  film  and  Al(SiCu)  film,  respectively.  Several  obser¬ 
vations  can  be  made  from  Figs.  9(a)  and  9(b).  First,  passi¬ 
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Table  II.  Summary  of  microscopic  analysis  for  AI(Cu)  and  Al(SiCu). 


Material 

Average  grain  size 

Microstructure 

Passivated  Al(Cu) 

6.3  /xm 

6  precipitates,  slip  traces 

Unpassivated  Al(Cu) 

1.8  /xm 

0  precipitates,  no  slip  traces 

Passivated  Al{SiCu) 

2.3  fxm 

Few  0  precipitates,  Si 
precipitate,  few  slip  traces 

Unpassivated  Al(SiCu) 

1.9  fxm 

Few  $  precipitates,  Si 
precipitate,  few  slip  traces 

vated  films  have  larger  grains  as  compared  to  those  of  un¬ 
passivated  films.  Second,  unpassivated  films  exhibit  a  log¬ 
normal  grain  size  distribution.  This  suggests  that  the  grains 
of  the  unpassivated  film  grow  normally.  It  also  reveals  that 
stagnation  of  normal  grain  growth  because  the  slopes  of  the 
distributions  are  steeper  for  larger  value  of  grain  size.  Third, 
passivated  films  show  a  lower  slope  which  indicates  wider 
spread  in  grain  size.  Furthermore,  the  passivated  Al(Cu) 
films  apparently  show  a  bimodal  distribution.  Two  distinct 
slopes  seen  in  the  distribution  correspond  to  a  subdistribution 
of  small  and  recrystallized  grains  which  have  grown  nor¬ 
mally,  and  another  subdistribution  of  large  grains  showing 


Fig.  9.  Grain  size  distributions  of  the  unpassivated/passivated  (a)  Al(Cu) 
film  and  (b)  Al(SiCu)  film.  Passivation  causes  the  increase  in  average  grain 
size  of  the  film. 
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abnormal  growth  of  the  pre-existing  larger  than  average 
grains.  The  observed  average  grain  size  is  also  summarized 
in  Table  IL  It  should  be  noted  that  the  passivated  Al(SiCu) 
film  has  an  average  grain  size  similar  to  that  of  the  unpassi¬ 
vated  Al(SiCu)  film,  whereas  the  passivated  Al(Cu)  film  has 
much  larger  grain  size  compared  with  the  unpassivated 
Al(Cu)  film. 

The  larger  grain  size  in  the  passivated  film  may  be  attrib¬ 
uted  to  two  factors.  First,  the  high  temperature  oxide  depo¬ 
sition  step  promotes  grain  growth.  Another  contributing  fac¬ 
tor  for  larger  grain  size  may  be  a  compressive  stress-driven 
grain  growth  when  the  Al  alloy  film  is  passivated.^^  Ven- 
katraman  et  al^  showed  that  grain  growth  of  an  Al(Cu)  film 
is  influenced  by  the  presence  of  oxide  as  well,  i.e.,  grain  size 
is  significantly  larger  in  the  region  where  the  film  is  sup¬ 
ported  by  the  layer  of  Si02  (hence  is  subjected  to  a  compres¬ 
sive  stress  during  heating)  than  in  the  region  of  the  free 
standing  film.  Grain  growth  relaxes  compressive  stresses  in 
the  film,  so  grain  growth  is  likely  to  occur  under  compressive 
stresses.^^  Since  an  overlayer  passivation  can  restrict  hillock 
growth,  one  of  the  primary  mechanisms  to  relax  compressive 
stress  during  heating,  the  grain  growth  of  the  passivated 
Al(Cu)  is  enhanced  to  relax  the  compressive  stress. 


D.  Yield  strengthening  model  for  metal  films 

In  a  previous  study  of  Pb  films,  a  thin  native  oxide  was 
found  to  enhance  the  film  yield  strength.^^  Since  a  native 
oxide  is  present  over  both  the  unpassivated  and  the  passi¬ 
vated  Al  alloy  films,  the  difference  observed  for  the  passi¬ 
vated  and  unpassivated  films  suggests  that  the  yielding  be¬ 
havior  depends  on  microstructure  as  well  as  passivation 
thickness.  Results  from  TEM  show  distinct  grain  structure 
for  the  two  Al  alloy  films.  Effects  of  grain  structure  and 
overlayer  constraint  are  taken  into  account  by  the  film 
strength  model  formulated  by  Chaudhari.^  Here,  we  apply 
this  model  to  analyze  the  yield  behavior  observed  for  the  Al 
alloy  films.  Since  the  plastic  deformation  of  Al  films  can  be 
explained  by  a  dislocation  glide  mechanism  through  the  ther¬ 
mal  cycle, Chaudhari’s  approach  for  the  stress  analysis  is 
considered  to  be  relevant. 

In  Chaudhari’s  model, ^  the  energies  per  unit  area  for 
strain  e  in  the  film  dislocation  lines  at  film-substrate 
and  film  overlayer  interfaces,  and  dislocations  ly¬ 
ing  along  grain  boundaries  were  added  together  to  ob¬ 
tain  total  energy  (E'tot)*  By  setting  dE^Jd€=Q  (minimum 
energy  criterion),  the  critical  strain  for  dislocation  motion 
(yielding)  could  be  determined.  For  an  isotropic  film  of 
thickness  h  with  grains  of  diameter  d,  and  covered  by  an 
overlayer  oxide  of  thickness  h' ,\he  total  energy  is 

E,,,=  E,+E^l+E^°+Et  (3) 


or 


Table  III.  Material  constants  used  in  calculations  (Refs.  25  and  26). 


Material 

£  (GPa) 

V 

fx  (GPa) 

Si 

130 

0.28 

50.8 

Al  (at  RT) 

69 

0.33 

25.9 

Al  (at316°C) 

49 

0.33 

18.42 

Si02 

71.7 

0.16 

30.9 

PSG 

79 

0.2 

32.9 

SiN, 

150 

0.25 

60 

Ei„f=Bhe^+{ef- e)2D\l3\ 

I 

+  {ef-e)2D'^\ 


2  1 
•+  T 


3(l-v)  3, 


2h\ 

In  —  + 1 
/ 


2  1 
+  - 


3(l-v)  3, 


/  2h'\ 

In  — ; —  +  1 

\  ^  / 


+  {€f- e) 


hbfif  ^/2 


+  1  + 


^  d  IT 

l\  1 


2  + 


Vl2 


d\ 

ln|  -  +1 


(4) 


where  €f  is  the  initial  strain  in  the  Al  film  before  plastic 
deformation  occurs,  v  is  the  Poisson  ratio  of  Al,  ^  =  2.86  A  is 
the  length  of  the  Burgers  vector  for  dislocation  motion  along 
(110)  directions  on  a  {111}  plane  in  Al.  The  constants  B,  D, 
and  D '  are  given  by 


2(1 -V}’ 


D'  = 


(5) 


where  the  /jli  are  the  shear  moduli.  For  the  material  constants 
at  room  temperature  and  316  °C  given  in  Table  III,^^’^^  the 
critical  strain  for  plastic  deformation  becomes 


ecr(RT) 


0.70 

(2h\ 

0.57 

l2h'\ 

In  —  +1 

H - ; — 

In  —  + 1 

h 

[  U/  j 

h 

_  \  b  j 

+  1 


(6) 


0.55 

6j316°C)=  — 


0.57 


+  1 


-h 


1.07 

'T" 


+  1 


where  the  h,  and  d  are  in  units  of  angstrom.  This  model 
assumes  isotropic  materials  and  does  not  consider  variations 
in  grain  orientation  or  alloy  composition  which  are  known  to 
influence  film  strength.^""^’^  It  is  important  to  note  that  grain 
size,  film  thickness,  and  passivation  layer  thickness  all  con¬ 
tribute  to  .  For  constant  h  and  d,  the  presence  of  a  thick 
overlayer  {h')  will  increase  although  the  amount  may  be 
small  because  of  the  logarithmic  dependence.  In  contrast,  if 
the  grain  size  d  increases  due  to  the  presence  of  the  over- 
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Table  IV.  Film  and  passivation  thicknesses  and  grain  size  determined  by  TEM  for  the  different  samples.  These 
parameters  predict  a  critical  strain  for  yielding  e^r  at  room  temperature  and  at  316  °C.  is  obtained  from 
6cr(RT)“6cr(316  °C),  and  represents  the  extent  of  hardening  during  cooling. 


Sample 

h{k) 

h'  (A) 

d{k) 

ecrlRT) 

6or{316“C) 

Unpassivated  Al(SiCu) 

4000 

20 

18  730 

2.87X10"^ 

2.31X10“^ 

0.56X10“^ 

Passivated  Al(SiCu) 

4000 

20  000 

22  950 

3.72X10"^ 

3.20X10"^ 

0.62X10"^ 

Unpassivated  Al(Cu) 

6000 

20 

18  220 

2.31X10“^ 

1.85X10'^ 

0.46X10’^ 

Passivated  Al(Cu) 

6000 

11000 

62  520 

2.50X10’^ 

2.20  X10‘^ 

0.30X10“^ 

layer,  6^^  may  decrease.  Since  overlayer  causes  increase  of 
grain  size,  the  two  terms-overlayer  thickness  and  grain  size- 
balance  each  other. 

Several  groups  have  recently  discussed  the  interplay  be¬ 
tween  pairs  of  the  three  terms  in  Eq.  (6).  For  example,  Do- 
emer  et  al^  and  Venkatraman  et  have  correlated 
changes  in  film  strength  with  film  thickness  and  grain  size. 
Thompson^  has  modeled  this  behavior  and  derived  an  equa¬ 
tion  that  is  similar  in  form  to  the  sum  of  the  first  and  third 
terms.  For  a  given  pair  of  passivated  and  unpassivated  films, 
the  present  study  keeps  the  first  term  in  Eq.  (6)  constant, 
which  allows  us  to  examine  the  interplay  between  the  effects 
of  overlayer  oxide  thickness  and  grain  size.  The  critical 
strains  for  plastic  deformation  predicted  by  Eq.  (6)  are  given 
in  Table  IV.  Unpassivated  films  are  assumed  to  have  a  20  A 
native  oxide  and  the  grain  sizes  listed  were  measured  via 
TEM.  The  critical  strains  obtained  for  each  of  the  Al  films 
studied  are  reasonable,  varying  between  0.19%  and  0.37%. 
We  note  higher  yield  stress  for  Al(SiCu)  films  than  that  of 
the  Al(Cu)  films  and  higher  yield  stress  at  lower  temperature. 
This  is  consistent  with  the  measurements  although  the  ob¬ 
tained  critical  strains  of  Al(Cu)  and  Al(SiCu)  cannot  be 
quantitatively  compared  with  each  other  because  of  the  dif¬ 
ference  in  alloy  composition.  This  model  also  correctly  pre¬ 
dicts  an  increase  of  yield  strain  for  passivated  films  at  high 
temperature  (at  316  °C)  although  it  fails  to  predict  a  lower 
yield  strain  for  the  passivated  Al(Cu)  film  than  that  for  the 
unpassivated  Al(Cu)  film  at  room  temperature.  When  we 
compare  of  different  film  in  the  Table  IV,  the  model 
predicts  the  hardening  behavior  correctly  [larger  hardening 
for  passivated  Al(SiCu)  and  less  hardening  for  passivated 
Al(Cu)  compared  with  their  respective  unpassivated  films]. 
The  disagreement  between  the  measurement  and  the  model¬ 
ing  for  Al(Cu)  films  at  room  temperature  could  be  attributed 
to  the  difference  of  reference  points  for  the  model.  We  con¬ 
clude  that  Chaudhari’s  model  successfully  predicts  the  trend 
in  the  yield  behavior  observed  for  Al  alloy  films.  In  the  Al- 
(SiCu),  grain  size  increase  can  be  negligible  because  the 
grain  sizes  of  the  unpassivated  and  the  passivated  film  are 
similar.  Thus,  strengthening  due  to  overlayer  confinement  is 
more  effective.  While,  in  the  Al(Cu),  softening  due  to  grain 
size  increase  is  greater  than  the  strengthening  due  to  over¬ 
layer  confinement,  and  a  decrease  in  room  temperature  stress 
in  the  passivated  Al(Cu)  is  observed. 


IV,  CONCLUSION 

In  this  article,  we  have  investigated  the  stress- 
temperature  behavior  of  unpassivated  and  passivated  Al  alloy 
films.  We  developed  a  method  to  deduce  the  stress  of  the 
confined  metal  films  from  bending  beam  measurements.  The 
bending  beam  results  were  compared  with  those  from  x-ray 
diffraction  and  a  good  agreement  was  observed.  The  Al  alloy 
films  show  distinct  stress-temperature  behaviors,  where  pas¬ 
sivation  causes  an  increase  in  yield  strength  for  the  Al(SiCu) 
film  but  a  decrease  for  the  Al(Cu)  film  at  room  temperature. 
The  contrasting  behavior  was  analyzed  by  a  film  strengthen¬ 
ing  model  which  takes  into  account  the  effect  of  passivation 
thickness  and  grain  structure  on  film  yielding.  This  model  is 
able  to  account  for  the  yield  behavior  of  the  two  alloy  films 
during  thermal  cycling,  and  reveals  two  competing  mecha¬ 
nisms:  hardening  due  to  overlayer  confinement,  and  soften¬ 
ing  due  to  grain  growth. 
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Effects  of  insulator  surface  roughness  on  the  overlying  aluminum  alloy  film  properties  and 
electromigration  performance  in  A1  alloy/Ti  insulator  layered  interconnects  have  been  investigated. 

Insulator  surface  roughness  changes  the  roughness  and  crystallographic  orientation  of  Al/Ti  layered 
films  formed  on  the  insulator.  In  particular,  the  effect  is  prominent  in  the  case  of  high-temperature 
sputtering  of  A1  alloy  films.  A  rough  insulator  surface  deteriorates  the  roughness  and 
crystallographic  orientation  of  the  A1  alloy  film  formed  on  the  insulator,  while  a  smooth  insulator 
surface  improves  those  properties.  The  insulator  surface  roughness  does  not  change  the  roughness 
and  crystallographic  orientation  of  the  A1  alloy  film  without  a  Ti  underlayer.  Ti  roughness  and 
crystallographic  orientation  are  improved  on  a  very  smooth  surface  insulator  resulting  in 
improvement  in  the  roughness  and  crystallographic  orientation  of  the  A1  alloy  film  formed  on  the  Ti 
layer.  As  a  result,  the  electromigration  performance  in  Al/Ti  layered  interconnects  changes 
according  to  the  underlayer  insulator  surface  roughness.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

In  the  trend  of  increasing  packing  density  of  large-scale 
integrated  circuits  (LSIs),  two  major  concerns  in  intercon¬ 
nects  used  in  LSIs  are  how  to  obtain  electrical  conductance 
at  contacts  or  via  holes  with  high  aspect  ratios  and  how  to 
obtain  fine  interconnects  with  high  reliability.  The  former 
concern  is  partly  solved  by  via  filling  with  metals  other  than 
Al,  for  example,  W,  however,  A1  via-filling  technologies  such 
as  high-temperature  sputtering,  reflow  process,^’^  and 
high-pressure  technology^  have  been  investigated  with  re¬ 
spect  to  the  low  cost  and  simplicity  of  the  fabrication  pro¬ 
cesses.  In  particular,  Al  high-temperature  sputtering  is  one  of 
the  most  promising  candidates  because  of  its  low  cost  with¬ 
out  any  special  equipment.  This  technology,  however,  has  a 
problem  with  Al  surface  roughness  that  will  make  the  fol¬ 
lowing  photolithography  process  difficult. 

As  for  the  latter  concern,  there  are  many  factors  that  affect 
the  fine  interconnect  reliability.  Those  factors  are  metal  line 
dimensions  that  determine  the  allowable  passage  current 
density,  Al  grain  size,  Al  crystallographic  orientation,  and 
impurities  contained  in  the  Al  alloy.  In  addition,  in  layered 
interconnects,  there  are  other  factors  that  make  the  electromi¬ 
gration  performance  complicated,  such  as  the  kinds  of  refrac¬ 
tory  metals  and  their  fabrication  processes,  their  resistivity, 
contact  resistance  to  the  Al  alloy  film,  and  their  impact  on  Al 
grain  size,  Al  crystallographic  orientation,  and  impurities  that 
will  be  incorporated  into  the  Al  alloy  from  the  refractory 
metals.  Those  factors  need  to  be  optimized  to  obtain  a  highly 
reliable  interconnect.  With  respect  to  the  crystallographic  ori¬ 
entation  among  those  factors,  Vaidya  et  al.  suggested  that  the 
median  time  to  failure  (MTTF)  of  an  Al  metal  line  is  a  func¬ 
tion  of  the  Al  grain  size,  the  log-normal  grain-size  standard 
deviation,  and  the  ATs  preferred  orientation,  considering  the 
electromigration  (EM)  performance  and  properties  of  the 
aluminum-copper  (Al-Cu)  alloys.^®  That  is,  the  EM  lifetime 


is  longer  in  Al  metal  lines  having  a  large  grain  size,  a  small 
grain-size  standard  deviation,  and  a  strong  (111)  orientation. 
Kaneko  et  al.  suggested  that  Al  crystallographic  orientation 
also  affects  stress-induced  migration.  This  is  the  result  for 
single  layer  Al  alloy  metal  lines.  A  lot  of  effort  has  been 
made  to  enhance  the  Al  texture  changing  the  underlying  ma¬ 
terials  in  layered  interconnects.  Authors  have  reported 
that  a  highly  (lll)-oriented  Al  metal  line  by  controlling  the 
underlying  TiN  preferred  orientation  has  an  excellent  EM 
performance.^^’^^  Other  researchers  also  show  the  crystallo¬ 
graphic  effects  of  under-metal  planes  on  Al  (111)  orientation 
in  Al/Ti  and  Al/TiN/Ti  structures  changing  the  pretreatment 
and  Ti  thickness.  Those  studies  use  the  crystallographic 
relations  with  underlayer  metals,  while  Toyoda  et  al.  show 
hyper- textured  (lll)-oriented  Al  alloy  with  a  high  EM  per¬ 
formance  on  the  amorphous  underlayer  that  has  a  high  sur¬ 
face  energy. 

In  this  article,  effects  of  insulator  surface  roughness  on 
the  overlying  aluminum  alloy  layered  interconnects  have 
been  investigated,  in  order  to  solve  both  problems  of  Al  alloy 
surface  roughness  and  reliability  issues  in  Al  alloy  layered 
interconnects  formed  by  high-temperature  sputtering.  It  was 
found  that  insulator  surface  roughness  changes  the  Al  surface 
roughness  and  crystallographic  orientation,  in  particular,  in 
Al  alloy/Ti  layered  interconnects  formed  by  high- 
temperature  sputtering.  As  a  result,  Al  alloy/Ti  layered  inter¬ 
connects  formed  on  extremely  planarized  insulator  have  a 
highly  reliable  EM  performance. 

II.  EXPERIMENT 

In  order  to  change  the  insulator  surface  roughness,  ther¬ 
mal  Si  oxide  film  (500  nm),  borophosphosilicate  glass  film 
(BPSG,  600  nm),  and  tetra-ethyl-ortho-silicate-03 
(TEOS-O3)  based  Si02/plasma  Si02  double  layered  films 
were  formed  or  deposited  on  6  in.  Si  substrates.  For  the 
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(a)  (b) 


Fig.  1.  The  typical  bird’s  eye  view  of  the  surface  roughness  on  chemical  mechanical  polished  TEOS-03  Si02/plasma  Si02  (CMP  TE0S-03Si02)  (a)  and 
nontreated  TE0S-03Si02/plasma  Si02  (TE0S-03Si02);  (b)  observed  by  AFM.  The  observed  area  is  2  /LtmX2  ^m. 


double  layered  films,  TEOS-O3  Si02  films  of  800  and  1000 
nm  were  deposited  on  200  nm  plasma  Si02  on  6  in.  Si  sub¬ 
strates.  Before  the  800  nm  TEOS-O3  Si02  film  deposition,  a 
part  of  the  samples  are  subjected  to  ethanol  treatment. This 
treatment  enables  one  to  obtain  a  very  planarized  TEOS-O3 
Si02  film.  The  TEOS-O3  Si02  films  of  1000  nm  were  sub¬ 
jected  to  chemical  mechanical  polishing  (CMP)  to  remove 
200  nm  of  TEOS-O3  Si02  and  planarize.  The  CMP  was  car¬ 
ried  out  with  fumed  silica  base  slurry  and  a  pressure  of  7.5 
psi.  The  polishing  rate  is  196  nm/m.  The  remaining 
TEOS-O3  Si02  film  thickness  is  800  nm  after  CMP.  Another 
sample  with  TEOS-O3  film  (800  nm)/plasma  Si02  film  was 
annealed  at  850  °C  for  30  min  for  degassing  before  follow¬ 
ing  metal  film  deposition.  On  these  substrates,  Ti  (50  nm) 
and  Al-0.8  wt%  Si-0.3  wt%  Cu  alloy  (500  or  700  nm)  films 
were  sequentially  sputter  deposited  in  a  multichamber  dc- 
magnetron  sputtering  system  that  has  a  base  pressure  of 
2X10~^  Torr.  Prior  to  the  deposition,  samples  were  subjected 
to  preannealing  at  450  °C  for  2  min  in  the  same  vacuum  for 
degassing.  The  substrate  temperatures  during  deposition  are 
100  °C  for  Ti  film  and  220-485  ^C  for  A1  alloy  films.  As 
references,  samples,  on  which  A1  alloy  films  were  deposited 
without  substrate  heating  and  reflowed  in  the  same  vacuum 
in  the  temperature  range  of  420-485  °C,  were  prepared. 

The  surface  roughness  of  the  insulators  was  evaluated  by 
atomic  force  microscopy  (AFM).  The  surface  morphology 
and  reflectance  of  the  A1  alloy  surface  were  evaluated  by 
scanning  electron  microscopy  (SEM)  and  reflectance  mea¬ 
surement  using  i  line  (wavelength:  365  nm),  respectively. 
The  Ti  and  A1  alloy  crystallographic  orientations  and  A1 
alloy/Ti  reactions  were  evaluated  with  x-ray  diffraction 
(XRD)  analysis  and  transmission  electron  microscopy 
(TEM).  The  impurity  concentration  in  the  A1  alloy  film  was 
analyzed  by  secondary  ion  mass  spectroscopy  (SIMS). 

For  EM  evaluation,  the  A1  (500  nm)/Ti  (50  nm)  layered 
metal  lines  2  fim  wide  and  22  mm  long  with  TiN(100  nm)  as 
antireflection  materials  and  plasma  Si3N4  (800  nm)  film  as  a 
passivation  layer  were  fabricated  on  TEOS-O3  film  (800 
nm)/plasma  Si02  film  with  and  without  CMP  treatment.  The 
A1  alloy  film  deposition  temperature  is  450  °C.  They  were 
subjected  to  a  continuous  dc  current  density  of  3  X 10^  A/cm^ 
in  the  temperature  range  of  240-260  °C.  Metal  line  resis¬ 


tance  change  is  monitored  during  the  EM  tests.  Finally,  after 
the  EM  tests,  samples  were  torn  off  from  the  package,  and 
were  exposed  to  CF4+O2  plasma  in  order  to  remove  the 
passivation  films  and  the  capped  TiN  films,  followed  by  op¬ 
tical  microscope  and  secondary  electron  microscope  obser¬ 
vations. 

111.  RESULTS  AND  DISCUSSIONS 

A.  Effects  of  insulator  surface  roughness  on  Al  film 
properties 

First,  in  this  section,  the  surface  roughness  of  various  in¬ 
sulators  that  were  formed  or  deposited  on  Si  substrates  will 
be  shown.  Their  effects  on  the  surface  roughness  and  crys¬ 
tallographic  orientations  of  the  overlying  Al  alloy  layered 
metal  film  will  be  presented  next. 

Figure  1  shows  the  typical  bird’s  eye  view  of  the  surface 
roughness  on  (a)  chemical  mechanical  polished  TEOS-O3 
Si02/plasma  Si02  (CMP  TEOS-O3  Si02)  and  (b)  nontreated 
TEOS-O3  Si02/plasma  Si02  (TEOS-O3  Si02)  observed  by 
AFM.  The  observed  area  is  2  ^tmX2  /xm.  It  is  well  known 
that  CMP  technology  is  usually  used  in  the  macroscopic  pla¬ 
narization  of  interlayer  dielectrics  in  LSI  fabrication  pro¬ 
cesses.  In  addition  to  the  macroscopic  planarization,  it  is 
clearly  shown  in  Fig.  1  that  CMP  TEOS-O3  Si02  has  a  very 
smooth  surface  down  to  the  nm  level  compared  to  that  of 
TEOS-O3  Si02.  In  Table  I,  the  roughness  factors  root  mean 
square  (rms)  in  various  insulator  surfaces  that  are  calculated 
from  observed  results  of  AFM  are  summarized.  Table  I  also 
includes  the  properties  of  Al  films  formed  on  those  insula¬ 
tors.  TEOS-O3  Si02  means  TEOS-O3  Si02/plasma-TEOS 
Si02  double  layer  film.  CMP  TEOS-O3  Si02  has  the  rms 
value  of  0.48  nm  while  TEOS-O3  Si02  has  that  of  12.20  nm. 
TEOS-O3  Si02  films  with  and  without  CMP  treatment  show 
a  remarkable  difference  in  their  roughness.  Thermal  oxide 
has  the  smoothest  surface  and  TEOS-O3  Si02  has  the  rough¬ 
est  surface. 

The  usual  reflectance  from  samples  prepared  by  Al  high- 
temperature  sputtering  is  very  low  because  of  the  rough  sur¬ 
face  of  deposited  Al  film,  and  this  makes  the  alignment  in  the 
following  lithography  process  difficult.  Figure  2  shows  (a) 
the  /-line  (365  nm)  reflectance  and  (b)  Al  (111)  peak  inten- 
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Table  I.  A1  alloy  film  properties  on  underlying  insulators  that  are  used  for  the  experiment. 


450 

®C  deposition 

220  °C  deposition 

A1 

Al(lU) 

A1 

Al(lll) 

Underlayer 

Roughness 

reflectance 

A1  grain  size  intensity 

AI(lll)  FWHM 

reflectance 

A1  grain  size  intensity 

Al(lll)  FWHM 

insulator 

(rms:nm) 

{%) 

(/im) 

(counts) 

(degree) 

(%) 

(/um)  (counts) 

(degree) 

TEOS-O3  Si02 

12.20 

32.5 

1.17 

830 

10.63*’ 

71.1 

0.56*’  961 

8.75*” 

TEOS-O3  Si02 

13.56 

43.9 

1.39 

390 

(degassed) 

TEOS-O3  Si02 

0.48 

84.7 

1.34 

131406 

1.69*’ 

80.8 

0.46*’  30450 

2.31*’ 

(CMP) 

TEOS-O3  Si02 
(ethanol  treatment) 

1.49 

84.2 

1.32 

88  953 

Thermal  oxide 

0.15 

86.7 

1.30 

128  701 

Borophosphosilicate 
glass  (BPSG) 

Q21 

89.7 

1.24 

209  363 

1.69*’ 

1.40*> 

TEOS-O3  SiOi 

12.20 

49.3" 

2.57“ 

4996" 

75.1" 

780* 

TEOS-O3  SiOj 
(CMP) 

0.48 

51.3" 

9.59* 

6123" 

82.9" 

1758* 

^Values  obtained  without  Ti  (50  nm)  underlayer. 

’’Values  obtained  with  A1  thickness  of  500  nm. 

All  other  data  were  taken  with  A1  thickness  of  700  nm. 


sity  obtained  by  XRD  analysis  from  the  Al-alloy  (700  nm)/Ti 
(50  nm)  structures  formed  on  the  insulators  shown  in  Table  L 
A1  deposition  temperature  is  450  ®C.  The  reflectance  depends 
strongly  on  the  underlying  insulator  surface  roughness  and 
increases  drastically  as  the  underlying  surface  roughness  de¬ 
creases  [Fig.  2(a)].  The  trend  in  Al(lll)  peak  intensity  [Fig. 
2(b)]  is  almost  the  same  as  that  of  reflectance.  The  Al(lll) 
preferred  orientation  depends  strongly  on  the  underlying  in- 


Roughness  (rms:nm) 


Fig.  2.  The  /-line  (365  nm)  reflectance  (a)  and  Al(lll)  peak  intensity  mea¬ 
sured  by  XRD  from  the  Al-alloy  (700  nm)/Ti  (50  nm)  structures  formed  on 
insulators  which  have  various  surface  roughness  shown  in  Table  I. 


sulator  surface  roughness  and  decreases  drastically  as  the 
underlying  surface  roughness  increases. 

Two  kinds  of  insulators,  TEOS-O3  Si02  film/plasma  Si02 
double  layered  films  with  and  without  CMP  treatment,  were 
chosen  for  further  experiments  because  only  roughness  of  the 
insulator  surface  can  be  changed  without  the  insulator  struc¬ 
ture  and  properties  changing.  Figures  3(a)  and  3(b)  show  the 
XRD  spectra  from  Al(700  nm)/Ti  (50  nm)/insulator  struc¬ 
tures  (a)  with  and  (b)  without  CMP  treatment  of  the  under¬ 
lying  insulator.  The  values  in  the  parentheses  after  Al(lll) 
notation  in  Figs.  3(a)  and  3(b)  show  Al(lll)  peak  intensities. 
More  than  two  orders  of  higher  Al(lll)  peak  intensity  were 
obtained  on  TEOS-O3  Si02  with  CMP  treatment  compared  to 
that  on  TEOS-O3  Si02  without  CMP  treatment.  The  Al(200) 
peak  can  be  scarcely  observed  only  in  the  spectrum  from  the 
sample  without  CMP  treatment.  Although  differences  in  un¬ 
reacted  Ti  and  reaction  product  Al3Ti(112)  peak  intensities 
can  be  seen  in  the  spectra,  this  is  due  to  the  difference  in  the 
level  of  preferred  crystallographic  orientation  and  not  due  to 
the  difference  in  the  reaction.  There  is  not  a  large  difference 
in  the  reaction  product  thickness  between  A1  and  Ti,  as 
shown  later  in  Fig.  9:  cross-sectional  TEM  observation  of  the 
samples. 

Figure  4  shows  the  reflectance  from  the  A1  alloy  (500 
nm)/Ti  (50  nm)  formed  on  TEOS-O3  Si02  film/plasma  Si02 
as  a  function  of  A1  alloy  deposition  temperature.  In  Fig.  4, 
the  reflectance  from  the  samples  in  which  A1  alloy  film  was 
deposited  without  substrate  heating  and  reflowed  in  the  same 
vacuum  at  420-485  °C  is  also  shown.  CMP  treatment  on  the 
underlying  insulator  causes  a  large  difference  in  the  reflec¬ 
tance  at  elevated  deposition  temperatures.  Without  CMP 
treatment,  the  reflectance  shows  about  75%  for  the  sample 
deposited  at  a  low  temperature  of  220  °C,  and  it  decreases 
drastically  as  the  deposition  temperature  increases  and  shows 
a  minimum  at  around  420  °C  (about  20%).  With  CMP  treat- 
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Fig.  3.  XRD  spectra  from  Al  alloy/Ti/TEOS-03  Si02  structures  with  (a)  and  (b)  without  CMP  treatment.  Al  alloy  deposition  temperature  is  450  °C.  The  values 
in  the  parentheses  after  Al(l  11)  notation  in  (a)  and  (b)  show  Al(lll)  peak  intensities. 


ment,  the  reflectance  is  high  at  a  low  temperature  of  220 
(about  86%),  and  it  gradually  increases  as  the  deposition 
temperature  increases  and  shows  maximum  also  at  around 
420  °C  (90%).  On  the  other  hand,  in  the  case  of  reflow 
samples,  the  reflectance  is  high  independent  of  CMP  treat¬ 
ment,  and  the  insulator  surface  roughness  does  not  change 
the  roughness  of  the  deposited  Al  alloy  surface. 

Figure  5  shows  the  Al(lll)  peak  intensities  from  the  Al- 
alloy  (500  nm)/Ti  (50  nm)  formed  on  TEOS-O3  Si02/plasma 
Si02  as  a  function  of  Al  alloy  deposition  temperature.  In  Fig. 
5  the  Al(lll)  peak  intensities  from  the  samples  in  which  Al 
alloy  film  was  deposited  without  substrate  heating  and  re¬ 
flowed  at  420-485  °C  are  also  shown.  In  contrast  to  the 
reflectance  results  shown  in  Fig.  4,  Al(lll)  peak  intensity 
from  the  samples  without  CMP  treatment  of  the  insulator 
shows  two  orders  lower  compared  with  the  CMP  treated 
samples  even  at  the  low  deposition  temperature  of  220  °C. 
As  the  deposition  temperature  increases,  the  peak  intensity 
decreases  further  and  shows  a  minimum  at  around  420- 
450  °C.  In  the  samples  with  CMP  treatment,  as  the  deposi¬ 
tion  temperature  increases,  the  peak  intensity  increases  and 
shows  a  maximum  at  around  420  °C.  The  maximum  peak 
intensity  in  this  case  (Al  thickness:  500  nm)  is  20%  higher 
than  that  of  the  220  °C  deposited  sample.  On  the  other  hand, 


Fig.  4.  The  reflectance  from  the  Al-alloy  (500  nm)/Ti  (50  nm)  formed  on 
TEOS-O3  Si02  film/plasma  Si02  as  a  function  of  Al  alloy  deposition  tem¬ 
perature.  The  reflectance  from  the  samples  in  which  Al  alloy  film  was  de¬ 
posited  without  substrate  heating  and  reflowed  at  420-485  °C  is  also  shown. 


in  the  case  of  reflow  samples  with  CMP  treatment,  the 
Al(lll)  peak  shows  almost  the  same  or  even  better  intensity 
as  high-temperature  sputtered  samples,  however,  the  peak 
intensity  shows  one  order  lower  in  the  case  of  reflow  sample 
without  CMP  treatment.  This  result  differs  from  the  results  of 
reflectance  shown  in  Fig.  4,  where  the  reflectance  does  not 
depend  on  the  insulator  surface  roughness.  The  underlying 
insulator  surface  roughness  affects  the  Al  alloy  crystallo¬ 
graphic  orientation  without  changing  reflectance. 

The  Al(lll)  peak  intensity  has  been  used  as  an  index 
value  for  the  degree  of  the  texture  so  far,  because  only  the 
(111)  peak  is  observed  in  many  of  the  samples  used  in  the 
experiment.  This  value,  however,  is  susceptible  to  receive  the 
influence  of  measurement  conditions.  Finally  in  this  section, 
the  full  width  at  half-maximum  (FWHM)  values  of  the  (111) 
rocking  curves  for  Al  films  were  obtained  in  order  to  confirm 
the  validity  of  Al(lll)  peak  intensity  as  an  index  value  for 
texture  distribution.^^  The  rocking  curves  were  measured  by 
x-ray  diffraction  using  the  Cu  K a  line  by  scanning  an  inci¬ 
dent  angle  0,  while  the  diffraction  angle  20  was  fixed  for  the 
Al(lll)  peak.  FWHM  values  were  obtained  as  functions  of 
Al  alloy  deposition  temperature  and  reflow  temperature,  and 
were  shown  in  Fig,  6.  The  samples  used  in  the  measurement 


Temperature  ("C) 


Fig.  5.  The  Al(lll)  peak  intensities  from  the  Al-alloy  (500  nm)/Ti  (50  nm) 
formed  on  TEOS-O3  Si02/plasma  Si02  as  a  function  of  Al  alloy  deposition 
temperature.  CMP  treatment  on  the  insulators  is  chosen  for  the  parameter. 
The  Al(lll)  peak  intensities  from  the  samples  in  which  Al  alloy  film  was 
deposited  without  substrate  heating  and  reflowed  at  420-485  °C  are  also 
shown. 
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Fig.  6.  FWHM  values  obtained  from  the  Al-alloy  (500  nm)/Ti  (50  nm) 
formed  on  TEOS-O3  Si02/plasma  Si02  as  a  function  of  Al  alloy  deposition 
temperature.  CMP  treatment  on  the  insulators  is  chosen  for  the  parameter. 
The  Al(lll)  peak  intensities  from  the  samples  in  which  Al  alloy  film  was 
deposited  without  substrate  heating  and  reflowed  at  420-485  °C  are  also 
shown. 

are  the  same  as  those  used  in  Fig.  5.  Almost  the  same  trends 
as  shown  in  Fig.  5  are  obtained,  that  is,  FWHM  value  de¬ 
creases  as  the  deposition  temperature  increases  in  the 
samples  with  CMP  treatment,  although  the  change  in  large 
FWHM  values  obtained  from  the  samples  without  CMP 
treatment  does  not  correlate  well  with  the  change  in  Al(lll) 
peak  intensity.  It  seems  that  the  measurement  in  the  large 
FWHM  value  region  is  not  sensitive  compared  with  the  mea¬ 
surement  in  Al(lll)  peak  intensity,  however,  Al(lll)  peak 
intensity  trends  shown  in  Figs.  2(b)  and  5  were  confirmed  to 
be  the  trends  in  preferred  orientation. 

B.  Improvement  mechanism  in  Al  planarization  and 
preferred  orientation 

In  the  previous  sections,  it  was  found  that  the  insulator 
surface  roughness  largely  affects  the  surface  roughness  and 
crystallographic  orientation  in  Al  alloy  film  deposited  at  el¬ 
evated  temperatures.  In  this  section,  the  improvement 
mechanism  of  planarization  and  preferred  orientation  in  Al 
alloy  film  due  to  the  underlying  insulator  surface  roughness 
will  be  discussed. 

First,  in  this  section,  the  role  of  the  Ti  underlayer  for  the 
planarization  and  preferred  orientation  should  be  studied. 


Figure  7  shows  the  XRD  spectra  from  the  Al  alloy/TEOS-03 
Si02  structures  (a)  with  and  (b)  without  CMP.  Al  alloy  depo¬ 
sition  temperature  is  450  °C.  The  values  in  the  parentheses 
after  Al(l  11)  notation  in  Fig.  7  show  Al(lll)  peak  intensities. 
When  the  Al  alloy  was  deposited  directly  on  the  insulators 
without  a  Ti  underlayer,  the  difference  in  Al(lll)  peak  inten¬ 
sities  due  to  the  insulator  roughness  is  not  observed,  al¬ 
though  the  Al(200)  peak  can  be  observed  only  on  a  non-CMP 
treated  sample.  On  the  other  hand,  as  shown  in  Table  I,  the 
reflectance  of  the  Al  alloy  film  directly  deposited  on  the 
TE0S-03Si02  without  a  Ti  underlayer  at  a  deposition  tem¬ 
perature  of  450  °C  is  49.3%  on  a  non-CMP  treated  insulator, 
while  that  on  a  CMP  treated  insulator  is  51.3%.  The  Ti  layer 
is  essential  for  the  planarization  and  preferential  orientation 
of  the  overdeposited  Al  alloy  film. 

Figure  8  shows  the  XRD  spectra  from  Ti(50  nm) 
TEOS-O3  Si02  structures  (a)  with  and  (b)  without  CMP.  Ti 
has  preferred  orientations  of  (002)  and  (011),  and  these  peak 
intensities  are  20-30-fold  stronger  on  samples  with  CMP 
treatment  than  those  without  CMP  treatment.  Since  Ti(002) 
and  Ti(Oll)  XRD  peaks  from  the  Ti  powder  have  20:100 
intensities,  Ti  film  in  this  experiment  has  a  preferential  ori¬ 
entation  of  Ti(002).  This  difference  in  Ti  peak  intensity  and 
orientation  change  seems  to  induce  the  difference  in  Al  alloy 
orientations  due  to  the  underlayer  insulator  roughness. 

We  consider  the  reaction  between  the  Al  alloy  film  and 
the  Ti  film,  since  Al  alloy  films  and  Ti  films  are  easy  to 
react^^  and  to  form  Al3Ti,  as  shown  in  Figs.  3(a)  and  3(b).  In 
order  to  obtain  the  visual  image  of  the  reaction,  cross- 
sectional  TEM  observations  of  the  Al/Ti  layered  structures 
deposited  at  380,  420,  and  450  °C  were  carried  out  and  are 
shown  in  Fig.  9.  At  around  these  deposition  temperatures, 
Al(lll)  peak  intensity  shows  a  maximum  as  shown  in  Fig.  5. 
CMP  treated  TEOS-O3  Si02  film  is  used  as  the  underlying 
insulator.  The  TEM  image  of  450  deposited  on  insulator 
without  CMP  treatment  is  also  shown  [Fig,  9(d)].  The  reac¬ 
tion  between  Al  alloy  and  Ti  film  does  not  proceed  in  a 
380  °C  deposited  sample  [Fig.  9(a)].  In  the  case  of  420  °C 
deposition  in  which  Al(lll)  peak  intensity  shows  the  maxi¬ 
mum,  about  60  nm  reaction  product  layer  that  is  considered 
as  Al3Ti  is  formed  [Fig.  9(b)].  The  reaction  proceeds  further 
in  450  °C  deposition,  and  a  150-200  nm  reaction  product 
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Fig.  7.  XRD  spectra  from  Al  aIloy/TEOS-03  Si02  structures  (a)  with  and  (b)  without  CMP.  Al  alloy  deposition  temperature  is  450  °C.  The  values  in  the 
parentheses  after  Al(l  11)  notation  in  (a)  and  (b)  show  Al(lll)  peak  intensities. 
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Fig.  8.  XRD  spectra  from  Ti  (50  nm)/TEOS-03Si02  structures  (a)  with  and  (b)  without  CMR 


layer  is  formed.  There  is  a  large  difference  between  CMP 
treated  samples  and  non-CMP  treated  samples  in  the  rough¬ 
ness  of  the  interface  between  the  metal  and  underlayer  insu¬ 
lator  [Figs.  9(c)  and  9(d)],  however,  there  is  no  difference  in 
the  thickness  of  the  reaction  layer  and  unreacted  Ti  layer. 
Regardless  of  the  formation  of  A1  and  Ti  reaction  product, 
the  sample  on  CMP  treated  insulator  has  a  high  Al(lll)  pre¬ 
ferred  orientation. 

Here  we  consider  the  crystal  orientation  and  lattice  mis¬ 
match  between  Al,  Ti,  and  a  reaction  product  Al3Ti.  Figure 
10  shows  the  schematic  figures  of  unit  cells  of  Al,  Ti,  and 
Al3Ti.  The  Al  crystal  structure  is  a  face  centered  cubic  (fee) 
structure  with  a  lattice  constant  of  4.0494  A  [Fig.  10(a)].  The 
Ti  crystal  structure  is  hexagonal  close  packed  (hep)  structure 
with  lattice  constants  of  a  =2.9503  A  and  c  =4.686  A  [Fig. 
10(b)].  The  Al(lll)  plane  is  the  close-packed  plane  in  Al  fee 
structure  and  Ti(002)  plane  is  the  close-packed  basal  plane  in 


the  Ti  hep  structure.  The  misfit  between  the  close-packed 
planes  is  about  3%  considering  their  closest  interatomic  dis¬ 
tances  d  (Al:2.8634  A;  Ti:2.9503  A).  The  misfit  between  the 
Al(lll)  plane  and  the  Ti(Oll)  plane  is  not  so  simple,  how¬ 
ever  Shibata  et  al  suggested  from  their  experiment  and 
analysis  that  Al(lll)  preferred  orientation  improves  on  the 
Ti(Oll)  plane. The  crystal  structure  of  Al3Ti  is  a  body  cen¬ 
tered  tetragonal  (bet)  structure  with  lattice  constants  of 
(3  =  3.84  A  and  c  =  8.61  The  unit  cell  of  this  structure 

is  considered  to  be  two  stacked  unit  cells  of  Al  fee  structures 
and  deformed,  neglecting  the  atom  difference  between  Al 
and  Ti  [Fig.  10(c)].  Therefore,  the  Al3Ti(112)  plane  could  be 
regarded  as  the  close-packed  (111)  plane  of  fee  structure  for 
a  first-order  approximation.  The  misfits  between  the  close- 
packed  plane  and  Al(lll)  and  Ti(002)  are  1%  and  4%,  re¬ 
spectively  [the  closest  interatomic  distance  and  the  second 
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Fig.  9.  The  cross-sectional  TEM  observations  of  the  Al/Ti  layered  structures  deposited  at  (a)  380,  (b)  420,  and  (c)  450  °C.  TEOS-O3  Si02  film  is  used  as  the 
underlying  insulator.  The  TEM  image  (d)  of  450  °C  deposited  on  CMP  treated  insulator  is  also  shown. 
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Fig.  10.  The  schematic  figures  of  unit  cells  of  Al,  Ti,  and  Al3Ti. 

closest  interatom  distance  in  the  Al3Ti(112)  plane  are  2.7217 
A  and  2.8874  A,  respectively]. 

According  to  the  above  consideration,  three  crystal  struc¬ 
tures  have  the  potential  of  having  an  epitaxial  relation  at 
least  as 

Al(lll)//Ti(002), 

Al(l  1  l)//Al3Ti(l  12)//Ti(002). 

The  above  discussion  concentrates  on  the  crystallographic 
orientation.  Here,  an  experimental  result  will  be  introduced 
in  order  to  discuss  the  planarization  in  Al  film.  Figure  11 
shows  the  reflectance  change  as  a  function  of  the  Al  alloy 
thickness  (100-700  nm)  deposited  at  450  °C.  The  Al  alloy 
was  deposited  on  Ti(50  nm)  on  TEOS-O3  Si02,  or  directly 


Fig.  11.  The  reflectance  change  as  a  function  of  Al  alloy  thickness  (100-700 
nm)  deposited  at  450  °C.  The  Al  alloy  was  deposited  on  Ti  (50  nm)  on 
TEOS-O3  Si02  or  directly  on  the  insulator  with  and  without  CMP  treatment. 


on  the  insulators  with  and  without  CMP  treatment.  The  re¬ 
flectance  from  the  sample  with  a  Ti  underlayer  and  CMP 
treatment  shows  over  80%  and  stays  constant  in  the  case  of 
Al  alloy  thickness  over  200  nm.  On  the  contrary,  the  reflec¬ 
tances  from  the  other  sample  are  less  than  about  50%  even  at 
the  Al  thickness  of  700  nm,  although  the  reflectance  shows  a 
gradual  increase  as  the  Al  thickness  increases.  The  Al  surface 
morphologies  observed  by  SEM  are  shown  in  Fig.  12.  Figure 
12  shows  the  cases  with  Ti  underlayer,  and  Al  thickness 
ranges  from  20  to  200  nm.  Without  CMP  treatment,  Al  film 
shows  the  semispherical  surface,  and  uniform  film  cannot  be 
obtained  even  at  a  deposited  Al  alloy  thickness  of  200  nm. 


3  )im 


Fig.  12.  SEM  observation  of  the  Al  surface  morphologies  with  a  Ti  underlayer.  Insulator  CMP  treatment  is  chosen  as  a  parameter.  Al  thickness  ranges  from 
20  to  200  nm. 
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On  the  other  hand,  with  CMP  treatment,  the  deposited  A1 
alloy  film  shows  a  smooth  plateau  surface,  and  a  uniform 
film  can  be  obtained  over  A1  thickness  of  around  200  nm.  It 
is  considered  that  the  fact  that  the  sample  with  a  Ti  under¬ 
layer  and  insulator  CMP  treatment  has  a  planarized  deposited 
structure  and  uniform  film  over  A1  thickness  of  around  200 
nm  is  the  reason  why  the  reflectance  of  the  samples  shows 
over  80%  and  stays  constant  in  the  case  of  A1  alloy  thickness 
over  200  nm. 

Here,  on  the  basis  of  the  above  discussion,  how  the  un¬ 
derlying  insulator  surface  roughness  contributes  to  the  im¬ 
provement  of  the  surface  roughness  and  crystal  preferred  ori¬ 
entation  in  the  Al/Ti  layered  structure  will  be  summarized.  Ti 
has  a  preferred  crystal  orientation  of  (002)  on  the  smooth 
insulator.  This  is  due  to  the  preferred  deposition  of  Ti(002) 
orientation  perpendicular  to  the  underlying  insulator  surface 
minimizing  the  surface  energy.  When  the  A1  alloy  film  is 
deposited  on  the  Ti  film,  A1  atoms  fix  at  Ti  crystal  sites  in  the 
initial  stage  of  deposition.  Hereafter,  basically,  the  layer 
growth  of  A1  alloy  film  occurs  epitaxially,  and  a  smooth  and 
uniform  film  forms  at  the  initial  stage  of  A1  deposition. 
Moreover,  in  the  case  of  high-temperature  sputtering, 
through  the  high  A1  atom  migration  on  the  substrate  and 
taking  over  the  underlying  Ti  crystal  information,  the  highly 
preferred  orientation  in  A1  alloy  film  occurs,  forming  the 
reaction  product  Al3Ti  which  also  has  an  epitaxial  relation 
with  A1  and  Ti.  On  the  other  hand,  in  the  case  of  the  rough 
insulator,  the  deposited  Ti  also  orients  perpendicular  to  the 
insulator  surface,  however  the  insulator  surface  roughness 
prevents  the  coherence  in  the  Ti  crystal  orientation.  The  ini¬ 
tial  A1  film  on  the  underlying  Ti  also  has  an  epitaxial  growth, 
however  the  crystal  orientation  of  the  film  has  random  direc¬ 
tions.  In  the  case  of  high-temperature  sputtering,  through 
high  A1  atom  migration,  the  A1  growth  in  each  direction  oc¬ 
curs  and  enhances  the  roughness  of  the  deposited  A1  film. 

When  the  A1  alloy  film  is  deposited  directly  on  the  insu¬ 
lator  without  Ti,  the  layer  growth  occurs  at  lower  deposition 
temperatures  and  A1  surface  planarizes  gradually  even  on  the 
rough  insulator  surface.  This  speculation  is  supported  by  the 
relatively  high  reflectance  of  the  A1  alloy  film  deposited  on  a 
rough  insulator  surface  at  low  deposition  temperature  (see 


Fig.  4).  In  the  case  of  high-temperature  deposition,  however, 
the  island  growth  occurs  because  the  Si02  surface  energy  is 
much  smaller  than  Ti  surface  energy  (surface  energy  of  Al, 
Si02,  and  Ti:866,  663,  and  1650  erg/cm^).^^  In  addition,  high 
Al  atom  migration  prevents  coalescence  from  occurring  until 
thick  Al  deposition.  This  enhances  the  roughness  of  the  de¬ 
posited  Al  film  surface,  and  the  roughness  of  the  deposited 
Al  film  does  not  depend  on  the  roughness  of  the  underlying 
insulator.  In  the  case  of  reflow  samples  that  are  deposited  at 
lower  temperature  and  reflowed  at  high  temperature,  the  sur¬ 
face  roughness  of  the  Al/Ti  layered  film  is  determined  in  the 
stage  of  Al  deposition  almost  regardless  of  insulator  surface 
roughness  and  stays  the  same  even  after  reflow  annealing, 
while  the  preferred  orientation  of  the  film  is  also  determined 
at  the  stage  of  deposition,  and  it  depends  on  the  insulator 
surface  roughness  (see  Figs.  5  and  6). 

C.  Electromigration  performance  in  Al/Ti  iayered 
interconnects 

In  Table  I,  the  samples  that  should  be  paid  attention  to 
here  for  electromigration  tests  are  TEOS-O3  Si02  films  with 
and  without  CMP  treatment.  The  Al  alloy  films  deposited  on 
these  insulators  without  a  Ti  underlayer  have  a  large  differ¬ 
ence  in  Al  grain  size  (without  CMP  treatment:  2.57  ^m;  with 
CMP  treatment:  9.59  /mm).  There  is  no  such  difference  with  a 
Ti  underlayer  (without  CMP  treatment:  1.17  yum;  with  CMP 
treatment:  1.34  jum).  On  the  other  hand,  without  a  Ti  under¬ 
layer,  Al(lll)  peak  intensity  from  XRD  analysis  shows  only 
1.5  times  higher  in  the  CMP  treated  sample  compared  with 
that  in  the  non-CMP  treated  sample,  however,  with  a  Ti  un¬ 
derlayer,  two  orders  higher  Al(lll)  peak  intensity  can  be 
obtained  in  the  CMP  treated  sample.  These  facts  enable  the 
Al(lll)  peak  intensity  to  change  without  changing  Al  grain 
size  when  the  Al  alloy  film  is  deposited  on  Ti  on  insulators  of 
which  surface  roughness  is  controlled  by  CMP  treatment.  If 
the  other  factors  such  as  Al-Ti  reaction  and  Ti  content  in  Al 
alloy  film  that  will  affect  the  electromigration  performance 
of  the  metal  lines  are  the  same,  the  evaluation  of  electromi¬ 
gration  performance  as  a  function  of  the  level  in  Al  preferred 
orientation  is  considered  to  be  possible  by  controlling  insu- 
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Fig.  13.  Impurity  concentration  depth  profiles  measured  by  SIMS  (a)  220  °C  Al  deposition,  (b)  450  °C  Al  deposition,  (c)  control  sample;  Al-0.1%  Ti-0.3% 
Cu). 
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Fig.  14.  One  example  of  the  electromigration  test  results.  The  samples  are 
Al  (500/nm)/Ti  (50  nm)  layered  metal  line  formed  on  TEOS-O3 
Si02/plasma-TE0S  Si02  double  layer  films  with  and  without  CMP  treat¬ 
ment.  The  Al  deposition  temperature  is  450  °C.  The  test  condition  is  dc 
current  density  of  3X10^  A/cm^  at  a  temperature  of  260  °C. 

lator  surface  roughness,  assuming  that  there  were  no  effects 
of  insulator  surface  roughness  on  the  electromigration  per¬ 
formance.  In  that  sense,  the  cross-sectional  TEM  observation 
of  Al/Ti  layered  structures  as  a  parameter  of  underlying  in¬ 
sulator  CMP  treatment,  as  shown  in  Figs.  9(c)  and  9(d), 
shows  that  there  is  no  difference  in  the  reaction  product 
thickness  and  unreacted  Ti  thickness  due  to  the  CMP  treat¬ 
ment.  Ti  concentration  depth  profile  evaluated  by  SIMS 
shows  that  Ti  content  in  Al  alloy  film  increases  with  increas¬ 
ing  deposition  temperature,  however  it  does  not  depend  on 
the  CMP  treatment.  Here,  in  Fig,  13,  impurity  concentration 
depth  profiles  measured  by  SIMS  were  presented  at  (a) 
220  °C  Al  deposition,  (b)  450  “^C  Al  deposition,  and  (c)  con¬ 
trol  sample:  Al-0.1%  Ti-0.3%  Cu).  On  the  assumption  that 
the  Ti  concentration  of  the  control  sample  (c)  is  correct,  the 
Ti  concentration  in  the  450  °C  Al  deposition  sample  has 
about  0.03%  and  that  of  220  °C  Al  deposition  sample  has  no 
Ti  in  Al  alloy  film. 

The  electromigration  tests  were  carried  out  as  a  parameter 
of  the  CMP  treatment  of  underlying  insulator.  The  samples 
are  Al  (500  nm)/Ti(50  nm)  layered  metal  lines  formed  on 
TEOS-O3  Si02/plasma-TEOS  Si02  double  layer  films  with 
and  without  CMP  treatment.  The  Al  deposition  temperature 
is  450  °C.  Figure  14  shows  one  example  of  the  electromigra¬ 
tion  test  results.  The  test  condition  is  dc  current  density  of 
3X 10*^  A/cm^  at  a  temperature  of  260  °C.  The  MTTF  of  the 
sample  with  CMP  treatment  is  about  seven  times  longer  than 
that  without  CMP  treatment  (702  and  98  h)  in  this  test  con¬ 
dition.  The  Arrhenius  plot  of  the  MTTF  in  the  temperature 
range  of  240-260  ®C  are  shown  in  Fig.  15.  The  activation 
energies  of  the  MTTFs  for  the  samples  with  and  without 
CMP  treatment  are  0.94  and  0.78  eV,  respectively. 

In  order  to  consider  the  obtained  electromigration  results, 
the  line  resistance  change  during  EM  tests  was  monitored 
and  the  analysis  of  electromigration  failure  mode  in  the 
metal  lines  after  EM  tests  was  carried  out.  The  resistance 
change  of  the  metal  lines  during  the  electromigration  test 
was  monitored  at  the  test  condition  of  dc  current  density  of 
3X10^  A/cm^  and  temperature  of  250  °C.  The  results  were 
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Fig.  15.  The  Arrhenius  plot  of  the  MTTFs  in  the  tenniperature  range  of 
240-260  °C.  The  current  density  is  3X10^  A/cm^. 

shown  in  Fig.  16.  Different  resistance  changes  were  obtained 
according  to  the  CMP  treatment.  The  samples  without  CMP 
treatment  [Fig.  16(a)]  show  a  monotonical  resistance  in¬ 
crease  and  result  in  line  opens  at  the  resistance  increase  of 
2%-4%  from  the  initial  values.  On  the  other  hand,  the 
samples  with  CMP  treatment  [Fig.  16(b)]  show  a  different 
resistance  change.  They  show  a  resistance  increase  during 
the  first  100  h,  however  it  then  starts  to  decrease.  The  peak  of 
the  resistance  change  is  not  so  high  (less  than  0.6%)  com¬ 
pared  to  the  resistance  increase  of  Fig.  16(a)  just  before  the 
lines  open.  The  resistance  continues  to  decrease  lower  than 
the  initial  resistance,  then  starts  to  increase  again  (the  second 


0  100  200  300  400  500 

Time  (hour) 


Fig.  16.  The  resistance  change  of  the  metal  lines  during  the  electromigration 
test  monitored  at  the  test  condition  of  3X10^  A/cm^  and  250  °C.  The  Al 
deposition  temperature  is  450  °C.  (a)  Non-CMP  treated  sample  and  (b)  CMP 
treated  sample. 
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Fig.  17.  Optical  micrographs  of  the  metal  lines  that  show  large  parts  of  the  serpentine  metal  line  with  dummy  lines  in-between  after  the  electromigration  test, 
(a)  CMP  treated  sample  and  (b)  non-CMP  treated  sample. 


increase)  and  the  metal  lines  become  open.  Some  of  the 
metal  lines  become  open  without  the  second  resistance  in¬ 
crease. 

In  Figs.  17  and  18,  photographs  of  the  metal  lines  after 
the  electromigration  test  are  shown.  Figures  17(a)  and  17(b) 
are  the  optical  micrographs  with  low  magnificent  that  show 
large  parts  of  the  serpentine  metal  line  with  wider  dummy 
lines  in-between  (a)  a  CMP  treated  sample  and  (b)  non-CMP 
treated  sample).  Figures  18(a)  and  18(b)  show  the  SEM  pho¬ 
tographs  of  the  voids  formed  in  CMP  treated  sample  and  the 
open  part  of  the  metal  line  in  a  non-CMP  treated  sample, 
respectively.  In  CMP  treated  samples,  a  large  number  of 
voids  such  as  shown  in  Fig.  18(a)  exist  in  the  metal  lines, 
and  the  open  parts  distribute  randomly  in  the  metal  lines 
[Fig.  17(a)].  It  is  speculated  here  that  the  Al  thickness  varia¬ 
tion  in  the  non-CMP  treated  sample  could  have  an  effect  on 
the  void  growth.  On  the  other  hand,  in  non-CMP  treated 
samples,  the  open  parts  of  the  metal  lines  such  as  shown  in 
Fig.  18(b)  always  exist  at  the  first  line  from  the  negative  pad, 
as  shown  in  Fig.  17(b).  Void  distribution  in  the  non-CMP 


Fig.  18.  SEM  photographs  of  the  voids  formed  in  CMP  treated  sample  (a) 
and  the  open  part  of  the  metal  line  in  non-CMP  treated  sample  (b). 


treated  samples  cannot  be  discerned  from  the  optical  micro¬ 
graph  because  of  the  random  reflection  from  the  rough  Al 
surface  [Fig.  17(b)]. 

From  the  results  shown  in  Figs.  16,  17,  and  18,  the  fol¬ 
lowing  speculations  can  be  deduced.  The  monotonical  resis¬ 
tance  increase  during  electromigration  test  until  the  line  open 
in  the  non-CMP  treated  samples  is  considered  to  be  mainly 
caused  by  void  formation  and  growth  in  the  first  line  from 
the  negative  pad.  Since  the  electromigration  resistance  of  the 
highly  oriented  Al/Ti  layered  interconnects  is  extremely 
high,  the  electromigration  tests  were  carried  out  at  rather 
severe  conditions,  in  particular  at  high  current  density  to  re¬ 
duce  the  test  time  in  this  experiment.  In  the  first  line  from  the 
negative  pad,  there  seems  to  be  a  temperature  gradient  be¬ 
cause  of  the  pad  heat  sink.  The  temperature  difference  be¬ 
tween  the  ambient  temperature  and  the  metal  line  tempera¬ 
ture  estimated  by  the  line  resistance  change  is  around 
100  °C.  The  temperature  gradient  accelerates  the  void 
growth  due  to  the  relatively  high  Al  atom  mobility  and  Al 
flux  divergence.  On  the  other  hand,  the  resistance  increase 
during  the  first  100  h  in  the  CMP  treated  samples  is  also 
caused  by  void  formation  and  growth,  however  the  place 
where  the  voiding  occurs  scatters  all  over  the  line  including 
the  first  line  from  the  negative  pad.  The  voiding  accelerated 
by  the  temperature  gradient  in  the  first  line  from  the  negative 
pad  could  also  occur  in  this  case,  however  the  metal  line  has 
an  endurance  to  keep  line  continuity  and  voiding  is  sup¬ 
pressed  to  limited  size  and  number  because  the  Al  film  in 
CMP  treated  samples  has  hyper  textured  grains  and  low  Al 
flux  divergence  so  the  metal  line  is  not  susceptible  to  the 
influence  of  temperature  gradient.  The  resistance  decrease 
after  the  first  resistance  increase  is  considered  to  be  the  re¬ 
covery  of  Al  crystalline  quality  by  long  time  annealing.  This 
recovery  is  usually  observed  during  a  long  storage  time  at 
relatively  high  temperature  without  current  passage  and  is 
not  special  for  EM  testing.  The  second  resistance  increase  is 
caused  by  void  growth,  and  void  length  increase  in  some 
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places  results  in  high  current  passage  through  the  reaction 
product  layer,  accelerating  void  growth  and  line  openings. 

IV,  CONCLUSION 

Effects  of  insulator  surface  roughness  on  the  overlying 
aluminum  alloy  film  surface  roughness  and  crystallographic 
orientation  in  A1  alloy/Ti/insulator  layered  structures  have 
been  studied.  The  following  conclusions  have  been  obtained. 

The  underlying  insulator  surface  roughness  has  a  large 
effect  on  the  surface  roughness  and  crystallographic  orienta¬ 
tion  of  the  A1  alloy/Ti  layered  film  deposited  on  the  insula¬ 
tors.  In  particular,  the  effect  is  remarkable  when  the  A1  alloy 
film  is  deposited  at  high  temperature.  The  A1  surface  smooth¬ 
ness  and  the  Al(lll)  preferred  orientation  in  Al/Ti  layered 
film  improves  when  the  underlying  insulator  surface  is 
smooth,  and  deteriorates  when  the  underlying  insulator  sur¬ 
face  is  rough. 

The  underlying  insulator  smoothness  enhances  Ti  pre¬ 
ferred  orientation  first,  and  that  enhances  the  A1  layered 
growth  on  the  Ti  which  has  a  large  surface  energy,  even 
when  the  reaction  between  A1  and  Ti  occurs.  The  lattice  mis¬ 
match  between  Al,  Ti,  and  main  reaction  product  Al3Ti  ex¬ 
plains  well  the  enhancement  in  Al  preferred  orientation. 
Without  Ti  underlayer,  such  a  smooth  surface  and  a  large 
crystallographic  orientation  change  in  Al  alloy  does  not  oc¬ 
cur. 

Al/Ti  layered  interconnects  formed  on  insulators  that  have 
a  very  smooth  surface  by  CMP  treatment  have  an  extremely 
high  electromigration  performance  because  of  the  high 
Al(lll)  oriented  texture. 
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Cross  sectional  scanning  electron  microscopy  was  used  to  study  the  dependence  of  gap  fill 
capabilities  in  0,25-2.0  wide  Al-Cu  metal  gaps  of  DuPont  fluorinated  polyimide  films  on  solid 
content,  adhesion  promoter,  spin  dwell  time,  and  solvent  spray.  It  was  found  that  reduced  solid 
content,  application  of  adhesion  promoter  and  solvent  spray  improve  gap  fill  capabilities.  However, 
these  improvements  are  insignificant.  Spin  dwell  time  has  little  effect  on  gap  fill.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

To  improve  the  performance  in  ultralarge  scale  integration 
(ULSI)  devices  with  increased  density  requires  insulating 
materials  with  low  dielectric  constant  (low  e)  as  interlevel 
dielectrics  (ILDs)  (Ref.  1)  so  that  device  RC  delay,  intercon¬ 
nect  cross  talk,  and  power  consumption  can  be  reduced.  Du¬ 
Pont  fluorinated  polyimide  (FPI)  A  and  B  are  promising  can¬ 
didates  due  to  their  low  dielectric  constants 
(^out-of-piane=2.64  and  2.67,  respectively,  at  1  MHz,  room 
temperature,  and  40%  relative  humidity),  high  thermal  sta¬ 
bility  (decrease  in  film  thickness  is  1.0%  for  FPI-A  after  3  h 
at  500  ^C,  and  0%  and  2.7%  for  FPI-A  and  FPI-B,  respec¬ 
tively,  after  8  h  at  425  °C  in  vacuum),  and  low  moisture 
absorption  (1.50  and  1.45  wt  %,  respectively,  after  1  h  at 
100%  relative  humidity  and  room  temperature). 

In  the  current  standard  multilevel  interconnect  process 
route  using  Al-Cu  lines/chemical  vapor  disposition  (CVD) 
W  plugs,  Al-Cu  metal  lines  are  patterned  through  reactive 
ion  etching  and  then  dielectric  film  is  deposited  over  the 
lines,  filling  in  the  gaps  between  these  lines  (subtractive 
method).  These  gaps  could  have  widths  in  the  sub-0.5  /xm 
range  and  aspect  ratios  greater  than  4.  Therefore,  small  gap 
fill  capability  is  required  for  dielectrics.  In  this  study,  the 
dependencies  of  gap  fill  capability  on  solid  content,  adhesion 
promoter,  spin  dwell  time,  and  solvent  spray  of  spin-on 
FPI-A  and  FPI-B  films  are  investigated  in  Al-Cu  metal  gaps 
with  widths  ranging  from  0.25  to  2.0  jum. 

II.  EXPERIMENT 

DuPont  experimental  fluorinated  polyimide  copolymers 
FPI-A  and  FPI-B  were  used  for  this  study.  They  are  solutions 
of  polyamic  acid  precursors  of  an  equivalent  total  amount  of 
two  dianhydrides:  pyromellitic  dianhydride  (PMDA)  and 
4,4-(hexafluoroisopropylidene)  diphthalic  anhydride  (6FDA) 
with  an  equivalent  total  amount  of  up  to  two  diamines:  2,2- 
bis  (trifluoromethoxy)benzidine  (TFMOB)  and  /7-phenylene 
diamine  (PPD),  dissolved  in  the  solvent  N-methyl-2- 
pyrrolidone  (NMP).  The  molecular  structure  of  this  FPI  fam- 
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ily  can  be  found  in  Refs.  2  and  3.  Molar  ratios  of  the 
dianhydride  and  diamines  in  FPI-A  and  FPI-B  are  listed  in 
Table  I. 

The  patterned  wafers  used  in  this  gap  fill  study  had  a 
series  of  metal  gaps  ranging  from  0.25  to  2.0  jam  in  width. 
The  wafers  were  manufactured  by  depositing  600  nm  Al-0.5 
wt  %  Cu  metal  film  on  200  mm  unpattemed  substrates  of 
600  nm  thermally  grows  Si02  on  Si(lOO),  followed  by  metal 
gap  pattern  definition  through  a  lithographic  and  etch  pro¬ 
cess.  Therefore,  if  no  overetch  into  the  Si/Si02  substrates 
occurred,  the  most  aggressive  aspect  ratio  for  the  gap  (de¬ 
fined  as  the  gap  depth  divided  by  the  gap  width)  would  be 
2.4.  In  this  study,  we  chose  primarily  the  gaps  which  form  a 
90°  angle  relative  to  the  radius.  Therefore,  the  voids  formed 
in  unfilled  gaps  were  observed  dominantly  on  the  gap  side- 
wall  oriented  away  from  the  wafer  center. 

The  typical  thickness  for  an  ILD  application  is  around  1 
^m.  The  recipes  to  achieve  l-/im-thick  final  cured  films  on 
blanket  wafers,  when  applicable,  were  used  to  spin  FPI-A 
and  FPI-B  films  on  200  mm  patterned  wafers.  In  some  cases, 
DuPont  VM-651  adhesion  promoter  was  applied  before  the 
FPI  film  spin  process.  In  addition,  NMP  solvent  spray  was 
used  on  one  patterned  wafer  after  the  application  of  the  VM- 
65 1  adhesion  promoter  and  before  the  FPI  film  deposition.  A 
two  step  spin  process  was  used  for  coating  200  mm  patterned 
wafers  with  either  adhesion  promoter  or  FPI  films,  where  the 
first  spin  step  spreads  solution  to  cover  the  whole  wafer  sur¬ 
face  followed  by  a  second  spin,  which  primarily  determines 
the  final  film  thickness.  In  some  cases,  a  spin  dwell  time  of 
120  s  was  added  between  the  two  spins. 

A  Machine  Technology  Inc.  FlexiFab  was  used  for  the 
spin  and  bake  of  adhesion  promoter  and  FPI  films.  Cure  of 
the  FPI  films  was  conducted  in  a  Yield  Engineering  Systems 


Table  L  Molar  ratios  of  the  dianhydride  and  diamines  in  the  FPI-A  and 
FPI-B  copolymers. 


Molar  ratio  of  dianhydride 

Molar  ratio  of  diamines 

Copolymer 

PMDA 

6FDA 

TFMOB 

PPD 

FPI-A 

95 

5 

95 

5 

FPI-B 

75 

25 

100 

0 
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Table  II.  Summary  of  properties  and  the  spin  parameters  of  the  FPI  films  used  in  this  study. 


Wafer 

ID  No. 

FPI  solution  properties 

Film  pr 

NMP 

solvent 

spray 

eparation  process 

FPI  spin 

Copolymer 

Molecular 

weight 

(mj 

Solid 

content 
(wt  %) 

Viscosity 

(poise) 

VM651 

adhesion 

promoter 

First 

spin 

(s/ipm) 

Dwell 

time 

(s) 

Second 

spin 

(s/rpm) 

1 

FPI-B 

350  K 

3.0 

0.5 

Yes 

No 

10/500 

0 

15/1200 

2 

FPI-B 

350  K 

4.5 

0.8 

Yes 

No 

10/500 

0 

20/1300 

3 

FPI-B 

350  K 

6.0 

4 

Yes 

No 

15/500 

0 

30/1500 

4 

FPI-B 

350  K 

7.0 

7 

Yes 

No 

30/500 

0 

45/1750 

5 

FPI-B 

350  K 

8.0 

12 

Yes 

No 

30/500 

0 

60/2000 

6 

FPI-A 

233  K 

7.0 

4.3 

Yes 

No 

10/500 

120 

30/1500 

7 

FPI-A 

233  K 

7.0 

4.3 

Yes 

No 

10/500 

0 

30/1500 

8 

FPI-A 

233  K 

7.0 

4.3 

No 

No 

10/500 

120 

30/1500 

9 

FPI-A 

233  K 

7.0 

4.3 

No 

No 

10/500 

0 

30/1500 

10 

FPI-A 

233  K 

7,0 

4.3 

Yes 

Yes 

10/500 

0 

30/2000 

Lamina  450PB  polyimide  bake  oven.  The  process  of  apply¬ 
ing  the  adhesion  promoter,  if  required  by  the  experiment,  is 
listed  as:  (a)  mix  0.05  wt  %  VM651  with  60  ml  DI  water 
well  and  add  40  ml  methanol;  (b)  without  aging,  dispense  6 
ml  VM651  solution  on  200  mm  patterned  wafers;  (c)  spin  the 
wafers  at  a  speed  of  300  rpm  for  15  s;  (d)  wait  for  60  s  while 
wafers  stay  still;  (e)  spin  the  wafers  at  a  speed  of  5000  rpm 
for  60  s;  (f)  bake  the  wafers  at  110  °C  for  60  s  in  air  on  the 
hot  plate;  (g)  let  the  wafers  cool  down  to  room  temperature. 

One  of  the  wafers  received  a  NMP  solvent  spray  at  this 
stage.  Six  ml  of  NMP  was  dispensed  on  this  wafer  and  the 
wafer  was  spun  at  100  rpm  for  5  s.  Then  all  the  wafers 
proceeded  with  the  FPI  film  spin  process.  The  spin  time  and 
speed  for  FPI-A  and  FPI-B  films  varied  in  different  experi¬ 
ments.  However,  the  film  soft  bake  and  cure  recipes  were  the 
same.  The  sequence  of  this  process  was:  (a)  dispense  various 
amount  (depending  on  the  viscosities,  but  generally  from  6 
to  10  ml  of  FPI-A  and  FPI-B  solutions  on  200  mm  patterned 
wafers;  (b)  apply  the  first  spin;  (c)  apply  a  spin  dwell  time  of 
120  s  (wafer  stays  still)  if  required  by  experiment;  (d)  apply 
the  second  spin;  (e)  soft  bake  the  films  at  110  °C  for  60  s  in 
air  on  the  hot  plate;  (f)  cure  the  wafer  in  the  oven  in  a  heated 
N2  atmosphere  at  a  pressure  of  4X10"^  Pa  through  a  se¬ 
quence  of  annealing  at  150  °C  for  30  min,  ramping  up  from 
150  to  230  °C,  annealing  at  230  °C  for  30  min,  ramping  up 
from  230  to  300  °C,  annealing  at  300  °C  for  30  min,  ramp¬ 
ing  up  from  300  to  400  °C,  annealing  at  400  °C  for  60  min, 
cooling  down  from  400  to  150  °C.  Both  ramping  and  cooling 


Table  III.  Experimental  results  of  gap  fill  dependence  on  the  solid  content. 


Wafer  ID  No. 

Thickness  (/xm) 
over  large  metal 
pads 

Gap  fill  capability 

0.8  yarn  gap 

1.0  yttm  gap  1.2 

fim  gap 

1 

0.33 

medium  void 

filled 

2 

0.28 

medium  void 

filled 

3 

0.84 

small  void 

filled 

4 

0.90 

large  void 

filled 

5 

1.05 

large  void 

filled 

rate  are  2  °C  min.  A  detailed  summary  of  the  FPI  films 
prepared  for  this  study  and  the  spin  parameters  is  given  in 
Table  IL 

The  scanning  electron  microscopy  (SEM)  used  for  all  the 
cross  sectional  analysis  was  an  Amray  1910FE  field  emission 
microscope  with  an  accelerating  voltage  of  15  kV.  Wafers 
deposited  with  FPI  films  were  first  capped  with  an  Olin 
CONSI-4000™  conformal  silane  substitute  (a  Si-O-C  mix¬ 
ture).  Then  a  focused  ion  beam  (FEI800)  was  used  to  prepare 
clean  and  smooth  cross  sectional  samples  in  order  to  obtain 
clear  and  undistorted  gap  fill  images  of  FPI  films. 

III.  RESULTS  AND  DISCUSSION 
A.  Solid  content 

For  the  study  of  gap  fill  dependence  on  solid  contents, 
five  different  kinds  of  FPI-B  solutions  were  used  (wafer  Nos. 
1-5  in  Table  II).  All  the  solution  had  an  identical  molecular 
weight  (m^)  of  350  k.  However,  when  diluted  by  different 
amounts  of  NMP  solvent,  the  solid  contents  of  these  solu¬ 
tions  varied  from  3  to  8  wt  %.  As  a  result,  their  viscosities 
ranged  from  0.5  to  12  poise.  The  VM651  adhesion  promoter 
was  used  before  the  FPI  film  spin  process  but  a  spin  dwell 
time  was  not  employed.  In  order  to  prevent  a  high  nonuni¬ 
formity  in  film  thickness  from  an  excessively  slow  spin 
speed,  1200  rpm  was  set  as  the  lower  limit  for  the  second 
spin. 

Table  III  summarizes  the  gap  fill  results  (SEM  micro¬ 
graphs  are  not  shown).  Also  listed  in  the  table  is  the  film 
thicknesses  measured  by  SEM  on  top  of  large  metal  pads. 
From  Table  III,  it  can  be  concluded  that  the  lower  solid  con¬ 
tent  and  viscosity  an  FPI  solution  has  with  a  fixed  molecular 
weight,  the  smaller  gap  it  can  fill.  However,  the  improvement 
is  marginal. 

Since  the  film  thickness  of  FPI-B  varies  from  0.28  to  1.05 
jiim  on  top  of  the  large  metal  pads,  a  question  may  arise 
whether  gap  fill  capability  is  film  thickness  dependant.  In 
this  study,  it  was  extremely  difficult  to  achieve  an  identical 
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l.Sum  gap 


1.2um  gap 


l|iin  scale  bar 


For  1.2pm  gap  For  1.5pm  gap 

micrographs  micrographs 


Samples  #6  to  #9 


Samples  #10 


Fig.  1.  Cross  sectional  SEM  micrographs  of  cured  FPI-A  films  in  the  1.2  and  1.5  /Ltm  Al-Cu  gaps  on  wafer  Nos.  6,  7,  8,  9,  and  10.  The  scales  in  these  figures 
are  different.  In  some  micrographs,  the  top  surface  profile  of  the  Olin  CONSI-4000™  conformal  coating  are  visible  while  the  FPI  film  and  CONSI  glass 
interfaces  are  not.  Also  visible  in  some  micrographs  are  the  residues  and  artifacts  from  sample  preparation  by  focused  ion  beam. 
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final  cured  film  thickness  from  the  five  FPI-B  solutions  be¬ 
cause  of  their  wide-spread  viscosities  and  the  limitation  on 
how  slow  of  a  second  spin  speed  can  be  used.  A  separate 
experiment  using  the  same  solution  but  a  different  second 
spin  speed  suggests  that  the  dependence  of  the  gap  fill  capa¬ 
bility  on  the  film  thickness  is  minimal.  It  should  be  noted 
that  the  film  on  wafer  No.  1  is  thicker  than  that  on  wafer  No. 
2  although  the  solid  content  and  the  viscosity  of  the  former 
sample  is  lower  than  those  of  the  latter  one.  It  was  caused  by 
the  reduced  second  spin  time/speed  combination  used  for 
wafer  No.  1  as  compared  to  that  of  wafer  No.  2. 

B.  Adhesion  promoter 

The  effect  of  adhesion  promoter  on  FPI-A  gap  fill  capa¬ 
bility  can  be  investigated  by  comparing  wafers  Nos.  7  and  9 
(without  dwell  time)  and  wafer  Nos.  6  and  8  (with  dwell 
time)  in  Table  II.  The  NMP  solvent  spray  was  not  used  in  this 
case.  For  gaps  of  identical  width  (1.2  or  1.5  /am),  wafer  Nos. 
9  and  8  always  have  larger  voids  than  wafer  Nos.  7  and  6, 
respectively  (Fig.  1).  For  example,  medium  size  voids  exist 
in  the  1.5  /am  gap  on  wafer  No.  9  while  the  1.5  /am  gap  on 
wafer  No.  7  is  completely  filled.  For  the  1.2  /am  gaps,  the 
void  size  is  larger  on  wafer  No.  8  than  that  on  wafer  No.  6. 
It  can  therefore  be  concluded  that  application  of  the  VM651 
adhesion  promoter  slightly  improves  gap  fill  capability  of  the 
FPI-A  film. 

C.  Spin  dwell  time 

Further  comparison  between  wafers  Nos.  8  and  9  (without 
adhesion  promoter)  and  between  wafer  Nos.  6  and  7  (with 
adhesion  promoter)  in  Table  III  reveals  that  a  spin  dwell  time 
of  120  s  after  the  first  slow  spin  and  before  the  second  fast 
spin  of  FPI-A  solution,  which  is  designed  to  provide  addi¬ 
tional  time  for  FPI  solution  to  flow  into  small  features,  shows 
no  beneficial  effect  on  gap  fill  (Fig.  1).  Voids  of  similar  size 
form  in  the  gaps  of  identical  widths  in  these  two  pairs  of 
samples. 

D.  Solvent  spray 

A  NMP  solvent  spray  step  was  added  after  the  application 
of  the  VM651  adhesion  promoter  and  before  the  spin  process 
of  FPI-A  films  to  investigate  the  effect  of  prewetting  by  sol¬ 


vent  on  the  gap  fill  of  the  FPI-A  films  (wafer  Nos.  7  and  10 
in  Table  II).  No  spin  dwell  time  was  employed  here. 

Comparing  wafer  Nos.  7  and  10,  it  was  found  the  1.5 
/am  gaps  were  filled  while  voids  formed  in  the  bottom  cor¬ 
ners  of  the  1.2  /am  gaps  on  both  wafers  (Fig.  1).  The  void 
size  is  smaller  for  wafer  No.  10  than  it  is  for  wafer  No.  7. 
Therefore,  NMP  solvent  spray  has  a  beneficial  effect  on  gap 
fill.  However,  this  effect  is  insignificant. 

IV.  CONCLUSIONS 

Slight  improvements  were  observed  in  Al-Cu  gap  fill  of 
FPI-A  and  FPI-B  films  when  the  solid  content  is  reduced, 
and  adhesion  promoter  and  solvent  spray  are  applied.  How¬ 
ever,  spin  dwell  time  has  little  effect  on  gap  fill. 

This  study  is  not  targeted  at  filling  the  gaps  as  small  as 
possible,  but  rather  to  investigate  how  reduced  solid  content, 
application  of  adhesion  promoter  and  solvent  spray,  and  use 
of  spin  dwell  time  affect  gap  fill  property.  For  example,  FPI 
used  in  this  study  have  unnecessarily  high  molecular 
weights.  In  addition,  optimization  of  spin^ake/cure  recipes, 
modification  of  the  solvent  system  by  adding  co- solvent  and 
platicizer,  and  use  of  a  more  advanced  spin  coater  equipped 
with  a  solvent  saturation  cup  are  just  a  few  options  that  can 
improve  gap  fill  capability.  We  have  recently  achieved  excel¬ 
lent  small  gap  fill  with  modified  solutions  containing  the 
same  FPI  materials.  The  results  will  be  published  at  an  ap¬ 
propriate  time. 
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The  Si/Si02  interface  formed  by  remote  plasma  enhanced  chemical  vapor  deposition  (RPECVD)  at 
low  temperature  with  SiH4/N20  or  SiH4/N20/Cl2  was  studied  and  compared  with  thermal  oxidation. 

The  interface  of  the  CVD  Si02  without  chlorine  addition  is  rougher  than  that  with  chlorine  addition. 

But  the  surface  roughness  of  CVD  Si02  films  increases  with  chlorine  addition.  The  thermal 
oxidation  induces  strong  interface  strains,  and  the  strains  generated  by  the  CVD  Si02  without 
chlorine  addition  are  stronger  and  are  distributed  more  nonuniformly  than  those  by  the  chlorinated 
Si02.  It  is  believed  that  chlorine  addition  during  RPECVD  affects  the  initial  stages  of  deposition, 
and  chlorine  is  combined  with  Si  dangling  bonds  existing  at  the  Si/Si02  interface  through  the 
formation  of  Si-Cl^  bonds.  It  was  also  found  that  with  chlorine  addition  during  RPECVD,  the 
strained  layer  thickness,  interface  trap  density,  and  suboxide  density  could  be  lowered  significantly. 
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I.  INTRODUCTION 

The  Si/Si02  interface  plays  a  crucial  role  in  microelec¬ 
tronic  devices  and  has  been  the  subject  of  intense  study  for 
over  40  years.  During  the  past  decade,  remote  plasma  en¬ 
hanced  chemical  vapor  deposition  (RPECVD)  has  emerged 
as  a  viable  technique  for  depositing  Si02.^’^  Many  applica¬ 
tions  of  Si02  films  require  low  temperature  processing.  For 
example,  in  active  matrix  liquid  crystal  display  (AMLCD) 
applications,  low  temperature  deposition  of  Si02  films  is  re¬ 
quired  to  utilize  low  cost  glass  substrates. 

The  incorporation  of  halogen  compounds  during  thermal 
oxidation  of  Si  results  in  a  significant  improvement  in  the 
electronic  properties  of  the  oxide  and  the  interface  formed 
with  underlying  However,  for  high  halogen  concen¬ 

tration  and  high  temperature,  a  volatile  phase  begins  to  form 
at  the  interface,  which  may  lift  the  oxide  from  the  silicon, 
form  bubbles,  etch  the  silicon  substrate,  and  increase  the  in¬ 
terface  roughness.  Therefore,  the  determination  of  an  ap¬ 
propriate  amount  of  halogen  compounds  is  important  to  op¬ 
timize  an  oxidation  window. 

The  effect  of  the  addition  of  halogen  compounds  on  the 
Si/Si02  interface  and  bulk  properties  in  the  PECVD  process 
of  Si02  film  deposited  at  low  temperature  below  500  °C  is 
still,  to  a  large  extent,  unknown.  Recently,  Falcony  and  co¬ 
workers  have  reported  on  Si02  films  deposited  using  a  halo¬ 
gen  chemistry,  i.e.,  SiF4/H2/N20,  SiF4/SiH4/N20,  and 
SiCl4/02,  in  order  to  reduce  the  incorporation  of  Si-H  and 
Si-OH  bonds  and  improve  the  breakdown  field.^^’^^  They 
could  effectively  decrease  a  significant  amount  of  these 


^^Present  address:  Department  of  Mechanical  Engineering,  Ohio  State  Uni¬ 
versity,  206  West  18th  Avenue,  Columbus,  Ohio  43210-1107;  Electronic 
mail:  ypark@vision.postech.ac.kr 

’’^Author  to  whom  all  correspondence  should  be  addressed;  Electronic  mail: 
srhee@vision.postech.ac  .la- 


bonds  and  improve  the  breakdown  field  up  to  4-6  MV/cm. 

In  the  present  article,  the  interface  between  the  Si  sub¬ 
strate  and  the  Si02  film  obtained  by  RPECVD  at  low  tem¬ 
perature  with  SiH4/N20  and  SiH4/N20/Cl2  was  studied  using 
transmission  electron  microscopy  (TEM),  high  frequency  (1 
MHz)  capacitance- voltage  (C-V)  measurement,  x-ray  pho¬ 
toelectron  spectroscopy  (XPS),  and  electron  spin  resonance 
(ESR).  The  surface  and  interface  roughness  and  morphology 
of  deposited  oxide  films  were  observed  by  atomic  force  mi¬ 
croscopy  (AFM).  For  comparison,  the  Si/Si02  interface  ob¬ 
tained  by  thermal  oxidation  was  also  studied. 

II.  EXPERIMENTAL  PROCEDURE 
A.  Film  preparation 

The  deposition  of  Si02  on  Si  was  performed  in  a 
RPECVD  reactor  as  shown  in  Fig.  1.  N2O  (99.999%  pure) 
was  introduced  into  a  quartz  tube,  excited  by  the  induction 
with  13.56  MHz  RF  power  to  produce  activated  oxygen,  and 
flowed  into  the  deposition  zone.  SiH4  (99.99%  pure)  mixed 
with  Ar  (99.9999%  pure)  diluent  was  introduced  into  the 
region  10  cm  downstream  of  the  plasma,  through  the  gas 
dispersal  ring.  This  then  mixed  with  the  excited  oxygen  spe¬ 
cies  and  flowed  toward  the  substrate,  20  cm  downstream  of 
the  plasma.  CI2  (99.99%  pure)  in  Ar  diluent  was  also  intro¬ 
duced  into  the  region,  12  cm  downstream  of  the  plasma. 
Commercially  supplied  10-20  D  cm  /?-type  Si  wafers  of 
(111)  orientation  were  used  as  substrates.  The  modified  RCA 
method,  (i)  dipping  in  H2S04:H202=3:1  solution  for  10  min 
and  deionized  (DI)  water  rinse,  (ii)  dipping  in  HF:H20=1:7 
solution  for  30  s  and  DI  water  rinse,  was  used  for  the  prede¬ 
position  cleaning.  After  the  substrate  was  loaded  in  the  reac¬ 
tion  chamber,  it  was  baked  for  30  min  to  eliminate  moisture 
from  the  wet  cleaning.  Thermal  oxidation  was  carried  out  at 
1000  °C  in  a  furnace  using  ultrahigh-purity  oxygen.  Deposi- 
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Fig.  1.  Schematic  diagram  of  the  RPECVD  system. 


tion  was  performed  at  250-300  °C.  The  amount  of  CI2  addi¬ 
tion  was  0  and  10  vol  %  of  total  gas  flow.  The  total  gas  flow 
rate  including  Ar  was  100  seem,  the  N2O  flow  rate  was  12 
seem,  and  the  SiH4  flow  rate  was  3  seem.  The  reactor  pres¬ 
sure  was  400  mTorr  and  the  plasma  power  was  50  W. 

B.  Film  characterization 

The  oxide  thickness  and  refractive  index  were  measured 
with  an  ellipsometer  (NIIC  EL- 101  A).  As  for  the  AFM  ob¬ 
servation  of  the  Si/Si02  interface,  optimization  of  the  oxide 
stripping  condition  is  essentially  required.  We  used  the  HF- 
dipping  method  and  showed  the  interfacial  morphology  in  a 
three-dimensional  real  space.  The  chemical  etch  of  the  oxide 


film  was  performed  in  the  buffered  HF  solution  (49% 
HF;H20:C2H50H= 1:10: 110)  at  room  temperature.^"^  The 
replica  of  the  Si/Si02  interface  may  be  preserved  by  such  a 
treatment,  because  bare  Si  was  not  attacked  by  the  diluted 
HF-based  solution.  After  the  HF  dipping,  the  substrates  were 
rinsed  in  DI  water  for  4  min  and  dried  by  blowing  with  dry 
N2  gas.  It  is  thought  that  the  Si  dangling  bonds  on  the  surface 
are  terminated  by  hydrogen,  which  prevents  surface  reoxida¬ 
tion  in  air  and  damage  of  the  original  Si/Si02  interface  mor¬ 
phology. 

Cross-sectional  specimens  for  TEM  were  first  glued  face 
to  face,  cut,  then  ground,  dimpled,  and  thinned  by  ion  mill¬ 
ing.  TEM  observations  were  conducted  using  JEOL  JEM- 


Fig.  2.  TEM  images  of  the  Si/Si02  interface  including  the  interface  formed  by  (a)  thermal  oxidation,  (b)  CVD  without  chlorine,  and  (c)  CVD  with  chlorine 
addition  (6  vol  %).  PECVD  oxides  were  deposited  at  the  temperature  of  300  °C  with  SiH4:N20=3:12  seem.  The  pressure  was  0.4  Torr. 
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Fig.  3.  High  resolution  TEM  lattice  images  of  the  Si/Si02  interface  formed  by  (a)  thermal  oxidation,  (b)  CVD  without  chlorine,  and  (c)  CVD  with  chlorine 
addition  (6  vol  %).  PECVD  oxides  were  deposited  at  a  temperature  of  300  °C  with  SiH4:N20=3:12  seem.  The  pressure  was  0.4  Torr. 


4000FX  transmission  electron  microscope  operating  at  400 
kV.  The  surface  roughness  of  the  Si02  film  was  measured  by 
AFM  using  a  Park  Science  Instrument  (PSI)  Autoprobe-CP. 
The  photoelectrons  were  detected  using  a  Perkin  Elmer  PHI 
5400  XPS  with  k\  K a  radiation.  The  photoelectrons  were 
collected  by  hemispherical  analyzer  at  a  45°  take-off  angle. 
Sputtering  for  the  depth  profile  was  performed  at  3.5  kV  with 
Ar  ions.  Si2p  core  levels  were  measured  and  analyzed  using 
a  least-square  curve-fitting  method.  The  paramagnetic  de¬ 
fects  were  examined  by  electron  spin  resonance  using  a 
Bruker  ER  200D  X-band  (microwave  frequency:  9.45  GHz) 
spectrometer  at  room  temperature. 

For  electrical  characterization,  metal-oxide- 
semiconductor  (MOS)  capacitors  were  fabricated  with  de¬ 
posited  oxides.  The  aluminum  dot  electrode,  0.05  cm  in  di¬ 
ameter,  was  deposited  by  thermal  evaporation  through  a 
metallic  mask.  The  C-V  characteristics  were  analyzed  at 
high  frequency  (1  MHz)  using  a  Hewlett-Packard  4275  mul¬ 
tifrequency  LCR  meter  with  a  sweep  voltage  range  of  + 10  to 
“10  V.  The  flat  band  voltage  and  the  concentration  of  oxide 
fixed  charges  were  obtained  from  the  high  frequency  C-V 
data.  The  interface  trap  density  at  the  Si/Si02  interface  was 
also  obtained  from  the  C-V  data  using  the  Terman 
method. 

III.  RESULTS  AND  DISCUSSION 

Figure  2  shows  the  TEM  images  of  the  Si/Si02  interface 
formed  by  thermal  oxidation  and  chemical  vapor  deposition 
(CVD)  from  SiH4/N20  and  SiH4/N20/Cl2 .  Sections  with  ho¬ 
mogenous  oxide  thickness  were  obtained  with  these  three 
samples.  The  thickness  of  the  Si02  sample  was  found  to  be 
about  96,  140,  and  100  nm,  respectively.  The  pattern  of 
thickness  fringes  was  observed  in  the  Si  substrate  below  the 
oxide  in  these  three  samples,  indicating  oxide-induced 
strains  in  the  lattice.  For  the  thermally  oxidized  sample, 
three  straight  fringes  with  clear  contrast  exist  in  the  depth 
range  from  the  surface  of  the  Si  substrate  to  a  depth  about  12 
nm,  whereas  only  one  fringe  with  poor  contrast  was  ob- 
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served  for  the  Si02  samples  by  CVD  with  and  without  chlo¬ 
rine.  This  indicates  that  the  thermal  oxidation  induces  strong 
interface  strains.  The  fringe  in  the  Si02  sample  with  chlorine 
addition  is  thicker  and  more  nonuniform  than  that  in  the 


Trace  distance  (pm) 

Fig.  4.  Two-dimensional  line  scanning  of  AFM  on  the  Si  surface  after  strip¬ 
ping  Si02 :  (a)  bare  Si,  (b)  thermal  oxidation,  (c)  CVD  without  chlorine,  and 
(d)  CVD  with  chlorine  addition  (6  vol  %).  PECVD  oxides  were  deposited  at 
a  temperature  of  300  °C  with  SiH4:N20=3:12  seem.  The  pressure  was  0.4 
Torr. 
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Fig.  5.  Three-dimensional  AFM  for  the  deposited  Si02  surfaces  from  (a)  thermal  oxidation,  (b)  CVD  without  chlorine,  (c)  CVD  with  chlorine  addition  (6 
vol  %),  and  (d)  CVD  with  chlorine  addition  (10  vol  %).  PECVD  oxides  were  deposited  at  a  temperature  of  250  °C  with  SiH4:N20=3:12  seem.  The  pressure 
was  0.4  Torr. 


sample  without  chlorine  addition,  indicating  that  the  strains 
generated  by  the  chlorine-free  CVD  Si02  are  stronger  and 
are  distributed  more  nonuniformly  than  those  by  the 
chlorine-added  CVD  Si02 . 

Figure  3  shows  the  high  resolution  TEM  lattice  images  of 
the  Si/Si02  interface  formed  by  thermal  oxidation  and 
chemical  vapor  deposition  from  SiH4/N20  and 
SiH4/N20/Cl2.  Taking  the  largest  protrusion  or  step  height 
observed  in  a  sample  as  a  measure  of  the  Si/Si02  interface 
roughness,  one  sees  that  the  interface  roughnesses  of  the 
thermally  oxidized,  chlorine-free,  and  chlorine-added  CVD 
Si02  samples  are  about  1,  0.67,  and  0.33  nm,  respectively. 
For  the  thermal  oxidation,  it  has  been  reported  that  the  inter¬ 
face  roughness  results  from  protrusions  of  Si  into  the  Si02 
rather  than  oxidized  valleys  in  the  silicon  surface.  This  is 
suggestive  of  locally  retarded  oxidation  rather  than  a  local 
acceleration  of  the  reaction  rate.  The  interface  protrusions 
lead  directly  to  a  variation  in  oxide  thickness.  The  local 
thickness  variation  is  indicative  of  nonuniform  oxidation. 

Compared  with  thermal  oxidation,  RPECVD  affects  the 
underlying  substrate  much  less.^^  It  is  obvious  that  CI2  addi¬ 
tion  decreases  interface  roughness  [Fig.  3(c)].  Lattice  strain 
fields  in  Si  near  the  interface  were  observed  in  these  three 
samples.  The  lattice  strain  field  in  the  thermally  oxidized 


sample  is  more  irregular  than  those  in  the  CVD  Si02 
samples.  The  strain  field  in  the  RPECVD  sample  with  chlo¬ 
rine  addition  is  weaker  than  that  in  the  chlorine-free  CVD 
Si02  sample.  It  seems  that  the  CI2  addition  can  relax  the 
interface  strain  by  breaking  the  strained  Si02  ring  structure. 
Beyond  the  Si/Si02  interface  roughness,  there  were  no  ap¬ 
parent  structural  defects  or  irregularities  associated  with  the 
thin  oxide  films.  We  did  not  observe  any  evidence  of  Si 
inclusions  in  the  Si02 ,  amorphous  oxide  regions  in  the  near 
interface  Si,  crystalline  patches  of  oxide,  metallic  or  other 
precipitates,  or  oxide  pinholes.  In  thermal  oxidation.  Si  is 
consumed  to  generate  the  oxide  film,  so  that  the  Si/Si02 
interface  is  located  in  the  bulk  of  the  Si,  below  the  original  Si 
surface.  Similarly,  the  Si/Si02  interface  deposited  by  remote 
PECVD  is  also  formed  by  a  subcutaneous  oxidation  of  Si 
substrate  that  occurs  at  the  initial  stage  of  the  film 
deposition. This  generates  unwanted  interfacial  oxide 
thickness  of  0.5-1  nm.  At  deposition  temperatures  above 
300  ®C  and  deposited  oxide  thicknesses  above  25  nm,  the 
amount  of  subcutaneous  oxidation  increased  significantly, 
and  the  electronic  quality  of  these  interface  rapidly  degraded. 
The  mechanism  for  defect  generation  may  be  related  to  a 
buildup  of  stress  in  the  subcutaneous  layer.  This  stress  de¬ 
rives  from  a  molar  volume  mismatch  between  Si  and  Si02 
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Fig.  6.  XPS  depth  profile  of  chlorinated  Si02  samples:  (a)  CVD  without 
chlorine  and  (b)  CVD  with  chlorine  addition  (6  vol  %).  PECVD  oxides  were 
deposited  at  a  temperature  of  300  °C  with  SiH4:N20”3:12  seem.  The  pres¬ 
sure  was  0.4  Torr. 


and  cannot  be  relieved  through  a  viscoelastic  relaxation  pro¬ 
cess  at  low  temperatures  in  this  RPECVD  process.  However, 
when  chlorine  was  added  into  this  RPECVD  process  gases, 
chlorine  can  strongly  adsorb  to  an  initial  Si  surface  and  pre¬ 
vent  further  subcutaneous  oxidation  from  active  oxygen  spe¬ 
cies  generated  by  remote  plasma.  It  is  believed  that  chlorine 
addition  improves  the  surface  states  by  reducing  surface 
roughness  and  interface  trap  density. 

Figure  4  shows  two-dimensional  line  scanning  for  the 
Si/Si02  interface  using  AFM  after  the  oxide  film  was  re¬ 
moved.  Compared  with  bare  Si  (rms=0.17  nm)  after  HF 
dipping,  the  rms  values  of  surface  roughness  are  0.35  nm  for 
thermal  oxidation  and  0.51  and  0.26  nm  for  CVD  without 
and  with  chlorine  addition,  respectively.  The  AFM  result  for 
the  Si/Si02  interface  roughness  gives  us  a  better  guide  be¬ 
cause  high  resolution  TEM  is  focused  only  on  a  very  small 
spot.  CVD  with  chlorine  addition  has  the  lowest  interfacial 
roughness.  The  roughness  at  the  interface  reduces  the  mobil¬ 
ity  of  charge  carriers  in  MOS-based  devices,  and  a  local  field 
enhancement  across  the  oxide  lowers  the  oxide  breakdown 
field  strength.  Interface  states  and  also  protrusions  or  hillocks 
can  act  as  electron  emitters,  thus  reducing  the  positive  charge 
centers  near  the  interface.^^ 
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Fig.  7.  XPS  curve  fitting  for  the  Si  2p  core  level:  (a)  CVD  without  chlorine 
and  (b)  CVD  with  chlorine  addition  (2  vol  %).  PECVD  oxides  were  depos¬ 
ited  at  a  temperature  of  300  °C  with  SiH4:N20=3:12  seem.  The  pressure 
was  0.4  Torr. 


Figure  5  shows  three-dimensional  (3D)  AFM  images  of 
CVD  oxide  surface  with  chlorine  addition.  It  can  be  seen  that 
the  oxide  surface  roughness  was  dependent  on  the  amount  of 
chlorine  addition  and  the  deposition  temperature.  The  rms 
values  of  surface  roughness  for  thermal  oxide  and  CVD  Si02 
without  chlorine  addition  were  0.2  and  1 .6  nm,  respectively. 
When  the  chlorine  was  added,  the  rms  value  was  increased 
from  3.1  nm  (2  vol  %  addition)  to  4.2  nm  (6  vol  %  addition) 
at  a  deposition  temperature  of  300  °C.  The  rms  value  was 
15.2  nm  with  10  vol  %  addition.  Surface  roughening  was 
mainly  due  to  the  surface  desorption  of  halide  groups  such  as 
Si-Cl;^  and  0-Cl. 

Figure  6  shows  the  XPS  depth  profile  with  Ar  ion  sput¬ 
tering.  With  chlorine  addition,  about  5  at.  %  chlorine  atoms, 
irrespective  of  the  amount  of  chlorine  addition,  piles  up  at 
the  Si/Si02  interface.  It  is  believed  that  chlorine  addition 
affects  the  initial  stages  of  deposition,  i.e.,  neutralizes  the  Si 
dangling  bonds  through  the  formation  of  Si-Cl  bonds.  The 
substitution  of  the  stronger  Si-Cl  bonds  for  the  weak  Si-0 
bonds  can  relax  the  strains  at  the  Si/Si02  interface.  The  film 
stoichiometry,  defined  by  the  0/Si  ratio,  was  still  lower  than 
2,  and  there  is  no  detectable  chlorine  in  the  bulk  oxide.  It 
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Fig.  8.  The  density  of  various  intermediate  Si  states  as  a  function  of  chlorine 
addition.  The  closed  symbols  represent  the  thermal  oxide.  PECVD  oxides 
were  deposited  at  a  temperature  of  300  °C  with  SiH4:N20— 3:12  seem.  The 
pressure  was  0.4  Torr. 


seems  that  the  chlorine  in  the  bulk  oxide  etches  out  the  Si02 
ring  structure  and  desorbs  out. 

Figure  7  shows  Si  2/7  spectra  at  the  interface  for  CVD 
Si02  without  and  with  chlorine  addition.  Si  2/7  peaks  at  the 
Si/Si02  interface  show  the  chemical  binding  state  of  Si  at  the 
Si/Si02  interface  at  various  deposition  conditions.  As  men¬ 
tioned  in  the  TEM  result,  the  transition  (strained  and  mis¬ 
matched)  layer  consisted  of  nonbonded  silicon  in  a  nonsto- 
ichiometric  oxide  and  chloride  with  intermediate  oxidation 
states,  i.e.,  Si^"^,  Si^"^,  and  SP"^.  The  Si  2p  peak  was  de- 
convoluted  into  intermediate  oxidation  states,  which  deter¬ 
mine  the  microscopic  oxide  interface  properties  such  as  in¬ 
terface  trap  density.^^  The  silicon-chlorine  bond  is  located  at 
100.15  eV  for  Si-Cl,  101.22  eV  for  Si-Cl2,  and  102.34  eV 
for  Si-Cl3 ,  respectively,^^  but  no  attempt  is  made  to  decon- 
volute  silicon  suboxides  and  subchlorides. 

Figure  8  shows  the  intermediate  Si  bond  density  obtained 
by  XPS  based  on  the  well  established  method  of  calculating 
the  suboxide  density, 


A^sio^-'^siAsi 


^Si 

SiO^ 


sin(  6), 


(1) 


where  (5.0X10^^  cm“^)  is  the  atomic  density  of  silicon, 
Xsi  (0.33  nm)  is  the  photoelectron  mean  free  path  in  silicon, 

I  is  the  integrated  peak  intensity  of  the  suboxide  or  the  bulk 
silicon,  6  (45°)  is  the  photoelectron  take-off  angle,  a  is  the 
photoionization  cross  section  (0-5^0  /o^si  ~  1*^ 

for  Si^"^,  SP"^,  and  Sp*^,  respectively).^^  The  calculated  val¬ 
ues  for  the  intermediate  densities  may  not  be  absolute  but  are 
useful  in  showing  relative  changes  as  a  function  of  deposi¬ 
tion  parameter.  For  the  Si/Si02-based  devices,  properties  of 
interfaces  will  certainly  be  affected  by  the  microscopic  struc¬ 
ture  at  the  Si/Si02  interface.  With  increasing  chlorine  addi¬ 
tion,  the  intermediate  silicon  bond  density  is  decreased  com¬ 
pared  with  the  thermal  oxide  and  the  CVD  oxide  without 
chlorine  addition.  The  decrease  of  the  intermediate  silicon 
bond  density  means  that  chlorine  at  the  interface  reduces  the 


Fig.  9.  Interface  trap  density  distributions  in  the  Si  energy  band  gap:  ( — ) 
thermal  oxidation,  (•••)  CVD  without  chlorine,  and  ( — CVD  with  chlorine 
addition  (2  vol  %).  PECVD  oxides  were  deposited  at  a  temperature  of 
300  °C  with  SiH4:N20=3:12  seem.  The  pressure  was  0.4  Torr, 


mismatched  suboxide  density.  At  high  chlorine  partial  pres¬ 
sure,  however,  the  intermediate  silicon  bond  density  in¬ 
creased  because  the  etching  process  dominantly  occurs  in 
competition  with  deposition  as  shown  in  the  AFM  image 
[Fig.  5(d)].  It  shows  that  the  deposited  Si02  film  surface  is 
also  roughened  due  to  the  desorption  of  halide  group.  There¬ 
fore  it  is  possible  that  too  much  chlorine  addition  in  the  Si02 
deposition  process  may  cause  the  degradation  of  bulk  prop¬ 
erties  as  well  as  interface  properties. 

Figure  9  shows  the  interface  trap  density  distributions  of 
the  interface  formed  by  thermal  oxidation  and  CVD  without 
and  with  chlorine  addition.  The  peaks  at  the  valence  band 
(jEj^) +0.3-0.35  eV  and  0.7-0.75  eV  mainly  appeared  and 
they  have  been  reported  to  be  the  trivalent  Si  dangling  bond 
(•Si=Si3,  where  *  denotes  an  unpaired  electron  and  =  de¬ 
notes  three  back  bonds  to  the  central  atom’s  nearest  neigh¬ 
bors)  near  the  Si/Si02  interface.^^  These  Si  dangling  bonds 
are  called  centers.  This  center  is  a  major  source  of  inter¬ 
face  traps  in  the  0.15-0.9  eV  range  of  the  Si  energy  band 
gap.  The  interface  trap  densities  at  +0.3-0.35  eV  for  the 
thermal  oxidation  and  CVD  without  and  with  chlorine  addi¬ 
tion  are  1.2X10^^,  8X10^^  and  3.5X10^^  eV“^  cm“^  re¬ 
spectively.  The  CVD  Si02  without  chlorine  addition  has  the 
highest  interface  trap  density.  It  is  clear  that  the  CI2  addition 
during  the  RPECVD  decreases  the  interface  trap  density 
greatly.  The  chlorine  incorporation  could  reduce  the  interface 
trap  density  due  to  the  passivation  of  positive  (Si^^)  Si  dan¬ 
gling  bonds  through  the  formation  of  Si-Cl^  bonds  and  in¬ 
hibit  the  incorporation  of  hydrogen  and  water  through  the 
formation  of  H-Cl  or  HO-Cl  bonds. 

Poindexter  et  have  extensively  studied  the  P^  center 
defect  using  ESR  and  found  a  very  strong  correlation  be¬ 
tween  the  P^  center  spin  density  and  the  interface  trap  den¬ 
sity  in  oxide  on  both  Si(lll)  and  (100)  surfaces.  Figure  10 
shows  ESR  spectra  for  three  differently  prepared  CVD 
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Fig.  10.  ESR  spectra  of  (a)  CVD  without  chlorine,  (b)  and  (c)  CVD  with 
chlorine  (2  and  4  vol  %)  Si02 .  PECVD  oxides  were  deposited  at  a  tempera¬ 
ture  of  250  °C  with  SiH4:N20=3:12  seem  and  the  pressure  was  0.4  Torr.  (d) 
The  thermal  oxide. 


samples  compared  with  the  thermal  oxide,  and  peak  posi¬ 
tions  are  around  g  =2.0056. 

The  correlation  between  the  interface  trap  at  +0.35  and 
0.75  eV  and  the  peak  amplitude  of  the  center  at 
g =2.0056  in  ESR  measurements  is  shown  in  Fig.  11.  We  can 
conclude  that  interfacial  states,  resulting  from  lattice  mis¬ 
match,  suboxide  states,  interface  roughness,  and  nonbonded 
defects  are  passivated  by  the  addition  of  small  amounts  of 
chlorine  during  the  low  temperature  CVD  process. 
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Fig.  11.  Correlation  between  the  interface  trap  density  at  £^+0.35  and  0.75 
eV  and  ESR  amplitude  at  g =2.0056  for  samples  shown  in  Fig.  10.  The 
interface  trap  density  at  £'^+0.35  eV  (A)  and  +0.75  eV  (O),  at  midgap  (□), 
and  at  the  peak  amplitude  of  ESR  (V).  The  closed  symbols  represent  the 
thermal  oxide  (A,  #,  ■,  and  T). 


IV.  CONCLUSIONS 

We  have  studied  and  compared  the  Si/Si02  interfaces 
formed  by  thermal  oxidation  and  RPECVD  with  or  without 
chlorine  addition.  The  thermal  oxide  has  a  rough  interface 
and  the  interface  of  the  CVD  Si02  without  chlorine  addition 
is  rougher  than  that  with  chlorine  addition.  The  thermal  oxi¬ 
dation  induces  strong  interface  strains.  The  strains  generated 
by  the  CVD  Si02  without  chlorine  addition  are  stronger  and 
are  distributed  more  nonuniformly  than  with  chlorine  addi¬ 
tion.  It  was  also  found  out  that  with  chlorine  addition  during 
RPECVD,  the  occurrence  of  strained  layers,  the  interface 
trap  density,  and  the  suboxide  density  could  be  lowered  sig¬ 
nificantly.  It  is  suggested  that  chlorine  addition  during 
RPECVD  affects  the  initial  stages  of  deposition,  i.e.,  neutral¬ 
izes  the  Si  dangling  bonds  through  the  formation  of  Si-Cl 
bonds.  Although  the  large  amount  of  chlorine  addition 
roughened  the  deposited  Si02  surface,  we  have  discovered 
the  appropriate  amount  of  chlorine  addition  in  order  to  get 
device  quality  low  temperature  Si02  films. 
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Silicon  dioxide  films  were  grown  using  an  oxygen  plasma  generated  by  an  electron  cyclotron 
resonance  (ECR)  source  at  several  low  temperatures.  The  plasma  oxidation  rate  was  investigated  by 
varying  the  growth  parameters.  The  oxide  thickness  parabolically  increases  with  microwave  power 
but  decreases  with  increasing  pressure  or  flow  rate.  A  complementary  model  of  the  Deal-Grove 
oxidation  theory  is  suggested  for  the  plasma  oxidation,  and  kinetic  parameters  are  compared  with 
the  other  plasma  and  thermal  oxidation  cases.  The  diffusion  rate  constant  in  O2  plasma  oxidation  at 
room  temperature  is  enhanced  up  to  the  level  of  the  diffusion  rate  in  thermal  oxidation  and  the 
reaction  rate  constant  is  much  larger  than  the  thermal  oxidation  case.  This  may  imply  that,  due  to 
oxygen  atoms  dissociated  by  the  ECR  plasma,  plasma  oxidation  is  related  to  the  atomic  diffusion 
through  oxide  layer  and  the  atomic  chemical  reaction  at  the  Si-Si02  interface.  The  high  quality  of 
the  ultrathin  oxide  film  was  characterized  with  a  breakdown  field  of  14.8  MeV/cm  and  an  interfacial 
state  density  of  1.2X10^^  eV~^  cm”^.  ©  1996  American  Vacuum  Society, 


I.  INTRODUCTION 

In  silicon  technology,  the  quality  of  ultrathin  oxide  films 
becomes  a  crucial  factor  for  ultralarge  scale  integration 
(ULSI)  devices  because  of  decreasing  sizes  of  integrated  cir¬ 
cuits.  The  conventional  thermal  oxidation  of  silicon  gives 
rise  to  questions  about  applications  in  ULSI  technology.  At 
high  temperatures,  potential  problems  of  the  thermal  oxida¬ 
tion  such  as  dopant  diffusion,  impurity  redistribution,  defect 
formation,  and  thermally  induced  stress  can  seriously  de¬ 
grade  the  device  performance.^  Therefore,  the  low- 
temperature  oxidation  process  is  highly  desirable  for  high 
quality  ultrathin  dielectric  films.  In  order  to  reduce  the  pro¬ 
cessing  temperature  and  to  increase  the  growth  rate,  many 
studies  on  the  sputtered  gate  oxide  films  and  the  films  depos¬ 
ited  by  the  plasma  enhanced  chemical  vapor  deposition 
(PECVD)  were  undertaken.^  However,  the  deposited  films 
show  interfacial  instability  with  large  gap  states  generated  by 
hydrogen  atoms  which  are  included  in  the  deposition  pro¬ 
cess,  and  so  the  degraded  electrical  property  is  not  compa¬ 
rable  with  that  in  thermal  oxides. 

High  quality  films  can  be  grown  by  low-temperature 
plasma  oxidation.  As  one  kind  of  plasma  oxidation  process, 
plasma  anodization  has  improved  the  low  oxidation  rate  of 
the  plasma  oxidation.  Anodization  with  two  electrodes  that 
apply  the  external  positive  bias  to  the  sample  enhances  the 
diffusion  of  oxygen  ions  into  oxide  layer.  Unfortunately,  the 
contamination  caused  by  electrode  sputtering  is 
unavoidable.^  O2  plasma  oxidation  without  an  external  bias 
shows  an  even  larger  oxidation  rate  than  the  thermal  oxida¬ 
tion  case.  Plasma  oxidation  without  external  bias  increases  as 
the  device  geometry  of  the  ULSI  decreases.  Recently,  an 
electron  cyclotron  resonance  (ECR)  plasma  source  has  been 
developed  and  has  several  advantages  such  as  a  low  plasma 
potential,  a  high  ionization  rate  (10%),  dissociation  of  oxy¬ 
gen  in  low  pressure  (mTorr),  and  a  long  plasma  lifetime. 


ECR  plasma  oxidation  shows  that  the  dielectric  property  of 
the  film  is  comparable  to  that  of  thermal  oxides."^"^ 

Usually,  the  driving  force  for  both  the  chemical  reaction 
and  the  diffusion  process  is  the  thermal  activation  energy  in 
the  thermal  oxidation  of  silicon.  Instead  of  using  the  thermal 
energy,  plasma  oxidation  utilizes  a  highly  activated  oxygen 
plasma  including  excited  neutral  oxygen  atoms  and  reactive 
radicals  of  oxygen  ions.  In  an  ECR  plasma,  there  are  optical 
emission  spectra  for  neutral  oxygen  atoms  (O  i)  and  molecu¬ 
lar  oxygen  ions  (Oj )."^’^  Without  the  ECR  condition,  the  oxy¬ 
gen  plasma  emits  spectra  not  only  from  O I  and  Oj ,  but  also 
from  However,  Kimura  et  al^  suggested  the  migration 
of  0~  ions  into  the  oxide.  Carl,  Hess,  and  Lieberman^  also 
suggested  that  the  strong  electric  field  induced  by  the  plasma 
self-bias  on  the  oxide  layer  can  conduct  O'  ions  through  the 
oxide.  Vinckier  et  al^  showed  that  charged  species  generated 
from  the  microwave  discharge  in  afterglows  have  no  effect 
on  the  oxidation  rate  due  to  poor  ionization  efficiency.  Jo¬ 
seph,  Hu,  and  Irene^  demonstrated  the  effect  of  applied  bias, 
suggesting  O”  as  the  main  oxidizing  species.  These  experi¬ 
mental  findings  contradict  each  other  and  provide  little  ex¬ 
planation  for  the  mechanism  of  the  plasma  oxidation  without 
an  external  bias.  The  plasma  oxidation  kinetics  parameters 
derived  from  the  Deal-Grove  model are  still  controversial. 
The  reaction  rates  have  been  reported  with  a  positive  value 
by  Carl  et  al.  but  with  a  negative  value  by  Kimura  et  al 
In  this  work,  we  systematically  analyze  the  kinetic  param¬ 
eters  from  the  complete  form  of  the  oxidation  model.  Fur¬ 
thermore,  the  ECR  plasma  oxidation  experiment  without  an 
applied  bias  on  samples  was  carried  out  to  confirm  the 
plasma  oxidation  kinetics,  and  the  oxidation  rate  was  studied 
in  terms  of  processing  pressure,  microwave  power,  substrate 
temperature,  and  oxidation  time.  The  experimental  results  for 
plasma  oxidation  were  fitted  by  the  present  formalism  of 
oxidation  theory.  Furthermore,  the  electrical  properties  of  the 
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Fig.  1.  Schematic  of  the  divergent  magnetic  field-type  ECR  source. 


breakdown  field  and  the  interfacial  state  density  were  char¬ 
acterized  from  metal-oxide-semiconductor  (MOS)  capaci¬ 
tor  structures. 

II.  EXPERIMENTAL  METHODS 

A  cross-sectional  schematic  view  of  the  ECR  system  of 
Astex,  Inc.  is  shown  in  Fig.  1.  A  microwave  generated  by  a 
magnetron  was  guided  through  the  waveguide  to  the  ECR 
cavity.  The  waveguide  was  separated  from  the  ECR  cavity 
by  a  quartz  disk.  The  ECR  condition  was  adjusted  by  the 
three-stub  tuner  and  accordingly  the  electromagnetic  mode 
could  be  changed.  In  order  to  reproduce  the  process,  it  was 
important  to  choose  a  distinct  electromagnetic  mode  that 
could  provide  a  stable  plasma.  The  ECR  source  was  excited 
by  a  microwave  power  supply  at  2.45  GHz  with  the  range 
50-1000  W.  The  ECR  source  was  surrounded  by  magnetic 
solenoids  to  get  the  resonance  condition.  Sufficient  electron 
kinetic  energy  could  be  obtained  by  introducing  a  steady 
magnetic  field  R  =  875  G  for  the  electron  cyclotron  fre¬ 
quency  (x)c=eBlmc  in  the  resonance  chamber,  which  could 
resonate  with  the  applied  microwave  frequency  a>=2.45 
GHz. 

Pure  oxygen  gas  (99.99%)  entered  the  ECR  source 
through  a  shower  ring.  The  distribution  of  magnetic  flux  was 
of  mirror  type  in  the  resonator  and  of  divergent  type  in  the 
discharge  chamber.  This  divergent  magnetic  field  could  ac¬ 
celerate  the  activated  oxygen  radical  stream  to  the  sample  by 
a  magnetic  longitudinal  force  regardless  of  the  species 
charge.  The  samples  were  negatively  self-biased  with  12  V 
from  the  induced  sheath  potential  of  the  ECR  plasma  without 
the  applied  bias.  The  distance  between  the  shower  ring  to  the 
sample  stage  was  adjustable  and  set  to  7.5  cm  to  optimize  the 
uniformity  of  the  plasma  density.  The  sample  stage  was 
made  of  stainless  steel  and  the  temperature  was  measured 
using  a  resistance  temperature  detector  placed  directly  under¬ 
neath  the  stage.  The  temperature  of  the  substrate  holder 


could  be  controlled  from  room  temperature  up  to  400  °C. 
The  reflected  power  was  kept  below  10%  of  the  forward 
microwave  power.  To  maintain  the  low  pressure,  the  system 
was  evacuated  by  a  1000  //s  turbomolecular  pump.  The 
chamber  was  pumped  out  to  a  base  pressure  of  2X 10“^  Torr 
before  each  run.  Plasma  oxidation  was  processed  in  the  pres¬ 
sure  range  of  about  (1-2)  X 10“"^  Torr.  The  processing  pres¬ 
sure  is  directly  related  to  the  oxygen  flow  rate. 

The  samples  used  for  oxidation  were  boron  doped  Si(lOO) 
with  a  resistivity  of  3-10  fl  cm  and  7.5  cm  in  diameter. 
These  samples  were  cleaned  with  a  standard  RCA-type  wet 
chemical  clean.  The  organic  material  on  the  surface  could  be 
dissolved  in  a  boiling  4:1  mixture  of  H2SO4  and  H2O2  at 
110  °C  for  10  min.  The  samples  were  dipped  in  a  20:1  di¬ 
luted  solution  of  H202:HF  for  10  s  to  etch  the  native  oxide 
layer.  The  wafers  were  also  rinsed  in  deionized  H2O  at  room 
temperature  after  each  of  these  steps.  However,  in  spite  of 
the  wet  chemical  cleaning,  native  oxides  with  a  thickness  of 
10  A  still  existed.  The  oxide  thickness  was  measured  by  a 
Rudolph  ellipsometer  II.  A  nominal  index  of  refraction  of 
1.462  was  used  for  oxide  thickness  measurements.  The 
breakdown  field  and  interface  charge  trap  density  were  ob¬ 
tained  from  I-V  and  C-V  measurements  on  MOS  capaci¬ 
tors.  These  MOS  capacitor  structures  were  fabricated  by  de¬ 
positing  l-/xm-thick  A1  on  both  sides  of  wafers.  The  dot  size 
of  the  capacitors  was  7.85  X 10”^  cm^  in  area. 

III.  GROWTH  PROPERTIES  IN  PLASMA  OXIDATION 

Instead  of  using  thermal  activation  in  conventional  high- 
temperature  oxidation,  the  ECR  plasma  enhances  oxidation 
by  increasing  the  activated  oxygen  species,  and  so  plasma 
oxidation  is  useful  for  a  low-temperature  process  in  semicon¬ 
ductor  technology.  When  a  low-pressure  gas  is  introduced, 
the  gas  breaks  down  and  a  discharge  forms  inside  the  cham¬ 
ber.  The  oxygen  plasma  diffuses  along  the  expanding  mag¬ 
netic  field  lines  into  a  process  chamber  toward  a  wafer 
holder.  The  oxidation  rate  asymptotically  decreases  with  in¬ 
creasing  pressure  or  O2  flow  rate  as  shown  in  Fig.  2.  This  is 
due  to  the  higher  plasma  density  at  lower  pressure  as  men¬ 
tioned  by  other  groups.^’^^  However,  at  very  low  pressure, 
the  oxidation  rate  decreases  with  increasing  intensity  of  the 
ion  peak.^ 

The  optical  emission  spectroscopy  (OES)  shows  several 
peaks  of  excited  neutral  atomic  oxygen  (O  l)  and  positively 
charged  molecular  oxygen  (O2 ).  There  are  several  spectrum 
lines  associated  with  O  i  in  plasma  at  4368,  6158,  5329, 
3947,  7775,  and  8446  There  are  also  a  number  of  rela¬ 
tively  broad  bands  associated  with  ionized  molecular  oxygen 
(O2 ),  the  most  prominent  of  which  are  at  5251,  5597,  5973, 
and  6351  In  particular,  Kimura  et  al  demonstrated  the 
oxygen  pressure  dependence  of  the  emission  intensity  from 
O I  (436.8  nm)  and  Oj  (525.1  nm).  The  maximum  intensities 
of  two  species  are  located  at  0.1 -0.2  mTorr,  consistent  with 
our  pressure  dependence  of  the  oxide  thickness  as  shown  in 
Fig.  2.  They  showed  that  the  dissociation  of  O2  molecules 
into  O I  with  an  energy  of  5  eV  likely  occurs  in  the  high- 
pressure  region  due  to  low  electron  cyclotron  energy,  while 
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Fig.  2.  Oxygen  pressure  dependence  of  oxide  thicknesses  with  1000  W  at 
room  temperature  for  30  min.  The  thickness  is  increased  by  lowering  the 
oxygen  pressure,  which  is  inversely  related  to  the  thermal  oxidation  case, 
and  the  pressure  for  the  maximum  oxidation  rate  is  consistent  with  the 
maximum  intensities  of  the  oxidizing  plasma  species  (Ref.  4). 


the  electron  energy  is  high  enough  to  give  rise  preferentially 
to  ionization  (13  eV)  in  the  low-pressure  region. 

The  effect  of  microwave  power  on  the  oxide  thickness  is 
shown  in  Fig.  3.  The  oxide  films  were  grown  under  condi¬ 
tions  such  that  the  flow  rate  was  10  seem  with  a  pressure  of 
0.1  mTorr  and  a  7.5  cm  distance  to  source  for  30  min.  The 
parabolic  behavior  of  the  oxidation  rate  to  the  incident  mi¬ 
crowave  power  P^r  is  observed  and  fitted  with  the  curve  of 

X  Xq  —  CIoP^^  ,  (l) 

where  x,  Xq,  and  aQ  are  the  oxide  thickness  grown  for  30 
min,  the  native  oxide  thickness,  and  the  proportional  con¬ 
stant,  respectively.  According  to  experimental  data  of  present 
work,  Sung  and  Pang^  and  Vinckier  et  al^  the  native  oxide 
thicknesses  are  given  by  10,  15,  and  18.7  A,  respectively. 
The  constant  is  fitted  to  0.5  for  both  the  present  work  and 


Fig.  3.  Oxide  thickness  as  a  function  of  the  microwave  power.  The  experi¬ 
mental  conditions  are  following:  pressure  =1  mTorr  and  oxidation  time 
==100  min  for  Sung  and  Pang  (Ref.  6);  pressure =1.05  Torr  and  oxidation 
time=4  h  for  Vinckier  et  al.  (Ref.  8);  pressure=0.1  mTorr  and  oxidation 
time=30  min  for  the  present  work.  As  the  power  increases,  the  oxide  thick¬ 
ness  is  parabolically  increased,  implying  a  direct  contribution  of  the  momen¬ 
tum  of  the  O  atoms. 


the  work  of  Sung  and  Pang  and  0.86  for  the  work  of  Vinckier 
et  al  The  experiment  on  the  dependence  of  the  0-atom  con¬ 
centration  on  the  applied  microwave  power  was  carried  out 
by  Vinckier  et  al  They  showed  that  the  atomic  oxygen  con¬ 
centration  linearly  increases  with  microwave  power  and  that 
the  oxide  thickness  is  proportional  to  the  0-atom  concentra¬ 
tion  Cq  with  the  following  relation: 

x^-^^-xf^=ACo,  (2) 

where  A  is  a  constant.  The  oxide  thickness  and  power  rela¬ 
tionship  may  support  that  the  belief  that  the  O  atom  is  dif¬ 
fused  into  the  interface  as  the  oxidant.  The  oxidation  rate 
may  be  directly  related  to  the  mass  transport  or  the  momen¬ 
tum  of  the  O  atom.  Vinckier  et  al"^  also  showed  that  the 
voltage  applied  on  samples  has  no  effect  on  the  oxidation 
rate.  This  implies  that  charged  species  generated  from  the 
oxygen  plasma  do  not  play  a  role  in  the  oxidation  process  of 
silicon  and  that  the  major  oxidizing  species  is  the  O  atom. 
On  the  other  hand,  Joseph,  Hu,  and  Irene^^  demonstrated  the 
effect  of  applied  bias  on  the  oxidation  rate.  They  suggested 
that  the  positive  bias  on  samples  provides  significant  en¬ 
hancement  of  the  oxidation  process  and  that  the  most  prob¬ 
able  oxidizing  species  is  0“.  The  oxidation  kinetics  by 
charged  species  in  the  limit  of  a  low  electric  field  is  compat¬ 
ible  with  the  Cabrera-Mott  theory,  which  purports  oxidation 
occurs  from  the  movement  of  both  Si  outward  as  a  cation  in 
the  interlayer  and  the  oxidant  species  0“  inward  as  an  anion. 
However,  there  is  still  no  evidence  on  optical  emission  spec¬ 
tra  of  0“  ions  in  the  ECR  oxygen  plasma  without  an  external 
bias. 

Our  concern  is  for  the  ECR  system  without  the  applied 
bias  on  samples  and  so  the  sheath  potential  of  ”12  V  is 
self-biased  between  the  plasma  and  the  sample.  This  implies 
that  the  0“  ions  are  repelled  by  the  negative  potential  on  the 
silicon  surface.  Therefore,  without  bias  on  samples,  the  pri¬ 
mary  oxidizing  species  is  concluded  to  be  the  neutral  excited 
oxygen  atoms  generated  from  the  dissociation  of  O2  mol¬ 
ecules  by  the  ECR  source.  Some  dissociated  O  atoms  can  be 
recombined  with  electrons  on  the  silicon  surface  to  reduce 
the  large  electronegativity  of  the  oxygen  atom.  Even  though 
growth  parameters  are  different  for  available  experimental 
data,  the  oxidation  rate  as  the  function  of  substrate  tempera¬ 
ture  is  shown  in  Fig.  4.  Kimura  et  al^  and  Carl,  Hess,  and 
Lieberman^  demonstrated  that  the  oxidation  rate  increases 
with  increasing  temperatures  higher  than  350  °C.  Sung  and 
Pang^  showed  the  low  rate  at  a  room  temperature  of  16  °C 
and  also  Lee  et  aO^  did  for  the  data  at  400  °C,  Our  measure¬ 
ment  of  the  rates  below  350  °C  shows  that  the  rate  slightly 
decreases  with  increasing  temperature.  This  may  imply  that 
the  dissociated  neutral  oxygen  atom  is  simultaneously  dif¬ 
fused  into  the  oxide  layer  below  350  "^C  and  then  thermalized 
above  350  °C,  indicating  the  inclusion  of  the  thermal  diffu¬ 
sion  effect  on  plasma  oxidation  at  high  temperatures. 

IV.  PLASMA  OXIDATION  KINETICS 

Many  authors  introduced  the  Deal-Grove  model  in  order 
to  understand  plasma  oxidation  kinetics.^"^’^^  Deal  and 
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Fig.  4.  Temperature  dependence  of  oxidation  rates  for  an  oxidation  time  of 
30  min.  Despite  the  different  experimental  growth  conditions,  there  is  an 
obvious  tendency  of  the  growth  rate.  The  rate  gradually  decreases  below 
350  °C,  while  it  increases  above  that  temperature. 


Grove  proposed  that  the  oxidation  rate  was  determined  by  a 
combination  of  two  processes.  One  is  the  actual  chemical 
reaction  of  oxygen  with  silicon  at  the  interface,  and  the  other 
is  the  diffusion  of  oxygen  through  the  previously  formed 
oxide  film.  The  combination  of  these  processes  resulted  in 
the  linear-parabolic  relationship  between  the  oxide  thickness 
X  and  the  oxidation  time  t. 


^=2 


1  + 


4B(t+T) 


1/2 


-1 


where 


2D 


B  =  2D 


C  ’ 


and 


T=  • 


Xq  +  AXq 

~B  ’ 


(3) 

(4) 

(5) 

(6) 


where  k  is  the  first-order  reaction  rate  constant  for  the  oxi¬ 
dation,  Co  the  concentration  of  oxidant  at  the  oxide  surface, 
Cqx  the  concentration  of  oxygen  molecules  incorporated  the 
oxide  layer,  and  D  the  effective  diffusion  coefficient  of  oxy¬ 


gen  in  the  silicon  dioxide.  Xq  is  the  native  oxide  thickness 
and  ris  the  corresponding  oxidation  time.  In  Eq.  (5),  B  is  the 
parabolic  rate  constant  while  the  ratio  of  B/A  is  the  linear 
rate  constant.  The  rate  constants  obtained  from  several 
groups  are  summarized  in  Table  I.  Since  the  experimental 
conditions  of  the  references  given  in  Table  I  are  different 
from  each  other,  the  comparison  of  the  rate  constants  in 
terms  of  the  substrate  temperature  can  be  arguable.  However, 
the  negative  value  of  the  reaction  rate  of  B/A  is  strange  even 
for  plasma  oxidation,  because  the  negative  rate  is  unphysical. 
Otherwise,  the  negativeness  of  the  reaction  rate  is  perhaps  a 
unique  characteristic  of  plasma  oxidation,  which  may  differ 
from  thermal  oxidation.  This  may  imply  that  the  Deal-Grove 
theory  is  not  valid  for  plasma  oxidation  kinetics,  which  is 
involved  with  the  atomic  diffusion  of  O  atoms. 

According  to  a  modified  Deal-Grove  model  proposed  by 
Tiller, it  is  shown  that  B/A  can  be  negative  if  migration  of 
oxygen  ions  is  considered.  In  Tiller’s  model,  the  effect  of 
field-aided  ion  migration  is  added  into  B/A,  B,  and  r  terms. 
However,  Kimura  et  al  argued  that  the  index  of  the  power 
law  relation  exceeds  2,  in  contrast  to  Tiller’s  results  of  the 
Deal-Grove  model.  Note  that  the  index  of  the  exponent  in 
the  Deal-Grove  model  is  varied  from  1  to  2  according  to  a 
change  of  the  mechanism  from  the  linear  region  in  a  thin 
oxide  to  the  parabolic  region  in  a  thick  oxide.  Vinckier  et  al 
used  an  exponent  index  of  2.17  for  their  model  calculations 
of  the  silicon  dioxide  growth  rate^  from  the  following  rela¬ 
tion: 


x^-xl  =  bt. 


(7) 


where  Xq  is  the  initial  thickness  of  the  native  oxide  layer  and 
b  a  constant  independent  of  the  oxidation  time  t.  This  indi¬ 
cates  that  before  analyzing  the  plasma  oxidation  kinetics,  the 
validity  of  the  Deal-Grove  model  has  to  be  examined  fun¬ 
damentally.  We  consider  again  the  analysis  of  the  silicon 
oxidation  kinetics  of  the  Deal-Grove  model  in  order  to  apply 
it  to  the  plasma  oxidation  mechanism. 

For  the  diffusion  process  of  O  atoms  in  the  oxide  layer, 
Pick’s  first  law  is  written  by 


Fn=~D 


dC{x') 
dx'  ’ 


(8) 


where  D  is  the  effective  diffusivity  of  the  O  atoms  in  the 
oxide  layer  and  C(x')  is  the  oxidant  concentration  at  any 


Table  I.  Comparison  of  rate  constants  for  several  plasma  oxides  and  thermal  oxides.  The  experimental  data 
were  fitted  by  the  Deal-Grove  model  for  16,  400,  and  640  (Refs.  4,  6,  and  11)  and  by  the  power  law  for 
350  °C  (Ref.  5).  The  present  experimental  result  at  27  °C  was  fitted  by  the  present  theory. 


Oxidation 

T{°0 

B  (ytim^/h) 

BIA  (/im/h) 

T(h) 

References 

Plasma 

27 

1.94X10'^ 

0,388 

3.16X10-2 

Present  work 

16 

8.01X10"’ 

-4.47X10-2 

-1.57 

6 

350 

1.26X10-2 

0.21 

0.03 

5 

400 

9.80X10-“ 

-0.14 

-0.34 

11 

640 

4.92X10"“ 

-1.23X10-2 

”1.44 

4 

Thermal 

920 

4.90  X10-’ 

2.08X10-2 

1.40 

10 

1000 

1.20X10-2 

0.07 

0.37 

10 
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Fig.  5.  Schematic  diagram  for  the  oxidation  of  silicon  in  the  O2  plasma 
phase.  The  native  oxide  is  denoted  as  xq  and  the  interface  as  x,- .  In  steady 
state,  the  whole  diffused  flux  at  the  Si/Si02  interface  participates  in  the 
chemical  reaction. 


position  X*  of  the  diffused  species.  It  is  valuable  to  note  that 
the  above  diffusion  equation  is  fundamentally  valid  for  a 
steady  state  process.^^’^^  The  variable  of  integration  x '  is  any 
arbitrary  distance  from  the  gas/oxide  interface  in  the  position 
In  this  work,  however,  our  concern  is  limited 
only  to  the  position  of  the  interface  because  oxidation  is 
assumed  to  occur  at  the  interface.  As  shown  in  Fig.  5,  the 
diffused  flux  at  the  interface  can  be  represented  from  the 
gradient  of  the  concentration  of  oxygen  for  an  infinitesimal 
element  of  the  interfacial  position  Xi , 


Fd=—D  lim 

Ax;^0 


Ciixi  +  Axi)-Ci{xi) 
{xi  +  Axj)  —  Xj 


(9) 


^  dCjjxi) 
dxi 


(10) 


where  the  x^  denotes  the  position  of  the  interface  and  is 
the  concentration  of  oxygen  species  at  the  interface.  We 
again  remind  ourselves  that  this  time-invariant  flux  at  the 
interface  is  in  a  steady  state.  It  is  noted  that  Deal  and  Grove 
approximated  Pick’s  law  to  the  linear  form  of  the  concentra¬ 
tion  gradient  from  the  assumption  of  steady  state  oxidation. 
This  led  to  the  difference  form  of  the  diffusion  flux  at  any 
point  of  the  interface  with  the  surface  concentration  Cq, 


Fo=~D 


Ci(Xi)  —  Co 
Xi 


(11) 


The  flux  involving  the  oxidation  reaction  at  the  interface 
is  related  to  the  interfacial  oxidant  concentration  with  the 
linear  reaction  coefficient  k. 


F^=kCi{Xi).  (12) 

Under  the  steady  state  oxidation  of  the  0-atom  diffusion  in 
the  equation  of  continuity, 

dCi  ,  , 

^  =  -VF=0,  (13) 

at 
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the  invariant  diffusion  flux  comes  into  the  bulk  oxide  and 
finally  reaches  the  Si/Si02  interface.  Therefore,  the  continu¬ 
ity  equation  is  valid  for  the  interface  as  well  as  in  the  bulk 
oxide.  Every  diffused  flux  participates  in  and  is  completely 
being  used  for  the  chemical  reaction  that  results  in  growth  of 
the  oxide  layer  at  the  interface,  implying  the  relationship  of 
Fj)~Fj^= const  By  equating  the  diffusion  flux  at  the  inter¬ 
face  in  Eq.  (10)  to  the  flux  required  for  the  chemical  reaction 
at  the  interface  in  Eq.  (12),  the  differential  equation  provides 
the  oxidant  concentration  and  the  flux  at  the  interface  from 
the  initial  boundary  condition  of  the  surface  oxidant  concen¬ 
tration,  Cq, 


dCiiXi) 

-D 

(14) 

C,(Ji:,-)  =  Coexp|-^|, 

d 

(15) 

F/j(x,-)  =  ^Coexp|-^j, 

(16) 

where  L^-DIk  is  represented  for  the  characteristic  diffusion 
length  in  the  exponentially  decaying  incident  flux  at  the  in¬ 
terface. 

In  addition,  the  exponential  term  of  the  flux  equation  can 
be  approximated  in  the  form  of  (1  +  x:/L£,)'^  for  both  limits 
of  thin  and  thick  oxide  films  Xf .  This  approximation  corre¬ 
sponds  to  the  flux  form  derived  by  Deal  and  Grove, 


Fr^ 


kCp 

1  -tkXi/D  ‘ 


(17) 


The  steady  state  oxidation  theory  requires  constant  flux  at  the 
interface  for  both  processes  of  diffusion  and  reaction  at  any 
instant.  When  the  oxygen  concentration  at  each  point  in  the 
oxide  no  longer  changes  with  time,  the  quantity  of  oxygen 
passing  through  the  oxide  per  unit  time  is  a  constant.  How¬ 
ever,  since  the  flux  is  the  flow  per  unit  area,  it  is  a  function  of 
the  oxide  thickness  and  again  it  cannot  be  simply  approxi¬ 
mated  to  linear  form.  The  difference  between  the  present 
theory  and  the  Deal-Grove  theory  is  the  interpretation  of  the 
steady  state  diffusion  flux  at  the  interface.  Deal  and  Grove 
thought  that  Eq.  (8)  is  not  a  steady  state  form  and  needed 
revision  due  to  the  relationship  of  —  VF£,  =  0.  However,  this 
claim  may  be  challenged  because  the  steady  state  condition 
applies  only  to  the  time  variable  and  the  continuity  equation 
provides  information  on  the  time-independent  flux  of 
Fj)=  —DdCfdx'  =const,  indicating  Eq.  (8)  in  a  steady  state 
form.^"^’^^  From  now  on,  let  us  denote  the  position  of  the 
interface  to  the  oxide  thickness  x. 

The  oxidant  flux  results  in  oxidation  with  the  growth  rate 
of  the  oxide  layer  x  through  the  relationship 


dx 
dt  ’ 


(18) 


where  is  the  concentration  of  the  oxygen  incorporated  in 
the  oxide.  By  equating  Eqs.  (16)  and  (18),  the  following 
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Oxidation  Time  (min) 

Fig.  6.  Time  dependence  of  the  oxide  growth  at  room  temperature.  The 
oxide  films  were  grown  under  the  following  experimental  conditions:  oxy¬ 
gen  flow  rate=10  seem,  pressure=0.2  mTorr,  microwave  power=1000  W, 
and  source  to  sample  distance =7.5  cm.  The  agreement  of  the  experimental 
fit  for  the  ultrathin  film  from  the  complete  form  of  the  silicon  oxidation 
model  is  remarkable  and  gives  positive  kinetic  parameters. 


differential  rate  equation  and  the  relationship  between  the 
oxide  thickness  and  the  oxidation  time  can  be  obtained  by 
taking  into  account  the  initial  oxidation  conditions, 


dx  kCQ  j  X  \ 


(19) 


X  — 


25(H^  \ 


with 


T= 


2^ 


(20) 


(21) 


where  A  and  B  are  defined  to  be  the  same  as  the  Deal-Grove 
case  in  Eqs.  (4)  and  (5).  The  present  oxidation  kinetics  shows 
that  the  oxide  layer  grows  logarithmically  with  increasing 
the  oxidation  time.  Therefore,  this  logarithmic  limit  shows 
faster  saturation  of  the  diffusion-limited  process  compared 
with  the  parabolic  limit  from  the  Deal-Grove  model. 

The  experimental  fit  from  this  oxidation  theory  for  the 
ultrathin  oxide  film  is  shown  in  Fig.  6.  The  agreement  of  the 
experimental  fit  is  remarkable.  The  oxidation  kinetic  param¬ 
eters  are  compared  with  others  in  Table  1.  The  diffusion  rate 
constant  (B)  of  the  O-atom  species  was  found  to  be 
1.94X  10“^  and  the  reaction  rate  constant  (B/A)  at  the 
Si/Si02  interface  was  found  to  be  0.388  /nm/h.  The  time  shift 
constant  r,  which  is  related  to  the  presence  of  the  initial 
oxide,  was  found  to  be  3.16X10“^  h.  The  diffusion  length  is 
characterized  as  2.5  nm  and  this  means  that  during  a  plasma 
oxidation  of  30  min,  the  oxidation  is  well  into  the  region 
where  the  diffusion-limited  process  of  O  atoms  into  the  ox¬ 
ide  layer  dominates.  In  comparison,  the  rate  constants  for  the 
thermal  oxidation  of  Si  in  dry  oxygen  at  920  and  1000  °C 
were  4.9X10“^  and  1.2X10“^  for  B,  2.08X10“^  and 
0.07  jiim/h  for  B/A,  and  1.4  and  0.37  h  for  r,  respectively. 


The  results  of  Carl,  Hess,  and  Lieberman  are  similar  to  the 
thermal  oxidation  case  at  1000  °C  for  B,  but  much  smaller 
for  B/A .  Our  result  for  the  diffusion  rate  of  B  shows  a  value 
similar  to  that  in  the  thermal  oxidation  case  at  920  °C.  How¬ 
ever,  the  reaction  rate  of  B/A  is  much  larger  compared  to 
other  thermal  and  plasma  oxidations. 

The  Deal-Grove  model  could  not  overcome  the  negative¬ 
ness  of  the  reaction  rate  of  B/A  (Refs.  4,  6,  and  11)  in  the 
plasma  oxidation  kinetics.  Although  a  spontaneous  reaction 
can  occur  with  a  negative  reaction  energy,  the  negativeness 
of  the  reaction  rate  is,  however,  too  complicated  to  under¬ 
stand  the  oxidation  phenomenon,  because  the  reaction  rate 
must  be  increased  without  increasing  the  concentration  of  the 
oxidant  species  at  the  interface,  indicating  evidence  of  the 
additional  process.  It  is  also  valuable  to  note  that  Deal- 
Grove  estimates  fail  in  predicting  even  thermal  oxidation 
rates  in  dry  oxygen  within  the  thin  film  regime  (<20  nm). 
However,  for  the  ultrathin  oxide  film,  our  theory  clarifies  the 
problem  of  the  negative  reaction  rate  and  provides  a  compa¬ 
rable  result  for  kinetic  parameters  for  thermal  oxidation,  im¬ 
plying  the  plasma  enhancement  of  the  low-temperature  oxi¬ 
dation. 

V.  ELECTRICAL  PROPERTIES 

The  samples  used  in  this  study  were  p-type  Si(lOO)  with  a 
boron  doping  concentration  of  2.17X10^^  cm“^.  Silicon  wa¬ 
fers  were  treated  with  the  standard  cleaning  process  before 
plasma  oxidation.  After  oxidation,  aluminum  was  deposited 
as  the  gate  metal.  The  area  of  contact  was  7.85X10“^  cm^. 
Postoxidation  and  premetallization  annealing  was  performed 
in  a  rapid  thermal  process  in  vacuum  at  400  °C  for  2  min. 
The  MOS  capacitor  structure  with  the  thin  oxide  layer  of  77 
A  was  characterized  using  1-V  and  C-V  measurements  at 
room  temperature. 

The  breakdown  field  is  defined  as  the  electric  field  that 
will  introduce  1  /ulA  leakage  current  across  the  MOS  capaci¬ 
tor.  The  present  average  breakdown  field  of  plasma  oxide 
films  was  found  to  be  14.8  MeV/cm.  This  is  higher  than  7.8 
and  12.39  MeV/cm  from  Salbert,  Reinhard,  and  Amussen^^ 
and  Sung  and  Pang,^  respectively.  The  measured  1  MHz 
capacitance-voltage  curve  with  (real  line)  and  without 
(dashed  line)  the  low-temperature  anneal  at  400  °C  for  2  min 
is  shown  in  Fig.  7.  The  annealing  effect  lowers  the  oxide 
charge  from  1.87X10"^  to  2.54X10“^  C/cm^  and  shifts  the 
flat  band  voltage  from  —2.67  to  —0.38  V.  The  work  function 
difference  for  the  annealed  case  is  -0.638  eV.  The  Si/Si02 
interface  state  density  is  determined  from  the  slope  of  the 
C-V  curve  by  deep  level  transient  spectroscopy  (DLTS) 
measurements.  The  trap  density  of  is  found  to  have  a 
minimum  of  1.2X10^^  eV“^  cm“^  as  shown  in  Fig.  8.  This 
is  much  lower  than  10^^  eV“^  cm“^  from  Sung  and  Pang^ 
and  5X10^^  eV“^  cm“^  from  Salbert,  Reinhard,  and 
Amussen.^^  The  low  density  of  interface  trapping  states  is 
comparable  to  the  thermal  oxide  MOS  capacitors.  This  indi¬ 
cates  that  the  plasma  damage  on  the  oxide  layer  is  not  sig¬ 
nificant  and  the  atomic  diffusion  of  neutral  O  atoms  into  the 
Si/Si02  interface  leads  to  the  layer-by-layer  oxidation.  Addi- 
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Gate  Bias  (V) 

Fig.  7.  MOS  C~V  characteristics  at  1  MHz  for  a  77-A-thick  oxide  with 
(real  line)  and  without  (dashed  line)  a  low-temperature  anneal  at  400  ®C  for 
2  min.  The  gate  oxide  was  grown  with  ECR  oxygen  plasma  under  the 
following  conditions:  substrate  temperature =300  ®C,  O2  flow  rate =20 
seem,  pressure =0,2  mTorr,  microwave  power=400  W,  source  to  sample 
distance =7.5  cm,  and  oxidation  time =30  min. 

tionally,  the  large  reduction  in  interface  states  is  attributed  to 
the  removal  of  traps  or  defects  after  annealing.  We  strongly 
argue  that  in  the  ECR  plasma  oxidation,  the  boron  dopant 
can  be  diffused  out  to  the  interface  from  the  silicon  bulk  due 
to  the  plasma  oxidation  environment.  The  sheath  potential 
attracts  the  effectively  positively  charged  boron  in  silicon. 
The  interfacial  boron  segregation  results  in  the  reduction  of 
Si  dangling  bonds  and  reflects  the  asymmetric  distribution  of 
the  density  of  the  interfacial  trapping  state  just  below  the 
conduction  band  edge  as  shown  in  Fig.  8.  More  study  is 
needed  about  the  dopant  behavior  at  the  interface.  Note  that 
the  ultrathin  gate  oxide  for  submicron  devices  (0.25  /rni 
channel  length)  is  required  to  be  70  A  in  order  to  reduce  the 
fringe  effect.  The  device  quality  of  the  electrical  property  of 


Ev  Energy  E  (meV)  Ec 

Fig.  8.  Si/Si02  interface  state  density  as  determined  from  the  slope  of 
the  C-  V  curve  by  the  DLTS  measurement.  The  high  quality  of  the  interface 
is  comparable  to  the  case  of  thermal  oxide  films  of  Djt=10*^  eV“^  cm“^. 


the  ultrathin  gate  oxide  grown  by  the  ECR  oxygen  plasma 
seems  to  be  applicable  for  ULSI  semiconductor  technology. 

VI.  SUMMARY 

Plasma  oxidation  of  silicon  has  been  performed  by  using 
O2  gas  generated  by  an  ECR  source  at  low  temperatures.  The 
oxidation  rate  was  studied  in  terms  of  processing  pressure, 
microwave  power,  substrate  temperature,  and  oxidation  time. 
The  oxide  thickness  was  found  to  increase  with  microwave 
power  parabolically.  The  pressure  dependence  of  the  oxida¬ 
tion  rate  is  consistent  with  the  maximum  intensities  of  the 
optical  emission  spectra  of  the  O  atom  and  O2  species.  Be¬ 
low  a  350  °C  substrate  temperature  the  rate  slightly  de¬ 
creases,  and  it  rapidly  increases  above  that  temperature.  For 
ultrathin  film  growth,  the  plasma  oxidation  kinetics  provides 
a  diffusion  rate  and  chemical  reaction  rate  comparable  to  the 
thermal  oxidation  cases.  The  proposed  model  shows  a  more 
general  formulation  of  the  oxidation  kinetics  than  the  Deal- 
Grove  model,  especially  for  plasma  oxidation,  and  gives  re¬ 
markable  agreement  with  the  experimental  fit.  As  a  conclu¬ 
sion,  in  spite  of  the  uncertainty  of  the  contribution  of  charged 
atomic  species,  plasma  oxidation  is  associated  with  the 
atomic  diffusion  process  rather  than  the  molecular  species 
related  process.  The  high  quality  of  the  MOS  capacitor  using 
plasma  oxidation  films  suggests  that  these  films  may  be  suit¬ 
able  for  gate  dielectrics  in  ULSI  devices. 
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We  report  the  properties  of  silicon  nitride  films  deposited  by  the  electron  cyclotron  resonance 
remote  plasma  enhanced  chemical  vapor  deposition  method  on  Si  substrates  using  SiH4  and  N2.  The 
effects  of  nitrogen/silane  gas  ratio  (/?=N2/SiH4),  electron  cyclotron  resonance  power,  substrate 
temperature,  and  H  on  growth,  refractive  index,  chemical  composition,  and  etch  rate  were 
investigated.  Nominally  stoichiometric  Si3N4  films  were  obtained  with  a  refractive  index  of  1. 9—2.0 
at  a  wavelength  of  632.8  nm.  The  etch  rate  of  the  films  in  a  buffered  HF  solution  (7:1)  was  low 
(—0.7  nm/min)  and  increased  with  increasing  H2  gas  flow  rate  and  decreasing  substrate  temperature 
during  deposition.  Fourier  transform  infrared  spectroscopy  and  high  temperature  thermal  evolution 
experiments  showed  very  small  amounts  of  H  in  the  films.  A  leakage  current  less  than  100  pA/cm^ 
at  a  field  of  2  MV/cm,  a  resistivity  of  >4X10^^  il  cm,  and  breakdown  strengths  of  6-11  MV/cm 
at  a  current  density  of  1  fiA/cm^  were  observed.  These  properties  are  comparable  to  those  of  Si3N4 
prepared  by  conventional  high  temperature  (700  °C)  chemical  vapor  deposition.  The  performance  of 
GaAs-based  field-effect-transistors  in  switching  and  power  applications  can  be  enhanced 
substantially  by  employing  a  metal-insulator- semiconductor  structure.  By  taking  advantage  of  an  in 
situ  process  approach,  insulator-GaAs  structures  were  successfully  gated  with  excellent  interfacial 
properties.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  nearly  ideal  Si02/Si  interface  has  made  the  integra¬ 
tion  of  large  numbers  of  silicon  transistors  on  a  single  chip 
possible.  Analogous  devices  in  compound  semiconductor 
structures  (which  have  more  favorable  electrical  characteris¬ 
tics  than  Si)  have  not  come  to  fruition  because  of  inferior 
dielectrics  and  dielectric/semiconductor  interfaces.  Since  O 
induces  a  large  density  of  trap  states,  the  adsorption  of  less 
than  a  monolayer  of  O  on  GaAs  causes  Fermi  level  pinning.  ^ 
Deposited  insulator/compound-semiconductor  interfaces 
have  been  reported  to  exhibit  quality  superior  to  those 
formed  exclusively  with  native  oxides.^  Si3N4  appears  to  be  a 
favorable  gate  dielectric  for  III-V  compound  semiconductor- 
based  metal-insulator-semiconductor  (MIS)  devices. 

Silicon  nitride  (Si^^-N)  layers  have  been  extensively  used 
as  gate  dielectrics  in  thin  film  transistors  (TFTs),^’^  as  inter¬ 
layer  dielectrics  in  metal-nitride-oxide-Si  (MNOS) 
devices,^  and  as  final  passivation  layers  in  integrated 
circuits.^  Attributes  of  high  quality  Si^^N  films  used  for  gate 
dielectrics  are  high  dielectric  constant,  high  breakdown  field, 
low  leakage  current,  and  low  interface  trap  density.^  Com¬ 
pared  to  Si02,  Si3N4  is  known  to  be  a  good  impurity  barrier 
against  sodium  ions  and  water.  ^  Nominally  stoichiometric 
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Si3N4  has  a  dielectric  constant  about  75%  larger  than  that  of 
Si02.  Such  a  large  dielectric  constant  results  in  a  larger 
transconductance,  a  larger  gate  insulator  capacitance,  and 
hence  a  smaller  threshold  voltage  shift  for  a  given  defect 
charge  concentration.  On  the  contrary,  nonstoichiometric 
films  have  both  lower  dielectric  breakdown  field  and  higher 
leakage  current  as  compared  to  those  of  Si3N4  films.^ 

Si3N4  films  are  typically  prepared  either  by  plasma  en¬ 
hanced  chemical  vapor  deposition  (PECVD)  or  low-pressure 
chemical  vapor  deposition  (LPCVD)  method.  In  the  current 
work,  remote  plasma  enhanced  chemical  vapor  deposition 
(RPECVD)  was  used.  In  RPECVD  the  plasma  is  separated 
from  the  surface  which  reduces  damage  caused  by  plasma 
field  and  energetic  particles.  PECVD  Si^^^N  films  are 
nonstoichiometric  and  contain  large  concentration^®  of  H 
which  begins  to  desorb  at  — 350  This  may  lead  to  in¬ 

stabilities  in  MIS  device  performance.  To  minimize  the  H 
incorporation  during  growth,  N2  has  been  used  in  place  of 
NH3 .  However,  PECVD  has  the  advantage  of  being  success¬ 
ful  at  a  substrate  temperature  around  300  °C  which  is  much 
lower  than  about  700  ‘^C  used  in  thermal  chemical  vapor 
deposition  (CVD).  Such  a  lower  temperature  prevents  evapo¬ 
ration  of  the  volatile  group  V  species  from  the  III-V  com¬ 
pound  semiconductor  surface. 

Recent  strides  for  the  realization  of  amenable 
GaAs  metal-insulator-semiconductor  field-effect-transistors 
(MISFETs)  utilize  epitaxial  Si  or  Ge  interlayers  grown  on 
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GaAs  in  order  to  avoid  the  formation  of  an  intervening  na¬ 
tive  oxide  prior  to  the  insulator  deposition  and  the  outdiffu- 
sion  of  the  volatile  group  V  element.  Pseudomorphic 
silicon  cap  layer  deposited  onto  GaAs  was  reported  to  unpin 
the  surface  Fermi  energy  and  reduce  the  density  of 
tunneling-related  traps.  Given  that  Si  on  GaAs  can  be 
grown  epitaxially  and  that  high  quality  dielectrics  can  be 
deposited  on  Si,  a  practical  approach  to  achieving  high  qual¬ 
ity  GaAs  MIS  structures  would  be  to  introduce  a  pseudomor¬ 
phic  Si  layer  between  GaAs  and  a  suitable  dielectric. How¬ 
ever,  because  of  the  thinness  of  Si  and  the  weak  nature  of  the 
III-V  compound  semiconductor  surface  involved,  the  dielec¬ 
tric  deposition  must  be  performed  with  no  exposure  to  atmo¬ 
spheric  conditions  at  any  time  during  the  entire  process.  To 
insure  this,  we  constructed  a  deposition  apparatus  which  al¬ 
lows  this  very  approach  and  utilized  it  to  attain  GaAs-based 
MIS  structures  with  properties  enhanced  to  a  point  at  which 
meaningful  analyses  of  the  interface  and  field  effect  transis¬ 
tor  (FET)  can  be  undertaken. 

In  this  article  we  discuss  the  effect  of  RPECVD  deposi¬ 
tion  parameters  on  the  physical  and  electrical  characteristics 
of  the  Si3N4  thin  films.  We  implemented  these  Si3N4  films, 
prepared  at  low  temperature  (<300  ""C),  as  gate  dielectrics 
for  the  GaAs-based  MIS  devices.  It  is  noted  that  the  proper¬ 
ties  of  these  Si3N4  films  are  comparable  to  those  of  thermal 
CVD  Si3N4  films,  and  hence  these  films  are  promising  can¬ 
didates  as  gate  dielectrics  for  III-V  MIS  devices. 

II.  EXPERIMENT 

The  ultrahigh  vacuum  (UHV)  RPECVD  system  used  for 
this  study  and  described  in  detail  previously, consists  of  an 
electron  cyclotron  resonance  (ECR)  source  coupled  to  an  ion 
and  turbomolecular  pumped  cold-wall  stainless-steel  cham¬ 
ber  with  a  base  pressure  of  1X10“^^  Ton*.  He  and  N2  are 
introduced  at  the  top  of  the  ECR  chamber  where  the  plasma 
is  generated.  SiH4  is  introduced  into  the  CVD  chamber 
through  a  “showerhead”  located  halfway  between  the  aper¬ 
ture  of  the  ECR  chamber  and  the  substrate. 

The  Si  substrates  were  P-doped  n-iype  (100),  had  a  diam¬ 
eter  of  76  mm,  and  were  misoriented  4®  toward  (110).  The 
substrates  were  etched  in  1:10  HF:H20  solution  followed  by 
3:1:1  HC1:H202:H20  solution  to  remove  the  native  oxide 
and  to  oxidize  the  Si  surface,  respectively.  This  procedure 
was  repeated  three  times  to  insure  the  removal  of  the  con¬ 
taminants  on  the  Si  surface  as  well  as  to  form  a  high  quality 
native  oxide.  The  substrates  were  then  etched  in  the  same  HF 
solution  for  1  min,  and  loaded  into  the  vacuum  system.  With¬ 
out  further  treatment  they  showed  a  (1X1)  reflection  high- 
electron  energy  diffraction  (RHEED)  pattern,  indicating  di- 
hydride  surfaces.  When  the  Si  substrate  temperature  was 
increased  to  about  550  °C,  the  surface  structure  changed 
from  (ixl)  to  the  (2X2)  (two  orthogonal  2xrs)  structure 
due  to  hydrogen  desorption.  It  has  been  reported  that  hydro¬ 
gen  desorbs  from  the  Si  surface  in  a  temperature  range  of 
450-510  ECR  PECVD  Si^N  was  deposited  at  substrate 
temperatures  ranging  between  100  and  650  °C  under  the 
application  of  an  ECR  power  (P^)  of  100-250  W  at  2.45 


GHz.  The  film  composition  varied  as  a  function  of  the  SiH4, 
N2,  H2,  and  He  flow  rates.  During  the  growth  the  N2/SiH4 
ratio  ranged  from  1-50  as  determined  from  relative  partial 
pressures  at  fixed  N2  flow  rate  of  30  seem.  H2  and  He  were 
added  to  the  gas  phase  at  a  rate  up  to  about  10  seem. 

An  ellipsometry  apparatus  was  used  to  determine  the  film 
thickness  (t)  and  refractive  index  (n).  The  LI  16  Gaertner 
ellipsometer  uses  a  rotating  analyzer  photometric  design  and 
operates  at  a  wavelength  of  632.8  nm.  Values  of  n  and  t  of 
the  films  obtained  by  this  ellipsometer  were  in  excellent 
agreement  (A/?~±0.02,  A^~±0.2  nm)  with  the  correspond¬ 
ing  data  obtained  from  a  spectroscopic  ellipsometer.  A 
Perkin-Elmer  5300  x-ray  photoelectron  spectroscopy  (XPS) 
facility  permitted  in  situ  analysis  of  the  Si/N  ratio  in  the  films 
and  testing  for  incorrect  Si  bonding.  Mg  Ka  (1253.6  eV) 
radiation  was  used  in  all  studies  reported  here.  Near  surface 
depth  profile  data  was  obtained  by  varying  the  photoelectron 
take-off  angle  from  20°-70°.  Mixed  Gaussian-Lorentzian 
curves  were  fitted  to  the  spectra  using  the  method  with  the 
fraction  Gaussian  character  as  a  fitting  parameter.  A  quanti¬ 
tative  analysis  of  the  H  content  and  bonding  species  of  the 
films  was  performed  by  fourier  transform  infrared  (FTIR) 
spectroscopy  using  Si  substrates  polished  on  both  sides. 
Thermal  H  evolution  was  also  used.^^  Thermal  stability  of 
the  film  was  estimated  by  measuring  the  film  thickness 
change  with  high  temperature  annealing  at  850  °C  in  N2  for 
30  min.  Etch  rates  of  the  layers  in  300  K  NH4F  buffered-HF 
(BHF)  solutions  (NH4F:HF=7:1)  were  measured  as  a  func¬ 
tion  of  the  substrate  growth  temperature  and  the  various  flow 
rates  of  H2.  For  comparison,  the  etch  rates  of  Si3N4  grown 
by  thermal  CVD  at  800  °C  and  by  radio  frequency  (RF) 
PECVD  at  400  °C  from  SiH4-NH3  were  also  evaluated. 

The  deposition  of  GaAs  capacitors  was  initiated  by  the 
growth  of  a  0.5  /xm,  10^^  cm~^  Si-doped  GaAs  buffer  layer 
by  molecular  beam  epitaxy  (MBE)  on  a  (100)  GaAs: Si 
substrate.  The  sample  was  then  transferred  via  an  UHV 
transfer  tube  (low  10“^  Torr)  to  an  ultrahigh  vacuum  chemi¬ 
cal  vapor  deposition  (UHVCVD)  chamber  where  Si  and 
Si3N4  were  deposited  sequentially  on  the  pristine,  as  grown 
GaAs  surface.  After  deposition  of  10  A  of  Si  on  the  as  grown 
GaAs  surface  at  300  °C  the  substrate  was  rapidly  heated  to 
600-700  °C  with  a  ramp  rate  of  50  °C/min  and  kept  at  that 
temperature  for  a  few  minutes  to  crystallize  the  as  deposited 
Si.  The  substrate  was  then  cooled  to  300  °C  and  the  deposi¬ 
tion  of  a  nominally  150-A-thick  Si3N4  layer  ensued. 
Depletion-mode  transistors  were  grown  in  a  similar  fashion 
with  the  exception  that  the  GaAs  was  grown  on  a  semi- 
insulated  substrate  and  consisted  of  0.5  pm  of  undoped 
GaAs  and  active  regions.  GaAs  channel  region  was  capped 
by  a  1000-A-thick,  2X10^^  cm~^  Si-doped  GaAs. 

Electrical  properties  were  obtained  from  the  current- 
voltage  (/-V)  characteristics  of  Al/Si^N/n^-Si  MIS  capaci¬ 
tors  and  from  capacitance- voltage  (C-V)  curves  of 
Al/Si^N/epi-Si/n^-Si  and  Al/Si3N4/Si/?2-GaAs  MIS  capaci¬ 
tors.  Nominally  undoped  Si  epitaxial-layers  (carrier  concen¬ 
trations  <5X10^^  cm~^)  were  deposited  using  60  W  of  mi¬ 
crowave  power.  This  was  followed  by  deposition  of  the  Si^N 
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Fig.  1.  (a)  Refractive  index,  (b)  chemical  composition  (Si/N  ratio),  and  (c) 
deposition  rate  of  silicon  nitride  deposited  by  ECR  RPECVD  as  a  function 
of  gas  ratio,  /?=N2/SiH4,  at  constant  helium  flow  rate  of  6  seem. 


without  air  exposure.  After  removal  from  the  growth  cham¬ 
ber,  the  samples  were  heat  treated  by  rapid  thermal  annealing 
(RTA)  at  450-500  °C  for  30  s.  An  Al  layer,  about  1000  A 
thick,  was  then  evaporated  on  the  top  and  patterned  photo- 
lithographically  into  300- /.tm  dots  to  serve  as  the  gate  elec¬ 
trodes  of  MIS  capacitors.  The  depletion-mode  transistors 
were  fabricated  using  a  self-aligned  technique  in  which  Al 
was  first  evaporated  over  the  entire  wafer.  The  step  was  fol¬ 
lowed  by  a  mesa  etch  to  define  the  individual  devices.  Next, 
the  Al  was  etched  away  from  the  source/drain  regions,  fol¬ 
lowed  by  etching  of  the  Si3N4  using  a  reactive  ion  etching 
step  which  employs  CF4  as  the  etchant  gas.  Finally,  AuGe/ 
Ni/Au  was  evaporated  to  form  the  ohmic  source/drain  con¬ 
tacts. 

In  order  to  probe  the  C-V  properties  of  the  MIS  junc¬ 
tions,  small-signal  admittance  measurements  at  frequencies 
ranging  from  20  Hz  to  1  MHz  were  carried  out  using  an 
HP4284A  LCR  meter.  I~  V  measurements  were  made  by  us¬ 
ing  a  HP4140B  picoampmeter/voltage  source.  The  leakage 
current  was  determined  at  a  field  of  2  MV/cm  and  the  resis¬ 
tivity  was  measured  from  the  slope  of  the  I-V  curves.  We 
estimated  the  breakdown  strength  as  the  breakdown  field  at  a 
current  density  of  1  The  dielectric  constant  of  the 

Si^N  layer  was  measured  by  C-V  technique  at  various  fre¬ 
quencies.  The  interface  characterization  of  the  GaAs  MIS 


structure  was  undertaken  by  C-V  and  conductance  measure¬ 
ment.  Direct  current  transfer  characteristics  measurements 
for  the  MISFETs  were  performed  using  an  HP4145. 

III.  RESULTS  AND  DISCUSSION 
A.  Growth  and  related  physical  properties 

Figures  1(a)- 1(c)  show  the  refractive  index,  the  Si/N  ra¬ 
tio,  and  the  deposition  rate  as  functions  of  gas  flow  ratio.  For 
all  measurements  of  these  data  the  He  flow  rate  was  kept 
constant  at  6  seem.  From  Fig.  1(a)  it  may  be  noted  that,  for 
R<2,  the  layers  were  silicon-rich  with  n~2.16.  However, 
for  300X<r,<650^C,  ECR  power  of  150-250  W,  and 
R>10,  stoichiometric  Si3N4  was  obtained  with  ^  =  1.93- 
1.99.  Si  has  a  much  higher  refractive  index  than  Si^^N.  Be¬ 
cause  of  this,  a  higher  refractive  index  corresponds  to  a  Si- 
rich  sample,  which  is  possible  when,  as  compared  to  SiH4 
flux,  the  N2  flux  into  the  deposition  chamber  is  relatively 
small.  Unlike  Si-rich  sample,  the  stoichiometric  film  is  made 
of  comparable  amounts  of  Si  and  N.  Higher  composition  of 
N  thus  causes  the  refractive  index  of  the  sample  to  be  lower. 
Our  experiments  indicate  that,  for  R>10,  the  refractive  index 
of  the  sample  becomes  independent  of  both  the  flow  ratio 
and  the  ECR  power.  This  occurs  because,  although  higher 
flux  of  N2  leads  to  a  higher  generation  of  reactive  nitrogen 
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species,  most  of  them  do  not  find  reactive  Si  species  to  react 
and  form  silicon  nitride.  So,  eventually  these  nitrogen  spe¬ 
cies  form  back  N2  molecules  and  leave  the  system. 

We  note  that  the  n  values  in  Fig.  1(a),  measured  at  a 
constant  photon  energy  —1.96  eV  and  a  constant  extinction 
coefficient  k  = -0.020,  were  normally  0.02-0.04  lower  than 
those  measured  in  representative  films  by  spectroscopic  el- 
lipsometry.  The  extinction  coefficients  measured  for  absorp¬ 
tion  were  an  order  of  magnitude  lower  than  —0.020,  suggest¬ 
ing  that  the  actual  n  values  for  our  nominally  stoichiometric 
samples  are  1.95-2.02.  That  the  silicon  nitride  formed  in  the 
regime  lO^R^SO  is  stoichiometric  silicon  nitride  is  appar¬ 
ent  from  Figs.  1(b)  and  1(c).  From  these  two  figures  one  can 
see  that  Si/N  ratio,  and  hence  the  film  deposition  rate  in  this 
regime,  are  about  0.75  and  10  A/min,  respectively.  This  in¬ 
dicates  that  the  composition  of  the  film  is  Si3N4  which  is 
formed  due  to  steady  state  incorporation  of  Si  and  N  into  the 
deposited  layer.  From  these  two  figures  one  can  also  see  that 
in  the  regime  0^/?^10,  Si/N  ratio  and  the  film  deposition 
rate  decrease  rapidly  from  their  maximum  values,  0.95  and 
52  A/min,  respectively,  with  increasing  R,  This  is  a  Si-rich 
regime  with  the  deposition  rate  determined  primarily  by  SiH4 
flow  rate  in  agreement  with  previous  results, and  with  Si 
content  of  the  silicon  nitride  films  decreasing  with  increasing 
N2  flux.  The  thickness  uniformity  of  Si3N4  is  within  4%  (2cr). 

Figures  2(a)-2(c)  show  Si  2p  core  level  spectra  as  a  func¬ 


(c)  Binding  Energy  (eV) 


Fig.  2.  Angle-resolved  XPS  spectra  of  nominally  stoichiometric  silicon  ni¬ 
tride  films.  Scan  data  for  N  2s  and  Si  2p  peaks  and  their  chemical  shifts 
were  collected  throughout  by  varying  the  photoelectron  take-off  angle  from 
20°-70°. 


tion  of  electron  take-off  angles.  These  spectra  have  been 
shifted  as  appropriate  to  account  for  sample  charging.  From 
these  figures  one  may  visualize  that  the  peak  counts  at  101.8 
±0.2  eV  correspond  to  Si-N  bonds,  while  the  peak  counts  at 
103.6±0.2  eV  correspond  to  Si-0  bonds.  Any  Si  peak  from 
Si-Si  bonds  (—99.15  eV)  was  not  discernible,  suggesting 
that  excess  Si  was  indeed  insignificant.  Similar  results  on  the 
absence  of  the  excess  Si  from  Si  LVV  Auger  electron  spectra 
of  Si3N4  also  classified  the  layer  to  be  “quasi-nominally 
stoichiometric  Si^N.’'  ^  The  constancy  of  the  Si  2 p  compo¬ 
nent  peak  intensities  for  various  take-off  angles  indicates  no 
noticeable  depth  dependence.  The  film  investigated  here  was 
found  to  be  rather  uniform  since  the  relative  magnitude  of 
peaks  of  Si  for  Si-0  and  Si-N  bonds  does  not  change  with 
take-off  angle.  The  possible  explanation  for  the  presence  of 
Si02  is  that  a  uniform  but  a  minute  amount  of  Si02  had  been 
present  through  the  depth  of  the  film  due  to  the  chamber 
contamination,  which  led  to  be  the  frequent  deposition  of 
Si02  within  the  same  system.  We  have  observed  almost  neg¬ 
ligible  oxygen  contamination  on  Si  interlayer  grown  on 
GaAs  buffer  layer  prior  to  the  silicon  nitride  deposition. 

Among  the  gas  ratios  employed,  a  setting  of  R =21  led  to 
stoichiometric  films  with  optimal  film  properties.  The  depen¬ 
dence  of  the  deposition  rate  at  the  substrate  temperature  of 
300  °C  on  ECR  power  at  this  value  of  R  and  with  6  seem  He 
flow  rate  is  shown  in  Fig.  3.  It  was  observed  that  the  depo- 
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Fig.  3.  Deposition  rate  of  silicon  nitride  at  the  substrate  temperature  of 
300  °C  deposited  by  ECR  RPECVD  as  a  function  of  ECR  power  at  /?=27 
and  He  flow  rate  of  6  seem. 


Fig.  5.  Refractive  index  and  etch  rate  as  a  function  of  flow  rate  of  H2  gas  at 
R=21  and  He  flow  rate  of  6  seem. 


sition  rate  increased  with  increasing  ECR  power,  and  that  the 
increase  was  about  —60%  at  P ^  between  150  and  250  W. 
This  suggests  that  the  plasma  power  influences  the  amount  of 
N2  and  SiH4  dissociation  and  enhances  the  formation  of  re¬ 
active  Si  and  N  species.  The  increase  of  deposition  rate, 
however,  gradually  slowed  down  as  the  ECR  plasma  power 
approached  300  W.  An  exactly  similar  trend  of  variation  of 
deposition  rate  with  the  ECR  power  was  observed  earlier  by 
Matsuo  and  Kiuchi,^^  who  used  10  seem  of  SiH4  and  10 
seem  of  N2  to  deposit  Si3N4  films  as  confirmed  by  Si-N 
bond  peak  at  the  wave  number  of  845  cm“^  As  indicated 
earlier  [see  Figs.  1(a)- 1(c)],  film  composition  and  n  of  Si3N4 
were,  however,  unaffected  by  power  under  these  conditions. 

Figure  4  shows  the  growth  and  etch  rates  of  stoichio¬ 
metric  silicon  nitride  as  functions  of  ECR  power  and  the 
substrate  temperature  at  R—21.  It  was  noted  that,  for 
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--□-■250W 


Substrate  Temperature  (°C) 


Fig.  4.  Deposition  rate  and  etch  rate  as  functions  of  substrate  temperature 
and  plasma  power.  Etch  rate  was  measured  in  NH4F  buffered-HF  (BHF) 
solution  (NH4F:HF~7:1)  at  room  temperature. 


30CC<7^<500  °C  the  refractive  index  n  is  —1.93-1.95, 
and  at  7^=650  °C,  n  — 1.99;  both  were  observed  to  be  con¬ 
sistent  with  nominally  stoichiometric  Si3N4.  The  deposition 
rate  was  independent  of  at  P^=  150  W,  but  increased  with 
decreasing  substrate  temperature  at  P ^=250  W.  This  behav¬ 
ior  is  uncommon,  although  Piccirillo  et  al?^  observed  similar 
trends  using  a  rf  plasma  source  and  SiH4+NH3  gas.  At  high 
power  levels,  there  occurs  an  efficient  dissociation  of  SiH4 
and  a  generation  of  more  excited  nitrogen  atoms  and/or  radi¬ 
cals,  which  lead  to  an  increase  in  the  deposition  rate  of  the 
films.  Increasing  substrate  temperature  decreases  the  amount 
of  excited  nitrogen  by  allowing  surface  reaction  of  nitrogen 
with  hydrogen  to  form  volatile  NH^  species.  This  results  in 
a  decrease  in  the  growth  rate  of  the  films.  The  deposition  rate 
was  found  to  increase  by  about  30%  as  the  flow  rate  of  He 
increased  from  6-10  seem.  Excited  He  is  known  to  supply 
energy  to  surface  species  thus  enhancing  surface  diffusion, 
generating  reactive  radicals,  and  increasing  the  deposition 
rate. 

The  etch  rate  in  BHF  is  generally  employed  to  gauge 
hydrogen  and  hydroxyl  ion  concentration  in  nitride  and  ox¬ 
ide  films.  Slower  etch  rates  indicate  smaller  H  and  OH  con¬ 
tents.  The  etch  rate  in  BHF  solution  decreased  as  the  sub¬ 
strate  temperature  and  the  plasma  power  increased.  It  may  be 
noted  from  Fig.  4  that,  at  7^-650  °C  and  7^=250  W,  the 
etch  rate  was  —0.7  nm/min.  This  is  comparable  to  that  of 
thermal  CVD  Si3N4  deposited  at  800  °C.  At  lower  substrate 
temperatures,  for  example,  at  7^=300  °C  and  7^=250  W 
the  etch  rate  was  five  times  higher  than  that  for  layers  pre¬ 
pared  at  7^=650  °C  and  7^=250  W,  but  five  times  lower 
than  that  for  rf  plasma  assisted  Si^N  deposited  at  400  °C. 

The  effects  of  H2  flow  rate  on  the  refractive  index  and 
etch  rate  of  hydrogenated  Si3N4  are  shown  in  Fig.  5.  Delib¬ 
erate  introduction  of  H2  at  a  rate  up  to  10  seem,  while  keep¬ 
ing  7=27  and  He  flow  rate  constant  at  6  seem,  did  not 
change  the  deposition  rate  significantly  at  7^=300  °C  and 
7^=250  W.  However,  n  decreased  from  1.97  to  1.75  for  H2 
flow  rate  increasing  from  2  to  10  seem.  Decreasing  n  with 
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FTIR  Absorption  Spectra 


Fig.  6.  FTIR  absorption  spectrum  of  an  optimized  nominally  stoichiometric 
Si3N4  layer  (n~1.975)  prepared  at  r,=300  °C,  F=250  W,  and  R-21. 
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Fig.  7.  Current  density  (7)  vs  electric  field  (E)  characteristic  for  the  nomi¬ 
nally  stoichiometric,  Si-rich,  and  hydrogenated  Si^^N  prepared  by  ECR 
RPECVD. 


increasing  H  incorporation  has  also  been  reported  in  amor¬ 
phous  silicon.^^  Hydrogen  replaces  Si-Si  and  Si-N  bonds 
presumably  with  Si-H  and  N~H  bonds  which  has  a  strong 
effect  on  nP  Bonds  to  H  can  very  well  be  nonbridging  (e.g., 
non-network  forming).  Because  of  this,  a  greater  number  of 
N~H  bonds  than  Si-H  bonds  can  make  the  material  nitrogen 
rich.  The  bond  lengths  for  both  Si-H  and  N-H  bonds  are 
smaller  than  the  Si-Si  and  Si-N  bonds,  which  cause  an  in¬ 
crease  in  the  number  density  of  atoms  in  the  medium.  The 
angle  of  refraction  increases  with  increase  in  density  of  the 
medium.  If  the  angle  of  incidence  is  unchanged,  the  refrac¬ 
tive  index  n  consequently  decreases  with  increase  in  the  H2 
flow  rate  into  the  chamber. 

Unlike  the  refractive  index,  the  etch  rate  increased  almost 
exponentially  to  25  nm/min  as  a  function  of  H2  flow  rate  of 
IG  seem.  This  value  is  similar  to  etch  rates  for  rf  plasma 
enhanced  Si^^N.  The  etch  rate  of  the  Si^N  is  reported^"^’^^  to 
be  strongly  affected  by  the  H  content  in  the  film.  The  low 
etch  rate  of  the  ECR-deposited  Si3N4  films  confirms  the  low 
H  content  of  the  films.  The  constancy  of  the  n  value  and 
insignificant  film  shrinkage  (<2%)  were  observed  after  fur¬ 
nace  annealing  at  850  for  30  min  for  the  Si_^N  layer  de¬ 
posited  at  300  °C  and  under  250  W.  This  shows  that  the 
silicon  nitride  films  were  thermally  stable  as  well. 

Figure  6  depicts  the  FTIR  absorption  spectrum  of  an  op¬ 
timized  nominally  stoichiometric  Si3N4  layer  1.975)  pre¬ 
pared  at  7^=300  °C,  and  /?~27  under  the  influence  of 
P^==250  W.  From  Fig.  6  it  is  apparent  that  the  dominant 
signal  of  the  FTIR  spectrum  is  the  Si-N  stretching  band 
occurring  at  870  cm”^  No  Si-H  stretching  mode  at  2200 
cm“^  was  observable  and  the  N-H  stretching  at  3360  cm”^ 
was  negligible.  Both  stretching  bands  were  considered  to  be 
the  main  peaks  of  the  H-related  bonds  in  Si^N  layers.  Very 
small  Si-H  wagging  and  N-H  bending  bonds  were  detected 
at  550  and  1150  cm“\  respectively. 

The  H  concentration  ([H])  determined  by  thermal 
evolution^^  during  heating  to  1200  °C  was  '--3X10^^  cm^,  or 


3  at.  %.  This  value  is  similar  to  the  H  content  observed  in 
thermal  LPCVD  Si3N4.^®  On  the  other  hand,  [H]  deposited  at 
100  °C  increased  to  I.IXIO^^  H  atom/cm^,  suggesting  that 
SiH4  cannot  be  easily  decomposed  into  SiH,  and  that  the  H 
desorption  from  the  reacting  surface  may  not  be  possible  at 
this  temperature.  The  hydrogenated  Si^^N  layer  (grown  with  a 
10-sccm  H2  flow  rate)  was  found  to  contain  about  1.5X10^^ 
H  atom/cm^,  consistent  with  the  higher  etch  rate.  It  is  inter¬ 
esting  to  note  that  the  H  concentration  reported  in  the  present 
investigation,  which  uses  N2  as  a  nitrogen  source  gas,  is 
much  lower  than  that  reported  for  films  deposited  with  NH3 
as  a  nitrogen  source. 

B.  /-  V  and  C-  V  characteristics 

Figure  7  shows  the  current  density  (J)  versus  electric 
field  (E)  plots  of  the  nominally  stoichiometric  Si3N4,  Si- 
rich,  and  hydrogenated  films.  From  this  figure  it  may  be  seen 
that  nominally  stoichiometric  Si3N4  exhibited  a  leakage  cur¬ 
rent  of  '^100  pA  cm”^  at  an  electric  field  of  2  MV/cm.  The 
J  vs  E  curves  for  the  Si-rich  Si^^N  and  hydrogenated  Si;^.N 
films  showed  orders  of  magnitude  higher  leakage  currents 
and  premature  failures,  respectively.  This  occurs  probably 
because  Si-rich  nitrides  contain  relatively  high  concentra¬ 
tions  of  Si  dangling  bonds  and  distorted  excess  Si-Si  bonds 
resulting  in  midgap  defect  states  or  bulk  traps.  These  traps 
create  trap  states  or  potential  wells  in  the  conduction  and 
valence  bands  inducing  electrical  conduction^’^^  by  Frenkel- 
Poole  emission.  In  the  case  of  stoichiometric  sample,  there 
are  essentially  three  regimes  in  the  J  versus  E  curve:  The 
regime  at  lower  electric  field  corresponds  to  electrical  con¬ 
duction  primarily  by  hopping.  The  regime  at  moderately  high 
electric  field  corresponds,  on  the  other  hand,  to  Frenkel- 
Poole  emission.  Fowler-Nordheim  tunneling  takes  place  al¬ 
most  always  at  higher  electric  fields  due  to  reduction  of  po¬ 
tential  barrier  to  the  mobile  carriers.  Threshold  voltage  drift 
is  also  thought  to  be  due  to  the  loss  of  H  when  the  device  is 
operated  at  elevated  temperatures.^^  A  resistivity  P5>4X  10^^ 
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11  cm  at  a  field  lower  than  1  MV/cm  was  measured  for  sev¬ 
eral  optimal  samples.  Typical  values  of  for  nominally 
stoichiometric  Si3N4  were  found  to  be  10^^- 10^^  fl  cm.^ 

As  determined  from  our  J  versus  E  plots,  the  breakdown 
field  in  the  case  of  Si-rich  sample  is  quite  low,  about  2-4 
MV/cm.  The  breakdown  field  for  nominally  stoichiometric 
Si3N4  samples  is  much  higher,  about  6-11  MV/cm.  This  is 
comparable  to  that  of  Si3N4  prepared  by  thermal  CVD.^^  /-V 
measurements  revealed  that,  for  the  stoichiometric  Si3N4 
films  at  high  electric  field  (5.6  MV/cm),  the  current  is  tem¬ 
perature  independent.  This  indicates  that  the  electrical  trans¬ 
port  in  stoichiometric  Si3N4  films  at  high  electric  field  is 
indeed  a  thermally  nonactivated  process,  which  is  Fowler- 
Nordheim  tunneling.  Based  on  this  observation  it  may  be 
safely  concluded  that  the  breakdown  was  caused  by  the 
Fowler-Nordheim  Tunneling  of  electrons  through  the  con¬ 
duction  band  potential  barrier,  and  not  by  other  breakdown 
mechanisms.  This  tunneling  is  identified  to  be  the  main  cur¬ 
rent  conduction  mechanism  of  the  thermally  grown  silicon 
dioxide.^^  Fowler-Nordheim  tunneling  in  Si3N4,  studied  in  a 
number  of  other  reports^^"^^  attests  to  the  fact  that  the  films 
prepared  by  ECR  RPECVD  are  indeed  of  high  quality. 

Hydrogenation  is  a  common  problem  with  deposited  sili¬ 
con  nitride  samples.^^"^^  To  determine  precisely  the  nature  of 
conduction  in  the  Si-rich  and  hydrogenated  samples  we  plot¬ 
ted  log(J/E)  vs  E^^^  for  E  up  to  its  breakdown  limit.  In  both 
cases  we  found  that  the  resulting  curves  were  straight  lines 
and  dependent  on  temperature,  although  the  slopes  for  the 
curves  corresponding  to  Si-rich  sample  was  much  higher 
than  that  corresponding  to  hydrogenated  sample.  For  these 
two  samples  with  film  thickness  about  150  A  the  leakage 
current  was  very  high  at  relatively  low  electric  fields.  This  is 
probably  because  in  both  cases  the  conduction  was  dictated 
by  Frenkel-Poole  emission  and  that  the  difference  between 
the  two  resulted  plausibly  from  different  dielectric  constants 
and  densities  of  traps.  The  log(//£')  versus  E^^^  plot  (not 
shown  here)  for  stoichiometric  film  corresponding  to 
Fowler-Nordheim  tunneling  was  also  a  straight  line.  How¬ 
ever,  it  was  observed  only  at  very  high  field  at  £>3  MV/cm. 
For  Si-rich  and  hydrogenated  samples,  there  were  break¬ 
downs  at  £^>3  MV/cm. 

The  static  dielectric  constant  was  determined  by  using  the 
capacitance  value  of  MIS  capacitors  in  strong  accumulation. 
The  thickness  employed  for  this  was  determined  by  the  el- 
lipsometer.  Si3N4  with  n  =  1.93-1.99  had  relative  dielectric 
constants  of  6.8-7. 1.  For  the  Si-rich  Si^^-N  (n~2.2)  the  di¬ 
electric  constant  was  —7.5  in  agreement  with  values  for  rf 
PECVD  materials.^*’ 

The  typical  C-V  characteristics  of  the  Si3N4/Si/GaAs  ca¬ 
pacitors  are  shown  in  Fig.  8(a).  The  thickness  of  Si  interlayer 
in  Si3N4/Si/GaAs  capacitors  with  static  in  situ  annealing  of 
Si  was  10  A,  at  which  the  capacitor  exhibits  the  best  C-V 
characteristics.^^  The  interface  trap  densities  (D^J  were  de¬ 
termined  using  the  conductance  loss  [Fig.  8(b)]  due  to  the 
interface  traps  from  the  measured  admittance  of  the  device 
when  biased  in  depletion.  The  series  resistance  and  insulator 
capacitance  effects  were  removed  from  the  measured  con- 


(a)  Gate  Voltage  (V) 


Fig.  8.  (a)  Measured  1  kHz,  1  MHz  HF  C-  V  curves  and  QSCV  curve  of  a 
Si3N4/Si/rt-GaAs  MIS  capacitor,  (b)  Loss-angular  frequency  of  the 
Si3N4/Si/n-GaAs  capacitors.  The  applied  gate  voltages  are  shown. 

ductance  peaks  to  avoid  artificially  low  interface  trap 
densities.^"^  By  assuming  that  the  defects  obey  the  single¬ 
time  constant  model,^"^  the  interface  trap  densities  were 
found  to  be  around  8X10^^  eV“^  cm“^.  Previous  C-V  mea¬ 
surements  of  Si3N4/Si  MIS  structure  indicated^  ^  that  is 
about  2X10^^  eV“^  cm“^.  The  discrepancy  between  the  two 
results  suggests  that  the  main  contribution  to  in 
Si3N4/Si/GaAs  MIS  structures  comes  from  Si/GaAs  interface 
states. 

Figures  8(a)  and  8(b)  depict  the  C~V  and  loss-angular 
frequency  curves  for  a  specific  Si3N4/Si/GaAs  capacitor.  As 
is  apparent  from  Fig.  8(a),  the  flatband  voltage  shift  was 
about  —10  mV,  which  demonstrates  a  very  small  amount  of 
fixed  charge  in  the  interfacial  region.  A  typical  hysteresis 
value  of  about  50  mV  was  obtained  from  retraced  capaci¬ 
tance  measurements  at  1  kHz,  which  resulted  from  subject¬ 
ing  the  MIS  capacitor  to  the  electric  fields  at  1.3  MV/cm  (2.0 
V).  The  hysteresis  for  a  field  swing  of  +4  to  -4  MV/cm  was 
about  150  mV.  Frequency  dispersion  was  found  to  be  around 
100  mV  from  the  deviation  of  the  1  MHz  and  1  kHz  C-V 
curves.  Both  hysteresis  and  frequency  dispersion  degraded  as 
we  went  down  into  the  GaAs  bandgap.  As  shown  in  Fig. 
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Gate  Voltage,  V^^  (V) 


Fig.  9.  Direct  current  transconductance  and  gate  leakage  current  of  a  nomi¬ 
nally  4.5-/xm-gate  length  Si3N4/Si/n-GaAs  depletion  mode  MISFET  as  a 
function  of  gate  voltage  Vgs . 


8(b),  the  minimum  conductance  peak  was  obtained  at  a  gate 
bias  of  —0.3  V,  corresponding  to  about  0.25  eV  above  the 
midgap.  Beyond  this  bias  the  conductance  peak  occurred  at 
frequencies  less  than  1  kHz.  An  increase  in  trap  density  in 
the  upper  half  of  the  bandgap  was  also  observed  in  Fig.  8(b). 

Superimposed  on  the  high  frequency  curves  in  Fig.  8(a)  is 
the  quasistatic  curve.  The  nature  of  this  curve  demonstrates 
that  the  excellent  insulator  quality  of  the  nitride  has  also 
allowed  the  realization  of  quasistatic  characteristics.  It  is 
noteworthy  to  mention  that  the  quasi-static  C~V  (QSCV) 
with  big  dip  can  only  be  obtained  when  the  leakage  current 
level  of  our  capacitator  is  lower  than  1  pA  under  the  applied 
field  of  ~2  MV/cm,  as  determined  by  the  I-V  characteris¬ 
tics.  Notably,  the  trap  density  determined  by  low  frequency 
(LF)/high  frequency  (HF)  method  using  QSCV  and  1-MHz 
signal  frequency  was  slightly  higher,  Dit~3Xl0^^ 
eV”^  cm“^.  Since  the  interface  trap  capacitance  (CiJ  must 
be  extracted  from  the  measured  capacitance  which  consists 
of  insulator  capacitance,  depletion  layer  capacitance,  and  in¬ 
terface  trap  capacitance,  higher  inaccuracies  are  likely  to 
arise  in  extracting  the  from  the  measured  LF/HF  capaci¬ 
tance  rather  than  from  the  conductance  measurement.^^  In 
other  studies  the  C-V  characteristics  of  GaAs  MIS  capacitor 
with  similar  device  structures  suffered  from  large  hyster¬ 
esis  and  frequency  dispersion,  and  quasistatic  C-V  curves 
for  them  were  characteristically  absent. 

C.  GaAs-based  MISFET 

Recent  interests  in  GaAs-based  MISFETs^^’^^  stem  from 
promising  features  of  GaAs  based  electronic  circuits  for 
high-speed  low-power  applications.  In  an  attempt  to  deter¬ 
mine  the  practical  applicability  of  our  silicon  nitride  films, 
we  measured  various  transfer  characteristics  of  GaAs-based 
depletion-mode  MISFETs.  These  typical  dc  transfer  charac¬ 
teristics,  for  example,  for  MISFETs  with  channel  length 
L—4.5  fim  and  channel  width  1V=  120  /xm  are  shown  in  Fig. 
9.  These  transfer  characteristics  involve  the  variations  of  the 


transconductance  (g^)  and  gate  leakage  current  (7^)  as  a 
function  of  gate-source  bias  As  may  be  evident  from 
this  figure,  the  variation  of  the  dc  transconductance  with  ap¬ 
plied  gate-source  bias  obtained  from  a  MISFET  with  channel 
length  L=4.5  /xm  shows  a  peak  at  about  1  V,  and  that  the 
maximum  transconductance  varies  between  90-110  mS/mm. 
Although  the  leakage  current  of  this  MISFET  at  a  voltage  of 
—3  V  is  of  the  order  of  10”^  A,  it  dies  down  rapidly  being 
reduced  to  10“^^  A  at  voltage  of  3  V.  It  might  be  advanta¬ 
geous  to  use  the  dc  transconductance  of  the  depletion  mode 
MISFETs  for  analyzing  the  as-measured  results.  This  trans¬ 
conductance  be  given  by^^ 

8m,dc=  Q+  C’inv+  Qt)»  (1) 

where  is  the  gate  insulator  capacitance 

(6.5  X 10“^  F/cm^),  is  the  depletion  capacitance,  is 

the  inversion  layer  capacitance,  is  the  static  capacitance 
of  interface  traps  (C^^=qN^^,  is  the  interface  trap  density), 
/X  is  the  carrier  mobility,  and  W  is  the  channel  width. 

Let  us  consider  the  region  where  the  gate- source  voltage 
is  higher  than  the  fiatband  voltage,  viz.,  (^0.1  V). 

As  may  be  apparent  from  Fig.  9,  this  region  corresponds  to 
the  active  channel  in  accumulation  with  the  largest  channel 
electron  concentration  and  exhibits  the  transconductance  at 
the  maximum  and  the  drain  current  saturated  condition.  The 
fiatband  condition  occurs  at  about  0.7  V  before  the  transcon¬ 
ductance  (g^,dc)  reaches  its  maximum  value,  resulting  in 
g^”110  mS/mm  at  ygs-0.8  V  and  V.  However, 

sweeping  toward  depletion  causes  the  channel  to  be 
partly  depleted.  The  rate  of  change  of  channel  conductance 
with  Vgs  thus  becomes  smaller,  resulting  in  a  decrease  of  the 
transconductance  g^.  During  transition  from  depletion  to 
weak  inversion,  decreases  progressively,  leading  to  a  re¬ 
duction  of  .  At  the  onset  of  strong  inversion,  the  maxi¬ 
mum  depletion  is  reached  and  saturates.  On  the  other 
hand,  Cj^v  increases  exponentially  leading  to  a  rapid  fall  of 

8m,dc  • 

It  should  be  noted  that  the  decrease  of  gm,dc  strong 
accumulation  is  due  to  the  reduction  of  the  surface  mobility 
with  Vgs .  Indeed,  the  surface  conduction  becomes  dominant 
in  strong  accumulation,  and  g^  is  obtained  by  replacing 
with  Cacc  and  fjL  with  fx^  (electron  mobility  at  the  surface). 
Due  to  high  carrier  scattering  resulting  from  the  presence  of 
dangling  bonds  at  the  interface,  the  surface  electron  mobility 
jji^  is  quite  low,  which  causes  the  transconductance  to  be 
reduced. 

Using  this  transconductance  and  other  relevant  param¬ 
eters,  namely,  the  gate  dielectric  capacitance  Cq  per  unit 
area,  the  drain  voltage  Vds,  the  channel  length  L,  and  the 
channel  width  Z,  the  field-effect  mobility  /xpE^^  may  be  cal¬ 
culated  from 

^FE—Lg^^^’/ZCoVds-  (2) 

The  field-effect  electron  mobility  determined  in  this  way 
is  found  to  be  around  1000  cm^A^  s  for  the  Si3N4/Si/GaAs 
structure  at  room  temperature.  For  bulk  GaAs  doped  with  Si 
to  2X10^^  cm“^,  it  should  be  about  4000  cm^fV  s.  Such  low 
values  of  electron  mobility  may  be  ascribed  to  (1)  the  inter- 
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Table  I.  The  peak  values  of  the  extrinsic  transconductances  and  gate  leak¬ 
age  of  GaAs-based  MISFETs  as  a  function  of  channel  length.  The  gate 
leakage  was  measured  at  Vdg=5  V. 


Channel  length 
(/Am) 

Extrinsic  transconductance 
(mS/mm) 

Gate  leakage 
(nA) 

3 

100-140 

<0.01 

4.5 

90-110 

0.1 

10 

60-70 

1 

20 

35-40 

1 

face  trap  density  at  the  Si3N4/Si/GaAs  interfaces  which  can 
act  as  recombination  or  trap  centers,  (2)  surface  roughness 
scattering  of  the  GaAs  active  layer,  and  (3)  Coulomb  scatter¬ 
ing  from  fixed  charges  and  interface  states.^^  That  it  is  domi¬ 
nantly  affected  by  the  surface  trap  density  is  apparent  also 
from  an  earlier  investigation  by  Reed^^  for  Al/Si3N4/Si/GaAs 
MISFETs  in  which  transconductance  is  found  to  decrease 
with  an  increase  in  interface  trap  density,  when  the  latter 
exceeds  a  level  of  about  10^^  eV“^  cm"^. 

Table  I  summarizes  the  peak  values  of  the  extrinsic 
transconductances  and  gate  leakage  of  GaAs-based  MISFETs 
as  a  function  of  channel  lengths.  The  transconductances 
shown  in  Table  I  are  extrinsic,  which  include  the  effect  of 
parasitic  resistance  values  (e.g.,  the  source  contact  resistance 
and  the  ungated  channel  resistance).  If  the  parasitic  resis¬ 
tance  effect  is  not  taken  into  account,  the  intrinsic  trancon- 
ductance  thus  obtained  for  Si3N4/Si/GaAs  devices  ex¬ 
hibits  a  value  of  about  165  mS/mm  for  L=4.5  /xm,  88 
mS/mm  for  L—9.5  yam,  and  45  mS/mS  for  L  =  19.5  yam. 
These  results  are  superior  to  the  reported  GaAs  metal- 
oxide- semiconductor  field-effect-transistor  (MOSFET)  re¬ 
sult  which  is  60  mS/mm  for  a  8-yam  gate  length  device. 
This  superiority  may  be  ascribed  to  the  improved  interface 
quality  provided  by  the  Si3N4/Si/GaAs  structure,  and  larger 
dielectric  constant  of  silicon  nitride. 

Important  issues  which  deserve  immediate  attention  are 
improved  leakage  current  of  GaAs  MISFETs  with  respect  to 
GaAs  metal-semiconductor  field-effect-transistors  (MES- 
FETs).  As  mentioned  earlier,  one  of  the  motivations  driving 
the  development  of  GaAs  MISFETs  is  to  realize  a  MIS  de¬ 
vice  with  low  gate  leakage  current.  It  was  noted  that,  at  the 
same  gate  bias  (-6.0  V),  while  the  gate  leakage  for  the 
MESFET  is  several  microamperes,"^®  the  gate  leakage  for  the 
MISFET  varies  between  several  tens  of  picoamperes  and 
several  nanoamperes.  This  improvement  of  more  than  three 
orders  of  magnitude  in  the  gate  leakage  of  the  MISFET  over 
that  of  the  MESFET  may  be  considered  quite  remarkable, 
being  beneficial  to  the  development  of  high-speed,  low- 
power  GaAs  integrated  circuits  (ICs)  used  for  wireless  com¬ 
munication  systems. 

IV.  CONCLUSION 

We  have  described  the  deposition  characteristics  and  re¬ 
lated  properties  of  nominally  stoichiometric  Si3N4  prepared 
by  the  ECR  RPECVD  method.  This  description  can  provide 
valuable  direction  to  the  optimization  of  high  quality  silicon 


nitride  films  for  practical  applications.  Our  study  convinc¬ 
ingly  demonstrates  that  the  films  deposited  from  SiH4  and  N2 
can  be  relatively  free  from  H  contents,  which  endow  them 
with  low  etch  rates  and  excellent  dielectric  strength.  The 
properties  of  these  stoichiometric  silicon  nitride  films  are 
comparable  to  those  of  thermal  CVD  Si3N4,  and  are  found  to 
be  promising  candidates  as  gate  dielectrics  for  III-V  MIS 
devices.  Nominally  stoichiometric  Si3N4  films  are  found  to 
exhibit  electrical  characteristics  superior  to  those  of  Si-rich 
and  hydrogenated  Si^N,  suggesting  that  all  MIS  structures 
should,  for  example,  prefer  insulator  gates  from  stoichio¬ 
metric  silicon  nitrides.  On  the  other  hand,  the  memory  de¬ 
vices  should  be  made  from  Si-rich  silicon  nitrides.  High/low 
frequency  C-F,  conductance  versus  frequency,  and  MIS¬ 
FETs  were  used  for  a  comprehensive  characterization  of  the 
n-type  MIS  structures.  From  the  stringent  conductance  mea¬ 
surements,  interface  state  densities  in  the  high  10^® 
eV~^  cm“^  have  been  obtained.  Self-aligned  gate  depletion 
mode  MISFETs,  which  utilized  stoichiometric  silicon  nitride 
as  a  gate  dielectric  and  which  had  gate  lengths  of  3  yam 
exhibited  transconductances  of  about  100-140  mS/mm  for 
GaAs  channels.  Noting  that  the  development  of  GaAs-based 
MISFET  technology  is  at  its  embriyonic  stage,  it  should  be 
considered  quite  remarkable. 
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A  boron-accumulated  Si(l(X))  surface  with  boron  coverage  up  to  0.1  monolayer  has  been  prepared 
by  high-temperature  annealing  of  B-doped  Si  samples.  Scanning  tunneling  microscopy  has  been 
used  to  monitor  the  transformation  in  surface  morphology  and  surface  atomic  structure  induced  by 
boron  surface  accumulation.  The  specific  boron-induced  surface  features  have  been  elucidated  and 
a  model  of  their  atomic  structure  has  been  proposed.  On  the  boron-accumulated  surface  epitaxial  Si 
films  with  a  thickness  of  0.1 -3.0  monolayers  have  been  grown  using  solid  phase  epitaxy  and 
molecular  beam  epitaxy.  The  results  show  that  under  appropriate  growth  conditions  B  surface 
segregation  can  be  suppressed  even  on  the  atomic  scale.  The  effect  of  boron  induced  features  on  the 
growth  process  has  been  discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  formation  of  delta-function-like  spikes  of  dopants  in 
Si  {Si  delta  doping)  has  recently  attracted  considerable  inter¬ 
est  both  from  a  scientific  viewpoint  and  because  of  its  tech¬ 
nological  applications.  Such  delta-doped  layers  have  been 
produced  successfully  in  Si  using  molecular  beam  epitaxy 
(MBE)  and  MBE-related  solid  phase  epitaxy  (SPE).^"^  A 
new  challenge  for  the  delta-doping  technique  is  the  fabrica¬ 
tion  of  delta-doped  layers  with  ordered  two-dimensional 
structures  of  the  dopants  {ordered  delta  doping).^''^  Dopant 
ordering  offers  two  prospective  advantages  for  device  appli¬ 
cations:  (i)  The  lowering  of  the  statistical  limit  for  the  device 
size  in  deep  submicron  VLSI  and  (ii)  the  possibility  of  re¬ 
ducing  the  ionized  impurity  scattering  which  one  can  antici¬ 
pate  from  an  ordered  array  of  dopants.^  The  most  appropriate 
approach  to  ordered  delta  doping  requires  the  formation  of 
an  ordered  submonolayer  surface  phase  of  adsorbate  on  an 
atomically  clean  Si  surface^®  followed  by  low-temperature 
epitaxial  Si  overgrowth  by  means  of  MBE  or  SPE.  Achieve¬ 
ment  of  this  goal  requires  sufficient  stability  of  the  ordered 
surface  phase  with  respect  to  Si  overgrowth.  Unfortunately, 
most  of  the  dopant  surface  phases  lose  their  ordered  structure 
already  upon  room-temperature  (RT)  deposition  of  Si 
submonolayers. Of  the  possible  dopants,  only  boron  shows 
promise  for  preservation  of  the  ordered  reconstruction  at  the 
buried  interface.  The  Si(lll)V3  X  —  B  surface  phase  is 
known  to  persist  after  a  RT  deposition  of  an  amorphous  Si 
cap.^  However,  at  the  relatively  high  temperatures  required 
for  the  epitaxial  Si(lll)  overgrowth  (400  °C  for  MBE  and 
600  °C  for  SPE),  ordered  delta  doping  cannot  be  achieved. 
The  buried  Si(  1 1 1)  V3  X  yl3  -  B  surface  phase  suffers  from 
both  boron  disordering^^  and  outsmearing  of  the  boron  pro¬ 
file  towards  the  surface. 


^^Corresponding  author;  Electronic  mail:  e91bzeus@rz.unibw-muenchen.de 


Compared  to  the  Si(lll)  surface,  the  temperature  required 
for  epitaxial  Si  growth  on  the  Si(lOO)  surface  is  much  lower. 
As  a  result,  preservation  of  the  2X 1  ordered  boron  submono¬ 
layer  beneath  the  crystalline  Si  cap  grown  at  about  200  °C 
has  been  confirmed  both  by  x-ray  diffraction^  and  transmis¬ 
sion  electron  microscopy.^  Electrical  characterization  of 
these  structures  showed  that  the  embedded  ordered  boron 
yields  the  highest  electrically  active  concentration  of  boron 
in  silicon  ever  reported.^’^  However,  the  expected  large  mo¬ 
bility  enhancements  have  not  been  observed.  For  optimiza¬ 
tion  of  the  ordered  boron  delta-doped  structures  detailed  in¬ 
formation  concerning  the  atomic  structure  of  the  B -adsorbed 
Si(lOO)  surface  and  of  the  main  regularities  of  Si  epitaxial 
overgrowth  are  highly  desirable.  Despite  the  widespread  use 
of  boron  as  a  p-type  dopant,  the  investigation  of  the  interac¬ 
tion  of  B  with  the  Si(lOO)  surface  has  been  started  just  re¬ 
cently.  Until  now  there  have  been  only  a  few  publications 
devoted  to  the  atomic  structure  of  the  B/Si(100)  surface  and 
there  is  no  atomic-scale  information  about  Si  growth  on  the 
B/Si(100)  surface. 

In  the  present  work,  the  MBE  and  SPE  overgrowth  of  a 
few  Si  monolayers  on  B/Si(100)  are  studied  by  scanning 
tunneling  microscopy  (STM).  The  B/Si(100)  surface  is  pre¬ 
pared  by  high-temperature  annealing  of  the  B-doped  Si(lOO) 
samples  and  the  atomic  structure  of  the  surface  formed  is 
discussed. 

II.  EXPERIMENT 

The  samples  used  in  this  study  were  Si  bars,  14X4.5X0.3 
mm^  in  size,  cut  from  0.01  fl  cm  B-doped  Si(lOO)  wafers 
(Wacker-Chemitronic).  Before  loading  into  the  UHV-STM 
system  the  samples  were  cleaned  ultrasonically  with  acetone 
and  methyl  alcohol.  Then  they  were  fixed  by  molybdenum 
fasteners  and  transferred  into  the  ultrahigh  vacuum  (UHV) 
chamber.  After  degassing  for  about  5  h  at  550  °C  the  samples 
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were  heated  by  direct  current  to  1230  to  induce  boron 
accumulation  at  the  sample  surface.  The  pressure  during  an¬ 
nealing  did  not  exceed  the  value  of  5X10“^^  Torr. 

Si  cap  layers  were  deposited  from  a  resistively  heated  Si 
stripe  at  a  rate  of  about  0.3  monolayer  (ML)/min  [1  ML 
=  6.8X10^"^  cm"^  the  site  density  of  the  Si(lOO)  plane]. 
During  deposition  the  pressure  was  below  4X10“^^  Torr.  The 
deposition  rate  was  determined  in  calibration  experiments 
from  the  shape,  size,  height,  and  number  density  of  Si  islands 
displayed  by  STM  after  deposition  for  a  fixed  time.  The  tem¬ 
peratures  of  the  sample  above  350  °C  were  monitored  by  an 
infrared  pyrometer  with  an  accuracy  of  ±15  °C.  The  tem¬ 
peratures  from  20  to  350  °C  were  estimated  by  extrapolating 
the  “heating  current-temperature”  dependence  to  lower  cur¬ 
rents.  The  accuracy  in  the  latter  case  was  about  ±50 

The  STM  observations  were  performed  using  Omicron 
UHV  STM  apparatus  driven  by  custom-made  electronics.^"^ 
All  the  STM  images  presented  here  were  obtained  at  a  con¬ 
stant  current  of  0.2  nA  and  at  sample  biases  in  the  range  from 
—  3.0  to  +3.0  V.  Images  are  shown  with  a  conventional  gray 
scale  keyed  to  the  surface  height. 

III.  RESULTS  AND  DISCUSSION 
A.  B/Si(100)  surface 

High-temperature  annealing  of  B -doped  Si  samples  is 
known  to  result  in  boron  accumulation  at  the  sample 
surface.^^"^^  The  saturating  surface  density  of  B  atoms 
achieved  upon  Si  sublimation  at  a  given  temperature  is  the 
result  of  a  competition  between  the  boron  enrichment  at  the 
surface,  due  to  (i)  the  much  lower  evaporation  rate  of  B 
atoms  from  Si  surface  compared  to,  the  one  of  Si  atoms,  and 
(ii)  the  boron  diffusion  into  the  bulk  driven  by  the  B  concen¬ 
tration  gradient  forming  in  the  near-surface  region. Auger 
electron  spectroscopy  measurements  showed  that  maximum 
boron  coverage  achieved  for  the  present  experimental  condi¬ 
tions,  namely,  after  annealing  of  0.01  flcm  Si(100):B 
samples  at  about  1230  °C  for  2  h  was  about  0.1  ML. 

STM  observations  have  revealed  that  B  accumulation 
causes  transformations  both  in  surface  morphology  and  sur¬ 
face  atomic  structure.  The  typical  large-scale  transformations 
are  illustrated  by  Fig.  1.  After  moderate  annealing  (1215  °C 
for  10  min)  boron  surface  concentration  is  low  (<0.03  ML) 
and  the  surface  displays  a  STM  image  [Fig.  1(a)]  much  like 
the  one  known  for  an  atomically  clean  Si(lOO)  surface.  One 
can  see  the  regular  distribution  of  the  monolayer-high  steps 
reflecting  the  sample  miscut.  The  step-down  direction  is 
from  the  upper  right  to  the  lower  left.  The  downward  steps 
running  parallel  to  the  dimer  rows  (S^  steps,  according  to 
Chadi’s  notation^^)  are  straight  whereas  those  running  per¬ 
pendicular  to  the  dimer  rows  {S^  steps)  are  rather  rough.  The 
B  accumulation  changes  the  surface  morphology  to  the  one 
shown  in  Fig.  1(b).  This  surface  corresponds  to  the  annealing 
for  about  1  h  at  1230  °C  and  B  coverage  of  0.06-0.08  ML. 
The  apparent  attribute  of  the  surface  is  enhanced  roughness 
of  the  Sq  steps.  Such  morphology  implies  the  formation  of 
certain  surface  features  that  pin  some  dimer  rows  during  sub¬ 
limation. 


(b) 


Fig.  1.  Large-scale  transformations  induced  by  boron  accumulation  on  the 
Si(lOO)  surface.  STM  images  of  a  0.01  fl  cm  Si(100):B  sample  surface  after 
annealing  in  UHV.  (a)  at  1215  °C  for  10  min,  (b)  at  1230  °C  for  1  h.  Scale: 
1870X1560  Al 


The  STM  images  with  atomic  resolution  show  specific 
features  which  are  believed  to  be  associated  with  boron  dop¬ 
ant  atoms.^^  As  one  can  see  in  Fig.  2,  these  are  nearly  round 
protrusions  which  occur  mostly  in  pairs  and  rarely  as  indi¬ 
viduals  or  as  chains  containing  three  and  more  elements.  In 


Fig.  2.  STM  image  of  the  B/Si(100)  surface.  Scale:  230X230  A^. 
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Fig.  3,  STM  images  of  the  boron-induced  double-protrusion  feature 
(marked  by  a  dashed  frame),  (a)  Empty  states  (sample  bias  is  +2.0  V),  (b) 
filled  states  (sample  bias  is  -3.0  V).  The  axes  are  drawn  for  convenience  of 
determining  the  local  symmetry  and  the  position  of  the  feature  with  respect 
to  the  Si(100)2Xl  dimer  rows.  Scale:  28X28  A^. 


particular,  one  can  see  that  many  of  them  are  located  adja¬ 
cent  to  the  edges  of  iS^-steps  terminating  dimer  rows.  Thus,  it 
can  be  assumed  that  the  boron-induced  features  pin  the  dimer 
rows  during  sublimation  leading  to  the  specific  form  of  the 
55-step  edges  seen  in  the  large-scale  STM  images  [Fig.  1(b)]. 
The  reason  for  such  pinning  might  be  related  in  some  way  to 
changes  of  the  local  stress  around  a  boron-induced  feature 
due  to  the  fact  that  Si-B  bond  is  shorter  than  the  Si-Si  one. 

To  elucidate  the  atomic  structure  of  the  boron-induced 
features,  high-resolution  STM  images  have  been  obtained. 
As  an  example.  Fig.  3  shows  a  small  fragment  of  the  surface 
with  a  double-protrusion  feature  recorded  at  positive  [Fig. 
3(a)]  and  negative  [Fig.  3(b)]  bias  polarities.  One  can  see  that 
the  feature  replaces  two  dimers  in  a  central  dimer  row.  The 
double-protrusion  feature  shares  the  symmetry  axis  of  the 
central  dimer  row  and  lies  on  the  (110)  line  drawn  between 
two  neighboring  dimers  as  shown  in  Figs.  3(a)  and  3(b). 

Figure  4  shows  a  structural  model  of  the  double¬ 
protrusion  feature  proposed  on  the  basis  of  the  atomic- 
resolution  STM  measurements,  on  the  data  available  in  the 
literature  concerning  the  B/Si(100)  surface,  namely,  that  this 
surface  is  not  a  dangling  bond-free  surface^  ^  and  on  the  pre¬ 


(30  top-layer  Si  dimers 
O  1st  bulk-layer  Si  atoms 
0  "denuded"  Si  atoms  (see  text) 
o  2nd  bulk-layer  Si  atoms 

•  2nd  bulk-layer  B  atoms 


(b) 

empty  states 


Fig.  4.  Proposed  model  for  the  boron-induced  double-protrusion  feature,  (a) 
top  view,  (b)  side  view,  (c)  schematic  diagram  showing  the  spatial  location 
of  the  boron-induced  protrusions  within  the  area  of  the  feature  (hatched 
region)  in  empty-  and  filled-state  STM  images. 

sumption  that  boron  atoms  occupy  substitutional  sites  in  the 
Si  lattice.^  In  the  model  two  boron  atoms  (shown  in  Fig.  4  as 
small  closed  circles)  occupy  substitutional  sites  in  the  second 
bulklike  layer.  These  boron  atoms  are  separated  by  2a,  where 
a  is  the  unit  length  of  the  Si(lOO)  plane.  Due  to  the  shorter 
Si-B  bond  as  compared  to  the  Si-Si  bond,  the  B-bonded  Si 
atoms  of  the  first  bulklike  layer  are  shifted  from  their  normal 
positions  towards  the  B  atoms.  The  stresses  originating  from 
these  shifts  are  believed  to  be  the  reason  of  the  bond  disrup¬ 
tion  and  the  elimination  of  two  Si  surface  dimers.  Thus,  two 
Si  atoms  of  the  first  bulklike  layer  (shown  in  Fig.  4  as  shaded 
circles)  become  “denuded.”  Each  of  these  atoms  has  two 
dangling  bonds  and  we  assume  that  these  electron  densities 
are  responsible  for  the  double  protrusions  observed  in  the 
STM  images.  The  images  are  obviously  affected  by  the  bo¬ 
ron  atoms  in  the  substitutional  positions,  which  act  as  nega¬ 
tively  charged  ions  simultaneously  repelling  the  electron 
density  and  causing  a  partial  depletion  of  the  electron  states 
of  the  neighboring  Si  atoms.  As  a  result,  the  protrusions  ap¬ 
pear  to  be  more  closely  spaced  in  the  filled-state  image  [Fig. 
3(b)]  compared  to  the  empty-state  image  [Fig.  3(a)].  Basi¬ 
cally  similar  structural  models  are  applicable  for  the  descrip¬ 
tion  of  features  containing  three  or  more  protrusions.  The 
periodic  occupation  of  2a  spaced  substitutional  sites  in  the 
second  bulklike  layer  by  B  atoms,  in  the  manner  described 
above,  produces  a  linear  chain  with  the  corresponding  num¬ 
ber  of  protrusions  in  the  STM  image.  More  detailed  analysis 
of  these  structures  is  presented  elsewhere.^^ 
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Comparison  of  the  present  results  with  recent  STM  data 
by  Wang  et  for  the  B/Si(100)  surface  prepared  by 

decomposition  of  boron  hydrides  implies  that  the  surface 
structure  formed  might  be  noticeably  different  for  the  differ¬ 
ent  preparation  procedures.  Wang  et  al  reported  the 
B/Si(100)  surface  to  be  a  mixture  of  regions  of  several  re¬ 
constructions  consisting  of  silicon  dimers  and  dimer  vacan¬ 
cies.  In  addition  to  double-protrusion  features  reported  here 
and  labeled  as  “A  features”  in  Refs. 23  and  24  regions  of 
c(4X4)  reconstruction  were  detected  in  Refs.  23  and  24  but 
have  never  been  observed  on  our  samples.  The  STM  images 
of  c(4X4)  presented  by  Wang  et  are  identical  in  both 

polarities  to  the  ones  reported  for  metastable  Si(100)c(4X4) 
structure  formed  under  hydrogen  exposure  followed  by  hy¬ 
drogen  desorbing  sample  annealing.^^  This  identity  makes  us 
suspect  that  the  reconstruction  observed  in  Refs.  23  and  24 
might  be  induced  by  the  interaction  of  the  Si(lOO)  surface 
with  the  hydrogen  released  during  decomposition  of  B2H6 
and  B10H24  precursors  rather  than  by  the  presence  of  boron  at 
the  surface.  The  procedure  used  in  the  present  work  (i.e., 
boron  extraction  from  the  bulk)  is  free  from  these  side  effects 
and  c(4X4)  does  not  form.  Unfortunately,  our  study  is  re¬ 
stricted  to  a  maximum  B  coverage  of  about  0.1  ML  and  thus 
the  question  of  what  the  completed  (stoichiometric) 
B/Si(100)  surface  looks  like  still  remains  open.  The  addi¬ 
tional  studies  (e.g.,  STM  observation  of  the  B/Si(100)  sur¬ 
face  prepared  by  deposition  of  elemental  B)  are  required  to 
clarify  the  intrinsic  structure  of  the  “ideal”  Si(100)-B  surface 
phase. 


B.  SI  overgrowth  on  B/Si(100)  surface 

Using  the  B/Si(100)  surface  as  a  substrate,  Si  overgrowth 
has  been  carried  out  both  in  SPE  and  MBE  regimes,  i.e.,  by 
RT  deposition  of  amorphous  Si  followed  by  annealing  and 
by  Si  deposition  onto  the  heated  substrate,  respectively.  We 
should  admit  that  the  experimental  conditions  of  the  present 
study  do  not  coincide  completely  with  the  ones  required  for 
the  delta-doping  technique  as  (i)  the  surface  is  not  a  com¬ 
pleted  Si(100):B  surface  phase  and  (ii)  boron  is  accumulated 
in  a  rather  thick  near-surface  layer  and  not  only  at  the  sur¬ 
face.  However,  the  main  regularities  of  Si  overgrowth  on 
such  “nonideal”  B/Si(100)  substrate  are  believed  to  remain 
valid  for  the  fabrication  of  Si:B  delta-doped  layers. 

L  Si-SPE  growth 

Figure  5(a)  shows  a  STM  image  of  a  3-ML- thick  Si  film 
deposited  onto  a  B/Si(100)  surface  held  at  RT.  One  can  see 
that  the  film  is  basically  amorphous,  although  the  presence  of 
a  few  short  pieces  of  dimer  rows  can  be  noticed.  Annealing 
at  temperatures  of  about  350  °C  induces  the  SPE  crystalliza¬ 
tion  as  evidenced  by  the  appearance  of  well-defined  dimer 
rows  [Fig.  5(b)].  The  observation  of  the  SPE  growth  at  such 
low  temperatures  indicates  a  facilitation  of  epitaxial  crystal¬ 
lization  in  Si  films  of  a  few  monolayers  thickness,  as  com¬ 
pared  to  Si  films  of  the  more  conventional  thickness  range  of 


(c) 


Fig.  5.  Si-SPE  growth  on  B/Si(100)  surface,  (a)  3  ML  Si  film  as  deposited 
at  room  temperature,  (b)  annealed  at  350  °C  for  5  min,  (c)  annealed  at 
500  °C  for  7  min.  Scale:  265X220  A^. 


10-100  nm.  The  extrapolation  to  350  °C  from  the  known 
data  for  “thick”  Si  films^^  yields  the  negligible  SPE  crystal¬ 
lization  rate  of  about  10“^  nm/min. 

A  detailed  STM  inspection  of  the  annealed  film  has  not 
revealed  the  presence  of  B -induced  features  on  the  surface. 
Selected  protrusions  initially  suspected  to  be  boron-induced 
features  appear,  at  a  greater  magnification,  to  be  isolated  Si 
dimers  or  intrinsic  Si(lOO)  surface  defects.  The  crystallized 
film  is  rough  and  up  to  four  different  atomic  layers  can  be 
distinguished  in  Fig.  5(b).  After  annealing  at  higher  tempera¬ 
tures  [e.g.,  at  500  °C  as  illustrated  by  Fig.  5(c)]  the  film 
surface  becomes  perfect  and  flat  and  only  very  few 
B-induced  features  can  be  found  on  it.  Thus,  one  can  see  that 
under  appropriate  heat  treatment  epitaxial  Si  overgrowth  can 
be  achieved  by  SPE  without  noticeable  boron  segregation, 
even  on  the  atomic  scale.  It  should  be  noted,  however,  that 
sufficient  thickness  (at  least  3  ML)  of  the  deposited  Si  film 
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(b) 

Fig.  6.  Surface  after  deposition  of  0.2  ML  Si  at  about  200  °C.  (a)  B/Si(100) 
surface  and  (b)  clean  Si(lOO),  Scale:  230X230  A^. 

appears  to  be  a  necessary  requirement  for  the  suppression  of 
boron  segregation  from  the  doped  substrate 
(ideally  from  the  delta-doped  layer)  during  Si-SPE  regrowth. 
In  fact,  upon  crystallization  of  amorphous  1-2  ML  thick  Si 
films,  the  B-induced  features  already  reappear  at  the  film 
surface  at  350  ""C.  The  existence  of  such  a  critical  thickness 
needed  to  avoid  surface  segregation  seems  to  be  an  inherent 
feature  of  Si-SPE  and  has  also  been  reported  for  other  dop¬ 
ants  (e.g.,  for  Sb^^). 

2.  Si-MBE  growth 

Epitaxial  growth  of  a  Si  cap  layer  on  the  B/Si(100)  sur¬ 
face  starts  at  a  temperature  of  about  200  °C.  The  initial  stage 
of  epitaxial  growth  is  illustrated  by  Fig.  6(a)  which  shows  a 
B/Si(100)  surface  after  deposition  of  about  0.2  ML  Si.  For 
comparison,  Fig.  6(b)  shows  a  clean  Si(100)2Xl  surface  af¬ 
ter  deposition  of  the  same  amount  of  Si.  One  can  see  that  in 
both  cases  the  formation  of  long  islands  occurs  with  a  pref¬ 
erential  growth  direction  perpendicular  to  the  dimer  rows  of 
the  original  substrate.  However,  in  the  case  of  the  B/Si(100) 
substrate  this  growth  anisotropy  is  less  pronounced.  This 


Fig,  7.  Top  Si  dimer  trapped  between  two  boron-induced  features. 

specific  feature  of  Si  epitaxy  on  B/Si(100)  surface  seems  to 
be  caused  by  the  enhanced  topographical  roughness  of  the 
original  surface  which  does  not  provide  enough  area  for  con¬ 
tinuous  growth  of  long  dimer  strings  [see  Fig.  1(b)]  and  by 
the  presence  of  B-induced  features.  The  effect  of  these  fea¬ 
tures  may  be  twofold.  First,  a  boron-induced  feature  may 
serve  as  a  preferential  site  for  island  nucleation  and,  second, 
the  propagation  of  the  growing  dimer  string  can  be  blocked 
on  reaching  a  boron-induced  feature.  Analysis  of  many  STM 
images  shows  that  boron-induced  protrusions  are  located 
mainly  at  the  ends  of  newly  grown  dimer  strings.  The  oppo¬ 
site  arrangement  is  rare,  i.e.,  there  are  only  very  few  dimer 
strings  or  islands  that  have  a  boron-induced  feature  in  the 
center  while  keeping  their  edges  free.  This  may  be  an  indi¬ 
cation  that  boron-induced  features  are  potential  barriers  for 
adatom  diffusion.  Another  argument  in  favor  of  this  assump¬ 
tion  is  the  sporadic  presence  on  the  annealed  B/Si(100)  sur¬ 
face  of  features  like  the  one  shown  in  Fig.  7.  It  presents  a 
remainder  silicon  dimer  of  a  completely  sublimated  upper 
layer  that  has  just  been  trapped  between  two  boron-induced 
features. 

Epitaxial  growth  of  thicker  films  is  presented  in  Fig.  8. 
Figure  8(a)  shows  a  surface  after  deposition  of  3  ML  Si  at 
about  200  °C.  The  film  exhibits  a  well-defined  epitaxial 
structure  though  it  is  rough  on  an  atomic  scale,  and  up  to  5 
different  atomic  layers  are  seen  much  like  the  case  of  SPE 
growth  at  350  °C  [Fig.  5(b)].  No  apparent  indication  of  any 
boron  segregation  can  be  found  at  this  growth  temperature. 
However,  an  increase  in  temperature  leads  to  pronounced 
surface  segregation  of  boron.  As  an  example,  Fig.  8(b)  shows 
the  surface  of  the  film  grown  by  MBE  at  about  500  '’C.  One 
can  see  the  presence  of  numerous  boron-induced  double¬ 
protrusion  features  in  each  atomic  layer.  The  threshold  tem¬ 
perature  for  the  onset  of  boron  segregation  could  not  be  de¬ 
termined  accurately  in  the  present  study  due  to  limited 
accuracy  of  the  temperature  measurements.  Our  estimation 
suggests  that  for  a  growth  rate  of  about  0.3  ML/min  it  should 
be  situated  between  230  and  300  °C. 

The  existence  of  a  temperature  range  in  which  Si(lOO) 
epitaxial  overgrowth  takes  place,  but  in  which  boron  segre- 
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(b) 

Fig.  8.  Si-MBE  growth  on  B/Si(100)  surface:  (a)  3  ML  Si  deposited  at  about 
200  °C,  (b)  1  ML  Si  deposited  at  about  500  °C.  Scale:  (a)  i75X  175  and 
(b)  350X350  Al 

gation  is  still  suppressed  even  on  the  atomic  scale,  is  very 
promising  for  Si:B  ordered  delta  doping.  In  contrast,  such  a 
temperature  interval  does  not  exist  for  Si  growth  on  the 
B/Si(lll)  surface  where  B  surface  segregation  already  oc¬ 
curs  at  the  minimal  epitaxial  temperature  of  about  400 

IV.  CONCLUSION 

Scanning  tunneling  microscopy  has  been  used  to  study  (i) 
the  structure  of  the  B/Si(100)  surface  prepared  by  annealing 
of  B-doped  Si(lOO)  samples  and  (ii)  regularities  of  epitaxial 
Si  overgrowth  on  this  surface.  The  appearance  of  specific 
boron-induced  features  has  been  detected  and  a  model  of 
their  atomic  structure  has  been  proposed  on  the  basis  of  STM 
observations  and  known  literature  data.  In  our  model  B  at¬ 
oms  are  suggested  to  reside  in  the  second  bulklike  layer, 
occupying  substitutional  sites  spaced  by  two  periods  of 
Si(lOO)  plane  while  Si  top  dimers  are  eliminated  denuding 


the  underlying  atoms.  Epitaxial  growth  of  a  few  monolayers 
has  been  monitored  by  STM  for  both  MBE  and  SPE  regimes. 
The  growth  conditions  under  which  the  suppression  of  B 
segregation  occurs  have  been  determined,  namely,  for  MBE 
deposition  temperatures  should  be  below  230  °C,  while  for 
SPE  annealing  temperatures  up  to  500  °C  can  be  used  pro¬ 
vided  the  Si  film  thickness  is  greater  than  3  ML. 
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In  situ  phosphorus  doped  Si  epitaxial  layers  have  been  grown  at  600  °C  in  a  very  low  pressure 
chemical  vapor  deposition  system,  using  SiH4  and  PH3  diluted  in  H2.  The  presence  of  H2  was  found 
to  decelerate  the  epitaxial  growth  rates.  Secondary-ion  mass  spectrometry  measurement  shows  that 
the  constant  phosphorus  concentration  with  depth  for  a  steady  flow  of  PH3  was  achieved.  Namely, 
dopant  concentration  is  a  function  of  gas  phase  dopant  concentration.  Chemical  concentration  as 
high  as  2.5X10^^  P/cm^  was  obtained  in  Si  epitaxial  layers  though  with  very  low  growth  rate. 

Epilayers  with  constant  doping  levels  from  1.5X10^^  P/cm^  to  1.2X10^®  can  readily  be  grown. 

Despite  the  relatively  low  growth  rate,  there  is  no  evidence  of  a  time-dependent  accumulation  of 
phosphorus  on  the  growth  surface  or  the  reactor  wall.  The  reasons  accounting  for  this  phenomenon 
are  discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Remarkable  progress  in  silicon-based  device  technology 
has  been  made  by  taking  advantage  of  the  low-temperature 
preparation  of  epitaxial  films  in  the  Si/Ge  materials  system. 
A  variety  of  unique  devices  with  high  performance  have 
been  fabricated.  Lately,  the  world’s  first  highly  integrated 
SiGe  IC,  a  1.0  GHz  12-bit  digital-to-analog  converter,  was 
developed  by  IBM  and  Analog  Devices."^  Low-temperature 
growth  of  rt-type  films  will  be  necessary  in  future  integrated 
circuit  fabrication.  However,  n-type  doping  to  high  levels  in 
low-temperature  (450-750  ""C)  Si  deposition  processes  has 
been  difficult  due  to  the  lack  of  control  of  phosphorus  con¬ 
tent  and  the  heavy  reduction  of  the  growth  rate.^’^  In  addi¬ 
tion,  when  attempting  to  incorporate  phosphorus  concentra¬ 
tions  in  excess  of  1X10^^  P/cm^,  a  steady  flow  of  PH3 
produces  an  monotonically  increasing  or  unstable  level  of 
phosphorus  doping.*^’^  In  this  case,  it  is  believed  that  phos¬ 
phorus  accumulates  on  and  eventually  passivates  the  growth 
surface.  To  achieve  high  levels  of  n-type  doping  at  low  tem¬ 
perature,  study  using  SiCl2H2  as  Si  gaseous  source  has  been 
carried  out  under  atmospheric  pressure.^  Study  of  high  level 
(>5XI0^^  cm“^)  n-type  doping  using  SiH4  as  gaseous 
source  has  not  been  reported  in  the  literature.  In  this  article, 
we  report  an  investigation  of  in  situ  phosphorus  doping  of  Si 
epilayers  at  high  levels  using  SiH4  and  1800  ppm  PH3  di¬ 
luted  in  H2 . 

II.  EXPERIMENT 

Phosphorus-doped  layers  in  this  study  were  prepared  in 
rapid  thermal  process/very  low  pressure  chemical  vapor 
deposition  (RTPA^LP-CVD)  reactor  which  has  been  previ¬ 
ously  described  elsewhere.^  The  main  feature  of  RTPA^LP- 
CVD  is  the  use  of  tungsten-halogen  lamps  around  the  quartz 
tube  reactor  as  energy  source  to  activate  deposition  reaction. 
In  this  article,  all  samples  were  grown  at  600  °C.  Epitaxial 
layers  were  deposited  on  (lOO)-oriented  Si  substrates.  Over¬ 


all  initial  substrate  preparation  involved  a  standard  RCA 
cleaning  process  followed  by  a  10:1  de-ionized  (DI) 
water/HF  bath  for  15  s.  This  preclean  process  produces  an 
oxide-free  hydrogen  passivated  silicon  surface.  Before 
deposition,  the  wafers  were  prebaked  in  14  seem  H2  flow  at 
600  °C  for  15  min,^^  then  the  major  residual  species  in  the 
reactor  was  hydrogen.  The  main  growth  gaseous  source  was 
SiH4.  The  gaseous  source  employed  for  the  purpose  of  dop¬ 
ing  was  1800  ppm  PH3  in  H2.  The  partial  pressure  of  SiH4 
was  maintained  at  10  mTorr  during  deposition.  The  ratio  of 
diluted  PH3  flow  rate  to  SiH4  flow  rate  varied  from  0.1  to 
1.4.  The  total  growth  pressure  varied  correspondingly  from 
10  mTorr  to  25  mTorr.  Growth  duration  was  between  45  and 
150  min  producing  80-350-nm-thick  epitaxial  layers.  Epi- 
layer  thickness  and  phosphorus  incorporation  was  deter¬ 
mined  from  spreading  resistance  profiling  (SRP),  secondary- 
ion  mass  spectroscopy  (SIMS)  profiling. 

III.  RESULTS  AND  DISCUSSION 

Figure  1  shows  a  typical  SIMS  profile  of  phosphorus 
doped  Si  epilayer  in  which  chemical  stability  and  abrupt 
dopant  transition  have  been  achieved.  The  flat  depth  profile 
shows  that  the  epitaxial  layer  has  nearly  constant  phosphorus 
level.  The  incorporation  efficiency  of  phosphorus  from  the 
gas  phase  is  stable.  A  steady  flow  of  PH3  produces  a  constant 
level  of  incorporated  phosphorus  rather  than  a  monotonically 
increasing  one.^  Phosphorus  content  of  epitaxial  layer  is  de¬ 
termined  by  the  phosphine  atomic  fraction  present  in  the 
growth  source.  In  our  case,  a  surface  poisoning  effect^ ^  is  not 
observed.  There  is  no  dopant  buildup  on  the  growth  surface 
or  the  reactor  wall.  Figure  1  also  shows  that  the  dopant  con¬ 
centration  turn-on  slope  is  about  18  nm/decade.  In  addition. 
Fig.  2  shows  the  phosphorus  doping  concentration  as  a  func¬ 
tion  of  the  ratio  of  diluted  PH3  flow  rate  to  SiH4  flow  rate. 
The  data  suggest  that  the  phosphorus  concentration  is  pro¬ 
portional  to  diluted  PH3  flow  rate  and  phosphorus  does  not 
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Fig,  1.  A  typical  SIMS  profile  of  phosphorus  in  epitaxial  Si  deposited  at 
600  °C  from  SiH4  and  1 800  ppm  PH3  in  H2 , 

block  surface  sites.  Many  phosphorus  doped  samples  show 
that  the  reproducible  range  of  phosphorus  concentration  is 
from  1.5X  10^^  to  1.2X  10^^  P/cm^.  We  obtained  growth  rates 
from  a  series  of  SIMS  profiles  related  to  samples  grown  un¬ 
der  different  PH3  flow  rates,  as  plotted  in  Fig.  3.  The  growth 
rate  decreases  as  the  ratio  of  diluted  PH3  flow  rate  to  SiH4 
flow  rate  increases.  When  the  flow  rate  ratio  varies  from  0. 1 
to  1.4,  the  growth  rate  decreases  from  3  to  0.4  nm/min. 


[PHg+Hgl/ISiHj 


Fig.  2.  Phosphorus  incorporation  in  Si  as  a  function  of  the  ratio  of  diluted 
PH3  flow  rate  to  SiH4  flow  rate. 


Fig.  3.  Phosphorus  doped  epitaxial  Si  growth  rate  dependence  on  the  ratio 
of  diluted  PH3  flow  rate  to  SiH4  flow  rate. 

The  reason,  why  a  constant  level  of  incorporated  phos¬ 
phorus  is  achieved,  is  attributed  to  the  introduction  of  a  large 
amount  of  diluted  gas  H2  and  the  features  of  RTPWLP-CVD. 
As  reported  in  previous  literature,  the  study  of  the  interac¬ 
tion  of  SiH4  and  PH3  with  Si(lOO)  surface  demonstrates  that 
a  stable  layer  of  adsorbed  phosphorus  will  form  during  the 
phosphorus-doping  deposition  process.  This  high  phosphorus 
coverage  leads  to  accumulation  of  phosphorus  on  Si  surface, 
which  accounts  for  the  rapid  reduction  of  the  Si  deposition 
rate  and  the  lack  of  control  of  phosphorus  content.  With  pres¬ 
ence  of  a  large  amount  of  dilute  gas  H2,  the  desorption  of  the 
reaction  by-product,  H2  from  surface  becomes  less  efficient, 
so  the  surface  reactive  sites  are  mostly  occupied  by  surface 
species,  H,  SiH,  SiH2,  SiH3,  rather  than  phosphorus.  The 
formation  of  the  overlayer  of  adsorbed  phosphorus  is  hin¬ 
dered  by  the  presence  of  dilute  gas  H2.  In  this  instance,  there 
is  no  accumulation  of  phosphorus  on  Si  surface.  This  results 
in  a  constant  level  of  incorporated  phosphorus.  On  the  other 
hand,  the  dilute  gas  H2  will  decrease  the  growth  rates,  since 
the  desorption  rate  of  hydrogen  which  is  the  reaction-limited 
process  will  decrease  as  a  function  of  H2  overpressure.  Al¬ 
though,  the  incorporation  of  phosphorus  can  cause  growth 
rate  reduction,  we  believe  the  reductions  is  mostly  a  result  of 
the  introduction  of  dilute  gas  H2  under  lower  incorporation 
of  phosphorus  from  1.5X10^^  P/cm^  to  1.2  XIO^^.  When  the 
content  of  phosphorus  is  larger  than  1.2X10^^  say  2.5  XIO^^ 
both  introduction  of  dilute  gas  H2  and  incorporation  of  phos¬ 
phorus  decrease  the  reduction  of  growth  rate  equally. 

IV.  SUMMARY 

By  using  1800  ppm  PH3  diluted  in  H2  as  dopant  in  RTP/ 
VLP-CVD,  we  obtained  samples  in  which  phosphorus  dop- 
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ing  concentrations  are  constant.  Chemical  concentration  as 
high  as  2.5X10^^  P/cm^  was  achieved.  No  evidence  shows 
time-dependent  accumulation  of  phosphorus  on  the  growth 
surface  or  the  reactor  wall.  In  addition,  the  growth  rates  de¬ 
crease  as  the  flow  rate  of  diluted  phosphine  increases.  We 
believe  that  this  reduction  of  growth  rate  is  mostly  a  result  of 
the  introduction  of  dilute  gas  H2  under  lower  content  of 
phosphorus,  and  that  the  dilute  gas  H2  has  important  effect 
on  the  phosphorus  doping  process  by  reducing  the  desorption 
rate  of  hydrogen.  As  a  result,  the  surface  reactive  sites  are 
mostly  occupied  by  surface  species,  H,  SiH,  SiH2,  SiH3, 
rather  than  phosphorus.  This  leads  to  the  phosphorus  doping 
process  in  which  there  is  no  accumulation  of  phosphorus  and 
the  growth  rates  are  relatively  low. 
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Characterization  by  x-ray  photoelectron  spectroscopy  of  the  chemical 
structure  of  semi-insulating  polycrystalline  silicon  thin  films 
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The  x-ray  photoelectron  spectroscopy  (XPS)  technique  has  been  used  to  investigate  the  composition 
of  semi-insulating  polycrystalline  silicon  (SIPOS)  films  having  oxygen  contents  of  10  and  35  at.  % 
prepared  by  low  pressure  chemical  vapor  deposition.  XPS  analysis  has  demonstrated  that  the  film 
compositions  can  be  qualitatively  described  by  means  of  the  five  Si-Si;c04-jc  tetrahedra  (with  0 
^  X  ^  4  and  integer)  predicted  by  the  statistical  random  bonding  model  (RBM).  However,  the 
quantitative  analysis  of  the  XPS  spectra  has  demonstrated  that  the  concentrations  of  the  various 
tetrahedra  found  in  the  SIPOS  films  are  remarkably  different  from  those  predicted  by  a  statistical 
approach,  i.e.,  by  assuming  that  each  Si  atom  forms  with  equal  probability  bonds  with  either  Si  or 
O.  We  have  also  found  that  the  composition  of  high  temperature  (up  to  1000  °C)  annealed  films 
further  departs  from  that  predicted  by  the  RBM  model;  indeed,  the  anneal  promotes  the 
decomposition  of  partially  oxidized  Si-Si;t04_;^  tetrahedra  in  Si-Si4  tetrahedra  (that  form  Si 
nanocrystals)  and  Si-04  tetrahedra  (that  enrich  the  oxygen  content  of  the  amorphous  phase). 

©  1996  American  Vacuum  Society. 


1.  INTRODUCTION 

Semi-insulating  polycrystalline  silicon  (SIPOS)  is  a  ge¬ 
neric  acronym  used  to  indicate  non-stoichiometric  silicon  ox¬ 
ide  films,  generally  prepared  by  low  pressure  chemical  vapor 
deposition  (LPCVD)  or  by  atmospheric  pressure  chemical 
vapor  deposition  (APCVD).  SIPOS  composition  is  usually 
expressed  as  SiO^^. ,  with  x  that  can  be  varied  from  0  (poly¬ 
crystalline  or  amorphous  silicon)  to  2  (stoichiometric  silicon 
dioxide);  the  various  compositions  are  obtained  by  changing 
the  flow  ratio  between  the  silicon-bearing  gas  (generally 
SiH4)  and  the  oxygen-bearing  one  (generally  N2O). 

SIPOS  thin  films  are  widely  used  as  passivation  layers 
and  as  resistive  field  shield  in  high  voltage  Si  devices, as 
emitters  in  heterojunction  bipolar  transistors^’"^  and  for  solar 
cell  applications.^’^  More  recently,  Er-doped  SIPOS  films 
have  been  proposed  for  optoelectronic  application.^  Because 
the  physical  properties  of  SIPOS  films  are  strictly  related  to 
their  oxygen  content,  a  large  number  of  papers  have  been 
published  in  the  last  few  years  dealing  with  the  chemical  and 
structural  characterization  of  these  films.  It  is  generally  rec¬ 
ognized  that  as-deposited  SIPOS  films  are  amorphous  mix¬ 
tures  of  silicon  and  oxygen,  while  SIPOS  films  annealed  at 
temperatures  of  800-900  °C  or  higher  are  multiphasic  mate¬ 
rials,  in  which  silicon  nanocrystals  are  dispersed  in  an  amor¬ 
phous  matrix.^“^^ 

The  microstructure  of  the  amorphous  phase  can  be  de¬ 
scribed  by  means  of  two  different  models.  The  first  one  is  the 
random  bonding  model (RBM);  according  to  this  model 
SIPOS  is  formed  by  five  different  tetrahedral  units,  namely 
Si-Si4,  Si-Si30,  Si-Si202,  Si-Si03  and  Si-04,  corresponding 
to  the  five  possible  oxidation  states  of  silicon  (from  0  to  +4). 
At  each  composition  the  tetrahedra  distribution  is  determined 


by  statistical  laws,  without  the  influence  of  any  chemical 
driving  force.  So,  for  example,  a  SIPOS  film  having  an  oxy¬ 
gen  content  of  50  at.  %  (a  “silicon  monoxide”  film)  should 
be  characterized  by  a  tetrahedra  distribution  consisting  in 
6.25%  of  Si-Si4,  25%  of  Si-Si  3O,  37.5%  of  Si-Si202,  25% 
of  Si-Si03  and  6.25%  of  Si-04.  Obviously,  this  symmetric 
distribution  mirrors  the  fact  that  in  a  SiOj  00  ®^ch  Si 
atom  has  the  same  probability  to  be  surrounded  by  a  Si  or  by 
an  O  atom. 

The  second  model,  the  random  mixture  model  (RMM), 
neglects  the  existence  of  the  intermediate  Si  oxidation  states, 
suggesting  that  all  the  possible  SIPOS  compositions  are  sim¬ 
ply  obtained  by  the  suitable  combination  of  Si-Si4  and 
Si-04  tetrahedra.  According  to  this  model,  a  SiOj  oo 
should  be  composed  half  of  Si-Si4  tetrahedra  and  half  of 
Si-04  tetrahedra. 

In  this  work  we  have  investigated  by  means  of  the  x-ray 
photoelectron  spectroscopy  (XPS)  technique  the  chemical 
structure  of  two  different  SIPOS  films  prepared  by  low  pres¬ 
sure  chemical  vapor  deposition,  having  nominal  oxygen  con¬ 
tents  of  10  and  35  at.  %.  These  chemical  structures  have 
been  compared  with  those  predicted  by  the  two  above  dis¬ 
cussed  models.  We  have  also  investigated  the  effect  of  high 
temperature  annealing  processes  (up  to  1000  ®C)  on  the 
chemical  structure  of  SIPOS. 

II.  EXPERIMENT 

SIPOS  films  were  deposited  on  (100)  silicon  substrates  by 
using  the  LPCVD  technique  at  a  temperature  of  620  °C.  The 
gaseous  precursors  were  SiH4  and  N2O;  the  N20/SiH4  flow 
ratio  was  set  to  values  of  0.40  and  0.12  in  order  to  obtain 
SIPOS  films  having  nominal  oxygen  contents  of  35  and  10 
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at.  %,  respectively.  The  deposition  rates  were  about  20 
A/min  for  the  35  at.  %  O  SIPOS  film  and  about  40  A/min  for 
the  10  at.  %  O.  The  35  at.  %  O  films  were  about  3700  A 
thick,  while  the  10  at.  %  O  films  were  about  6000  A  thick. 
The  bulk  composition  and  the  thickness  of  the  films  were 
determined  by  Rutherford  backscattering  spectrometry 
(RBS)  of  1.7  MeV  "^He  ions.  Electron  diffraction  analysis  of 
the  SIPOS  films  was  made  on  plan  view  transmission  elec¬ 
tron  microscopy  samples,  prepared  by  mechanical  thinning 
and  backside  chemical  etching.  The  analysis  was  made  in  a 
JEOL  2010  transmission  electron  microscope  (TEM),  oper¬ 
ating  at  200  kV  accelerating  voltage.  SIPOS  films  were  an¬ 
nealed  in  a  rapid  thermal  annealing  apparatus,  at  tempera¬ 
tures  ranging  from  700  to  1000  °C,  in  nitrogen  atmosphere. 
HE  etched  samples  were  prepared  by  dipping  small  frag¬ 
ments  of  the  wafers  for  15  s  in  a  diluted  HE  solution  (5%  in 
H2O)  immediately  before  the  introduction  in  the  XPS 
vacuum  chamber. 

XPS  analysis  was  performed  using  a  Kratos  ES  300  elec¬ 
tron  spectrometer,  operating  at  a  base  pressure  of  1X10~^ 
Torr.  All  spectra  were  obtained  by  using  the  unmonochroma- 
tized  AIK  a  radiation.  Angular  resolved  XPS  (ARXPS)  mea¬ 
surements  were  performed  at  two  different  take-off  angles 
(measured  with  respect  to  the  sample  normal),  i.e.,  10°, 
roughly  corresponding  to  the  maximum  sampling  depth,  and 
60°,  corresponding  to  a  sampling  depth  reduced  of  approxi¬ 
mately  a  factor  of  two. 

The  energy  scale  of  the  XPS  spectra  was  calibrated  as¬ 
suming  at  285.0  eV  the  binding  energy  (B.E.)  of  the  C  Is 
peak  due  to  sample  surface  contamination;  under  this  as¬ 
sumption,  the  binding  energy  of  the  elemental  Si  component 
in  the  Si  2p  peaks  was  99.5  ±  0.15  eV  for  all  samples.  The 
Si  2p  XPS  peaks  were  analyzed  by  means  of  a  computer 
program  of  gaussian  peak  fitting,  using  an  inelastic  Shirley- 
type  background.  The  results  of  the  peak  fitting  procedure 
were  used  to  evaluate  the  oxygen  content  of  the  films,  by 
considering  the  concentration  of  each  component  and  the 
related  oxygen  content.  The  oxygen  content  was  also  evalu¬ 
ated  from  the  XPS  (Si  2p)/{0  l*s)  ratio,  by  introducing  the 
appropriate  corrections  for  the  photoionization  cross 
sections^^  and  the  experimentally  determined  transmission 
function  of  the  spectrometer.  The  latter  method  generally 
gave  higher  oxygen  contents  (+10-20%)  with  respect  to  the 
former,  indicating  the  presence  of  excess  oxygen  on  the 
SIPOS  surface,  probably  as  C-0  bonds  in  contamination  lay¬ 
ers,  detectable  also  as  a  shoulder  in  the  C  Is  peaks.  Conse¬ 
quently,  the  oxygen  contents  reported  in  the  article  are  in  all 
cases  those  obtained  by  the  fitting  procedure  of  the  Si  2p 
peaks. 

III.  RESULTS  AND  DISCUSSION 
A.  Characterization  of  the  as-deposited  SiPOS  fiims 

In  order  to  determine  the  average  oxygen  content  of  our 
films  we  have  used  the  RBS  technique.  Eigure  1  shows  a 
typical  RBS  spectrum  for  a  SIPOS  film  having  a  nominal 
oxygen  content  of  35  at.  %  (solid  line).  In  order  to  better 
distinguish  the  backscattering  yields  of  O  and  Si  in  SIPOS, 
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Fig.  1.  Experimental  (solid  line)  and  background  subtracted  (open  circles) 
RBS  spectra  for  a  SIPOS  film  having  a  nominal  oxygen  content  of  35  at.  %, 
compared  with  the  simulation  of  a  RBS  spectrum  for  a  Si-0  mixture  con¬ 
taining  35  at.  %  of  oxygen  (dashed  line). 


the  sample  was  oriented  with  the  (100)  axis  of  the  Si  sub¬ 
strate  aligned  with  the  He  beam  (channelling  conditions), 
thus  reducing  the  backscattering  yield  due  to  the  substrate. 
The  figure  shows  also  the  spectrum  after  the  subtraction  of 
the  background  due  to  backscattering  from  the  substrate  in 
the  100-220  channel  region  (open  circles),  thus  leaving  only 
the  signal  due  to  the  O  of  SIPOS.  This  background  sub¬ 
tracted  spectrum  fits  quite  well  with  a  simulation  relative  to  a 
Si-0  mixture  containing  35  at.  %  of  oxygen  (dashed  line), 
even  if  a  small  O  enrichment  is  observed  in  the  surface  re¬ 
gion.  Similarly,  the  RBS  analysis  has  demonstrated  that  the 
SIPOS  films  having  a  nominal  oxygen  content  of  10  at.  % 
are  really  characterized  by  this  atomic  concentration  of  oxy¬ 
gen. 

According  to  the  above  described  RBM  model,  the  oxy¬ 
gen  atoms  contained  in  SIPOS  can  exist  under  the  form  of 
Si“Si30,  Si-Si202,  Si-Si03  and  Si-04  tetrahedra.^^  On  the 
other  hand,  the  RMM  model  predicts  only  the  existence  of 
Si-04  tetrahedra.^^  In  this  work  we  have  employed  the  XPS 
technique  in  order  to  investigate  if  the  compositions  of  our 
SIPOS  films  agree  with  one  of  the  proposed  models. 

To  extract  detailed  chemical  information  from  the  XPS  Si 
2p  peaks  it  is  necessary  to  establish  a  suitable  peak  fitting 
procedure.  Our  procedure  was  based  on  the  use  of  five 
gaussian-shaped  peaks  to  simulate  the  five  silicon  oxidation 
states;  the  full  widths  at  half  maximum  (EWHM)  used  were 
1.6  eV  for  the  Si-Si4  tetrahedra  and  1.8  eV  for  the  oxidized 
species.  We  have  found  that  for  all  our  SIPOS  samples  the 
binding  energies  of  the  elemental  Si  components  fall  in  the 
range  99.5  ±  0.15  eV,  in  good  agreement  with  literature^"^; 
the  binding  energies  of  the  various  Si-0  species  were  ob¬ 
tained  from  the  empirical  relationship  A£'  =  2.2x,  which  cor¬ 
relates  the  binding  energy  chemical  shift  {AE)  of  a  SiO^^. 
species  with  respect  to  elemental  Si  with  its  oxygen  content 

An  example  of  this  approach  is  shown  in  Pig.  2  for  a 
SIPOS  film  containing  35  at.  %  of  oxygen.  The  figure  shows 
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Fig.  2.  Background  subtracted  Si  2p  XPS  peak  for  a  35  at.  %  O  SIPOS  film, 
illustrating  the  peak  fitting  procedure.  The  five  components  correspond  to 
the  five  Si  oxidation  states,  i.e.,  to  Si-Si4 ,  Si-Si30,  Si-Si202 ,  Si-Si03  and 
Si-04  >  tetrahedra  (in  order  of  increasing  binding  energies). 


that  all  the  five  different  Si  species  predicted  by  the  RBM 
model  are  present.  The  main  species  are  Si“Si4  (99.5  eV 
B.E.)  and  Si-04  (103.9  eV  B.E,),  but  relevant  contributions 
of  Si-Si30  (100.6  eV  B.E.)  and  Si-SiOs  (102.8  eV  B.E.)  are 
present,  while  Si-Si202  (101.7  eV  B.E.)  is  the  only  kind  of 
tetrahedron  having  an  almost  negligible  intensity.  The  quan¬ 
titative  analysis  of  the  peak  fitting  gives  an  oxygen  content 
of  about  57  at.  %,  a  value  much  greater  than  that  expected  on 
the  basis  of  the  RBS  measurements  (35  at.  %).  We  explain 
this  result  with  an  oxidation  phenomenon  of  the  SIPOS  sur¬ 
face  due  to  the  exposure  to  the  atmosphere,  similar  to  that 
observed  in  crystalline  silicon,  where  a  thin  (~10-15  A) 
Si02  layer  is  formed.  In  order  to  characterize  the  oxidized 
overlayer  of  SIPOS,  we  have  used  the  ARXPS  technique. 

Figure  3  shows  the  background  subtracted  XPS  Si  2p 
peaks  obtained  at  two  different  take-off  angles  (10°,  roughly 
corresponding  to  the  maximum  sampling  depth,  valuable  in 
about  90  A  on  the  basis  of  the  values  of  inelastic  mean  free 
path  of  Si  and  Si02 ,  and  60°,  corresponding  to  a  sampling 
depth  reduced  of  about  a  factor  of  two)  for  both  the  studied 
SIPOS  compositions.  In  particular.  Fig.  3a  refers  to  the  35 
at.  %  O  SIPOS  film.  The  absence  of  any  large  angular  de¬ 
pendence  indicates  for  this  sample  the  presence  of  a  partially 
oxidized  overlayer,  whose  thickness  is  comparable  to  the 
maximum  XPS  sampling  depth  or  larger. 

A  surface  oxygen  enrichment  is  also  observed  for  the  10 
at.  %  O  SIPOS  films,  as  shown  in  Fig.  3b.  The  peak  fitting 
procedure  performed  on  the  Si  2p  peak  obtained  at  a  take-off 
angle  of  10°  indicates  a  very  high  oxygen  content  (about  32 
at.  %)  with  respect  to  the  value  given  by  the  RBS  measure¬ 
ments  (10  at.  %).  Differently  than  the  previous  case,  the 
ARXPS  analysis  gives  evidence  that  the  Si  2p  peak  obtained 
in  a  situation  of  enhanced  surface  sensitivity  (60°)  exhibits  a 
more  intense  oxide  component;  the  quantitative  analysis  of 
these  ARXPS  data  indicates  the  presence  of  a  15  A  thick 
overlayer,  mainly  composed  of  Si-04  Si-Si03  tetrahedra, 
on  the  10  at.  %  O  SIPOS  surface. 

The  conclusion  is  therefore  that  both  the  SIPOS  surfaces 


Binding  Energy  (eV) 

Fig.  3.  Background  subtracted  Si  2p  XPS  peaks  for  SIPOS  films  containing 
(a)  35  at.  %  and  (b)10  at.  %  of  oxygen,  obtained  at  two  different  take-off 
angles,  10°  (maximum  sampling  depth)  and  60°  (enhanced  surface  sensitiv¬ 
ity). 


are  not  representative  of  the  bulk  of  the  films,  due  to  an 
oxidation  phenomenon  induced  by  the  exposure  to  the  atmo¬ 
sphere.  Furthermore,  these  data  indicate  that  the  SIPOS  re¬ 
activity  towards  atmospheric  oxygen  increases  with  the  O 
concentration  of  the  film,  contrary  to  the  expected  situation. 

The  surface  oxidation  of  SIPOS  is  a  general  problem, 
encountered  by  all  the  researchers  dealing  with  the  XPS 
analysis  of  this  material.  There  are  two  possible  ways 
to  remove  the  surface  oxidized  layer  and  expose  to  the  XPS 
analysis  a  “true”  SIPOS  surface,  i.e.,  sputtering  and  wet 
chemical  cleaning,  by  using  HF  solutions.  We  did  not  use 
sputtering,  because  it  is  well  known  that  a  secondary  effect 
of  the  ion  bombardment  is  to  induce  chemical  modifications 
in  the  bombarded  surface.  It  has  been  found  that  also  the 
very  stable  Si02  surface  can  be  affected  by  inert  gas  bom¬ 
bardment,  undergoing  loss  of  oxygen.^  ^  Therefore,  we  expect 
greater  effects  for  the  less  stable,  partially  oxidized 
Si-Si;t04_^  tetrahedra  contained  in  SIPOS,  such  as  rear¬ 
rangements  of  the  Si-0  bonds,  and  preferential  sputtering  of 
oxygen. Other  reported  deleterious  effects  induced  by  the 
sputtering  are  reactions  with  carbonaceous  overlayers,  lead¬ 
ing  to  the  formation  of  Si-C  bonds, and  the  oxygen  uptake 
due  to  the  exposure  of  the  very  reactive  sputtered  surface  to 
residual  H2O  vapor  during  the  pumping  down  subsequent  to 
the  sputtering  treatment. 

Due  to  the  above  discussed  problems,  we  have  preferred 
to  use  the  HF  wet  etching  method  to  remove  the  oxidized 
layer  from  the  surface  of  our  films.  This  method  is  widely 
used  in  literature, even  if  some  caution  must  be  taken 
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Fig.  4.  Background  subtracted  Si  2p  XPS  peaks  for  HF  etched  SIPOS  films 
containing  (a)  35  at.  %  and  (b)  10  at.  %  of  oxygen,  obtained  at  two  different 
take-off  angles,  10°  (maximum  sampling  depth)  and  60°  (enhanced  surface 
sensitivity). 


with  regard  to  the  possibility  of  etching  the  SIPOS  film 
also.^^  To  avoid  this  problem  we  have  used  very  diluted  HF 
solutions  (5%  in  H2O),  characterized  by  very  low  etch  rates, 
and  very  low  dipping  times  (15  s).  Moreover,  we  do  not 
expect  to  have  etching  effects,  since  the  O  content  of  our 
films  is  relatively  low.  Indeed,  it  has  been  reported  that 
SIPOS  etching  by  concentrated  HF  solutions  occurs  only  for 
films  having  an  oxygen  content  greater  than  34  at.  %?^  As  a 
further  test,  we  have  verified  that,  for  both  the  studied  SIPOS 
compositions,  the  XPS  spectra  do  not  appreciably  change 
their  shapes  even  if  the  HF  dipping  time  is  doubled  (30  s). 
The  etched  samples  have  been  introduced  in  the  XPS  analy¬ 
sis  chamber  immediately  after  the  HF  dip  (the  total  time 
spent  between  the  HF  dip  and  introduction  in  the  UHV 
chamber  is  less  than  1  min). 

The  background  subtracted  Si  2p  XPS  spectra  obtained 
after  HF  etching  for  both  SIPOS  samples  are  shown  in  Fig. 
4.  Figure  4a  in  particular  shows  the  Si  2p  XPS  peaks  for  the 
35  at.  %  O  SIPOS  film,  obtained  at  take-off  angles  of  10° 
and  60°.  The  spectrum  obtained  at  a  take-off  angle  of  10° 
shows  a  marked  decrease  of  the  contribution  of  the  oxidized 
species  with  respect  to  the  situation  found  for  the  unetched 
film.  The  peak  fitting  analysis  indicates  an  oxygen  content  of 
37  at.  %,  in  excellent  agreement  with  the  value  obtained  by 
RBS  (35  at.  %).  The  absence  of  any  relevant  dependence  on 
the  take-off  angle  for  the  two  peaks  shown  in  figure  demon¬ 
strates  the  complete  elimination  of  the  oxidized  overlayer. 

Figure  4b  shows  the  Si  2p  XPS  peaks  for  the  10  at.  %  O 
SIPOS  film,  obtained  at  take-off  angles  of  10°  and  60°,  after 


Table  I,  Experimental  and  calculated  percent  concentrations  of  the 
Si-Si;t04_j^.  tetrahedra,  for  SIPOS  films  containing  10  and  37  at.  %  of  oxy¬ 
gen.  The  experimental  data  refer  to  the  as-deposited,  HF  etched  samples. 


Percent  concentrations 


Tetrahedron 

10  at.  %  0 

37  at.  %  0 

Expt. 

RMM 

RBM 

Expt. 

RMM 

RBM 

Si-Sh 

87.7 

94.5 

79.7 

52.0 

70.7 

24.9 

Si-Si30 

6.7 

18.6 

15.8 

41.4 

Si-Si202 

1.9 

1.6 

4.1 

25.8 

Si-Si03 

3.6 

0.1 

20.5 

7.1 

Si-04 

0.1 

5.5 

7.6 

29.3 

0.8 

the  HF  dip.  The  spectra  show  that  also  in  this  case  the  HF  dip 
produces  a  marked  decrease  of  the  oxidized  species  concen¬ 
tration.  The  peak  fitting  procedure  indicates  an  oxygen  con¬ 
tent  of  10  at.  %,  in  agreement  with  the  RBS  data.  Also  in  this 
case  the  Si  2p  band  shape  does  not  depend  on  the  take-off 
angle,  demonstrating  the  absence  of  oxidized  overlayers. 

The  absence  of  angular  dependence  shown  by  the  Si  2p 
peaks,  as  well  as  the  excellent  agreement  between  the  total 
oxygen  concentrations  obtained  by  XPS  and  those  obtained 
by  RBS,  demonstrates  the  capability  of  the  HF  etching 
method  in  producing  clean,  homogeneous  SIPOS  surfaces, 
representative  of  the  bulk  of  the  films.  Therefore,  in  the  rest 
of  the  article  the  oxygen  concentrations  given  by  the  XPS 
analysis  of  the  HF  etched  surfaces  (10  and  37  at.  %)  will  be 
used  to  identify  the  two  studied  SIPOS  films. 

The  oxygen  distributions  for  the  two  HF  etched  SIPOS 
films,  as  obtained  by  the  peak  fitting  procedure  of  the  relative 
Si  2p  peaks,  are  reported  in  columns  2  and  5  of  Table  I,  in 
terms  of  percent  concentrations  of  the  various  Si-Si;^04_^ 
tetrahedra.  These  data  demonstrate  that  in  as-deposited 
SIPOS  oxygen  is  present  under  the  form  of  all  the 
Si-Sij^04_;^.  tetrahedra  predicted  by  the  RBM  model;  for  both 
films  the  main  species  is  Si-Si4,  but,  in  the  film  containing 
10  at.  %  O  there  is  also  a  strong  component  due  to  the 
Si-Si30  tetrahedra,  whilst  in  the  37  at.  %  O  there  is  a  large 
contribution  due  to  Si-Si03 .  Therefore,  the  tetrahedra  distri¬ 
bution  in  SIPOS  seems  to  depend  on  the  total  oxygen  content 
of  the  film.  These  differences  in  the  tetrahedra  concentrations 
probably  determine  also  the  different  reactivity  toward  atmo¬ 
spheric  oxygen  shown  by  the  XPS  spectra  of  the  two  un¬ 
etched  SIPOS  films.  Indeed,  the  high  reactivity  of  SIPOS 
toward  oxygen  is  due  to  the  presence  of  partially  oxidized 
tetrahedra;  Table  I  shows  that  the  37  at.  %  O  SIPOS  contains 
about  48%  partially  oxidized  tetrahedra,  and  therefore  is 
more  reactive  than  the  10  at.  %  O,  which  contains  only  about 
12%  of  this  kind  of  tetrahedra. 

B.  Comparison  with  the  RBM  and  RMM  models 

Our  XPS  data  have  demonstrated  that  SIPOS  is  composed 
by  all  the  tetrahedra  predicted  by  the  RBM  model.  This 
means  that  the  RMM  model  should  be  totally  unsatisfactory 
to  describe  the  composition  of  our  films  because  it  does  not 
predict  the  existence  of  any  partially  oxidized  tetrahedron. 
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Fig,  5.  Comparison  among  the  experimental  Si  2p  XPS  peak,  and  those 
built  according  to  the  RBM  and  RMM  models,  for  SIPOS  films  containing 
(a)  37  at.  %  and  (b)10  at.  %  of  oxygen. 

However,  in  order  to  compare  in  a  more  careful  way  our 
experimental  data  with  the  two  models,  we  have  calculated 
the  distributions  of  the  Si-Si^04_^  tetrahedra  predicted  by 
the  two  models  for  the  two  studied  SIPOS  compositions  (10 
and  37  at.  %  O).  The  results  of  the  calculation  are  shown  in 
columns  3,  4,  6  and  7  of  Table  L  We  have  used  these  theo¬ 
retical  distributions  to  build  the  relative  Si  2p  XPS  peaks,  by 
applying  the  same  B.E.  and  FWHM  used  to  fit  the  experi¬ 
mental  spectra.  Figure  5  shows  the  comparison  between 
these  calculated  spectra  and  the  experimental  ones. 

Figure  5a  compares  the  two  calculated  Si  2p  peaks  with 
the  experimental  one  for  the  37  at.  %  O  SIPOS  film.  The 
figure  demonstrates  that  the  experimental  peak  significantly 
differs  from  those  predicted  by  both  models.  In  particular, 
though  the  RMM  model  is  clearly  inadequate  to  describe  the 
37  at.  %  O  SIPOS,  which  is  characterized  by  a  relevant  con¬ 
tent  of  intermediate  Si-0  species,  the  overall  aspect  of  the 
experimental  spectrum  is  closer  to  this  model  than  to  the 
RBM  one.  Indeed,  the  RBM  spectrum  is  a  wide,  unresolved 
band,  peaked  in  correspondence  of  the  B.F.  of  the  Si-Si30 
tetrahedra.  On  the  other  hand,  the  RMM  spectrum  exhibits  a 
double-peaked  structure,  like  the  experimental  one,  even  if 
the  oxidized  peak  of  the  experimental  spectrum  has  its  maxi¬ 
mum  at  lower  B.E.  with  respect  to  the  calculated  one. 

Figure  5b  compares  the  two  calculated  Si  2p  peaks  with 
the  experimental  one  for  the  10  at.  %  O  SIPOS  film.  The 
figure  shows  also  in  this  case  marked  differences  between 
experimental  and  calculated  peaks.  In  particular,  the  experi¬ 
mental  peak  clearly  differs  from  the  one  predicted  by  the 
RMM  model,  because  the  experimental  data  show  a  relevant 


content  of  partially  oxidized  Si-Sij,-04_j^  tetrahedra,  whose 
existence  is  neglected  by  this  model;  on  the  other  hand  it 
appreciably  differs  also  from  the  one  predicted  by  the  RBM 
model,  because,  according  to  this  model,  almost  all  its  oxi¬ 
dized  components  should  exist  as  Si-Si30  tetrahedra. 

Therefore,  the  chemical  structures  of  our  SIPOS  films  do 
not  satisfactorily  agree  with  those  predicted  by  the  RBM  and 
RMM  models.  The  RMM  model  fails  by  neglecting  the  in¬ 
termediate  Si-0  species  existence,  which  has  been  clearly 
demonstrated  by  XPS,  not  only  in  SIPOS  films,  but  also  in 
evaporated  or  sputter  deposited  substoichiometric  SiO;^ 
films,^^"^^  and  at  the  interface  between  single  crystal  Si  and 
native  Si02 .  On  the  other  hand,  even  if  the  approach  of 
the  RBM  model,  by  taking  into  account  the  existence  of  the 
intermediate  oxidation  states  of  silicon,  is  more  correct,  the 
deviation  of  the  experimental  spectra  from  this  model  is 
huge,  demonstrating  the  failure  of  this  pure  statistical  model. 
This  means  that  the  formation  of  the  various  tetrahedra  dur¬ 
ing  the  film  deposition  cannot  be  treated  as  a  pure  statistical 
problem,  but  chemical  factors  must  be  taken  into  account. 
Even  if,  during  deposition,  the  arrival  of  the  O  and  Si  atoms 
on  the  SIPOS  surface  can  be  described  in  terms  of  statistical 
laws,  according  to  the  RBM  model,  the  subsequent  step,  i.e., 
the  formation  of  the  Si-0  bonds,  must  be  clearly  influenced 
by  chemical  factors.  The  relative  weights  of  the  chemistry 
and  the  statistics  in  determining  the  film  structure  are  deter¬ 
mined  by  the  deposition  rate  coupled  with  the  surface  diffu¬ 
sion  rate  of  the  reacting  species.  Indeed,  it  is  known  that  the 
Si-0  bond,  due  to  the  greater  electronegativity  of  O,  has  an 
ionic  character  that  determines  a  bond  polarization  which 
can  be  described  as  Si'^^-O^”.  For  low  deposition  rates  and 
high  surface  diffusivities,  an  O  atom  arriving  at  the  SIPOS 
surface  prefers  to  bind  to  a  polarized  Si  atom,  thus  forming 
Si-04  tetrahedra,  according  to  the  RMM  model.  On  the  other 
hand,  for  high  deposition  rates  and  low  surface  diffusivities, 
the  atoms  have  no  time  to  be  influenced  by  this  factor,  and 
therefore  form  tetrahedra  according  to  the  statistical  laws  of 
the  RBM  model,  because  they  are  quenched  in  their  posi¬ 
tions  by  the  fast  deposition  of  new  layers.^^ 

According  to  the  above  discussion,  it  is  therefore  not  sur¬ 
prising  that  our  LPCVD  SIPOS  films  contain  less  partially 
oxidized  Si-Si^-04_;f  tetrahedra  than  predicted  by  the  RBM 
model.  Indeed,  in  our  case  the  deposition  rate  is  sufficiently 
low  (20---40  A/min)  and  the  surface  diffusion  rate  sufficiently 
high  (due  to  the  high  deposition  temperature  of  620  °C)  to 
allow  a  relevant  number  of  oxygen  atoms  to  bond  to  already 
polarized  Si  atoms,  thus  forming  a  higher  number  of  Si- 
Si03  and  Si-04  tetrahedra  than  predicted  by  the  RBM  model. 
Also  in  other  cases  reported  in  literature^^’^^"^^  the  shape  of 
the  XPS  spectra  of  LPCVD  SIPOS  films  seems  to  confirm 
this  model,  even  if  a  detailed  analysis  of  the  Si  2p  peaks  in 
terms  of  the  five  silicon  oxidation  states  is  generally  missing. 

The  influence  of  the  deposition  rate  in  determining  the 
tetrahedra  composition  of  SIPOS  has  been  already  demon¬ 
strated  on  the  basis  of  infrared  data  coupled  with  data  about 
stress  and  reactivity  to  HF.^^  In  this  work,  LPCVD  SIPOS 
films  (typical  deposition  rates  of  30  A/min,  roughly  corre- 
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spending  to  the  deposition  of  a  monolayer  each  6  s)  have 
been  found  to  exhibit  a  marked  Si02  character  with  respect 
to  APCVD  SIPOS  films  (typical  deposition  rates  of  500 
A/min,  corresponding  to  about  3  monolayers/s). 

This  analysis  can  be  extended  also  to  the  case  of  SiO^^ 
films  prepared  by  evaporation.  It  has  been  shown  that  the 
chemical  structure  of  these  films  is  characterized  by  high 
concentrations  of  partially  oxidized  Si-Si;,.  04„;^ 
tetrahedra.^^’^^  In  this  case  the  very  high  deposition  rate 
(about  1000  A/min)  and  the  low  surface  diffusion  rate  (due 
to  the  low  deposition  temperature)  minimize  the  influence  of 
the  chemical  factor,  leading  to  the  deposition  of  films  having 
compositions  in  good  agreement  with  those  predicted  by  the 
RBM  model. 

C.  Effect  of  high  temperature  annealing 

The  chemical  characterization  of  high  temperature  an¬ 
nealed  SIPOS  films  must  take  into  account  the  deep  struc¬ 
tural  rearrangements  undergone  by  these  films.  Indeed,  by 
annealing  amorphous  SIPOS  films  at  temperatures  of  about 
800-900  °C,  the  nucleation  of  Si  nanocrystals  occurs.^“^® 
We  have  analyzed  by  XPS  the  same  SIPOS  films  containing 
10  and  37  at.  %  of  oxygen  already  characterized  in  the  as- 
deposited  state,  after  annealing  at  temperatures  ranging  from 
700  to  1000  ®C.  Because  we  have  found  that  also  the  sur¬ 
faces  of  the  annealed  films  contain  an  excess  of  oxygen,  also 
in  this  case  we  have  used  a  HF  dip  to  remove  this  oxidized 
overlayer  before  the  XPS  analysis. 

Figure  6  reports  the  composition,  obtained  by  XPS,  as  a 
function  of  the  annealing  temperature,  for  the  37  at.  %  O 
SIPOS.  In  particular,  Fig.  6a  shows  the  atomic  percent  com¬ 
position  as  a  function  of  the  annealing  temperature.  Note  that 
the  oxygen  concentration  is  constant  up  to  a  temperature  of 
900  °C  and  decreases  from  37  to  28  at.  %  at  1000  ®C.  Figure 
6b  reports  the  atomic  percent  concentrations  of  the  different 
Si  species,  derived  from  the  peak  fitting  analysis  of  the  Si  2p 
XPS  peaks,  as  a  function  of  the  annealing  temperature.  The 
figure  shows  that  the  tetrahedra  concentrations  are  strongly 
dependent  on  temperature.  In  order  to  discuss  this  figure  it  is 
useful  to  divide  it  into  two  parts:  a  first  one,  up  to  900  °C, 
corresponding  to  the  temperature  range  in  which  the  film 
composition,  according  to  Fig.  6a,  is  constant,  and  a  second 
part  referring  to  the  1000  °C  annealing.  The  figure  shows 
that  up  to  900  °C  the  concentrations  of  the  Si-Si4  and  Si04 
tetrahedra  increase,  while  all  the  other  contributions  de¬ 
crease.  This  means  that  the  anneal  promotes  the  transforma¬ 
tion  of  partially  oxidized  tetrahedra  in  those  having  extreme 
compositions.  As  a  result,  the  composition  of  SIPOS  films 
annealed  up  to  a  temperature  of  900  °G  further  departs  from 
the  RBM  model,  approaching  the  RMM  model. 

By  extrapolating  from  the  data  up  to  900  °C  the  SIPOS 
behavior  at  higher  temperatures,  one  should  expect  a  further 
increase  of  the  Si-Si4  and  Si-04  concentrations,  with  a  cor¬ 
responding  decrease  of  the  other  species.  Figure  6b  instead 
shows  that  at  1000  ''C  all  the  concentrations  of  the  oxidized 
tetrahedra  decrease,  while  only  the  Si-Si4  contribution 
largely  increases. 
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Fig.  6.  (a)  Atomic  percent  composition,  and  (b)  Si-Si;,.04_;,.  tetrahedra  per¬ 
cent  concentration  as  a  function  of  the  annealing  temperature,  for  a  SIPOS 
film  containing  37  at.  %  of  oxygen. 

This  discontinuity  must  be  related  to  the  decrease  of  the 
total  oxygen  concentration  occurring  at  this  temperature 
shown  in  Fig.  6a.  According  to  the  low  temperature  trends, 
we  think  that  at  1000  °C  the  conversion  of  the  Si-Si30, 
Si-Si202  and  Si-Si03  tetrahedra  into  Si-Si4  and  Si-04  be¬ 
comes  very  efficient.  This  effect  is  not  shown  by  the  XPS 
spectra,  because,  contrarily  to  the  as-deposited  and  low  tem¬ 
perature  annealed  SIPOS  films,  the  films  annealed  at  1000 
°C  are  not  stable  towards  the  HF  dip  used  to  remove  the 
surface  oxidized  layer,  but  undergo  a  selective  etching  the 
Si-04  tetrahedra;  this  explains  both  the  decrease  in  the  total 
amount  of  oxygen  (see  Fig.  6a)  and  of  Si-04  •  This  interpre¬ 
tation  of  the  XPS  data  is  also  confirmed  by  RBS.  Indeed,  for 
all  the  anneal  temperatures,  the  SIPOS  compositions  mea¬ 
sured  by  RBS  remain  constant  to  the  35  at,  %  value,  demon¬ 
strating  that  the  loss  of  oxygen  occurring  at  1000  °C  is  not  a 
bulk  phenomenon  due  to  the  anneal,  but  an  artifact  due  to  the 
HF  dip.  Therefore,  the  stability  of  SIPOS  films  with  respect 
to  the  HF  etching  seems  to  depend  not  only  on  the  total 
oxygen  content  of  the  film,^®  but  also  on  the  kind  of  tetrahe¬ 
dra  composing  the  film.^^ 

All  these  findings  are  confirmed  also  by  the  behavior  of 
the  10  at.  %  O  SIPOS.  Figure  7a  shows  the  atomic  percent 
composition  of  these  films,  derived  by  the  XPS  analysis,  as  a 
function  of  the  annealing  temperature.  Also  in  this  case  we 
observe  two  distinct  regions;  a  first  one,  up  to  800  °C,  in 
which  the  SIPOS  composition  remains  almost  constant,  and 
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Fig.  7.  (a)  Atomic  percent  composition,  and  (b)  Si-Si;t04_;,.  tetrahedra  per¬ 
cent  concentration  as  a  function  of  the  annealing  temperature,  for  a  SIPOS 
film  containing  10  at.  %  of  oxygen. 


a  second  one,  at  higher  temperatures,  in  which  the  oxygen 
content  decreases  (4  at.  %  at  900  °C,  3  at.  %  at  1000  °C). 
Figure  7b  reports  the  atomic  percent  concentrations  of  the 
different  tetrahedra,  derived  from  the  peak  fitting  analysis  of 
the  Si  2p  XPS  peaks,  as  a  function  of  the  annealing  tempera¬ 
ture.  The  figure  shows  that  also  in  this  case  the  various  con¬ 
tributions  depend  on  the  temperature.  In  particular,  up  to 
800  °C  a  slow  decrease  of  the  main  oxidized  species  concen¬ 
trations  is  observed,  while  at  900  °C  all  the  contributions  of 
the  oxidized  tetrahedra  drop  to  0,  or  become  only  slightly 
greater  than  0.  Correspondingly,  the  contribution  of  the 
Si-Si4  tetrahedron  increases  up  to  a  value  of  about  94%. 

Therefore,  even  if  the  low  oxygen  content  makes  difficult 
a  clear  evaluation  of  the  phenomena  occurring  in  the  low 
temperature  range  (620-800  °C),  by  applying  the  same  con¬ 
siderations  reported  above  for  the  37  at.  %  O  SIPOS,  we  can 
conclude  that  also  for  the  10  at.  %  O  SIPOS  films  the  effect 
of  the  annealing  mainly  consists  in  a  conversion  of  the  par¬ 
tially  oxidized  tetrahedra  into  Si-Si4  and  Si-04;  this  effect 
becomes  relevant  at  900  °C,  and  at  this  temperature  the  high 
concentration  of  Si-04  tetrahedra  formed  by  the  annealing 
leads  to  a  material  that  is  unstable  towards  the  HF  dip  we 
used  to  remove  the  surface  oxidized  layer,  so  that  a  prefer¬ 
ential  etching  of  Si02  occurs.  Also  in  this  case  these  conclu¬ 
sions  are  supported  by  RBS  data,  showing  that  the  bulk  oxy¬ 
gen  content  of  the  film  does  not  change  due  to  the  anneal. 


Fig.  8.  Electron  diffraction  patterns  for  a  10  at,  %  0  SIPOS  film  after 
annealing  at  (a)  800  and  (b)  900  °C,  and  for  a  37  at.  %  O  SIPOS  film  after 
annealing  at  (c)  900  and  (d)  1000  °C. 


The  observation  that  also  the  10  at.  %  O  SIPOS  can  be 
affected  by  the  HF  dip  is  particularly  remarkable.  Indeed, 
according  to  literature  data,^°  the  37  at.  %  O  SIPOS  films  are 
in  the  boundary  region  between  stability  and  instability  to¬ 
wards  the  HF  etching.  Therefore,  we  would  expect  the  10 
at.  %  O  samples  to  be  totally  stable,  due  to  the  very  low 
oxygen  content.  The  observed  instability  of  these  films  al¬ 
lows  to  us  to  affirm  that  the  main  factor  determining  the 
stability  towards  the  HF  etching  of  SIPOS  is  not  the  total 
oxygen  content,  but  the  way  in  which  this  oxygen  is  bonded 
to  the  Si  atoms. 

A  last  point  to  be  discussed  is  the  different  temperatures  at 
which  the  prominent  formation  of  Si02  starts  (and  conse¬ 
quently  also  the  preferential  etching  due  to  HF)  for  the  two 
studied  SIPOS  compositions.  We  have  already  shown  that 
the  37  at.  %  O  SIPOS  is  stable  up  to  900  °C,  while  the  10 
at.  %  O  one  is  stable  up  to  800  °C.  It  is  remarkable  that  these 
temperatures  are  exactly  the  same  at  which  the  crystallization 
of  these  materials  starts,  as  shown  in  Fig.  8.  This  figure 
shows  the  transmission  electron  diffraction  patterns  obtained 
for  the  10  at.  %  O  SIPOS  after  annealing  at  800  and  900 
(a,  b),  and  for  the  37  at.  %  O  after  annealing  at  900  and 
1000  °C  (c,  d).  Note  that  the  10  at.  %  O  sample  is  amor¬ 
phous  at  800  ®C,  as  demonstrated  by  the  presence  of  broad 
and  diffuse  rings  in  the  diffraction  pattern;  at  900  °C  the 
rings  become  well  defined  structures,  indicating  the  presence 
of  the  nanocrystalline  Si  phase.  Similarly,  for  the  37  at.  %  O 
sample,  the  electron  diffraction  patterns  demonstrate  the 
presence  of  an  amorphous  phase  up  to  900  °C,  while  the 
crystallization  starts  at  1000  °C. 

Therefore,  the  XPS  analysis  has  allowed  to  us  to  demon¬ 
strate  a  correlation  between  structural  changes  (formation  of 
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Si  nanocrystals)  and  chemical  transformations  (formation  of 
Si-Si4  and  Si-04  tetrahedra)  induced  by  high  temperature  an¬ 
nealing  in  SIPOS.  In  particular,  the  nucleation  of  silicon 
nanocrystals  occurring  at  high  temperatures  can  be  related  to 
the  formation  of  Si-Si4  tetrahedra,  due  to  the  decomposition 
of  the  partially  oxidized  Si  tetrahedra  shown  in  Figs.  6  and  7. 
These  nanocrystals  are  surrounded  by  an  amorphous  phase 
that  is  more  oxygen-rich  compared  to  that  present  in  the 
as-deposited  film,  since  the  former  also  contains  the  Si-04 
tetrahedra  formed  due  to  the  temperature-induced  decompo¬ 
sition  of  the  partially  oxidized  tetrahedra.  This  conversion  of 
the  Si-Si30,  Si-Si202  and  Si-Si03  tetrahedra  into  Si-Si4  and 
Si-04  shifts  the  compositions  of  the  annealed  films  towards 
those  predicted  by  the  RMM  model. 

IV.  CONCLUSIONS 

We  have  described  in  this  article  the  results  of  the  XPS 
analysis  of  LPCVD  SIPOS  films  containing  10  and  37  at.  % 
of  oxygen.  We  have  found  that  the  as-deposited  samples  are 
mixtures  of  all  the  possible  Si-Si^04_;t  tetrahedra  predicted 
by  the  RBM  model  and  that  each  SIPOS  composition  exhib¬ 
its  a  peculiar  tetrahedra  distribution.  However,  these  distri¬ 
butions  do  not  satisfactorily  agree  with  those  predicted  by 
this  model,  because  the  detected  concentrations  of  partially 
oxidized  Si-Six04-x  tetrahedra  are  too  low. 

We  have  explained  this  deviation  from  the  RBM  model  in 
terms  of  deposition  rate  and  surface  diffusivity.  In  general, 
high  deposition  rate  and  low  surface  diffusivity  involve  the 
prevailing  of  the  statistical  factor,  and  therefore  the  forma¬ 
tion  of  films  characterized  by  high  concentrations  of  interme¬ 
diate  Si  oxidation  states  (RBM  model),  while  a  low  deposi¬ 
tion  rate  and  a  high  surface  diffusivity  allow  the  deposition 
of  films  with  a  greater  Si02  character  (RMM  model).  Our 
LPCVD  films  are  characterized  by  a  sufficiently  low  deposi¬ 
tion  rate  and  by  a  sufficiently  high  surface  diffusivity  to  have 
a  composition  remarkably  different  from  the  one  predicted 
by  the  RBM  model. 

We  have  also  observed  that  the  SIPOS  films  containing  37 
at.  %  of  oxygen  are  more  reactive  than  the  film  with  10  at.  % 
of  oxygen  with  respect  to  the  exposure  to  atmospheric  oxy¬ 
gen,  since  they  tend  to  oxidize  for  quite  large  thicknesses 
(larger  than  90  A).  We  have  explained  this  effect  with  the 
higher  concentration  of  partially  oxidized  tetrahedra  ob¬ 
served  in  the  37  at.  %  O  SIPOS.  This  oxidized  layer  can  be 
selectively  removed  from  the  surface  of  the  as-deposited 
samples  by  using  diluted  HF  solutions  without  any  modifi¬ 
cation  of  the  SIPOS  film. 

We  have  also  demonstrated  that  the  nucleation  of  Si 
nanocrystals  induced  in  SIPOS  by  a  high  temperature  anneal 
is  related  to  changes  in  the  tetrahedra  distribu¬ 

tion.  Indeed,  the  effect  of  the  anneal  mainly  consists  in  the 
decomposition  of  partially  oxidized  tetrahedra  in  Si-Si4  and 
Si-04  tetrahedra.  The  Si-Si4  tetrahedra  form  nanocrystalline 
Si  grains,  while  the  Si-04  tetrahedra  enrich  the  oxygen  con¬ 


centration  of  the  amorphous  phase.  We  have  shown  that  a 
high  concentration  of  Si-04  tetrahedra  makes  annealed 
SIPOS  films  unstable  towards  the  HF  solution  used  to  pre¬ 
pare  a  clean  surface  before  XPS  measurements.  The  high 
temperature  induced  formation  of  Si-Si4  and  Si-04  tetrahedra 
further  makes  more  distinct  the  compositions  of  the  two 
studied  SIPOS  films  than  those  predicted  by  the  RBM 
model. 
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Design  specifications  for  plasma  immersion  ion  implantation  systems  are  discussed.  An  analytical 
model  of  the  plasma  sheath  evolution  is  used  to  deduce  restrictions  for  the  dimension  of  the  vacuum 
chamber  and  the  pressure  range.  The  avoidance  of  arcing  implies  an  upper  limit  for  the  maximum 
electric  field  in  the  sheath  and  hence  for  the  plasma  density.  The  total  fluence  per  pulse,  which 
determines  the  implantation  duration  and  the  power  requirement,  is  calculated.  Furthermore,  safety 
considerations,  especially  the  shielding  of  x-rays  generated  by  secondary  electrons,  are  discussed. 
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I  INTRODUCTION 

Ion  implantation  is  a  versatile  conventional  method  for 
injecting  atoms  into  a  substrate,  thereby  changing  the  com¬ 
position  and  the  properties  of  the  surface.  It  is  widely  used  in 
semiconductor  device  manufacturing.  Additionally,  implanta¬ 
tion  into  metals,  ceramics  and  polymers  is  an  emerging  tech¬ 
nology  by  which  new  surface  alloys  are  created  to  improve 
the  tribological  properties.^  However,  conventional  ion  im¬ 
plantation  has  its  limitations:  low  beam  currents  lead  to  long 
implantation  times;  beam  scanning  and  target  manipulation 
are  necessary  for  getting  uniform  implantation  profiles  in 
non-fiat  items;  and  ion  sources  are  rather  complex. 

Plasma  immersion  ion  implantation  (PHI)  has  been  devel¬ 
oped  as  a  technique  without  these  limitations  to  allow  for 
cost-effective  implantation.^  By  applying  negative  high  volt¬ 
ages  pulses  to  the  substrate,  which  is  immersed  in  a  plasma, 
the  positive  ions  are  extracted  from  the  plasma  and  acceler¬ 
ated  to  the  surface.  Pill  processes  have  been  demonstrated 
for  metallurgical^’"^  and  semiconductor  applications.^’^  Re¬ 
cently,  cost  estimates  for  commercial  PHI  were  published.^ 

For  reaching  widespread  attention  and  industrial  signifi¬ 
cance,  design  parameters  for  PHI  systems  must  be  specified. 
In  this  article,  specifications  for  the  vacuum  system,  different 
possibilities  of  plasma  generation,  high  voltage  pulse  genera¬ 
tion,  and  safety  aspects  in  regard  to  commercial  applications 
are  discussed.  In  Sec.  II,  an  analytical  model^  is  used  for 
approximating  the  evolution  of  the  positively  charged  ion 
sheath  and  the  ion  flux  during  the  high  voltage  pulse.  The 
range  for  different  system  parameters  and  subsequent  design 
restrictions  is  explored,  and  recommendations  for  PHI  sys¬ 
tem  specifications  are  given  in  Sec.  III.  Finally,  a  summary  is 
given  in  Sec.  IV. 

II.  ANALYTICAL  MODEL 

A  high  voltage  pulse  of  amplitude  -  and  duration  tp  is 
applied  to  a  target  in  a  homogeneous  plasma  of  density  . 
The  square  pulse  is  repeated  after  an  off  time  ?off»  corre¬ 
sponding  to  a  repetition  frequency /^=  +  ?off)  ^tid  a  duty 
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cycle  R  =  tpl{tp  +  tQf^.  Fur  sufficiently  low  pressure  the 
mean  free  path  is  larger  than  the  sheath  width.  Hence  colli¬ 
sions  in  the  plasma  sheath  can  be  neglected.  The  rise  time 
of  the  pulse  is  assumed  to  be  shorter  than  the  inverse  ion 
plasma  frequency  Ifo^pi  and  long  compared  to  the  inverse 
electron  plasma  frequency  l/cup^.  Therefore  the  electrons 
can  be  considered  inertialess  and  neglected.  An  exact  calcu¬ 
lation  should  be  performed  in  spherical  coordinates.^  Albeit, 
for  our  present  purposes  a  planar  calculation  is  sufficient. 


A.  Sheath  evolution 

Initially,  for  t<\l(j)pi  a  positively  charged  ion  sheath 
forms.  Its  width  can  be  calculated  from  Poisson’s  equa¬ 
tion  to  be 


(1) 


where  is  the  free-space  permittivity  and  e  is  the  ion 
charge.  After  the  formation  of  the  initial  sheath,  a  quasi- static 
Child-Langmuir  sheath  is  formed  on  a  timescale  of  liopi . 
The  ions  with  the  mass  M  are  accelerated  in  the  pre-sheath  to 
the  Bohm  (ion  sound)  speed  u b=  {kT J ,  where  is 
the  electron  temperature  and  k  the  Boltzmann  constant.  For 
this  acceleration,  a  potential  drop  of  kTJl  in  the  pre-sheath 
has  to  occur,  corresponding  to  a  decrease  in  the  ion  density 
=  .  The  total  ion  current  density  is  sup¬ 

plied  by  the  ion  drift  current  and  by  the  uncovering  of  ions 
due  to  the  moving  sheath  edge^ 

I  dx\ 

jc  =  ^-6enAuB^—y  (2) 

which  across  a  sheath  of  the  thickness  x  is  equal  to^® 


Therefore,  the  evolution  of  the  sheath  edge  is  characterized 
by  the  following  equation: 
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Fig,  1.  Sheath  edge  evolution  in  planar  geometry  for  different  values 
of  plasma  parameters:  Nj  plasma,  7^=2  eV,  ^^  =  10,  35,  100  kV; 
(a)  =  m“^  and  (b)  ^ 


dx  _  4£<,yf  /T“ 

dt  0.6X9x^n„y  eM 


The  solution  of  this  differential  equation  yields  the  sheath 
evolution  for  a  square  pulse  and  different  plasma  densities, 
which  is  shown  in  Fig.  1 .  The  sheath  is  expanding  monotoni- 
cally  during  the  pulse,  and  the  maximum  sheath  width 
^max^  corresponding  to  a  stationary  sheath  edge  at  is 


The  sheath  width  can  reach  several  meters  for  low  plasma 
densities  of  10^"^  m“^  and  voltages  higher  than  35  kV.  How¬ 
ever,  for  a  density  of  about  10^^  the  sheath  can  be  as 
small  as  15  cm  at  35  kV.  It  can  expand  for  200  juls  and  longer 
in  the  former  case,  whereas  the  stationary  width  is  reached  in 
less  than  50  /^s  in  the  later  case. 


B.  Ion  fluence 

The  ion  fluence  density  F  per  pulse  of  length  is  com¬ 
posed  of  the  ions  uncovered  by  the  receding  sheath  edge  and 
the  ions  crossing  the  sheath  edge  with  the  Bohm  speed^: 

F=0.6noi^xitp)  +  tp-\J^^.  (6) 

The  ions  impinging  from  the  Child-Langmuir  sheath  are  im¬ 
planted  with  the  maximum  energy  whereas  the 
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ions  from  the  initial  matrix  sheath  as  given  by  Equation  (1) 
arrive  at  the  surface  of  the  target  with  an  energy  continuously 
distributed  between  aJ3d  Wniax* 

III.  DESIGN  CONSIDERATIONS 

For  PHI  applications,  the  control  of  the  energy  of  the  im¬ 
planted  ions  is  desired,  i.e.,  monoenergetic  ions  should  reach 
the  target  during  the  application  of  the  pulse.  Therefore  the 
collision  rate  during  the  transit  through  the  sheath  should  be 
negligible,  and  the  fraction  of  ions  implanted  from  the  initial 
matrix  sheath  small  compared  to  the  total  fluence. 

A.  Vacuum  equipment  and  plasma  generation 

The  pressure  range  that  can  be  used  by  commercial  Pill  is 
limited  by  two  factors:  In  batch  processing,  the  time  needed 
for  pumping  down  the  system  to  the  base  pressure  must  be 
comparable  to  the  total  implantation  time.^  The  working 
pressure  in  the  plasma  source  must  be  higher  than  the  base 
pressure  by  some  orders  of  magnitude  to  reduce  the  implan¬ 
tation  of  residual  impurity  ions  during  PHI. 

The  second  factor  is  the  requirement  of  a  collisionless 
flight  through  the  sheath,  which  is  only  possible  if  the  mean 
free  path  is  larger  than  the  sheath  width.  It  has  been  shown 
that  collisions  are  important  for  sheath  width  greater  than  0.5 
times  the  mean  free  path  which  is  related  to  the  pressure 
p  by  the  expression  pX A=constant.  The  constant  is 
6.1  X 10”^  Pa- m  for  nitrogen,  and  of  the  same  order  of  mag¬ 
nitude  for  other  gases.  For  small  targets,  the  planar  ap¬ 
proximation  is  no  longer  valid  and  the  sheath  widths  should 
be  calculated  within  spherical  coordinates,  leading  to  a  de¬ 
crease  by  a  factor  of  2-4  for  pulses  longer  than  some  tens  of 
microseconds.  In  Pill  applications,  the  lower  limit  of  the 
mean  free  path  for  short  pulses  and  long  pulses  is  the  initial 
sheath  width  and  the  steady  state  sheath  width  re¬ 
spectively.  This  condition  provides  an  upper  limit  for  the 
working  pressure. 

In  Fig.  2(a)  the  initial  x^"^  and  steady  state  x”^^  sheath 
widths  calculated  from  Equations  (1)  and  (5)  are  plotted  as 
functions  of  ion  density  for  three  different  pulse  voltages  . 
It  can  be  seen  that  the  sheath  widths  increase  for  higher 
voltages  and  during  the  voltage  pulse.  A  lower  limit  for  the 
sheath  width,  corresponding  to  an  upper  limit  for  the  plasma 
density,  is  given  by  the  maximum  electrical  field  in  the  initial 
sheath  for  which  no  dielectric  breakdown  with  subsequent 
arcing  can  occur.  For  pressures  lower  than  1  Pa,  this  field 
strength  is  approximately  10  kV/mm,^^  depending  among 
others  on  the  specific  geometry,  i.e.,  field  enhancement  at 
edges  or  protrusions  from  the  workpiece,  and  surface  prop¬ 
erties.  This  limiting  field  strength  occurs  at  the  beginning  of 
the  pulse  because  the  field  decreases  during  the  extension  of 
the  sheath.  For  an  initial  matrix  sheath  the  maximum  field 
^-max  calculated  to  be 


The  widths  and  densities  for  an  electric  field  of  10  kV/mm 
are  indicated  with  crosses  for  the  different  voltages  .  This 
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Fig.  2.  (a)  Sheath  width  depending  on  ion  density  in  the  initial  and  steady 
state  case  [see  Equations  (1)  and  (5)]  for  three  different  pulse  voltages  (10, 
35,  and  100  kV).  The  crosses  indicate  the  sheath  widths  where  arcing  can 
occur  [Equation  (7)].  (b)  Parameter  range  of  ion  density  and  gas  pressure  of 
different  plasma  sources  (dc  filament  discharge, RF  radio-frequency 
discharge, ECR  electron  cyclotron  resonance, and  MW  microwave 
source^^). 

limitation  can  be  partly  alleviated  with  longer  rise  times  of 
the  pulse  or  smooth  workpieces.  The  minimum  sheath  width 
in  the  order  of  some  millimeters  to  decimeters,  depending  on 
the  voltage,  is  also  a  lower  limit  for  the  radius  of  holes  and 
trenches  which  can  be  homogeneously  implanted.  The  im¬ 
plantation  of  a  smaller  opening  is  accompanied  by  the  dis¬ 
placement  of  the  plasma  out  of  it,  leading  to  a  simultaneous 
decrease  of  the  flux  into  it. 

At  the  same  time,  the  dimension  of  the  plasma  chamber 
must  be  larger  than  the  maximum  sheath  width  x(tp)  for  the 
applied  pulse  length  tp  and  the  size  of  the  workpiece  taken 
together.  Otherwise  the  implantation  of  ions  would  be  lim¬ 
ited  by  the  depletion  and  the  subsequent  extinction  of  the 
plasma.  On  the  other  hand,  a  plasma  chamber  that  is  too 
large  wastes  electrical  power  in  plasma  generation,  so  the 
vacuum  chamber  should  be  as  small  as  possible.  When  im¬ 
planting  several  workpieces  in  one  run,  care  must  be  taken 
that  the  distance  between  the  targets  is  clearly  larger  than 
twice  the  maximum  sheath  width.  A  treatment  chamber  with 
a  diameter  of  several  meters  implies  a  mean  free  path  of  the 
same  order  of  magnitude.  This  results  in  a  operating  pressure 
of  the  plasma  source  of  10”^  Pa,  or  less. 

In  Figure  2(b)  typical  pressure  and  ion  density  ranges  for 
different  plasma  sources  are  shown. A  suitable  plasma 
source  for  a  given  application  is  found  by  the  following  pro¬ 
cedure.  Proceeding  from  the  required  magnitude  of  the  high 


Fig.  3.  Ion  fluence  as  function  of  pulse  length  calculated  from  Equation  (6) 
for  different  plasma  parameter:  N2  plasma,  7^=2  eV,  Vp  =  10,  35,  100  kV; 
(a)  =  m“^  and  (b)  m“^.  The  arrows  indicate  the  ion  flux 

from  the  initial  matrix  sheath  with  a  continuously  distributed  energy. 

voltage,  the  possible  ranges  for  the  sheath  extension  and 
plasma  density  are  determined  in  Fig.  2(a).  Then  a  plasma 
source  can  be  selected  from  Fig.  2(b)  using  the  restraint  of  a 
sheath  extension  smaller  than  the  mean  free  path  \ .  It  can  be 
seen  that  microwave  and  dc  discharges  cannot  be  used  for 
applications  requiring  an  implantation  energy  of  more  than 
40  keV.  Using  hot  filaments  can  also  result  in  additional 
contamination  of  the  plasma  and  therewith  the  implanted 
workpiece. 

B.  High  voltage  equipment  and  pulse  generation 

Square  pulse  generators  are  widely  used  and 
available,^^"^^  so  that  the  model  presented  in  Section  II  has 
practical  relevance.  However,  it  is  only  valid  for  short  rise 
times.  The  energy  distribution,  which  is  deduced  from  this 
model,  presents  a  reasonable  approximation  for  the  fraction 
of  ions  that  are  implanted  with  the  energy  eV^  for  pulse  rise 
times  up  to  a  few  microseconds.^^  A  model  for  finite  pulse 
rise  times  in  the  order  of  or  larger  than  l/copf  is  available. 
In  that  case,  the  fraction  of  ions  with  an  energy  lower  than 
eV^  is  increased,  so  that  the  model  used  in  this  paper  yields 
an  upper  limit  for  the  fraction  of  ions  impinging  with  the 
maximum  energy. 

In  Fig.  3  the  ion  fluence  depending  on  the  pulse  length  is 
plotted  for  the  plasma  same  parameters  as  used  in  Fig.  1.  In 
the  beginning,  the  ions  uncovered  by  the  moving  sheath  edge 
and  the  ions  traversing  the  sheath  contribute  to  the  flux.  For 
longer  pulses,  the  sheath  becomes  stationary  and  the  flux 
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is  due  to  the  drift  current,  linearly  increasing  with  time.  It 
can  be  seen  that  the  fraction  of  ions  with  a  kinetic  energy 
below  eV^ — the  arrows  in  Fig.  3 — decreases  with  increasing 
ion  density  corresponding  to  a  decreasing  initial  sheath 
width  For  an  ion  density  of  10^^  pulse  lengths  of  30 
yLts  are  sufficient  to  reduce  the  ion  fraction  from  the  initial 
sheath  below  10%  of  the  total  fluence.  When  the  density 
decreases  by  a  factor  of  100,  the  length  must  be  200  /xs  or 
greater  to  achieve  the  same  effect. 

The  total  current,  which  must  be  supplied  by  the  pulse 
generator  at  the  constant  voltage  is  composed  of  the  fol¬ 
lowing  parts:  the  ion  current  impinging  on  the  target  [Equa¬ 
tion  (3)]  times  the  area  of  the  workpiece;  the  displacement 
current  due  to  the  capacitance  of  the  system;  and  the  electron 
current  generated  at  the  surface  by  the  ions  with  a  secondary 
electron  coefficient  of  5  to  10,^®  This  current  can  reach  tens 
of  amperes  for  large  workpieces  and  determines  the  dimen¬ 
sions  of  the  high  voltage  feed.  The  time  integrated  current 
and  the  implantation  voltage  yield  the  energy  which  must 
be  stored  and  recharged  during  the  pulses. 

The  depleted  plasma  near  the  workpiece  is  subsequently 
refilled  between  the  voltage  pulses  by  ambipolar  diffusion  of 
ions  and  electrons  from  regions  of  higher  density.  The  time 
evolution  of  the  density  can  be  calculated  to  be^^ 


with  a  time  constant 


(8) 


_  ^  TJ  kT, 

^  x{tp)TX2TTM 

is  the  density  at  the  end  of  the  pulse  of  width  tp .  For 
sheath  widths  of  0.5  m  or  larger,  the  time  required  to  fill  in 
the  depleted  region  up  to  95%  of  the  undisturbed  plasma 
density  ^95%  =  3/7  can  be  0.5  ms  or  longer.  At  high  repetition 
rates  the  initial  density  of  the  ion  matrix  in  the  beginning  of 
each  voltage  pulse  will  decrease  with  each  pulse.  An  equi¬ 
librium  will  be  reached  in  which  the  sheath  width  during  the 
pulse  will  be  larger  than  predicted  in  Section  II  A,  corre¬ 
sponding  to  a  lower  ion  flux  per  pulse  and  eventually  the 
occurrence  of  collisions  during  the  flight  through  the  sheath. 
Therefore,  the  repetition  rate  should  be  of  the  order  of  1  kHz 
or  less. 

In  a  pulse  with  a  duration  of  100  /xs,  the  fluence  density 
varies  between  5X10^^  and  10^^  ions  per  m^,  for  plasma 
densities  between  10^"^  and  10^^  m“^  and  voltages  between 
10  and  100  kV.  An  implantation  dose  of  10^^  m“^  is  reached 
after  10^  to  2  X 10^  pulses,  corresponding  to  100  s  to  5  1/2  h 
at  a  repetition  rate  of  1  kHz.  The  energy  deposited  by  the 
implanted  ions  is  100  mJ-200  J  per  pulse  and  per  m^  for 
voltages  between  10  and  100  kV.  Due  to  the  higher  current  in 
the  beginning  of  a  pulse,  average  power  densities  of  500 
W/cm^  can  be  reached  in  short  pulses.  For  commercial  ap¬ 
plications,  a  substrate  area  of  1  m^  is  reasonable.  Therefore, 
the  average  power  that  must  be  dissipated  by  cooling — 
unless  implantation  at  elevated  temperatures  is  desired — can 
reach  a  few  kilowatts. 


The  increase  in  the  target  temperature  during  a  pulse  can 
be  estimated.  Typical  ion  ranges  are  in  the  order  of  100  nm. 
However,  the  ion  energy  during  a  pulse  of  length  ^  is  dissi¬ 
pated  over  a  conduction  length^^ 


(10) 


where  /c,  X,  p,  and  Cp  are,  respectively,  the  thermal  diffusiv- 
ity,  the  thermal  conductivity,  the  density,  and  the  specific 
heat  capacity  under  constant  pressure.  This  equation  yields 
conduction  lengths  of  some  micrometers  for  pulse  lengths  of 
some  tens  of  microseconds.  Therefore,  the  heating  of  the 
workpiece  can  be  approximated  by  a  semi-infinite  plane  with 
a  heat  flux  impinging  on  the  surface  during  the  time  tp . 
The  solution  of  the  thermal  conduction  equation,  neglecting 
loss  due  to  radiation  or  conduction,  yields  the  temperature 
evolution^^: 


with  erfc  denoting  the  complementary  error  function.  A  flux 
F^=500  W/cm^  with  a  pulse  length  100  /xs  yields  a 
temperature  increase  at  the  surface  of  5.4  K  per  pulse  for  a 
304  stainless  steel  target  [ac= 0.081  cm^/s,  X=0.30  W/cm^ 
(Ref.  23)],  which  is  not  enough  for  partially  melting  the  tar¬ 
get  during  a  pulse. 

For  determining  the  equilibrium  temperature  during  an 
implantation,  the  total  heat  balance  must  be  obtained,  includ¬ 
ing  all  fluxes  to  and  from  the  workpiece: 

dT  . 

pCpV—  =  Q  =  {A,F,-A,F-  Q,),  (13) 


where  V  is  the  target  volume,  A^  the  heated  area,  Ff^  the 
incident  heat  flux  (including  ion  flux  times  duty  cycle  F^ 
X  R,  plasma  heating,  radiation  from  hot  filaments,  micro- 
wave  heating  and  x-ray  absorption,  where  applicable),  A^  the 
radiative  area,  F^  the  radiative  flux,  and  the  loss  due  to 
conduction.  The  radiative  heat  flux  is  given  by  the  Stefan- 
Boltzmann  law: 

F=sa{T*-Tt),  (14) 

with  s  the  emissivity,  cr  the  Stefan-Boltzmann  constant,  and 

the  chamber  temperature.  Assuming  an  ion  energy  flux 
F^  =  500  W/cm^  and  a  duty  cycle  /?=10“^,  which  yields  an 
average  flux  of  0.5  W/cm^,  and  an  additional  heating  from 
the  plasma  source  of  the  same  size,  an  incoming  flux 
Fi^  =  \  W/cm^  is  reasonable.  With  an  emissivity  e=0.4,  a 
radiative  area  twice  the  size  of  the  heated  area,  i.e,,  both 
sides  of  the  implanted  sample,  and  a  chamber  at  room  tem¬ 
perature,  the  equilibrium  temperature  of  the  target  is  405  °C 
in  the  worst  case  of  no  thermal  contact  between  the  work- 
piece  and  the  holder.  This  temperature  can  be  decreased  by 
good  thermal  contact  between  the  sample  and  the  mounting, 
and  by  additional  cooling.  In  the  limiting  case  of  an  ideal 
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contact  to  a  heat  sink  at  20  °C,  the  temperature  at  the  surface 
of  a  10  cm  thick  stainless  steel  target  will  be  53  °C. 

For  high  voltage  (75  kV  and  higher)  and  a  given  size  of 
the  workpiece,  either  the  repetition  frequency  and/or  the 
pulse  length  tp  must  be  reduced.  Otherwise  the  power  con¬ 
sumption  and  the  cooling  system  become  prohibitively  large 
and  expensive.  The  pulse  length  and  therewith  the  sheath 
extension  is  correlated  with  the  size  of  the  vacuum  chamber, 
and  the  pulse  length  is  correlated  with  the  energy  distribution 
of  the  implanted  ions.  A  compromise  must  be  found  between 
a  small  system  and  a  monoenergetic  implantation.  Neverthe¬ 
less,  implantation  times  of  20  min  for  doses  of  10^^  m“^  with 
100  keV  for  areas  larger  than  0.1  m^ — ^three  8-in.  wafers — 
should  be  possible. 

Up  to  now,  it  was  implicitly  assumed  that  the  applied 
voltage  is  present  at  the  surface  of  the  target,  corresponding 
to  a  very  low  specific  resistivity  of  the  substrate.  However,  it 
is  possible  that  isolating  substrates,  or  conducting  substrates 
into  which  an  isolating  layer  is  deposited  during  implanta¬ 
tion,  are  used.  A  model  was  developed  by  Emmert^^  for  this 
case.  The  applied  high  voltage  drops  across  the  insulator  and 
the  plasma  sheath,  depending  on  the  respective  impedances. 
During  the  pulse,  the  target  becomes  more  positively  charged 
due  to  the  ion  current,  amplified  by  the  emission  of  second¬ 
ary  electrons.  Hence,  the  average  ion  energy  will  decrease 
during  the  pulse.  Between  the  pulses,  the  positive  surface 
attracts  electrons,  so  that  charge  neutrality  on  the  surface  is 
rapidly  restored.  The  energy  distribution  is  broadened  and 
shifted  to  lower  energies,  depending  on  the  pulse  length  and 
the  resistivity  of  the  substrate.  Care  must  be  taken  for  inho¬ 
mogeneous  substrates  and  surfaces  where  insulator-metal 
junctions  are  exposed  to  the  plasma.  Due  to  the  high  electric 
field  arcing  is  possible  and  must  be  avoided. 

C.  Safety  considerations 

In  a  PHI  system,  there  are  three  inherent  sources  of  haz¬ 
ards  that  can  lead  to  serious  injuries  or  death.  All  possible 
precautions  must  be  taken  to  protect  the  operators  of  com¬ 
mercial  systems. 

First,  there  is  the  high  voltage  generator  that  supplies  the 
current  for  the  pulse  generator.  As  shown  above,  the  average 
current  can  be  as  high  as  1  A  at  100  kV  with  a  peak  current 
of  up  to  50  A.  It  is  imperative  that  all  high  voltage  compo¬ 
nents  of  the  system  are  screened  to  prevent  incidental  touch¬ 
ing. 

A  second  health  hazard  is  the  intensive  bremsstrahlung.  It 
is  caused  by  secondary  electrons  emitted  from  the  substrate 
which  are  accelerated  through  the  plasma  to  the  walls  of 
vacuum  chamber.  Therefore,  all  windows  of  the  chamber 
should  be  made  of  lead-glass.  For  voltages  higher  than  50 
kV,  additional  lead  shielding  is  necessary,  because  the  steel 
walls  normally  used  for  vacuum  chambers  become  partially 
transparent  for  x-rays  at  this  energy. 

Using  the  following  assumptions,  the  thickness  of  the 
shielding  can  be  estimated:  secondary  electrons  with  a  ki¬ 
netic  energy  are  generated  with  an  average  power  den¬ 
sity  of  5  W/cm^,  corresponding  to  an  ion  flux  F^=500 


10  100 
Applied  voltage  (kV) 


Fig.  4.  Wall  thickness  necessary  to  reduce  the  radiation  dose  at  the  outer 
surface  of  the  chamber  to  7  /xS  h^^  for  Al,  Fe,  and  Pb  coatings.  The  rising 
absorption  above  the  Pb  AT-edge  is  responsible  for  the  slight  increase  in  the 
thickness  for  Pb  shielding  between  90  and  150  kV. 


W/cm^,  a  duty  cycle  R=10~^,  and  a  secondary  electron  co¬ 
efficient  of  10.  For  a  distribution  of  the  electrons  to  all  cham¬ 
ber  walls  and  an  isotropic  x-ray  emission  the  power  density 
and  hence  the  dose  will  be  smaller  at  the  surface  of  the 
chamber  and  decreasing  with  the  distance  from  it.  The  gen¬ 
erated  bremsstrahlung  spectrum  ly{E)  can  be  approximated 
by  the  following  formula^^: 

lr{E)^^-^{eV,-E),  (15) 

where  is  the  secondary  electron  current  and  Z  the  atomic 
number  of  the  decelerating  material.  This  spectral  distribu¬ 
tion  of  x-rays  is  attenuated  during  the  transit  through  the 
shielding  and  the  maximum  intensity  is  shifted  to  higher  en¬ 
ergies  due  to  the  stronger  absorption  for  lower  energies. 
The  shielding  thickness  in  Fig.  4  was  calculated  as  a  function 
of  the  applied  high  voltage  to  yield  a  radiation  dose  of  7 
/xS/h,  which  is,  for  instance,  below  the  maximum  level  of  7.5 
fjiS/h  allowed  in  Germany  for  systems  that  do  not  request 
special  authorization.^^  As  can  be  seen,  a  reduction  of  the 
intensity  by  a  factor  of  10  will  only  reduce  the  required 
shielding  by  33%,  whereas  a  decrease  of  the  applied  voltage 
by  50%  will  reduce  the  thickness  by  a  factor  of  8.  Therefore, 
the  thickness  is  mostly  determined  by  the  applied  voltage 
and  not  by  the  consumed  power.  Above  the  Pb  K-edge  at  an 
energy  of  87.9  keV,  the  rapidly  rising  absorption  coefficient 
results  in  a  shielding  thickness  nearly  independent  of  the 
voltage  up  to  150  kV. 

The  third  source  of  danger  is  the  microwave-source  that 
can  be  used  as  a  plasma  generator  in  PHI  systems.  Micro- 
waves  endanger  health,  especially  at  the  power  densities 
used  in  plasma  sources.  All  non-metallic  parts  of  the  system 
should  therefore  be  covered  with  a  metallic  mesh  to  absorb 
the  emitted  radiation. 
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IV.  SUMMARY 

PHI  is  a  method  for  fast  and  cost  effective  implantation  of 
large  non-planar  areas.  Essential  requirements  for  monoener- 
getic  implantations  are  a  collisionless  transit  through  the 
positively  charged  ion  sheath  and  a  square  voltage  pulse.  A 
mean  free  path  larger  than  twice  the  sheath  width  limits  the 
available  pressure  range  in  the  vacuum  system.  To  avoid  arc¬ 
ing,  the  ion  density  must  be  below  4X10^^  m“^  for  implan¬ 
tation  energies  of  100  keV.  At  the  same  energy,  a  dose  of 
10^^  m“^  in  areas  of  0,1  m^  can  be  reached  in  20  min.  The 
repetition  rate  and  therefore  the  duration  of  one  implantation 
run  is  restricted  by  the  effectivity  of  the  cooling  system. 
Additional  lead  shielding  is  necessary  for  ion  energies  above 
50  keV  to  screen  the  x-ray  radiation  emitted  by  the  generated 
secondary  electrons. 
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X-ray  photoelectron  spectroscopy  study  on  native  oxidation 
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The  native  oxidation  of  As-implanted  Si  surfaces  used  in  actual  ultralarge  scale  integrated  processes 
are  investigated.  Quantitative  analysis  of  oxidation  is  made  possible  by  x-ray  photoelectron 
spectroscopy  spectral  decomposition  of  Si  2p  into  Si"^^,  Si^"^,  and  Si^^,  and  by  calculation  of  Si02 
and  SiO^  thicknesses  using  the  decomposition  results.  Here,  the  sensitivity  is  such  that  less  than  1 
A  change  is  detectable.  The  native  oxidation  of  As-implanted  (IXlO^^-lXlO^^/cm^,  at  25  kV) 
Si(lOO)  is  compared  to  that  of  Si(lOO)  without  implantation.  The  results  show  that  the  oxidation  rate 
of  As-implanted  Si  is  faster  than  that  of  Si  without  implantation,  that  the  native  oxide  on 
As-implanted  Si  includes  more  suboxide  than  that  on  Si  without  implantation,  and  that  As  is 
oxidized  in  deeper  regions  than  Si.  These  results  indicate  that  As  implantation  changes  the  Si  native 
oxidation  mechanism  itself.  We  propose  an  oxidation  model  of  As-implanted  Si  to  explain  our 
observations,  and  discuss  the  validity  of  this  model.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Native  oxidation  on  Si  surfaces  causes  problems  in  ul¬ 
tralarge  scale  integrated  (ULSI)  processes  and  properties  in¬ 
cluding  the  metal/Si  ohmic  contact,  low-temperature  epitax¬ 
ial  growth  of  silicide,  and  dielectric  breakdown  of  thin  Si02. 
HF  treatment  is  a  very  effective  way  to  suppress  the  native 
oxidation  of  Si;  it  removes  surface  oxide,  makes  the  Si  sur¬ 
face  flat,  and  forms  a  stable  surface  with  hydrogen  atoms 
terminating  dangling  bonds. Therefore,  HF-treated  Si  sur¬ 
faces  are  widely  investigated  by  x-ray  photoelectron  spec¬ 
troscopy  (XPS),  Fourier-transform  infrared  spectroscopy 
(FTIR),  and  other  analytical  methods. However,  most  of 
these  studies  are  directed  toward  Si  surfaces  that  are  ideally 
flat,  ideally  crystallized,  and  without  implanted  impurities. 
There  have  been  far  fewer  investigations  on  realistic  surfaces 
formed  in  ULSI  fabrication.  These  include  ion-implanted  Si 
surfaces  and  poly  crystallized  Si  surfaces. 

The  present  work  investigates  the  native  oxidation  of 
Si(lOO)  surfaces  having  implanted  As  ions 
(iXlO^^-lXlO^^/cm^)  in  anticipation  of  a  future  necessity 
for  controlling  native  oxidation  during  semiconductor  device 
fabrication.  The  oxide  thickness  is  quantitatively  estimated 
by  XPS  with  a  sub-1  A  sensitivity.^  Comparing  the  oxidation 
of  As-implanted  Si  with  that  of  Si  without  implantation  dem¬ 
onstrates  that  the  implanted  As  atoms  change  the  oxidation 
mechanism  of  Si.  This  article  proposes  a  model  of  the  native 
oxidation  mechanism  of  As-implanted  Si  and  discusses  the 
validity  of  this  model. 

II.  EXPERIMENT 

Table  I  shows  the  sample  preparation  parameters.  As"^ 
ions  were  implanted  into  p-type  4°  off  Si  (100)  wafers  at  25 
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keV  (iXlO^^-lXlO^^/cm^).  These  wafers  were  annealed  at 
850  °C  for  30  min  and  cut  into  small  pieces  for  convenience 
in  XPS  measurement.  They  were  dipped  in  5  mol  %  HF  so¬ 
lution  for  10  min,  blown  dry  in  N2  gas,  exposed  in  a  clean- 
room  atmosphere  (40%  relative  humidity)  for  0  to  48  h,  and 
analyzed  by  XPS.  For  comparison,  a  Si  wafer  without  im¬ 
plantation  was  prepared  in  the  same  way.  Ar"^- sputtered 
clean  Si  and  thermally  oxidized  Si  were  also  analyzed  as  the 
reference. 

The  photoelectron  spectra  were  measured  with  the 
Shimazu-KRATOS  XSAM800  system,  using  a  Mg  Xa  x  ray 
(12  kV,  10  mA)  as  the  x-ray  source.  The  energy  resolution  of 
the  measured  spectrum  was  0.96  eV,  which  was  determined 
as  the  FWHM  (full  width  at  half-maximum)  of  the  Ag3d  5/2 
peak.  The  base  pressure  in  the  analysis  chamber  was  below 
7X10“*  Pa. 

For  quantitative  analysis,  measured  Si2p  spectra  were 
decomposed  into  Si^'^  (Si  2p  from  Si),  SP^  (.x  =  l,  2,  and  3, 
Si  2p  from  Si  bonding  with  x  oxygen  atoms  and  4—x  silicon 
atoms),  and  Si^"^  (Si  2p  from  Si02).  Then,  the  thickness  of 
silicon  dioxide  (d^)  and  that  of  silicon  suboxide  (d^)  are 
calculated.^  The  decomposition  process  and  the  calculation 
are  described  briefly. 

The  decomposition  process  is  shown  in  Fig.  1.  We  sub¬ 
tracted  Si^"^  and  Si"^"^  from  the  Si  2p  spectra  and  the  remain¬ 
der  was  treated  as  the  summation  of  all  S  f^  components. 
For  subtraction  of  Si"^"^,  the  binding  energy  of  Si"^"^ 
[£2?(Si'^'^)]  was  calculated  under  the  assumption  of  a  constant 
energy  difference  between  the  binding  energies  of  O  Ij’ 
[£,(0  15)]  and  £,(Si^+): 


£i(Si^+)  =  £,(Ol5)-A£,(ref), 

(1) 

A£i,(ref) =429.25  (eV). 

(2) 

Here,  A£j,(ref)  denotes  the  energy  difference  between 
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Table  L  Sample  preparation  parameters. 


Exposure  time 
in  clean-room  air 

Wafer 

As  implantation 

Annealing 

Treatment 

(h) 

Si(lOO) 
P  type 

0-96 

Measured 

samples 

Si(lOO) 
P  type 
4“  off 

25  kV,  IXlO'^/cm^ 

25  kV,  2X10'5/cm^ 
25  kV,  IXlO'^/cm^ 

850  °C,  30  min 

10  min  dipping 
in  5%  HF  solution 

5 

0-24 

5 

Reference 

samples 

Si(lOO) 
P  type 

Ar  sputtering  for  Si^"^ 
Thermal  oxidation  for  Si'^"’' 

£^^(0  Is)  and  and  is  experimentally  determined 

from  an  XPS  spectrum  of  a  reference  sample  having  a  100- 
nm-thick  thermal  oxide.  The  FWHMs  and  peak  shapes  of 
Si^^  and  Si"^"^  were  also  experimentally  determined  from 
XPS  spectra  of  reference  samples,  clean  Si  for  Si^"^  and  100- 
nm-thick  thermal  Si  oxide  for  Si"^"^.  The  FWHMs  are  1.24 
and  1.64  eV,  respectively. 

Layer  thicknesses  of  the  multilayered  structure  can  be  cal¬ 
culated  when  the  sensitivity,  the  escape  depth,  and  the  XPS 
peak  intensity  for  the  typical  element  in  each  layer  are 
given.  We  assume  the  native  oxide  structure  of 
Si02/SiO^/Si.  Then,  thickness  of  Si02  (d^)  and  thickness  of 
SiO^  (dj^)  are  given  in  Eqs.  (3)  and  (4),  using  the  Si"^"^,  SP"^, 
and  Si^^  peak  intensities  (/^ ,  4 »  and  4)  and  sensitivity  fac¬ 
tors  Q  (sensitivity  of  Si"^"^  to  Si^"^)  and  Q'  (Si^^  to  Si®"^), 
which  are  defined  in  Eqs.  (5)  and  (6): 


SiO. 


d^=k^.  "  sin  6  ln|  7^~  +  l  ), 


1  Ir 


Q'  Is 

1 


(3) 


+  1  , 


(4) 


(5) 


^SiO.ySiO^^SiO^^SiO^j^SiO^ 


(6) 


Fig.  1.  Schematic  diagram  of  the  decomposition  process. 


Here,  R  is  the  surface  roughness,  /  is  the  photoelectron 
transmission  function,  n  is  the  density  of  Si  atoms,  a  is  the 
atomic  photoionization  cross  section,  X  is  the  escape  depth, 
and  0  is  the  photoelectron  take-off  angle.  Q  is  determined  to 
be  0.67  based  on  XPS  measurements  of  the  reference 
samples,  clean  Si,  and  thermal  Si  oxide.  Q  ^  cannot  be  deter¬ 
mined  experimentally  because  we  have  no  standard  samples 
of  SiO;^ .  Therefore,  we  use  2'  of  0.84,  which  is  interpolated 
between  that  of  Si  and  Si02.  This  may  cause  error  in  calcu¬ 
lated  results;  however,  the  error  in  the  total  thickness  is  very 
small  because  the  amount  of  SiO^  is  relatively  small.  We 
have  also  experimentally  determined  the  Si  escape  depth  X^j 
in  Si02  (2.7  nm)  using  the  XPS  results  and  a  cross-sectional 
TEM  observation  of  thin  thermal  Si02  on  Si.^^"^^ 

III.  RESULTS 

Figures  2(a)  and  2(b)  show  the  native  oxidation  of  Si(lOO) 
without  implantation  and  that  of  Si(lOO)  with  As- 
implantation,  respectively.  The  open  and  filled  circles  denote 
the  total  oxide  thicknesses  (Si02+SiO^)  and  the  open  and 
filled  squares  denote  SiOj^  thickness  on  the  Si  surface. 

In  Fig.  2(a),  the  total  oxide  thickness  increases  with  the 
exposure  time  in  the  clean-room  atmosphere.  It  increases 
slowly  up  to  3  h,  increases  abruptly  after  3  h,  and  saturates 
after  10  h  with  oxide  thickness  of  about  4  A.  The  SiO^  thick¬ 
ness  increases  slightly  up  to  10  h  but  decreases  after  that. 
This  decrease  shows  that  some  of  the  SiO^  turns  into  Si02 
with  increasing  exposure  time.  Based  on  the  amount  of  SiO^^ , 
SiO^  is  thought  to  be  located  only  at  the  interface  between  Si 
and  Si02.^  These  figures  show  layer-by-layer  oxidation,  a 
well  known  phenomenon.  The  saturation  after  10  h  indi¬ 
cates  a  similar  oxidation  rate  to  that  of  the  previous  work"^ 
and  it  can  be  explained  with  the  structural  model  of  Si02/Si 
by  Herman  et  if  Si  atoms  from  two  surface  layers  are 
oxidized. 

In  Fig.  2(b),  the  total  oxide  thickness  increases  with  the 
exposure  time.  Here,  there  is  no  step,  which  indicates  layer- 
by-layer  oxidation.  The  SiO^  thickness  also  increases  with 
the  exposure  time.  This  behavior  is  completely  different  from 
that  seen  for  Si(lOO)  without  implantation. 

Figure  3  shows  the  dependency  of  the  total  oxide 
thickness  and  the  SiO^  thickness  on  the  As  dose.  The 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2709 


Yano  et  al.:  XPS  study  on  native  oxidation  process 


2709 


:  Si(lOO)  without  implantation 
Si02+SiOx 
6  n  SiOx 


0.1  1  10  100 
time  (hours) 

(a) 


time  (hours) 
(b) 


Fig.  2.  Native-oxide  thickness  on  HF-treated  Si  as  a  function  of  exposure 
time,  (a)  The  results  for  Si  (100)  without  implantation.  O  Si02+SiO^  thick¬ 
ness;  n  SiOj^-  thickness,  showing  layer-by-layer  oxidation,  (b)  The  results  for 
As-implanted  Si  (100)  (2X10^Vcm^).  •  Si02+SiO^  thickness;  ■  SiO^. 
thickness.  This  behavior  is  completely  different  from  that  seen  in  (a). 


samples  had  been  exposed  in  a  clean-room  atmosphere  for  5 
h.  With  an  increasing  As  dose,  in  the  range  between 
IXlO^^-lXlO^^/cm^,  both  thicknesses  increase  similarly. 

Summarizing  these  results,  it  becomes  clear  that  the  HF- 
treated,  As-implanted  Si  is  oxidized  faster  than  the  Si  with¬ 
out  implantation,  and  it  does  not  exhibit  layer-by-layer  native 
oxidation.  Furthermore,  the  native  oxide  on  the  As-implanted 
Si  includes  a  thicker  suboxide  layer  than  the  native  oxide  on 
Si  without  implantation  and  both  the  total-oxide  thickness 
and  the  SiO^^.  thickness  increase  as  the  As  dose  increases. 


As  dose  ( /cm2) 


Fig.  3.  Oxide  thickness  on  Si  (100)  as  a  function  of  the  As  dose.  The 
As-implanted  Si  was  dipped  in  a  5  mol  %  HF  solution  and  exposed  in  a 
clean-room  atmosphere  for  5  h.  •  Si02+SiOjj.  thickness;  ■  SiO^  thickness. 


oxidation  mechanism  oxide  structure 


Fig.  4.  Models  of  the  native  oxidation  process  on  Si  without  implantation 
and  on  As-implanted  Si.  In  the  case  of  Si  without  implantation,  Si  atoms  are 
oxidized  layer  by  layer.  The  suboxide  is  located  only  at  the  interface  be¬ 
tween  Si  and  Si02 .  In  the  case  of  As-implanted  Si,  the  As  atoms  are  pref¬ 
erentially  oxidized  first,  then  the  Si  atoms  near  As  are  oxidized.  Because 
oxidation  spreads  from  the  oxidized  As,  more  suboxide  is  created  in  the 
oxide  layer. 

IV.  DISCUSSION 

A.  Model  of  the  native  oxidation  of  As-implanted 
SI(IOO) 

Figure  2(a)  shows  that  the  total-oxide  thickness  and  the 
SiO^  thickness  of  Si  without  implantation  exposed  for  5  hs  in 
a  clean-room  atmosphere  are  2.3  and  1,3  A,  respectively. 
Comparing  these  thicknesses  and  Fig.  3,  it  becomes  clear 
that  As-implantation  affects  the  native  oxidation  of  Si  when 
the  dose  is  10^ ^/cm^  or  greater.  It  is  well  known  that  the 
surface  of  ion-implanted  Si  is  rougher  than  that  of  Si  without 
implantation.  In  addition,  wafers  misorientated  4°  from 
(100),  which  were  used  for  As-implanted  samples,  have 
more  steps  than  perfectly  oriented  (100)  wafers.  The  rough¬ 
ness  and  steps  may  accelerate  oxidation,  however,  they  are 
not  supposed  to  change  the  mechanism  of  layer-by-layer  oxi¬ 
dation.  The  change  of  the  native  oxidation  mechanism  of  Si 
is  caused  by  the  high  concentration  of  implanted  As  atoms. 
To  explain  the  oxidation  on  As-implanted  Si  surfaces,  we 
have  proposed  a  model  shown  in  Fig.  4. 

In  the  case  of  Si  without  implantation.  Si  atoms  are  oxi¬ 
dized  layer  by  layer.  Although  the  microscopic  mechanism  of 
layer-by-layer  oxidation  is  not  firmly  established,  it  is  quite 
natural  to  assume  that  the  vertical  oxide-growth  rate  is  much 
slower  than  the  lateral  oxide-growth  rate  in  layer-by-layer 
oxidation  process.  As  explained  in  Sec.  Ill,  SiO^  is  regarded 
as  being  localized  at  the  interface  between  Si  and  Si02,  as 
depicted  in  the  oxide  structure  shown  in  the  upper  right  cor¬ 
ner  of  Fig.  4. 

On  the  other  hand,  layer-by-layer  oxidation  is  blurred  in 
the  native  oxidation  of  As-implanted  Si.  Also,  it  is  natural  to 
assume  that  this  blurring  is  the  result  of  the  faster  vertical 
oxide-growth  rate  than  the  lateral  oxide-growth  rate.  From 
this  point  of  view,  we  assumed  a  model  of  the  native  oxida¬ 
tion  of  As-implanted  Si  in  which  implanted  As  atoms  work 
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Fig.  5.  As  3d  and  Si  2p  spectra  of  As-implanted  Si  (2XlO^^/cm^),  dipped  in 
a  5  mol  %  HF  solution  and  exposed  in  the  clean-room  atmosphere  for  5  h. 

as  oxidation  initiators.  This  model  includes  the  following 
processes,  i.e.,  As  atoms  are  preferentially  oxidized  in  Si. 
After  As  oxidation,  the  Si  atoms  near  the  As  atoms  are  also 
easily  oxidized.  This  is  because  Si  bonds  are  known  to  be 
easily  oxidized  after  one  of  the  bonds  combines  with  an  oxy¬ 
gen  atom.  This  makes  the  vertical  oxide-growth  rate  faster 
and  blurs  the  layer-by-layer  oxidation.  As  a  consequence  of 
this  model,  more  suboxide  is  produced  around  the  oxidized 
As  atoms,  contributing  to  the  interfacial  region  between  Si02 
and  Si.  The  structure  of  this  oxide  is  shown  in  the  lower  right 
comer  of  Fig.  4,  in  which  there  is  a  layer  of  suboxide  be¬ 
tween  Si  and  Si02. 

B.  Examining  the  model 

In  order  to  verify  this  oxidation  model,  it  is  important  to 
show  that  the  As  atoms  are  preferentially  oxidized  in  As- 
implanted  Si.  For  this  purpose,  As  3d  and  Si  2p  XPS  spectra 
are  investigated  in  more  detail. 

Figure  5  shows  the  As  3d  and  Si2p  XPS  spectra  of  the 
As-implanted  Si  (100)  surface  after  5  h  of  exposure  in  the 
clean-room  atmosphere.  There  are  two  well-resolved  peaks 
in  both  the  As  3d  and  Si  2p  spectra.  In  the  As  3d  region,  the 
peak  at  the  higher  binding  energy  reflects  oxidized  As.  The 
lower  energy  peak  reflects  unoxidized  As.  However,  it  is  un¬ 
clear  whether  this  unoxidized  As  peak  corresponds  to  As 
bonding  with  Si  atoms  or  As  binding  with  As  atoms  (As 
clusters  in  Si).  In  the  Si  2p  region,  the  higher  energy  peak 
corresponds  to  Si02  and  the  lower  one  corresponds  to  Si  in 
the  Si  substrate.  Roughly  65%  of  the  As  atoms  and  30%  of 
the  Si  atoms  are  oxidized  in  the  region  probed  by  XPS.  The 
depth  of  As  oxidation  can  be  calculated  when  the  As  depth 
profile  in  Si  and  the  escape  depth  of  As  3  photoelectrons  in 
Si  and  Si02  are  known.  In  calculating  this  As  oxidation 
depth,  we  make  two  assumptions. 
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Fig.  6.  Si  native-oxide  thickness  on  As-implanted  Si  (2XlO^^/cm^)  and  the 
oxidation  depth  of  As.  As  is  oxidized  in  a  deeper  region  than  Si,  confirming 
that  As  is  preferentially  oxidized  in  Si, 

First,  we  assume  that  the  As  concentration  is  constant  in 
the  depth  probed  by  XPS  (it  is  shallower  than  50  A  when  the 
take-off  angle  is  30°).  This  is  based  on  the  As  depth  profile 
measured  by  HG-ICPS  (hydride  generation  inductively 
coupled  plasma  spectrometry).^^  In  this  profile,  the  topmost 
Si  layer  contains  l.lXlO^Vcm^  of  As.  However,  this  layer  is 
removed  during  HF  treatment.  Then,  the  As  concentration 
either  remains  constant  or  increases  in  the  range  between 
3.5Xl0^®/cm^  and  7.5X10^%m^,  with  increasing  depth  in 
the  depth  sampled  by  XPS.  So  as  not  to  overestimate  the 
oxidation  depth  of  As,  we  assume  a  constant  concentration  of 
As  in  the  region  probed  by  XPS. 

Second,  we  assume  that  the  escape  depths  of  As  3  pho¬ 
toelectrons  in  Si  and  Si02  are  approximately  the  same  as 
those  of  Si  2;?.  This  assumption  is  based  on  the  escape  depth 
dependence  on  the  kinetic  energy  of  the  photoelectron.  The 
kinetic  energy  of  As  3<i  is  about  1210  eV  and  differs  by  only 
5%  from  that  of  Si  2p,  about  1150  eV. 

Based  on  these  assumptions,  the  As  oxidation  depth  is 
calculated  in  exactly  the  same  way  as  the  Si02  thickness  is 
calculated. 

Figure  6  shows  the  Si  native  oxide  thickness  (total  thick¬ 
ness  of  Si02  and  SiO^,.)  and  the  As  oxidation  depth  in  As- 
implanted  Si  (100)  (2XlO^^/cm^)  as  a  function  of  the  expo¬ 
sure  time  in  the  clean-room  atmosphere.  As  is  oxidized  in  a 
deeper  region  than  Si,  showing  that  the  As  atoms  are  prefer¬ 
entially  oxidized.  At  an  exposure  time  of  24  h,  As  atoms 
located  15  A  below  the  Si/Si02  interface  are  oxidized.  This 
is  a  reasonable  result  for  this  oxidation  model  because  it  is 
estimated  that  there  is  one  As  atom  per  a  15  A  cube  when  the 
As  concentration  is  5Xl0^%m^. 

There  are  several  factors  that  could  influence  native  oxi¬ 
dation  of  As-implanted  Si;  damage  and  surface  roughness 
caused  by  the  As-implantation  process,  a  chemical  effect  of 
implanted  As,  or  an  electrical  effect.  The  experimental  re¬ 
sult  of  preferential  As  oxidation  suggests  that  As- 
implantation  affects  the  native  oxidation  of  Si  via  its  chemi¬ 
cal  effect. 

V.  CONCLUSION 

The  authors  have  investigated  the  native  oxidation  pro¬ 
cess  of  As-implanted  (IXlO^^-lXlO^^/cm^)  Si(lOO)  by 
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quantitative  XPS  analysis  in  which  the  thicknesses  of  Si02 
and  SiO;,.  are  calculated  with  a  sensitivity  of  about  ^  of  a 
monolayer.  The  results  show  that  the  oxidation  of  As- 
implanted  Si  is  faster  than  that  of  Si  without  implantation, 
that  the  native  oxide  on  As-implanted  Si  contains  more  sub¬ 
oxide  than  that  on  Si  without  implantation,  and  that  As  is 
oxidized  in  a  deeper  region  than  Si  is.  This  means  high- 
concentration  As  implantation  changes  the  native  oxidation 
process  of  Si  itself. 

From  these  results,  we  developed  an  oxidation  model  for 
As-implanted  Si  in  which  As  atoms  are  preferentially  oxi¬ 
dized  in  Si,  acting  as  oxidation  initiators,  followed  by  subse¬ 
quent  oxidation  of  the  surrounding  Si.  The  oxidation  depth  of 
As-implanted  samples  was  determined,  confirming  the  valid¬ 
ity  of  this  model. 
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Quantification  of  metai  trace  contaminants  on  Si  wafer  surfaces 
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We  have  shown  that  quantitative  determination  of  metal  trace  impurities  in  the  range  between  10^ 
cm“^  and  10^^  cm“^  on  top  of  the  native  oxide  of  a  Si  wafer  by  laser  postionization  of  sputtered 
neutrals  in  combination  with  time-of-flight  mass  spectrometry  (Laser-SNMS)  and  time-of-flight 
secondary  ion  mass  spectrometry  (TOF-SIMS)  is  possible.  Trace  metal  standards  with 
concentrations  in  the  range  between  10^  cm"^  and  10^^  cm“^  were  used  for  calibration.  These 
standards  were  prepared  by  sputter  deposition  and  were  independently  controlled  by  Total 
Reflection  X-Ray  Fluorescence  Analysis.  Relative  sensitivity  factors  for  12  metals  on  Si  for 
Laser-SNMS  were  determined.  Additionally,  we  compared  postionization  with  different 
wavelengths  (193  nm  and  248  nm).  With  Laser-SNMS  it  was  then  possible  to  determine  the 
influence  of  UV/ozone  treatment  on  the  measured  surface  concentration  of  metal  species.  The 
UV/ozone  treatment  is  necessary  to  achieve  high  and  reproducible  useful  yields  for  the  metal 
species  in  TOF-SIMS.  With  this  knowledge,  we  were  able  to  determine  relative  sensitivity  factors 
for  the  metals  on  UV/ozone-treated  Si  wafer  surfaces  measured  by  TOF-SIMS.  Detection  limits 
down  to  10^  cm"^  and  10^^  cm"^  for  sample  surface  areas  of  100  jmm  in  diameter  and  1  /xm  in 
diameter,  respectively,  were  found  for  both  Laser-SNMS  and  TOF-SIMS.  ©  1996  American 
Vacuum  Society. 


I.  INTRODUCTION 

The  detection  and  quantitative  determination  of  metal 
trace  contaminants  is  an  important  issue  in  wafer  processing 
and  integrated  circuit  (IC)  fabrication.  Detection  limits  well 
below  1  ppm  in  a  single  monolayer  are  required.  From  the 
results  of  shallow  depth  profiling  of  oxide  layers  on  Si,  it  is 
known  that  these  surface  contaminants  are  preferentially  lo¬ 
cated  on  top  of  the  oxide  layer,  i.e.,  the  uppermost 
monolayer.^  Standard  techniques  for  surface  analysis  like 
Auger  electron  spectroscopy  and  x-ray  photoelectron  spec¬ 
troscopy  cannot  be  applied  because  of  their  limitations  in 
sensitivity.  The  same  holds  true  for  the  electron  microprobe. 
The  standard  technique  for  the  quantitative  determination  of 
metals  on  wafer  surfaces  is  total  reflection  x-ray  fluorescence 
spectroscopy  (TXRF),^  widely  applied  in  the  semiconductor 
industry  for  research  and  development  as  well  as  production 
control.  The  most  serious  limitations  of  this  technique  are  the 
detection  limits  (in  the  range  of  10^^  cm“^),  the  lack  of  lat¬ 
eral  resolution  and  a  larger  information  depth  (about  10 
monolayers). 

We  have  demonstrated  the  usefulness  of  TOF-SIMS  for 
metal  trace  detection  on  GaAs,  down  to  the  ppm-range  with 
the  areas  examined  as  small  as  50  jum  in  diameter.^  More 
recently  these  investigations  were  extended  to  Si  wafer 
surfaces. In  this  article  we  report  on  a  combined  Laser- 
SNMS/TOF-SIMS  study  of  metal  contaminants  on  Si  wafer 
surfaces.  For  calibration,  standards  were  produced  by  con- 
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trolled  sputter  deposition  of  metal  submonolayers  on  Si  wa¬ 
fer  surfaces  under  TXRF  control.  Quantitation  was  achieved 
for  both  Laser-SNMS  and  TOF-SIMS  with  comparable  de¬ 
tection  limits  down  to  10^  cm“^  and  10^^  cm“^  for  sample 
surface  areas  of  100  /xm  in  diameter  and  1  ^tm  in  diameter, 
respectively. 


II.  PROCEDURE 

In  previous  work^’^  it  has  been  shown  that  quantitative, 
high  sensitivity  bulk  analysis  of  metal  alloys  is  possible  with 
Laser-SNMS  by  applying  nonresonant  multiphoton  postion¬ 
ization.  The  important  quantity  used  in  this  context  is  the 
relative  sensitivity  factor  (RSF)  S{A,  B)  of  element  A  with 
respect  to  element  B.  It  is  defined  by  the  following  equation: 


S(A,B) 


/(A)  c(B) 
/(B)  ■c(A)’ 


(1) 


/(X)  — detected  photoion  signal  intensity  of  element  X, 


c(X)  — bulk  concentration  of  element  X. 

RSFs  are  determined  by  the  sputtering  and  the  ionization 
process.  They  are  in  particular  influenced  by: 

(i)  Preferential  sputtering.  This  means  that  from  a  multi- 
component  sample,  some  species  are  dominantly  sput¬ 
tered,  whereas  other  species  are  enriched  on  the  sur¬ 
face.  Only  when  sputter  equilibrium  is  reached  (after 
removing  many,  or,  in  some  cases,  several  hundred 
monolayers)  the  flux  of  sputtered  particles  does  rep¬ 
resent  the  composition  of  the  sample  material. 

(ii)  Different  angular  and  energy  distributions  for  various 
sputtered  species.  These  distributions  determine  the 
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fraction  of  sputtered  particles  passing  the  ionization 
volume.  The  ionization  volume  is  the  intersection  vol¬ 
ume  of  the  acceptance  of  the  mass  analyzer  and  the 
portion  of  the  laser  focus  in  which  the  intensity  is 
high  enough  for  ionization. 

(iii)  Different  ionization  cross  sections  of  various  species 
at  a  given  laser  wavelength  and  laser  power  density. 

For  Laser-SNMS  with  nonresonant  multiphoton  ioniza¬ 
tion,  the  experimentally  determined  RSFs  of  different  ele¬ 
ments  are  within  approximately  one  decade.^’^ 

Variations  of  the  RSF  for  a  given  pair  of  elements  in 
different  samples  can  be  partially  explained  by  different 
sputter  yields  (the  average  number  of  sputtered  particles  per 
primary  ion)  due  to  preferential  sputtering,  and  changes  in 
the  angular  and  energy  distributions  of  the  sputtered  species. 
In  the  case  of  sputter  equilibrium,  it  has  been  shown  in 
Laser-SNMS  that  the  RSFs  remain  constant  within  20%,  in 
most  cases,  even  for  strong  matrix  variations.^  In  addition  to 
sputter  induced  variations,  changes  in  the  ionization  prob¬ 
ability  are  possible.  Switching  from  a  metal  surface  to  an 
oxidized  surface  for  example,  the  RSFs  of  most  elements  are 
preserved  within  a  factor  of  1.5.^’^^  An  exception  are  ele¬ 
ments  with  a  relatively  low  multiphoton  ionization  cross  sec¬ 
tion.  Their  RSFs  can  be  enhanced  by  as  much  as  a  factor  of 
5  for  the  most  dramatic  case  (e.g.,  by  changing  from  Si  to 
PtSi^).  This  is  primarily  caused  by  large  amounts  of  major 
molecular  sputtering  and  dissociative  ionization  of  sputtered 
molecules  or  clusters,  featuring  a  higher  ionization  cross  sec¬ 
tion  than  the  corresponding  atomic  species.  Consideration  of 
all  these  effects  makes  standard  calibration  of  Laser-SNMS 
necessary. 

These  results  for  Laser-SNMS  bulk  analysis  were  the 
starting  point  for  our  attempt  to  quantify  trace  metals  on  Si 
wafer  surfaces  by  Laser-SNMS.  Since  the  general  features  of 
our  samples  [low  concentrations  (10^  cm“^  up  to  10^^  cm“^) 
of  metals  on  top  of  the  native  oxide  of  a  Si  wafer]  did  not 
change,  variations  of  RSFs  for  different  samples  are  not  ex¬ 
pected.  As  long  as  we  remain  within  static  sputtering  condi¬ 
tions  (primary  ion  dose  <4X10*^  cm“^),  the  sputter  induced 
changes  in  the  surface  structure  and  composition  can  be  neg¬ 
lected.  Consequently,  the  influence  of  preferential  sputtering 
and  changes  in  the  angular  and  energy  distributions  of  the 
sputtered  particles  on  the  RSFs  are  insignificant.  Therefore, 
good  reproducibility  of  our  measurements  for  various 
samples  was  expected,  once  a  standard  calibration  for  the 
laser  parameters  has  been  carried  out. 

A  technique  was  necessary  for  the  preparation  of  stan¬ 
dards  which  provides  homogeneous  surface  coverages  by 
different  metals  in  various  combinations  in  a  concentration 
range  between  10^  cm"^  and  10^^  cm  Additionally,  the 
amount  of  deposited  material  should  be  controllable  and  re¬ 
producible.  Sputter  deposition  meets  these  requirements.  In 
contrast  to  solution  or  evaporation  preparation,  this  tech¬ 
nique  supplies  very  homogeneous  coverages.  Furthermore,  it 
is  possible  to  sputter  virtually  all  metals  and  alloys.  The 
amount  of  deposited  material  can  be  controlled  and  calcu- 
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lated  by  the  total  primary  ion  dose  applied  to  the  sputtered 
target. 

As  an  additional  control,  the  standard  wafers  were  ana¬ 
lyzed  after  metal  deposition  by  TXRF  to  determine  the  re¬ 
sulting  surface  concentrations  of  the  metals.  TXRF  is  the 
standard  surface  analytical  tool  for  research,  development 
and  control  in  IC  manufacturing.  It  is  the  only  reference 
technique  which  is  comparable  to  Laser-SNMS  and  TOF- 
SIMS,  in  terms  of  detection  limits  (between  10^^  cm“^  and 
10^^  cm“^  for  the  transition  metals),  and  information  depth 
(^10  monolayers,  Laser-SNMS  and  TOF-SIMS  <3  mono- 
layers).  The  different  lateral  resolution  of  the  techniques  is 
no  problem  for  calibration  because  our  samples  are  homoge¬ 
neously  covered.  The  homogeneity  was  shown  by  Laser- 
SNMS  analysis  of  different  spots  on  one  sputter  deposited 
wafer. 

With  these  types  of  standards,  RSFs  and  detection  limits 
were  determined  for  Laser-SNMS.  The  RSFs  allow  correc¬ 
tion  for  the  effects  mentioned  above  arising  from  the  particu¬ 
lar  choice  of  samples.  Prior  to  these  studies,  the  sample 
preparation  itself  has  been  investigated  in  some  detail.  A 
comparison  of  postionization  with  different  wavelengths 
(193  nm  and  248  nm)  follows. 

In  TOF-SIMS,  a  greater  effort  to  achieve  quantification  is 
necessary  because  of  the  more  pronounced  matrix  effect. 
Here,  the  RSFs  can  vary  by  several  decades,  e.g.,  when  com¬ 
paring  a  metal  surface  to  an  oxidized  one.  In  these  investi¬ 
gations,  the  oxidation  of  the  surface  is  essential  to  achieve 
high  ionization  yields  for  the  metal  contaminants  and  a  well- 
defined  chemical  environment  for  these  species.  With  the 
Laser-SNMS  results,  it  was  possible  to  determine  the  influ¬ 
ence  of  UV/ozone  treatment  on  the  surface  concentrations  of 
metal  species.  Based  on  these  results,  RSFs  and  detection 
limits  for  various  metals  on  Si  surfaces  with  TOF-SIMS  have 
been  determined. 

III.  INSTRUMENTATION 
A.  Time-oMlight  mass  spectrometer 

All  measurements  presented  here  were  performed  in  a  re- 
flectron  type  time-of-flight  mass  spectrometer  developed  at 
the  University  of  Munster  (TOF  III  type).^^  Both  techniques, 
TOF-SIMS  and  Laser-SNMS,  are  implemented  in  this  type 
of  spectrometer.  A  schematic  drawing  of  the  instrument  is 
shown  in  Fig.  1.  The  instrument  is  equipped  with  a  10  keV 
Ar"^  electron  impact  (El)  primary  ion  source,  which  is 
coupled  to  a  90° -deflection  pulsing  unit. 

In  the  Laser-SNMS  mode,  primary  ion  pulses  with  a  pulse 
width  of  1  /xs  and  a  pulse  current  of  200  nA  are  used.  The 
beam  diameter  is  about  100  /xm.  In  the  Laser-SNMS  mode,  it 
is  important  to  suppress  the  sputtered  secondary  ions.  This  is 
achieved  for  the  positive  secondary  ions  by  a  repulsive  po¬ 
tential,  which  is  applied  to  the  extractor  during  the  primary 
ion  pulse  of  length  tp ,  and  the  time  delay  for  the  laser 
pulse,  according  to  the  timing  diagram  of  Fig.  2.  The  gener¬ 
ated  negative  secondary  ions  are  suppressed  by  the  analyzer, 
which  is  held  at  a  negative  potential  for  the  analysis  of  the 
positive  photoions.  For  postionization,  a  standard  excimer 
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Fig.  1.  Scheme  of  the  time-of-flight  mass  spectrometer. 


laser  (Lambda  EMG  103  MSC,  pulse  width  15  ns,  maximal 
pulse  energy  120  mJ)  is  used  at  different  wavelengths  (193 
nm  (ArF)  and  248  nm  (KrF)).  The  laser  beam  is  focused  by 
a  single  quartz  lens  with /=  250  mm.  The  resulting  ionization 
volume  has  a  diameter  of  about  200  jum,  which  is  compa¬ 
rable  to  the  acceptance  of  the  analyzer  (0  ^  500  yttm).  The 
laser  focus  is  located  as  close  to  the  target  surface  as  pos¬ 
sible,  in  order  to  maximize  the  portion  of  sputtered  particles 
in  the  ionization  volume.  Nearly  simultaneously  with  the  la¬ 
ser  pulse,  the  extraction  voltage  of  2  kV  is  switched  on  to 
accelerate  the  generated  photoions  over  the  extraction  dis¬ 
tance  of  3  mm  into  the  analyzer.  The  effective  flight  path  of 
the  ions  is  3  m.  The  energy  spread  is  focused  in  second  order 
by  a  double  stage  reflectron.  The  transmitted  ions  are  de¬ 
tected  by  a  dual  channelplate  detector  (chevron  arrange¬ 
ment).  In  Laser-SNMS,  there  are  up  to  several  hundred  pho¬ 
toions  of  one  mass  generated  in  a  single  shot.  Therefore,  the 
photoions  must  be  detected  in  analog  registration  mode 
rather  than  by  single  ion  counting.  The  signals  are  recorded 
by  an  8  bit- transient  digitizer  (Tektronix  RTD  720)  with  a 
maximum  sampling  rate  of  2  GHz.  For  the  results  reported 
here,  the  data  transfer  to  the  PC  limited  the  repetition  rate  to 
8  or  16  Hz,  depending  on  the  recorded  mass  range. 

For  the  TOF-SIMS  measurements  we  have  used  a  similar 
spectrometer  (TOF  III  type)  which  has  been  optimized  for 
TOF-SIMS  applications.  This  instrument  has  an  improved 
primary  ion  gun  (11  keV  Ar"^)  whose  90° -deflection  pulsing 
unit  produces,  in  combination  with  an  electrodynamic 
buncher,  short  mass  separated  ion  pulses  up  to  3000  primary 
ions  per  pulse  with  a  minimal  pulse  width  of  600  ps  or  a 
minimal  spot  diameter  of  5  fim.  In  the  TOF-SIMS  mode, 
secondary  ions  are  extracted  into  the  mass  analyzer  by  a 
static  extraction  field  in  front  of  the  target.  Because  of  the 
smaller  energy  spread  of  secondary  ions  compared  with  that 
of  photoions,  which  are  generated  in  an  extended  volume  in 
front  of  the  target,  a  single  stage  reflectron  for  time  focusing 
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Fig.  2.  Timing  diagram  of  one  acquisition  cycle  in  Laser-SNMS. 

is  sufficient.  This  has  the  advantage  of  a  higher  transmission 
(factor  2)  than  a  double  stage  reflectron.  TOF-SIMS  spectra 
are  recorded  in  single  ion  counting  mode  by  a  multistop 
time-to-digital  converter  (TDC)  with  ultimate  time  resolution 
(50  ps,  512  stops,  2  M  channels)  and  a  maximal  repetition 
rate  of  20  kHz. 

The  mass  resolution  R  of  a  time-of-flight  mass  spectro¬ 
meter  is  given  by 

m  1  t  I  t 

^  Am  2  Ar  2  JaT^  fAP 

In  the  following,  the  50%  definition  of  the  mass  resolution  is 
used.  This  means  that  Am  is  the  full  width  at  half  maximum 
(FWHM)  of  the  respective  peak  at  mass  m.  In  this  equation, 
A  f  start  A^stop  are  the  time  durations  of  the  start  event  and 

the  stop  event,  respectively,  and  ^t^na  is  the  time  dispersion 
of  the  particles  during  their  flight  in  the  mass  analyzer. 
Whereas  A^start  ^^stop  ^^e  independent  of  the  observed 
mass,  A?^„^  is  proportional  to  the  total  flight  time  of  the  ion. 

If  a  static  ion  extraction  is  used,  A^gtart  is  determined  by 
the  duration  of  the  ion  formation  and  therefore  in  the  case  of 
Laser-SNMS  is  comparable  with  the  laser  pulse  width.  How¬ 
ever,  this  duration  can  be  eliminated  by  the  use  of  a  pulsed 
and  delayed  extraction  (Fig.  2),  where  the  start  moment  of 
the  ions  is  given  by  the  rising  edge  of  the  extraction  pulse. 
The  duration  A  f stop  is  given  by  the  pulse  width  of  the  detec¬ 
tor  (800  ps)  and  the  time  resolution  of  the  transient  digitizer 
(channel  width  500  ps).  The  dispersion  ^t^na  is  caused  by 
the  energy  distribution  of  the  photoions.  This  energy  distri¬ 
bution  has  two  components:  The  first  one  is  due  to  the  initial 
energy  distribution  of  the  sputtered  particles  which  is  of  the 
order  of  a  few  eV.  The  second  one  originates  from  the  spatial 
extension  of  the  ionization  volume  in  the  extraction  field. 
Depending  on  the  extraction  field  strength  and  the  ionization 
volume,  this  second  energy  spread  can  be  50  times  larger 
than  the  initial  energy  spread  of  the  sputtered  particles.  Un¬ 
fortunately,  these  two  energy  distributions  cannot  be  focused 
in  time  simultaneously,^^  so  that,  in  general,  the  double  stage 
reflectron  is  used  for  second  order  focusing  of  the  energy 
spread  caused  by  the  axial  space  distribution  of  the  photoions 
in  the  ionization  volume. 

In  TOF-SIMS,  higher  mass  resolution  is  achieved  than  in 
Laser-SNMS.  The  short  time  interval  in  which  secondary 
ions  are  generated  (primary  ion  pulse  width  of  600  ps)  does 
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Fig.  3.  Scheme  of  the  sputter  device.  The  wafer  surface  is  positioned  at  a 
distance  r  to  the  sputter  target  and  an  angle  relative  to  the  normal  of  the 
target. 


not  require  a  pulsed  and  delayed  extraction.  In  addition,  all 
secondary  ions  start  at  the  same  potential.  This  means  the 
time-of-fiight  analyzer  can  be  used  to  focus  the  initial  energy 
spread  of  the  sputtered  secondary  ions.  Secondary  ion  spec¬ 
tra  are  recorded  in  single  ion  counting  mode  by  a  TDC  with 
an  ultimate  time  resolution  corresponding  to  the  channel 
width  of  50  ps.  This  means  that  the  time  error  of  the  stop 
event  is  considerably  smaller  than  that  for  Laser- 

SNMS  analysis.  In  addition,  the  determination  of  the  stop 
event  with  a  constant  fraction  discriminator  is  very  precise 
and  independent  of  the  pulse  width  of  the  detector  response. 


B.  Preparation  of  surface  standards  by  sputter 
deposition 

Standards  have  been  prepared  by  sputtering  correspond¬ 
ing  metals  or  alloys  by  10  keV  Ar’*’  ions  (primary  ion  current 
200  nA  dc)  with  an  angle  of  incidence  <p=45°.  Details  of  the 
arrangement  are  given  in  Fig.  3.  The  primary  ion  beam  was 
scanned  over  an  area  of  0.5  mm^  on  the  sputter  target  to 
compensate  for  possible  inhomogeneities  of  the  alloys.  Sput¬ 
ter  equilibrium  was  ensured  by  presputtering  in  the  case  of 
alloys.  The  amount  of  deposited  material  was  controlled  by 
the  total  primary  ion  dose  Apj  applied  to  the  sputter  target. 
Assuming  sputter  equilibrium,  a  cosine  distribution  dNfdfl 
of  the  sputtered  particles,  and  a  sticking  probability  of  1  for 
the  sputtered  metal  atoms  on  the  Si-surface,  one  can  calcu¬ 
late  the  surface  coverages  @  on  the  wafer  surface  by  the 
following  equation: 


0(A)  =  a(A)c(A)Vpi  Y,{<p)- 


dN 


-a(A)c(A)Api  cos-/(<p)y,(.p  =  0) - 

77  r 

(3) 

where  c(A)  is  the  relative  concentration  of  metal  A  in  the 
sputtered  target,  and  Y^{(p)  the  total  sputter  yield  for  bom¬ 
bardment  at  angle  cp.  The  values  for  7^(9  =  0)  are  taken 
from  Matsunami  et  alP  fl  is  the  solid  angle  and  F  the  area 
of  the  wafer.  The  exponent /has  following  value^"^: 


^target  J 

i^.J 

'  tor  . . 1 

projectile  [ 

^3 


^3’ 


(4) 


M  is  the  mass  of  a  projectile  particle  and  a  target  particle, 
respectively.  Factor  a  in  Equation  (3)  is  a  correction  factor 
which  considers  deviations  from  the  assumptions  made 
above.  Its  value  must  be  determined  experimentally. 


IV.  RESULTS 

A.  Preparation  and  calibration  of  standards 

For  the  preparation  of  standards,  4  in.-Si  wafers  (orienta¬ 
tion  (100),  thickness  (525 ±15)  ^tm,  resistivity  16-24  Dcm, 
B-doped)  were  used  without  any  pretreatment  from  the  stan¬ 
dard  production  of  Wacker  Chemitronic  GmbH,  Burghausen, 
FRG. 

Qualitative  Laser-SNMS  investigations  show  that  the  in¬ 
tensities  of  hydrocarbon  peaks  in  a  mass  spectrum  of  a  wafer 
after  metal  deposition  by  sputtering  are  higher  than  those 
from  the  original  wafer.  The  surface  coverages  by  the  sputter 
deposited  metals  are  not  stable  over  time  or  at  higher  tem¬ 
peratures  without  any  further  treatment.  This  effect  for  wa¬ 
fers  after  metal  deposition  was  investigated  for  an  extended 
time  period  (six  months)  and  by  heating  a  metal  covered 
wafer  for  10  min  to  a  temperature  of  about  60  °C  under 
vacuum.  In  both  cases,  the  majority  of  the  deposited  metals 
disappeared  from  the  surface.  Hydrocarbon  adsorption  on  the 
freshly  deposited  metals  which  then  form  volatile  metallo- 
organic  compounds  may  explain  these  observations.  A  UV/ 
ozone  treatment  (30  min  in  a  commercial  device,  UV-1, 
SAMCO,  Japan)  of  the  wafers  immediately  after  sputter 
deposition  was  found  to  stabilize  the  metal  concentrations. 
We  assume  that  by  this  treatment,  organic  contaminants  are 
fragmented  and  oxidized  to  small  volatile  molecules  like 
CO2 ,  H2O,  or  N2 .  A  qualitative  investigation  of  UV/ozone 
treated  wafers  by  Laser-SNMS  has  shown  that  the  majority 
of  organic  contaminants  disappear,  whereas  the  surface  cov¬ 
erages  by  the  sputter  deposited  metals  change  only  slightly 
but  become  stable  even  over  long  times. 

For  calibration,  one  wafer  with  12  different  metals  on  its 
surface  was  prepared  (see  Table  I).  In  addition,  a  series  of  six 
wafers  was  prepared  with  the  four  metals  V,  Ni,  Co,  and  Cu 
in  different  surface  coverages  in  the  range  between  some 
10^  cm“^  and  10^^  cm“^  for  testing  the  functional  depend¬ 
ence  on  the  surface  coverage  of  the  measured  peak  intensi¬ 
ties  in  Laser-SNMS  and  TOF-SIMS.  Since  the  isotopes  of 
these  four  metals  have  no  mass  interferences  with  other  ele¬ 
ments  and,  additionally,  these  elements  are  either  mono¬ 
isotopic  or  their  main  isotope  accounts  for  over  65%  of  the 
element,  the  uncertainty  caused  by  background  in  the  spec¬ 
trum  is  small,  even  for  low  surface  coverages.  Absolute  cov¬ 
erages  of  the  standards  were  determined  by  TXRF  with  an 
ATOMIKA  8010  (excitation  with  Mo-J^a).  They  were  found 
to  be  in  good  agreement  with  the  values  calculated  by  Equa¬ 
tion  (3)  after  setting  the  factor  a— 2  (see  also  Table  I).  This 
factor  may  be  explained  in  the  following  manner.  As  the 
collision  cascade,  in  the  case  of  nonperpendicular  incidence 
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Table  L  Standard  for  metal  trace  determination  on  Si  wafer  surfaces. 


Element 

Surface  coverage/cm  ^ 

Calculated  by  Equation  (3) 

(by  setting  the  factor  a  =2) 

Determined  by  TXRF 

Al 

5.0X10" 

Ti 

7.0X10" 

V 

1.0X10'^ 

1.0X10" 

Cr 

8.0X10" 

8.3X10" 

Fe 

1.6X10*^ 

I.IXIO" 

Co 

9.0X10" 

I.OXIO" 

Ni 

2.6X10'^ 

2.1X10" 

Cu 

3.0X10'^ 

2.2X10" 

Ga 

1.4X10'^ 

1.5X10" 

As 

1.4X10" 

8.0X10" 

Mo 

9.2X10" 

w 

9.8X10" 

4.0X10" 

of  the  primary  particles,  reaches  the  surface  sooner  than  in 
case  of  normal  incidence,  the  excitation  direction  cannot  be 
neglected.  The  angular  distribution  of  sputtered  particles  is 
more  pronounced  in  this  direction. Therefore  the  assump¬ 
tion  of  a  cosine  distribution  in  Equation  (3)  results  in  too 
small  values  of  the  calculated  surface  concentrations.  For  the 
following  investigations,  the  surface  coverages  determined 
by  TXRP  were  used.  For  Al,  Ti,  and  Mo,  the  values  calcu¬ 
lated  by  Equation  (3)  were  used  (factor  a— 2),  because  they 
could  not  be  detected  by  TXRF  in  our  investigations:  the 
surface  coverages  by  Al  and  Ti  on  the  samples  were  below 
the  detection  limits  of  TXRF  due  to  overlap  of  Al  and  Ti 
peaks  with  the  intense  Si  peak.  Mo  could  not  be  detected  in 
TXRF  because  the  samples  were  excited  by  Mo-Ka  radia¬ 
tion. 

B.  Laser-SNMS  results 

The  typical  mass  resolution  achieved  in  Laser-SNMS  is 
demonstrated  in  Fig.  4.  Since  pulsed  and  delayed  extraction 
of  the  photoions  is  performed,  the  width  of  the  peak  (1.5 
ns)  is  caused  by  the  mass  independent  duration  of  the  detec- 


Mass  /  u 


Fig.  4.  Details  of  a  Laser-SNMS  spectrum  demonstrating  the  high  mass 
resolution  of  the  double  reflectron  TOF  mass  spectrometer  in  the  Laser- 
SNMS  mode. 
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Fig.  5.  Mass  resolution  of  the  mass  spectrometer  in  the  Laser-SNMS  mode. 
The  solid  curve  is  fitted  to  the  experimental  data, 

tor  pulse  (800  ps)  and  the  channel  width  of  the  transient 
digitizer  (500  ps).  This  leads  to  a  time  error  ^  940  ps. 
The  time  dispersion  can  be  assumed  to  be  about  670 
ps  since  the  overall  flight  time  is  known  to  be  about  5.1  /ns. 
Therefore,  the  duration  of  the  starting  process  A^gtart  can  be 
estimated  as  about  950  ps  [Equation  (2)].  The  pulsed  and 
delayed  extraction  minimizes  the  duration  of  the  ionizing 
event  (laser  pulse  width  15  ns).  In  contrast  to  H,  the  time 
width  for  masses  above  12  u  is  mainly  determined  by  the 
time  dispersion  Ar^„^.  In  this  mass  range,  the  mass  resolu¬ 
tion  approaches  the  limit  given  by  the  time-of-flight  analyzer 
(R^3800).  In  Fig.  5  the  experimentally  determined  mass 
resolution  of  Laser-SNMS  is  plotted  as  a  function  of  mass. 
The  solid  curve  represents  the  functional  form  of  the  Equa¬ 
tion  (2)  fitted  to  the  experimental  values  by  following  param- 
eters:  Atl^+ Atl^^=  1 A  ns,  i/2Af^„„=3800. 

In  Fig.  6  a  logarithmically  scaled  spectrum  of  a  Si  wafer 
covered  with  metals  (Ni,  Cu,  Zn)  at  a  total  surface  concen¬ 
tration  of  about  10^^  cm“^  is  shown.  It  was  acquired  under 
low  dose  sputtering  conditions  (primary  ion  fluence 
<4X10^^  cm“^,  equivalent  to  sputtering  of  less  than  10%  of 
a  monolayer)  over  10  000  cycles  (acquisition  time  21  min). 
The  sputtered  particles  were  postionized  with  a  laser  power 
density  of  about  10^®  W  cm“^  (wavelength  \=248  nm,  laser 
pulse  width  t  =  15  ns).  The  most  important  features  of  this 
spectrum  may  be  summarized  as  follows: 

(i)  The  dynamic  range  in  the  spectrum  is  about  six  orders 
of  magnitude. 

(ii)  There  is  almost  no  background  over  the  entire  mass 
range.  Obviously,  the  generated  secondary  ions  are 
effectively  suppressed  by  the  combination  of  the  re¬ 
pelling  field  during  sputtering  and  the  delayed  extrac¬ 
tion  after  postionization.  Since  the  majority  of  sput¬ 
tered  molecular  species  are  photodissociated  by 
interaction  with  the  laser  beam,  metastable  decay  in 
the  analyzer  does  not  contribute  to  the  background  in 
the  spectrum. 

(iii)  The  achieved  mass  resolution  is  sufficient  to  separate 
the  elemental  peaks  from  molecular  peaks  at  the  same 
nominal  mass.  Only  some  H-free  molecules,  mainly 
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Fig.  6.  Logarithmically  scaled  Laser-SNMS  spectrum  (primary  ion  fluence 
4X10^^  cm“^,  postionization:  \=248  nm,  /-iX  10*®W  cm“^)  of  a  Si  wafer 
covered  by  various  metals  (Ni,  Cu,  Zn).  (a)  Overview  and  (b)  detail. 

monoxides  like  SO  or  VO,  show  interferences  with 
elemental  signals  and  require  higher  mass  resolution. 

(iv)  The  base  peak  in  the  spectrum,  originates  pri¬ 
marily  from  photodissociation  of  sputtered  hydrocar¬ 
bons.  The  contribution  to  this  peak  of  gas  phase  ion¬ 
ization  from  the  residual  gas  (total  pressure  <5X10“^ 
mbar)  or  thermally  desorbed  organic  molecules  was 
found  to  be  below  1%. 

(v)  The  matrix-signals  Si*^,  SiO"^  are  relatively  low.  We 
found  the  ionization  cross  section  of  SiO  at  248  nm  to 
be  low  compared  to  those  of  metals  like  V,  Cr,  etc. 
and  that  of  Si  to  be  even  smaller.  The  Si^-signal  is 
mostly  formed  from  SiO  photofragmentation. 

(vi)  The  intense  S"^  peak  indicates  a  high  concentration  of 
S  on  the  wafer  surface.  S  has  not  been  deposited  by 
our  preparation  but  was  on  the  surface  before,  as  in¬ 
dicated  by  spectra  of  untreated  wafers. 

(vii)  Most  of  the  detected  molecular  photoions  are  of  the 

general  structures  C^OyH^  or  Due  to  the 

strong  dissociation  of  sputtered  molecules,  the  inter¬ 
pretation  of  this  part  of  the  spectrum  is  difficult. 

(viii)  Some  doubly  charged  ions  are  generated  for  several 
metals  like  V,  Cr,  Mn  10%  depending  on  the  laser 
power  density).  These  species  must  be  considered  for 
accurate  quantification  of  the  corresponding  elements. 
This  is  performed  by  summing  up  the  intensities  of 
the  singly  charged  and  doubly  charged  photoions,  be¬ 
cause  the  doubly  charged  ions  are  mainly  generated 
by  successive  absorption  of  additional  photons  by  pre¬ 
viously  formed  singly  charged  ions.^^ 

The  accuracy  and  reproducibility  of  Laser-SNMS  results 
have  been  demonstrated  by  the  analysis  of  a  series  of  Si 
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Fig.  7.  Linear  correlation  between  the  normalized  Cu’^  Laser-SNMS  signal 
and  the  calculated  Cu  surface  coverage  The  solid  line  is  the  linear 
regression  fit  to  the  experimental  data. 


standards  covered  by  metals  in  the  range  of  some  10^  cm“^ 
to  10^^  cm“^.  In  Fig.  7  the  linear  relation  between  the  Cu 
surface  coverage  and  the  ^^Cu'*'  intensity  (intensity  corre¬ 
sponds  to  the  integrated  peak  area)  normalized  to  the  sum  of 
the  Si"^  and  the  SiO"^  intensities  is  shown.  The  excellent 
correlation  observed  (correlation  coefficient  R=0.997)  is 
also  confirmed  by  corresponding  results  for  V,  Co,  and  Ni. 

In  Table  II  the  RSFs  of  several  metals  deposited  on  Si 
wafers  and  the  detection  limits  are  listed  for  the  two  laser 
wavelengths  (193  nm  and  248  nm)  applied.  The  RSFs  are 
defined  by  Equation  (1)  wherein  the  bulk  concentration  c(A) 
is  replaced  by  the  surface  coverage  0(A).  The  normalization 
of  the  metal  ion  intensities  to  the  sum  of  Si'^  and  SiO"^  takes 
into  account  small  variations  in  experimental  conditions  (pri¬ 
mary  ion  current,  transmission,  position  of  the  laser  focus) 
during  data  acquisition.  Additionally,  the  effects  of  various 
oxidation  states  of  the  surface  are  considered.  Since  the  sput¬ 
tered  SiO  species  are  either  photoionized  to  SiO"^  or  photo- 
fragmented  to  atomic  Si^,  the  sum  of  the  Si"^  and  the  SiO"^ 
intensities  is  an  approximate  measure  for  the  total  number  of 
sputtered  Si  (elemental  or  oxidized). 

At  193  nm  the  ion  yield  for  Si*^  is  enhanced  by  a  factor  of 
20  compared  to  248  nm,  because  of  an  accidental  resonance. 
Therefore  the  combined  ion  yield  of  Si  and  SiO"^  is  of  the 
same  order  as  the  ion  yield  of  metals  like  Cr  and  Fe.  A1  and 
Ga  are  ionized  in  a  single  photon  process  at  193  nm,  which 
leads  to  an  enhancement  of  the  ionization  probability  by  one 
decade  for  A1  and  two  decades  for  Ga.  This  improves  the 
detection  limits  for  these  two  metals  accordingly.  A  slight 
enhancement  of  the  ion  yield  is  observed  for  the  other  met¬ 
als,  too,  in  agreement  with  the  investigations  of  Hayashi 
et  al  All  these  facts  results  in  an  accidental  improvement 
of  uniformity  of  the  RSFs  at  193  nm  in  contrast  to  248  nm.  A 
disadvantage  of  the  193  nm  wavelength  is  the  higher  ioniza¬ 
tion  probability  for  hydrocarbons.  For  this  wavelength,  even 
in  the  higher  mass  range  (m>70  u),  organic  molecules  are 
detected  in  the  spectrum  with  high  intensities,  and  the  in¬ 
tense  peak  can  lead  to  detector  saturation,  making 
quantification  impossible. 
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Table  II,  Laser-SNMS:  Relative  sensitivity  factors  and  detection  limits  for  metals 
Detection  limits  may  originate  from  different  factors  (see  Sec.  V  B  1). 

on  Si  wafer  surfaces. 

Element 

Relative  sensitivity  factor  5(Me,XSi) 

Detection  limits  in  cm  ^ 

193  nm 

248  nm 

193  nm 

248  nm 

limited  by 

A1 

0.9 

0.9 

8X10’ 

3X10'“ 

useful  yield 

Ti 

1.9 

9 

5X10’ 

5X10’ 

mass  interference 

V 

2,1 

42 

3X10* 

3X10* 

data  rates 

Cr 

2.9 

28 

3X10’ 

3X10’ 

mass  interference 

Fe 

1.6 

31 

5X10’ 

5X10’ 

mass  interference 

Co 

1.1 

6.9 

5X10* 

5X10* 

data  rates 

Ni 

0.9 

14 

IX 10’ 

IX 10’ 

isotopy,  data  rates 

Cu 

1.2 

24 

5X10’ 

5X10’ 

mass  interference 

Ga 

4.4 

0.4 

5X10* 

8X10’ 

isotopy,  data  rates 

As 

1.9 

10 

3X10* 

3X10* 

data  rates 

Mo 

1.8 

32 

3X10* 

3X10* 

isotopy,  data  rates 

W 

5.1 

39 

5X10* 

5X10* 

isotopy,  data  rates 

The  detection  limits  for  the  respective  metals  are  also 
given  in  Table  II.  For  a  detailed  discussion  see  Sec.  V  Bl. 

C.  TOF-SIMS  results 

In  Fig.  8  the  mass  resolution  achieved  in  the  TOF-SIMS 
mode  is  plotted  as  a  function  of  mass.  The  scatter  in  the  data 
is  caused  by  large  differences  in  the  energy  distributions  of 
elemental  and  various  molecular  secondary  ion  species.  The 
resolution  of  the  mass  analyzer  is  about  12  000  and  the  time 
error  of  the  start  and  stop  events  can  be  estimated  as 
PS-  Fig-  9  a  detail  of  a  positive  sec¬ 
ondary  ion  spectrum  from  a  Si  wafer  is  shown  demonstrating 
the  high  mass  resolution  achieved.^  Eleven  peaks  at  the 
nominal  mass  56  u  can  be  identified.  The  mass  difference 
between  ^^Fe”^  and  ^^Si^  is  only  0.0189  u.  A  mass  resolution 
of  more  than  3000  is  required  to  separate  the  corresponding 
peaks,  provided  they  have  equal  intensities. 

For  quantitative  determination  of  metal  contaminants  on 
Si  wafers  with  TOF-SIMS,  a  reproducible  oxidation  state  of 
the  metal  atoms  at  the  surfaces  is  required  in  order  to  control 
the  matrix  effect.  This  reproducible  oxidation  state  has  been 
achieved  by  a  30  min  UV/ozone  treatment  of  the  wafer  sur¬ 


face.  This  oxidation  procedure  results  in  a  high  and  repro¬ 
ducible  probability  that  sputtered  metal  atoms  are  emitted  as 
positive  ions.  In  addition,  surface  coverage  by  hydrocarbons 
is  considerably  reduced. 

The  effect  of  the  UV/ozone  treatment  is  demonstrated  in 
Fig.  10.  Molecular  surface  contaminations  of  a  Co-covered 
(0=3X10^^  cm“^)  wafer  results  in  a  significant  background 
in  the  mass  spectrum  [Fig.  10(a)].  Co"^  ions  are  not  detect¬ 
able.  Figure  10(b)  shows  the  spectrum  of  the  same  wafer 
after  a  30  min  UV/ozone  treatment,  resulting  in  a  well  sepa¬ 
rated  small  Co"^  peak.  The  remaining  background  originates 
from  surface  adsorption  of  hydrocarbons  from  the  ambient 
air  during  transfer  of  the  wafer  from  the  UV/ozone  chamber 
to  the  mass  spectrometer.  In  principle,  this  re-coverage  could 
be  avoided  by  a  UV/ozone  treatment  directly  in  the  vacuum 
chamber  of  the  spectrometer. 

We  have  investigated  the  influence  of  the  UV/ozone  treat¬ 
ment  on  the  Laser-SNMS  spectra,  since  this  method  is  much 
less  sensitive  to  changes  in  the  surface  composition  than 
TOF-SIMS.  There  is  only  a  change  in  the  relative  metal  ion 
intensities  /(Me ***)//( 2  Si '^)  by  a  factor  of  2  in  the  worst  case, 
and  a  considerable  reduction  of  the  hydrocarbon  ion  intensi¬ 
ties.  These  results  justify  the  assumption  that  the  trace  metal 
concentration  is  not  affected  by  the  UV/ozone  pretreatment. 


Mass  m/u 


Fig.  8.  Mass  resolution  of  the  mass  spectrometer  in  the  TOF-SIMS  mode. 
The  solid  curve  is  fitted  to  the  experimental  data.  The  scattering  of  the 
values  is  caused  by  large  differences  in  the  energy  distributions  of  elemental 
and  various  molecular  secondary  ion  species. 


Nr. 

peak 

mass  /  u 

1 

“Fe 

55.9343 

2 

55.9536 

3 

“SiCO 

55.9706 

4 

-^SiCNH 

55.9869 

5 

^“SiCNHj 

55.9956 

6 

^^SiC2H3 

55.9999 

7 

^^SiC2H4 

56.0079 

8 

C3OH4 

56.0256 

9 

C2N2H4 

56.0385 

10 

C3NH, 

56.0500 

11 

CA 

56.0618 

Fig,  9.  Detail  of  a  secondary  ion  spectrum  demonstrating  the  high  mass 
resolution  of  the  mass  spectrometer  in  the  TOF-SIMS  mode. 
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Fig.  10.  Details  of  secondary  ion  peaks  at  59  u  [0(Co)=3X  10*°  cm“^] 
(primary  ion  dose  1.25X10^,  analyzed  area  700  fxjx?).  (a)  Untreated;  (b)  30 
min  UV/ozone  treatment;  (c)  30  min  UV/ozone  treatment,  presputtered  (pri¬ 
mary  ion  fluence  2X10*^  cm”^);  and  (d)  30  min  UV/ozone  treatment,  pre¬ 
sputtered  (primary  ion  fluence  8X10*^  cm“^). 


To  reduce  the  remaining  background  in  the  secondary  ion 
spectrum  after  UV/ozone  treatment,  moderate  presputtering 
(primary  ion  fluence  <8X10^^  cm“^)  is  helpful.  This  can  be 
seen  from  Figs.  10(c)  and  10(d).  Presputtering  results  in  a 
significant  decrease  of  molecular  ions  with  only  negligible 
sputtering  of  metal  atoms.  One  exception  is  Cu:  a  possible 
explanation  is  the  assumption  of  preferential  sputtering  of  Cu 
from  a  Si  surface  or  a  strong  decrease  of  the  ionization  prob¬ 
ability  of  Cu  after  presputtering.  Unfortunately,  the  determi¬ 
nation  of  low  Cu  surface  coverages  is  not  possible  without 
presputtering. 


Fig.  11.  Logarithmically  scaled  secondary  ion  spectrum  (primary  ion  dose 
1.25X10^,  analyzed  area  700  fm^)  of  a  Si  wafer  covered  by  various  metals 
(Ni,  Cu,  Zn).  (a)  Overview  and  (b)  detail. 


Fig.  12.  Linear  correlation  between  the  normalized  Ni^  secondary  ion  signal 
and  the  calculated  Ni  surface  coverage  ©^ai-  The  solid  line  is  the  linear 
regression  fit  to  the  experimental  data. 

Figure  11  presents  a  logarithmically  scaled  TOF-SIMS 
spectrum  from  the  same  wafer  as  Fig.  6.  The  wafer  was 
presputtered  with  a  primary  ion  fluence  of  5X 10^^  cm“^.  For 
spectral  acquisition,  the  surface  was  bombarded  with  a  total 
dose  of  1.25X10^  primary  ions  in  a  spot  size  of  700  /zm^  in 
15  min.  This  is  equivalent  to  an  erosion  of  about  40%  of  the 
outer  monolayer  assuming  a  sputter  yield  of  about  2.  The 
most  important  features  of  this  spectrum  may  be  summarized 
as  follows: 

(i)  The  dynamic  range  in  the  spectrum  is  about  six  orders 
of  magnitude. 

(ii)  The  background  in  the  spectrum  originates  from 
metastable  decay  of  highly  excited  molecular  second¬ 
ary  ions  in  the  analyzer. 

(iii)  The  high  mass  resolution  allows  separation  of  nearly 
all  peaks  at  each  nominal  mass. 

(iv)  The  base  peak  in  the  spectrum  is  the  matrix  signal 
Si'^.  There  are  also  intense  peaks  of  SiH^  and 
SiOH+. 

(v)  The  Na'^  and  peaks  are  relatively  high,  because  of 
the  high  ionization  probabilities  for  alkali  atoms  in 
SIMS. 

(vi)  Most  of  the  molecular  ions  can  be  identified  as  ions  of 
the  general  compositions  C^NHy,  SiQHy,  C^OHy 
and  Si  xOyH^ .  They  can  supply  important  information 
on  the  molecular  composition  at  the  wafer  surface.^^ 

The  quantification  of  metal  contaminants  is  based  upon 
the  assumption  of  a  functional  dependence  of  the  observed 
integrated  peak  intensities  on  the  corresponding  surface  cov¬ 
erage.  We  have  investigated  by  TOF-SIMS  the  same  metal 
covered  Si  wafers  as  by  Laser-SNMS.  Since  the  transforma¬ 
tion  probability  P(Me-^Me'^)  is  higher  than  P(Me-^Me“) 
for  almost  all  metals,  our  investigations  were  restricted  to 
positive  secondary  ion  spectra.  The  metal  ion  intensities 
were  normalized  to  the  matrix-signals  ^^Si"^  plus  ^^Si^^O"^ 
(=2  Si)  in  order  to  minimize  small  variations  in  the  experi¬ 
mental  conditions.  Figure  12  presents  as  an  example  the  re¬ 
sults  for  Ni.  A  good  linear  correlation  (correlation  coefficient 
/?= 0.999)  was  found  in  the  concentration  range  between 
some  10^  cm“^  and  10^^  cm“^.  This  was  confirmed  also  for 
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Table  III.  TOF-SIMS:  Relative  sensitivity  factors  and  detection  limits  for  metals  on  Si  wafer  surfaces  (30  min 
UV/ozone  treated).  Detection  limits  may  originate  from  different  factors  (see  Sec.  V  B  1). 


Element 

Relative  sensitivity  factor  5(Me,XSi) 

Detection  limits  in  cm  ^ 

Limited  by 

A1 

1.6 

5X10® 

background 

Ti 

0.9 

3X10'’ 

background 

V 

1.1 

2X10’ 

useful  yield 

Cr 

1.5 

1X10’ 

useful  yield 

Fe 

1 

3X10’ 

useful  yield 

Co 

0.5 

2X10’ 

useful  yield 

Ni 

0.4 

4X10’ 

useful  yield,  isotopy 

Cu 

0.5 

5X10’ 

useful  yield,  isotopy 

Ga 

1.8 

2X10’ 

useful  yield,  isotopy 

As 

0.1 

2X10'“ 

useful  yield 

Mo 

0.2 

X 

o 

useful  yield,  isotopy 

W 

>1X10'^ 

useful  yield,  isotopy 

V,  Co,  and  Cu.  Assuming  the  same  linear  behavior  for  the 
other  investigated  metals  one  can  define  RSFs  5'(Me,2Si)  as 
in  Laser-SNMS.  Table  III  presents  the  values  for  5(Me, 
2  Si)  and  the  corresponding  detection  limits. 

The  concentration  of  W  on  the  reference  wafer  is  below 
the  TOF-SIMS  detection  limit  for  this  metal.  A  higher  sec¬ 
ondary  ion  yield  is  expected  for  the  oxidized  W  atoms 
(WO~)  in  the  negative  spectrum.  Similar  considerations  hold 
for  As. 

The  RSFs  presented  in  Table  III  are  valid  only  for  the 
chemically  well  defined  oxidized  Si  surface  with  metal  cov¬ 
erages  in  the  submonolayer  range.  For  higher  surface  con¬ 
centrations  of  the  metals  (>1X10^^  cm“^),  sputter  yields  as 
well  as  ionization  probabilities  may  be  affected. 

V.  DISCUSSION 
A.  Fundamental  aspects 
1,  Sample  preparation 

Controlled  ion  beam  sputter  deposition  was  applied  for 
preparation  of  all  investigated  standards.  The  surface  concen¬ 
tration  of  metals  prepared  by  such  a  sputter  deposition  is 
determined  by: 

(i)  The  flux  of  sputtered  particles. 

(ii)  The  sticking  probability  of  the  metal  atoms  arriving  at 
the  surface. 

(iii)  The  behavior  of  the  metal  atoms  on  the  surface  after 
their  deposition  (surface  diffusion,  cluster  formation, 
diffusion  into  the  surface  oxide,  desorption,  etc.). 

For  describing  the  coverage  of  the  wafer  surface  by  sput¬ 
tered  metal  atoms  [Equation  (3)]  the  following  assumptions 
were  made: 

(i)  The  flux  of  sputtered  metal  atoms  is  determined  by  the 
composition  of  the  target,  the  sputter  yield  of  the  respective 
species,  and  their  angular  distribution.  Sputter  yields  have 
been  applied  according  to  Matsunami  et  alP  These  yields 
are  calculated  by  an  empirical  formula  with  best  fit  of  the 
parameters  to  the  available  experimental  data.  They  are  as¬ 
sumed  to  be  accurate  within  20%.  The  calculated  total  sput¬ 
ter  yields  are  valid  for  normal  incidence  of  the  projectile 


particles.  The  dependence  of  these  yields  on  the  angle  of 
incidence  of  the  projectiles  was  considered  as  well  as  the 
dependence  of  the  angular  distribution  of  sputtered  particles 
by  the  factor  a  in  Equation  (3).  The  value  of  a  has  been 
found  to  be  about  2  for  all  sputtered  targets  by  comparing  the 
calculated  surface  coverages  and  those  determined  by  TXRF. 
The  good  agreement  between  the  calculated  values  for  the 
surface  coverages  according  to  Equation  (3)  and  those  deter¬ 
mined  by  TXRF  confirms  an  adequate  modelling  of  sputter¬ 
ing  by  the  equations  used. 

(ii)  The  assumption  of  sticking  probabilities  near  1  for  all 
sputter  deposited  metals  on  a  Si  wafer  surface  is  a  priori  not 
justified.  However,  the  good  agreement  between  the  calcu¬ 
lated  surface  coverages  and  those  determined  by  TXRF  sup¬ 
ports  this  assumption.  A  possible  sticking  probability  of  less 
than  1  is  considered  by  the  correction  factor  a  in  Equation 
(3).  This  factor  has  been  found  to  be  nearly  the  same  for  all 
deposited  metals  indicating  that  the  sticking  probability  does 
not  depend  on  the  metal  species.  This  all  indicates  a  value  of 
1  for  the  sticking  probability. 

(iii)  Thermal  desorption  of  sputter  deposited  submonolay¬ 
ers  of  metal  atoms  on  the  Si  wafer  surface  was  observed.  To 
understand  this,  the  structure  of  the  wafer  surface  must  be 
considered:  the  Si  wafer  is  covered  by  a  native  oxide,  which 
is  covered  by  a  so-called  contamination  layer,  containing  a 
wide  variety  of  hydrocarbons.  The  desorption  of  metal  atoms 
may  be  enhanced  by  the  formation  of  volatile  metallo- 
organic  compounds  on  this  type  of  surface.  This  is  supported 
by  the  observation  that  UV/ozone  treatment,  resulting  in  the 
transformation  of  all  metal  atoms  on  the  surface  into  corre¬ 
sponding  oxides,  stops  this  loss  of  metal  atoms. 

(iv)  The  metal  covered  Si  wafers  were  calibrated  by 
TXRF.  This  technique  features  an  information  depth  of  about 
10  monolayers,  compared  with  an  information  depth  of  the 
sputter  based  techniques  (Laser-SNMS  and  TOF-SIMS)  of 
less  than  3  monolayer.  This  means  that  the  results  of  TXRF 
and  Laser-SNMS/TOF-SIMS  are  only  comparable  if  the 
metal  atoms  are  present  only  in  the  uppermost  monolayer 
and  not  in  or  under  the  oxide  layer.  This  has  been  confirmed 
by  appropriate  depth  profiling.  The  sputter  deposited  material 
was  only  detected  in  the  uppermost  monolayer.  As  a  result  of 
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this  observation  diffusion  of  the  sputter  deposited  metal  at¬ 
oms  into  the  native  oxide  could  be  excluded  even  after  oxi¬ 
dation  by  UV/ozone  treatment. 

With  the  exception  of  W,  the  surface  coverages  by  all 
deposited  metals  are  adequately  described  by  Equation  (3), 
i.e.,  they  are  in  good  agreement  with  the  TXRF  results.  For 
W,  the  TXRF  measurements  give  a  much  lower  surface  con¬ 
centration  then  calculated.  From  the  experimental  results,  it 
could  not  be  decided  if  this  difference  originates  from  a 
sticking  probability  of  W  atoms  on  the  Si  wafer  surface  be¬ 
ing  below  1 ,  or  if  the  TXRF  results  are  not  correct  in  the  case 


2721 

of  W.  The  latter  may  be  possible,  since  for  the  TXRF  mea¬ 
surements,  the  samples  were  excited  by  Mo-Ka  radiation, 
and  therefore,  in  contrast  to  other  elements,  W  must  be  de¬ 
termined  by  the  weak  Lfi  line. 

2.  Laser-SNMS  features 

The  useful  yield  y„(A,Xf)  is  the  most  important  quantity 
describing  the  fraction  of  removed  particles  A  which  are 
transformed  into  ions  Xf  available  for  the  measurement.  It  is 
defined  by  following  equation: 


Yu(A,X])-- 


number  of  detected  ions  of  type  i  and  charge  q  formed  from  A 
number  of  removed  particles  A 


(5) 


The  RSFs  give  the  relationship  between  the  useful  yields  of 
the  respective  metal  and  the  useful  yield  of  Si  (Si^  plus 
SiO"^),  if  one  neglects  effects  like  selective  sputtering.  The 
useful  yield  of  Si  from  a  UV/ozone  treated  wafer  surface  is 
assumed  to  be  about  2X10“"^  in  Laser-SNMS.  The  useful 
yields  can  be  improved  by  one  order  of  magnitude  by  reduc¬ 
ing  the  primary  ion  pulse  length  considerably  below  1  jms. 
The  fraction  of  sputtered  particles  in  the  ionization  volume  at 
the  moment  of  interaction  with  the  laser  pulse  is  then  opti¬ 
mized.  With  reduction  of  the  primary  ion  pulse  length,  how¬ 
ever,  the  absolute  number  of  particles  in  the  ionization  vol¬ 
ume  decreases  accordingly.  This  results  in  a  longer 
acquisition  time  for  a  spectrum  if  the  same  dynamic  range  is 
required.  A  primary  ion  pulse  length  of  1  yu-s  used  in  our 
experiments  is  a  compromise  between  a  reasonable  acquisi¬ 
tion  time  and  a  high  useful  yield. 

The  differences  in  RSFs  for  different  metals  stem  from 
three  phenomena:  preferential  sputtering,  differences  in  an¬ 
gular  and  energy  distribution,  and  variations  in  ionization 
probabilities  (see  Sec.  II).  Because  the  surface  coverage  by 
the  metals  on  the  samples  is  below  1X10^^  cm“^,  the  sputter 
process  is  dominated  by  the  silicon  oxide  matrix.  Therefore, 
differences  in  the  RSFs  for  different  metals  can  be  explained 
mainly  by  different  densities  of  sputtered  particles  in  the  ion¬ 
ization  volume  and  by  different  ionization  probabilities: 

(i)  The  density  of  sputtered  particles  is  inversely  propor¬ 
tional  to  the  velocity  of  the  sputtered  particles.  This  velocity 
is  inversely  proportional  to  the  square  root  of  the  mass,  as¬ 
suming  the  same  energy  distribution  for  the  sputtered  par¬ 
ticles.  Therefore,  the  RSFs  of  elements  with  higher  atomic 
number  are  in  general  larger  than  those  of  lighter  elements. 

(ii)  To  achieve  equal  ionization  probabilities  for  all  ele¬ 
ments  it  is  necessary  to  reach  ionization  saturation  levels.  In 
that  case,  all  particles  in  the  interaction  volume  are  available 
for  the  mass  analysis,  independent  of  the  respective  ioniza¬ 
tion  potential  E  or  the  generalized  n-photon  ionization  cross 
section  Since  we  are  limited  by  the  maximum  laser 
power  density  of  approximately  IX 10^^  W  cm“^  (248  nm)  in 


the  focus  it  was  not  possible  to  reach  saturation  levels  for  the 
ionization  of  all  elements,  e.g.,  Al. 

(iii)  When  the  ionization  saturation  level  is  obtained,  the 
ionization  probability  is  mainly  determined  by  the  size  of  the 
ionization  volume.  The  laser  power  density  in  the  wings  of 
the  focus  outside  the  maximum  focus  point  can  be  high 
enough  to  ionize  elements  with  a  high  ionization  cross  sec¬ 
tion  Therefore,  the  ionization  volume  for  these  elements 
is  larger  than  that  for  elements  with  a  low  ionization  cross 
section. 

The  RSFs  are  determined  by  calibration  with  the  stan¬ 
dards.  Problems  with  quantification  will  arise  if  the  investi¬ 
gated  sample  deviates  too  much  from  the  calibrated  standard, 
or  the  experimental  conditions  are  changed.  Because  only  Si 
wafer  surfaces  with  metal  contaminants  below  IX 10^^ 
cm”^  are  investigated,  the  first  limitation  is  not  important. 
Since  the  number  of  sputtered  particles  is  proportional  to  the 
primary  ion  dose  for  all  elements,  variation  of  the  primary 
ion  dose  has  no  influence  on  the  RSFs,  On  the  other  hand, 
variations  of  the  laser  power  density  between  different  mea¬ 
surements  may  influence  the  RSFs.  The  ionization  probabil¬ 
ity  of  elements,  whose  level  of  ionization  is  not  saturated, 
varies  more  with  the  laser  power  density  than  that  of  ele¬ 
ments  with  saturated  levels.  Another  influence  on  the  RSFs  is 
given  by  the  dependence  of  the  size  of  the  ionization  volume 
on  the  laser  power  density.  The  ionization  volume  increases 
with  greater  laser  power  density  in  the  wings  of  the  focus. 
This  effect  is  more  pronounced  for  elements  with  a  high 
ionization  cross  section  0*^”^  once  the  ionization  is  saturated 
in  the  focus.  A  third  effect  influencing  the  sensitivity  of  an 
element  is  the  degree  of  dissociative  ionization  if  this  ele¬ 
ment  is  sputtered  as  a  molecule,  particularly  as  an  oxide,  e.g., 
SiO  and  Si.  The  extent  of  dissociation  increases  with  higher 
laser  power  density.  Under  defined  conditions  (laser  energy 
95-105  mJ  at  248  nm,  the  same  focusing  optic  for  each 
measurement)  changes  in  the  RSFs  has  been  found  to  be 
below  20%. 
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3.  TOF-SIMS  features 

The  RSFs  for  TOF-SIMS  describe  the  relationship  be¬ 
tween  the  useful  yield  of  the  respective  metal  and  the  useful 
yield  of  Si  (Si"^  plus  SiO"^),  if  the  differences  in  the  sputter 
yields  for  different  metal  atoms  are  neglected.  The  estimated 
useful  yield  for  Si  (UV/ozone  treated  wafer  surface)  is  found 
to  be  about  5X10”^.  In  contrast  to  Laser-SNMS,  this  yield 
strongly  depends  on  the  chemical  environment  of  the  sput¬ 
tered  atoms.  By  UV/ozone  treatment  this  chemical  environ¬ 
ment  was  stabilized  in  all  the  described  experiments.  Repro¬ 
ducible  oxidation  of  the  Si  wafer  surface  as  well  as  of  the 
metal  atoms  on  its  surface  was  achieved  by  UV/ozone  oxi¬ 
dation,  resulting  in  a  high  metal  ion  yield.  Under  these  con¬ 
ditions,  the  uncertainty  of  RSFs  was  below  10%. 


B.  Analytical  aspects 

The  usefulness  of  a  microanalytical  technique  is  mainly 
determined  by  its  detection  limits,  the  possibility  of  quanti¬ 
fication,  the  lateral  resolution,  and  the  acquisition  time.  The 
detection  limits  will  be  discussed  in  more  detail,  and  Laser- 
SNMS,  TOF-SIMS,  and  the  applied  reference  technique 
TXRF  will  be  compared  with  regard  to  the  features  men¬ 
tioned  above.  For  a  discussion  of  quantification  see  Secs. 
V  A  2  and  3. 


t.  Detection  limits 

Absolute  detection  limits  in  Laser-SNMS  and  SIMS  are 
determined  by  the  total  amount  of  sample  material  A  avail¬ 
able  for  the  analysis.  The  fraction  of  this  sample  material 
which  can  be  transformed  into  detected  ions  Xf — after  sput¬ 
tering,  ionization,  and  transfer  through  the  spectrometer — is 
described  by  the  useful  yield  The  minimal  detect¬ 

able  surface  coverage  (detection  limit)  by  an  element  A  in  a 
matrix  M  by  sputtering  of  one  monolayer  is  given  by 


0min(A) 


^min(Xf) 

r„(A,xf)F- 


(6) 


^min(Xf)  is  the  minimal  number  of  detected  ions  Xf  required 
to  identify  the  signal,  and  F  is  the  analyzed  area.  By  sputter¬ 
ing  of  one  monolayer  from  a  relatively  large  analyzed  area, 
F==1X10~^  cm^  and  an  useful  yield  of  T(^(A,Xf)  =  1 
X  10“^,  the  detection  limit  is  cm“^,  assuming  that 

10  ions  will  be  sufficient  to  identify  the  corresponding  peak. 
For  a  given  amount  of  sample  material,  this  detection  limit 
can  be  improved  by  two  means:  by  improving  the  overall 
transmission  of  the  spectrometer,  which  means  the  probabil¬ 
ity  that  a  generated  ion  is  detected  after  mass  separation,  or 
by  increasing  the  probability  for  a  sample  atom  to  be  trans¬ 
formed  into  an  ion  (transformation  probability). 

Additional  limitations  in  sensitivities  are  given  by  back¬ 
ground  or  peak  interferences  in  the  spectrum,  and  the  acqui¬ 
sition  time  required  for  sputtering  of  the  sample  material 
available.  These  effects  can  lower  the  detection  limits  for 
some  elements  considerably. 

In  Laser-SNMS  the  quantification  of  the  main  isotopes  of 
Ti,  Cr,  and  Fe  is  hampered  by  the  following  mass  interfer- 
ences:  [Fig.  13(a)],  ,  and 
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Fig.  13.  Details  of  a  Laser-SNMS  spectrum  (primary  ion  dose  4X10^®, 
analyzed  area  1X10“^  cm^,  postionization:  X=248  nm, 

I  =  1  X  1 0^®  W  cm“^).  The  detection  limits  of  V  and  Mo  are  limited  by  the 
useful  yields  of  these  elements  and  the  available  data  rates.  The  detection 
limits  of  Ti  and  Fe  are  limited  by  mass  interference. 


^^Fe'^^^^Si2  [Fig.  13(c)].  For  elements  with  higher  masses, 
mass  interferences  become  less  important  since  the  transfor¬ 
mation  probabilities  of  complex  molecules  into  large  mo¬ 
lecular  ions  decrease.  Therefore,  one  is  not  restricted  by 
mass  interferences  for  As"^,  Mo^  [Fig.  13(d)],  and  W^.  The 
detection  limits  of  elements  which  are  unhampered  by  back¬ 
ground  and  mass  interferences  are  only  limited  by  the  avail¬ 
able  data  rates  ('=«4X10^  counts/s)  [Fig.  13(b)].  For  the  ex¬ 
periments  reported  here  we  were  restricted  by  the  time 
needed  to  process  the  data.  With  a  faster  data  registration  and 
processing,  it  is  now  possible  to  improve  the  detection  limits 
for  Laser-SNMS  by  more  than  one  decade,  provided  one  has 
enough  sample  material  and  is  not  restricted  by  one  of  the 
other  effects  mentioned  above. 

In  TOF-SIMS,  one  is  hampered  by  a  significant  back¬ 
ground  in  the  spectrum  caused  by  metastable  ions  (Fig.  10). 
To  minimize  this  effect,  the  surface  concentration  of  organic 
compounds  can  be  reduced  by  UV/ozone  treatment.  Also, 
cleaning  by  moderate  presputtering  (primary  ion  fluence 
^2X 10^^  cm~^)  has  proven  to  be  useful  and  acceptable.  The 
detection  limits  in  TOF-SIMS  are  then  restricted  by  the 
maximal  achievable  dynamic  of  six  decades  in  the  spectrum. 


2,  Comparison  of  Laser-SNMS,  TOF-SIMS,  and  TXRF 

Regarding  quantification  of  metal  trace  impurities  on  Si 
wafers,  there  are  only  marginal  differences  between  TOF- 
SIMS  and  Laser-SNMS.  The  detection  limits  are  comparable 
with  some  advantages  for  the  heavier  elements  (m  >  70  u)  in 
Laser-SNMS.  The  accuracy  of  the  measurements  are  slightly 
better  in  TOF-SIMS  (error  <  10%)  than  in  Laser-SNMS  (er¬ 
ror  <20%).  This  is  mainly  caused  by  the  more  critical  ex¬ 
perimental  setup  of  Laser-SNMS:  variations  in  the  laser 
power  density  or  in  the  location  and  size  of  the  laser  focus 
may  have  effects  on  the  measurements.  TOF-SIMS 
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Fig.  14.  High  mass  resolution  secondary  ion  spectrum  at  56  u  (Ga'*'  liquid 
metal  ion  source,  primary  ion  fluence  2X10^^  cm~^)  from  a  1  juum^  area  of 
a  Si  wafer  covered  by  1.4X10^^  cm^^  Fe  atoms. 

additionally  supplies  information  on  molecular  surface  spe¬ 
cies,  but  spectral  interpretation  may  be  more  difficult  for 
complex  surface  compositions.  The  main  advantage  of 
Laser-SNMS  is  that  even  for  drastic  changes  in  the  chemical 
environment  of  the  sputtered  particles,  this  technique  still 
allows  a  straightforward  quantification  in  element  analysis. 

Comparing  the  surface  mass  spectrometries  Laser-SNMS 
and  TOF-SIMS  with  TXRF,  the  detection  limits  of  the  sput¬ 
tering  based  techniques  for  metal  impurities  on  Si  wafers  are 
lower  than  those  of  TXRF.  The  sensitivities  are  only  compa¬ 
rable  for  the  transition  metals  Fe  up  to  Zn.  For  the  light 
elements,  TXRF  is  restricted  by  the  signal- to-noise  ratio  due 
to  the  tailing  of  the  Si  peak.  The  sensitivity  for  the  heavy 
elements  is  low  because  they  must  be  detected  by  their 
LyS-line  when  the  sample  is  excited  by  Mo-Ka  or  W-L/3, 
The  accuracy  of  TXRF  is  slightly  better  than  that  of  Laser- 
SNMS  and  TOF-SIMS.  Laser-SNMS  and  TOF-SIMS  have 
monolayer  sensitivity  whereas  the  information  depth  of 
TXRF  is  about  3  nm.  In  addition,  TOF-SIMS  also  gives  im¬ 
portant  information  on  the  molecular  surface  composition. 
An  advantage  of  TXRF  is  the  easy  implementation  of  the 
technique.  There  is  no  need  to  place  the  sample  under 
vacuum,  and  the  analysis  is  almost  automatic.  Additionally, 
TXRF  provides  a  nearly  standard-free  calibration.  Besides 
their  high  sensitivities  for  almost  all  elements,  an  important 
advantage  of  the  sputtering  based  techniques  is  the  high  lat¬ 
eral  resolution  which  is  only  limited  by  the  primary  ion  spot 
diameter  (<100  nm  for  a  Ga'^liquid  metal  ion  source)  in 
contrast  to  the  large  area  (about  1  cm^)  required  in  TXRF. 
The  detection  limits  in  Laser-SNMS  and  TOF-SIMS  are  lin¬ 
early  scaled  by  the  respective  size  of  the  analyzed  area  (Fig. 
14).  Both  techniques  allow  surface  imaging,^^  depth  profiling 
and  3D- analysis  of  micro  structures. 

VI.  OUTLOOK 

The  calibration  procedure  described  in  this  paper  is  not 
restricted  to  metal  contaminants  on  silicon  wafer  surfaces.  It 
can  easily  be  extended  to  the  quantitative  determination  of 
trace  metals  on  other  semiconductors  like  GaAs.^^  For  pre- 
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liminary  quantification,  it  has  been  shown  to  be  sufficient  to 
calibrate  standards  for  TOF-SIMS  by  Laser-SNMS  with  the 
RSFs  determined  for  metal  contaminants  on  Si.  For  more 
accurate  quantification,  it  is  necessary  to  calibrate  the  stan¬ 
dards  by  an  additional  reference  technique,  such  as  TXRF. 
For  other  substrates  like  biological  samples  or  polymers, a 
similar  calibration  procedure  may  be  applied.  Future  investi¬ 
gations  have  to  show  the  possibility  of  producing  metal  stan¬ 
dards  by  sputter  deposition  on  these  kinds  of  surfaces.  A 
further  lowering  of  detection  limits  of  trace  metals  on  wafer 
surfaces  may  be  achieved  by  the  combination  of  TOF-SIMS 
and  Laser-SNMS  with  the  technique  of  vapor  phase  decom¬ 
position  (VPD)  as  it  is  done  with  TXRF.^^ 
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The  scattering  effects  (specifically  LO~phonon  scattering)  in  a  45  A  AlAs/80  A  GaAs/33  A  AlAs 
asymmetric  double  barrier  resonant  tunneling  (ADBRT)  structure  with  a  short  period 
GaAS/Alo  3Gao.7As  superlattice  incorporated  on  one  side  of  the  double  barrier  have  been  studied  and 
characterized.  Enhanced  levels  of  current  conduction  were  produced  in  the  ADBRT  due  to  the 
superlattice  miniband  electron  transport  under  forward  bias.  And  the  effect  of  the  said  superlattice 
on  the  phonon  scattering  phenomena  exhibited  by  the  entire  device  was  subsequently  examined. 
Magnetic  field  fan  diagrams  at  4.2  K  under  reverse  bias  showed  a  new  feature  at  an  energy  of  22 
meV  that  could  be  explained  on  the  basis  of  previously  unreported  GaAs  LO-phonon  scattering 
processes  from  the  first  excited  emitter  level.  Finally,  quenching  of  phonon-assisted  tunneling  in 
reverse  bias  on  decreasing  the  period  of  the  superlattice  was  also  observed.  ©  1996  American 
Vacuum  Society, 


I.  MOTIVATION 

The  concept  of  double  barrier  resonant  tunneling  (DBRT) 
devices  has  undergone  significant  development  since  the  first 
theoretical  work  by  Esaki  and  Tsu  in  1970  and  the  first  pro¬ 
totype  GaAs/Alo.4Gao  6As  resonant  tunneling  diode  in  1974.^ 
Since  then,  DBRT  structures  have  found  potential  use  in 
many  devices  such  as  high-frequency  oscillators,^  resonant 
tunneling  transistors  for  binary  as  well  as  multivalued  logic, 
quantum  interference  devices,^  photodetectors,  light-emitting 
diodes,  and  laser  diodes."*  The  models  used  to  analyze  and 
explain  the  physics  behind  the  unique  property  of  carrier 
(electron)  tunneling  have  grown  more  sophisticated,  and  at¬ 
tempts  have  been  made  to  include  all  the  various  effects  that 
contribute  to  the  experimentally  observed  current-voltage 
characteristics — particularly  the  property  of  negative  differ¬ 
ential  resistance  (NDR). 

The  numerically  calculated  values  of  the  peak-to-valley 
current  ratios  (PVCR)  of  DBRT  devices  are  normally  higher 
than  the  experimentally  observed  values  by  an  order  of  mag¬ 
nitude  (or  more).  This  is  due  to  the  fact  that  the  theoretical 
models  of  double  barrier  structures  do  not  generally  take  into 
account  all  of  the  real  scattering  processes  during  tunneling. 
Not  only  do  these  processes  produce  lower  peak  currents  but 
they  also  have  been  thought  to  be  the  cause  of  excess  valley 
currents — a  phenomenon  that  has  not  yet  been  completely 
understood.  Some  phenomenological^’^  and  some  highly  rig¬ 
orous  models^’^  have  recently  achieved  moderate  success  in 
explaining  these  processes.  Therefore,  characterization  and 
study  of  these  phenomena  are  needed  in  order  to  correctly 
incorporate  their  effects  into  the  theory  of  resonant  tunnel¬ 
ing.  The  study  of  phonon  scattering  is  important  because  it  is 
the  dominant  scattering  mechanism  in  electron  transport.^’^ 
The  study  of  phonon  scattering  effects  using  asymmetric 
double  barrier  (ADBRT)  structures  is  particularly  suitable 


because  these  structures  have  been  found  to  exhibit  enhanced 
phonon-assisted  tunneling.**^  It  has  been  seen  that  the  PVCRs 
of  symmetric  double  barrier  structures  can  be  increased  by 
engineering  their  emitter  state — e.g.,  by  growing  a  short- 
period  superlattice  in  the  emitter  region.**  Although  ADBRT 
structures  normally  do  not  exhibit  high  PVCRs,  they  show 
enhanced  PVCRs  when  grown  with  a  short  period  superlat¬ 
tice  in  their  emitter  structure.*^  Under  this  new  configuration, 
however,  the  question  of  how  phonon  scattering  is  mani¬ 
fested  has  not  been  examined.  Therefore,  this  article  presents 
a  first-time  study  of  phonon  scattering  in  an  asymmetric 
double  barrier  structure  grown  with  variable  period  superlat¬ 
tice  on  one  side  of  the  double  barrier.  In  this  configuration, 
under  forward  bias,  the  electron  current  flows  through  the 
superlattice  before  the  ADBRT  structure — thus  the  superlat¬ 
tice  is  in  the  emitter  region  of  the  device.  However,  under 
reverse  bias,  the  electron  current  flows  through  the  superlat¬ 
tice  after  the  ADBRT  structure — the  superlattice  now  falls  in 
the  collector  region  of  the  device.  The  effect  of  both  bias 
conditions  has  been  studied  in  this  experiment. 


II.  THEORETICAL  BACKGROUND 

Phonon-assisted  tunneling  (PAT)  can  be  explained  as  the 
tunneling  of  electrons  with  emitter  state  energies  greater  than 
that  of  the  well  state  by  losing  the  excess  energy  through  the 
emission  of  a  phonon.  These  are  optical  phonons  and  their 
dispersion  curve  exhibits  a  nonzero  wave  vector  at  zero  en¬ 
ergy.  These  phonons  are  primarily  longitudinal  in  character, 
as  shown  in  Ref.  13.  In  the  /-V  characteristics,  the  presence 
of  satellite  peaks  (shoulders)  beside  the  main  resonant  peak 
has  been  attributed  to  PAT.*"*  This  phonon  contribution  in  the 
valley  current  can  be  further  studied  under  the  effect  of  a 
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300  A  n+  GaAs  ( Si  doped)  Nd  =  2x10  cm 


2  microns  n+  GaAs  (Si  doped)  Nd  =  2x10  cm ' 


n+  GaAs  substrate  (100)  direction 


Fig.  1.  Layer  schematic  of  the  asymmetric  DBRTDs  with  superlattice  emitters.  Three  types  of  five-period  GaAs/Alo.3Gao.7As  superlattices  were  grown  with 
layer  thicknesses  of  50,  30,  and  15  A,  respectively. 


quantizing  magnetic  field.^’^^  The  Schrodinger  equation  in 
the  presence  of  a  magnetic  field  B  in  the  growth  direction  z 
is  written  as  follows: 

l[P,-(e/c)BY]^  P]  Pi  \ 

\  2m*(z)  2m*(z)  2m*(z)/ 

+  [E-V{z)]il^=0,  '  (1) 

where  P  is  the  momentum  operator  and  Y  is  the  y  component 
of  the  position  operator. 

The  total  energy  is  now  given  as  follows: 

6=  6/4- (/7  4- 1/2)^  ,  n  =  0,l,2...,  (2) 

where  o)^={helm^)B  is  the  cyclotron  frequency  and  6/  is 
the  quantum  well  state. 

When  phonon  emission  (scattering)  takes  place,  it  can  be 
modeled  as  a  change  p  =  n  —  n'  in  the  eigenstate  of  the  har¬ 
monic  oscillator  derived  above.  The  difference  in  the  ener¬ 
gies  of  the  phonon  scattering  events  and  the  resonant  tunnel¬ 
ing  events  is  related  to  the  phonon  energy  as  follows: 

Ae=  Apfio)^-^  (3) 

where  i  is  an  integer  equal  to  0  for  elastic  scattering  and  1  for 
LO  phonon  scattering  and  p  is  an  integer  corresponding  to 
the  change  in  Landau  level  due  to  scattering  effects.  The 
resonant  tunneling  peak  in  the  I-V  characteristics  corre¬ 
sponds  to  i=0  and p—0.  (The  case  p  —  \  in  resonant  tunnel¬ 
ing  has  also  been  observed  and  has  been  attributed  to  inter¬ 
face  scattering.)  Phonon  scattering  peaks,  on  the  other  hand. 


correspond  to  /  =  1  and  any  integral  p.  Multiphonon  peaks 
are  also  possible  (/>1)  but  no  experimental  evidence  has 
been  found  for  them  as  yet. 

The  energy  of  the  electron  is  proportional  to  the  total 
voltage  drop  across  the  device  (taking  the  accumulation  and 
depletion  drops  into  account).  Thus,  the  difference  Ae  is  pro¬ 
portional  to  the  difference  in  the  applied  voltage  (AV)  be¬ 
tween  the  resonant  tunneling  and  PAT  peaks,  thus  eliminat¬ 
ing  the  accumulation  and  depletion  drops  provided  the 
space-charge  effects  and  charge  buildup  in  the  well  are  not 
large.^’^^  On  plotting  the  applied  voltage  at  the  position  of 
PAT  peaks  versus  the  magnitude  of  the  magnetic  field  (B), 
we  obtain  experimental  fan  diagrams  that  yield  the  linear 
relation  between  A 6  and  AV.  In  addition,  the  phonon  ener¬ 
gies  calculated  from  the  fan  diagrams  (discussed  in  a  later 
section)  represent  the  different  types  of  phonon  scattering 
processes  taking  place  in  these  structures. 

III.  EXPERIMENT 

In  this  study,  three  sample  structures  were  grown,  as 
shown  in  Fig.  1.  Briefly,  each  structure  consisted  of  a  graded 
doped  emitter  region,  a  variable  composition  five-period  su¬ 
perlattice,  the  ADBRT  structure  of  an  80  A  GaAs  well  with 
45  and  33  A  AlAs  barriers,  and  a  graded  doped  collector 
(listed  in  the  order  of  growth).  The  superlattice  had  five  pe¬ 
riods  of  50  A  GaAs/50  A  AIq^Guo^As  layers  or  30  A 
GaAs/30  A  AloaGaojAs  layers  or  15  A  GaAs/15  A 
Alo.3Gao.7As  layers  in  the  three  samples,  respectively.  There¬ 
fore,  the  structures  are  essentially  distinct  from  one  another 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2727 


Banoo  et  aL:  Phonon  scattering  in  ADBRT  structure 


2727 


Fig.  2.  Measured  I-V  curve  of  the  ADBRT  structure  with  a  50  A  GaAs/50 
A  AlojGaojAs  superlattice  at  77  K  showing  the  forward  and  reverse  bias 
characteristics. 


as  each  has  a  uniquely  designed  superlattice  grown  on  one 
side  of  the  double  barrier.  As  the  final  step,  a  1  fjm  heavily 
doped  GaAs  cap  layer  was  deposited  to  allow  ohmic  contact 
to  be  made  to  the  device. 

I~V  characteristics  were  measured  for  all  the  samples 
described  above  both  at  77  (without  magnetic  field)  and  4  K 
(with  magnetic  field).  The  bias  conditions  used  in  these  mea¬ 
surements  have  been  defined  at  the  end  of  Sec.  1.  In  the  case 
of  forward  bias,  the  superlattice  forms  a  part  of  the  emitter 
and  in  the  case  of  reverse  bias  it  falls  in  the  collector  region. 
All  structures  probed  had  a  device  area  (it  is  necessary  to 
know  the  area  of  the  device  in  order  to  calculate  current 
densities  from  the  absolute  values  of  the  measured  current) 
of  1.963X10“^  cm^  defined  by  the  deposition  of  Ge-Ni-Au 
contacts.  Measurements  at  77  K  showed  resonant  tunneling 
and  phonon-assisted  tunneling  in  the  reverse  bias.  None  of 
the  samples  showed  resonant  tunneling  or  phonon-assisted 
tunneling  in  the  forward  bias  (see  Fig.  2). 


Detailed  magnetic  field  measurements  were  taken  at  4.2  K 
under  reverse  bias  to  study  the  effect  of  the  superlattice  on 
phonon  scattering.  These  measurements  were  not  done  in 
forward  bias  due  to  the  enhanced  forward  current  conduction 
through  the  samples — which  has  been  further  described  in 
the  results. 

Under  reverse  bias  the  effect  of  the  quantizing  magnetic 
field  on  phonon-assisted  tunneling  in  the  50  A  superlattice 
sample  can  clearly  be  seen  in  Fig.  3.  The  position  of  the 
peaks  in  the  I~V  characteristics  is  obtained  by  calculating 
the  second  derivative  d^IfdV^  to  more  clearly  distinguish  the 
features  in  current  variation,  as  shown  in  Fig.  4.  Thus  the 
positions  of  the  phonon  peaks  can  be  noted  simply  from  the 
position  of  the  maxima  in  the  second  derivative.  These  cal¬ 
culations  are  repeated  for  each  7-  V  curve  obtained  with  dif¬ 
ferent  magnitudes  of  B,  and  the  evolution  of  the  phonon 
peaks  with  magnetic  field  is  plotted  as  a  fan  diagram.  The  fan 
diagram  of  the  ADBRT  structure  with  a  50  A  GaAs/50  A 
Alo.3GaojAs  superlattice  is  shown  in  Fig.  5.  Three  fans  are 
observed  starting  at  A  F  of  196,  343,  and  489  mV,  respec¬ 
tively,  from  the  main  resonant  peak.  These  correspond  to 
energies  Aeof  20.2,  36.7,  and  47.3  meV  using  the  numerical 
method  described  in  Ref.  16.  Here  we  clearly  see  evidence  of 
GaAs  LO-phonon  scattering  (tKOi^Q^36  meV)  and  AlAs  LO- 
phonon  scattering  (hoyi^Q^SO  meV)  events. 

Phonon-assisted  tunneling  in  the  ADBRT  structure  with  a 
30  A  GaAs/30  A  AlojGao.yAs  superlattice  is  illustrated  in  its 
fan  diagram  (Fig.  6).  Two  fans  are  observed  starting  at  184.8 
and  351.2  mV  from  the  main  resonant  peak  which  yields  Ae 
of  21.9  and  36.6  meV,  respectively.  Again  ho}^Q^36  meV 
corresponds  to  GaAs  LO-phonon  scattering  events.  Like¬ 
wise,  phonon-assisted  tunneling  in  the  ADBRT  structure  with 
a  15  A  GaAs/15  A  Alo.sGaojAs  superlattice  is  illustrated  in 
its  fan  diagram  (Fig.  7).  Two  fans  are  observed  starting  at 
137  and  254  mV  yielding  Ae  of  22.3  and  33.9  meV,  respec¬ 
tively.  The  presence  of  GaAs  LO-phonon  scattering  is  shown 
by  the  second  fan. 

In  each  case,  an  anomalous  fan  of  energy  around  22  meV 


Fig.  3.  Reverse  bias  I-V  characteristics  of  the  ADBRT  structure  with  a  50  A  GaAs/50  A  Alo^GaojAs  superlattice  at  4.2  K  for  magnetic  fields  varied  from 
0  to  ^7  T. 
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V 

Fig.  4.  Plots  of  d^lldV^  of  the  ADBRT  structure  with  a  50  A  GaAs/50  A  Alo.3Gao.7As  superlattice  in  reverse  bias  at  4.2  K  with  different  magnetic  fields. 


(20.2,  21.9,  and  22.3  meV  for  the  50,  30,  and  15  A  samples, 
respectively)  was  observed.  These  fan  energies  have  not  been 
previously  reported  in  the  literature.  The  presence  of  an 
anomalous  line  in  the  fan  diagram  may  be  explained  as  an 
artifact  of  the  mathematical  analysis  of  the  data.  It  is  possible 
for  the  second  derivative  of  the  current  to  show  a  local  maxi¬ 
mum  when  the  phonon-assisted  tunneling  starts  to  contribute 
to  the  valley  current.  These  peaks  in  the  second  derivative 
would  therefore  appear  as  a  line  in  the  fan  diagram.  How¬ 
ever,  the  anomalous  effect  observed  in  our  study  was  a  col¬ 
lection  of  lines  that  showed  the  characteristic  structure  of  a 
fan  exhibiting  an  energy  of  22  meV.  Clearly,  this  indicates  a 
phonon  scattering  process.  The  difference  in  energy  can  be 
explained  by  considering  PAT  not  from  the  ground  emitter 
energy  level  but  from  the  first  excited  emitter  level  into  the 
well  state^^  when  the  superlattice  is  in  the  collector.  It  is 
known  that  the  energies  of  the  incoming  electrons  are  those 
of  the  bound  states  formed  due  to  band  bending  in  the  emit¬ 
ter.  By  accounting  for  the  energy  of  the  first  excited  state  in 
the  emitter  of  the  ADBRT  structure  (^12  meV  with  respect 


to  the  ground  state  in  the  emitter),  the  phonon  energy  ob¬ 
tained  is  34  meV — which  is  evidently  the  GaAs  LO-phonon 
energy.  Therefore,  this  analysis  strongly  suggests  the  occur¬ 
rence  of  phonon  scattering  events  from  the  first  excited  state 
of  the  emitter  into  the  well  state.  Such  an  effect  is  almost 
insignificant  in  the  absence  of  a  superlattice  in  the  collector. 

IV.  DISCUSSION 

Realistically,  the  resonant  tunneling  process  through  a 
complex  structure  such  as  a  superlattice-ADBRT  diode  is 
extremely  complicated.^^  It  occurs  under  the  alignment  of 
the  energy  levels  in  different  regions  of  the  entire 
structure — a  critical  condition  that  is  affected  by  many  fac¬ 
tors  besides  the  applied  bias.  First,  the  energy  level  configu¬ 
ration  of  the  entire  system  has  a  crucial  dependence  on  the 
quality  of  the  interface  (its  roughness)  and  fluctuations  in 
layer  thicknesses.  Variations  in  these  parameters  tend  to  pre¬ 
vent  the  formation  of  miniband  and  localize  the  electron 
wave  functions  in  the  wells.  The  electron  tunneling  in  such 


Fig.  5.  Fan  diagram  of  the  phonon  peaks  of  the  ADBRT  structure  with  a  50  A  GaAs/50  A  Alo.3Gao jAs  superlattice  calculated  at  4.2  K.  The  arrows  indicate 
phonon  energies  of  the  respective  fans  obtained  after  calculating  them  from  the  voltage  intercept. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2729 


Banoo  et  aL:  Phonon  scattering  in  ADBRT  structure 


2729 


Fig.  6.  Fan  diagram  of  the  phonon  peaks  of  the  ADBRT  structure  with  a  30  A  GaAs/30  A  AlojGao  jAs  superlattice  calculated  at  4.2  K.  The  arrows  indicate 
phonon  energies  of  the  respective  fans  obtained  after  calculating  them  from  the  voltage  intercept. 


a  case  will  be  incoherent  (or  sequential)  if  at  all.  Another 
aspect  to  consider  is  the  coupling  between  the  superlattice 
and  the  ADBRT  structure.  From  the  measured  reverse  bias 
characteristics  of  the  device,  we  can  conclude  that  the  super** 
lattice  and  the  ADBRT  structure  are  essentially  decoupled. 
(For  example,  this  decoupling  is  very  important  in  the  use  of 
the  superlattice  as  an  electron  launching  mechanism,  inde¬ 
pendent  of  the  operation  of  the  ADBRT  diode.) 

Under  forward  bias,  the  absence  of  NDR  may  be  because 
of  increased  current  levels  due  to  collective  transport  through 
the  superlattice  minibands.  If  the  NDR  peaks  at  a  current 
much  higher  than  the  current  measuring  capability  of  our 
apparatus  then  only  the  onset  of  resonant  tunneling  in  the 
forward  direction  can  be  observed  as  an  exponentially  in¬ 
creasing  I-  V  curve.  This  is  seen  to  be  the  case  for  the  7-  U 
characteristics  of  all  our  samples  in  the  forward  bias. 

Under  reverse  bias,  all  the  samples  show  resonant  tunnel¬ 
ing  which  is  to  be  expected  as  the  ADBRT  structure  has  a 
predominant  effect  on  the  electron  current  under  the  reverse 


bias  configuration.  The  presence  of  the  superlattice  in  the 
collector  region  makes  the  “effective”  collector  of  the 
double  barrier  well  weakly  transmitting.  Small  collector 
transmissivity  has  been  found  to  produce  enhanced  scattering 
in  the  ADBRT  structure^^  and,  therefore,  this  could  explain 
the  prominent  PAT  observed  in  the  reverse  bias  in  our 
samples.  Enhanced  scattering  could  also  explain  the  indica¬ 
tions  of  phonon  scattering  from  the  first  excited  emitter  level 
that  has  not  been  observed  previously. 

The  fan  diagrams  also  show  decreasing  levels  of  PAT  in 
reverse  bias  with  decreasing  layer  thickness  of  the 
GaAs/Alo^Gao jAs  superlattice.  Therefore,  it  can  be  con¬ 
cluded  that  an  optimally  thin  period  superlattice  can  quench 
PAT  in  the  reverse  bias  while  producing  high  PVCRs  in  the 
forward  bias. 

V.  CONCLUSIONS 

We  studied  the  1-  V  characteristics  of  an  ADBRT  struc¬ 
ture  with  three  superlattice  structures  of  different  periods  in 


Fig.  7.  Fan  diagram  of  the  phonon  peaks  of  the  ADBRT  structure  with  a  15  A  GaAs/15  A  AlojGaojAs  superlattice  calculated  at  4.2  K.  The  arrows  indicate 
phonon  energies  of  the  respective  fans  obtained  after  calculating  them  from  the  voltage  intercept. 
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the  emitter  as  well  as  the  collector.  Collective  transport 
through  the  superlattice  minibands  in  the  emitter  was  found 
to  produce  enhanced  current  conduction  in  the  forward  bias 
and  thus  no  NDR  could  be  observed.  Resonant  tunneling  and 
PAT  were  observed  in  the  reverse  bias  with  the  quenching  of 
PAT  as  the  period  of  the  superlattice  in  the  collector  was 
decreased.  Magnetic  field  studies  of  the  reverse  bias  PAT 
exhibited  a  new  feature  in  the  fan  diagram  that  could  be 
explained  on  the  basis  of  phonon  scattering  processes  from 
the  first  excited  emitter  level. 
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Extra  facet  generation  during  molecular  beam  epitaxy  of  GaAs/AlGaAs  multilayers  on  (001)  GaAs 
substrates  patterned  with  stripes  running  in  the  [110],  [110],  and  [100]  directions  and  having  various 
slopes,  designated  as  “[110]  stripe,”  “[110]  stripe,”  and  “[100]  stripe,”  respectively,  has 
systematically  been  investigated  for  the  first  time.  It_has  been  confirmed  that  extra  (111) A  and 
(114)A  facets  are  generated  on  the  [110]  stripes,  extra  (lll)B  and  (113)B  facets  on  the  [110]  stripes, 
and  extra  (031),  (011),  (045),  and  (013)  facets  on  the  [100]  stripes,  depending  on  the  intersection 
angle  6  of  the  sidewall  and  the  substrate  plane.  Among  them,  the  (114)A,  (113)B,  and  (013)  facets 
are  important  because  they  persist  over  a  wide  range  of  6  and,  therefore  play  a  detrimental  role  in 
forming  fiat  and  uniform  sidewall  layers  on  the  (001)  patterned  substrates.  It  has  been  made  clear 
that  no  extra  facets  are  generated  on  the  sidewalls  and  flat  and  uniform  layers  maintaining  the  initial 
as-*etched  stripe  patterns  can  be  grown  for  the  [110]  stripes  with  10°^^,  the  [110]  stripes  with 
24°^ ^^20°,  and  the  [100]  stripes  with  19°^ 6^.  Extra  (lll)A-related  facets  have  developed  on  the 
(lll)A-related  inte^ection  of  the  equivalent  [100]  and  [010]  stripes,  while  no  extra  facets  have 
developed  on  the  (lll)B -related  intersection.  The  facet  generation  behavior  has  been  compared 
between  the  (001)  patterned  substrates  and  previously  studied  (1 1 1)A  patterned  substrates.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

In  our  recent  article,  facet  generation  behavior  during  mo¬ 
lecular  beam  epitaxy  (MBE)  of  GaAs/AlGaAs  multilayers  on 
GaAs  (111) A  substrates  patterned  with  ridge-type  triangles 
was  investigated  for  the  first  time,  and  how  the  behavior 
could  be  utilized  for  the  formation  of  novel  device  structures 
on  the  (111) A  patterned  substrates  under  the  sidewall  control 
was  elucidated.^  In  this  respect,  there  are  many  articles  pub¬ 
lished  on  crystal  growth  and  microstructure  formation  on 
(001)  patterned  substrates.^”^^  Almost  all  of  these  artmles 
have  dealt  with  the  growth  on  stripes  running  in  the  [110] 
direction  with  an  exact  (lll)A  or 

(112)A-(113)A  sidewalls.^”^’^"^’^^  Several  articles  have  dis¬ 
cussed  the  growth  on  stripes  running  in  the  [110] 
direction^’^’^’^^’^^’^^’^^’^"^  and  very  recently  on  stripes  running 
in  the  [100]  direction  with  the  (010)  sidewall. To  the 
best  of  our  knowledge,  however,  no  articles  have  published 
systematically  studying  growth  on  stripes  with  a  wide  range 
of  sidewall  slopes  or  on  intersections  of  two  stripes  running 
in  the  orthogonal  directions  from  a  point  of  view  of  facet 
generation  control  for  device  application. 

In  the  present  article,  facet  generation  during  MBE  of 
GaAs/Al0 ^GaojAs  multilayers  on  the  stripes  running  in  the 
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Itami,  Hyogo  664,  Japan;  Electronic  mail:  takebe@ritami.sumiden.co.jp 
^^Present  address:  Fundamental  Research  Department,  Central  Research 
Laboratory,  Technology  Division,  Mitsubishi  Cable  Industries,  Ltd.,  4-3 
Ikejiri,  Itami,  Hyogo  664,  Japan. 

‘^^Present  address:  Development  Section  I,  Devices  Development  Depart¬ 
ment,  Technical  Administration  Division,  Murata  Manufacturing  Co.,  Ltd., 
2288  Oshinohara,  Yasu-cho,  Yasu-gun,  Shiga  520-23,  Japan. 


[110] ,  [110],  and  [100]  directions  with  various  slopes  and 
their  intersections  on  (001)  substrates  is  investigated  for  the 
first  time.  By  comparing  the  present  results  with  those  on 

(111) A  patterned  substrates,  clearer  image  and  deeper  under¬ 
standing  of  crystal  growth  on  nonplanar  substrates  are  gained 
on  the  basis  of  the  differences  in  the  surface  properties,  the 
position  of  a  sidewall  relative  to  the  substrate  plane,  and  the 
incident  effective  molecular  beam  fluxes. 

II.  EXPERIMENT 

Stripes  running  in  the  [110],  [110],  and  [100]  directions 
with  heights  of  5~1  /jm  whose  sidewalls  are  composed  of 
(lll)A-related,  (lll)B -related,  and  (OlO)-related  surfaces, 
respectively,  were  formed  on  the  (001)  substrates  using  pho¬ 
tolithography  and  selective  etching  techniques.  They  are 
briefly  designated  as  [110]  stripe,  [110]  stripe,  and  [100] 
stripe,  respectively.  These  stripes  are  schematically  shown 
with  respect  to  the  configuration  of  the  (001)  surface  in  Fig. 
1.  The  [010]  stripe  is  a  mirror  image  of  the  [100]  stripe  with 
respect  to  the  (110)  plane  and  has  sidewall  orientations 
equivalent  to  those  of  the  [100]  stripe.  The  intersection  angle 
0  of  the  sidewall  and  the  (001)  substrate  plane  was  varied  in 
a  wide  range  using  H202-excess  HF+H2O2+H2O  mixtures 
of  various  compositions.^^  After  pregrowth  chemical  and 
thermal  treatments  identical  to  those  for  the  (lll)A  patterned 
substrates,  the  (001)  patterned  substrates  were  subjected  to 
the  same  MBE  growth  runs  as  the  (lll)A  patterned  sub¬ 
strates  discussed  in  Ref.  1 :  The  Ga  and  A1  cells  were  located 
side  by  side  and  the  AS4  cell  was  located  opposite  to  them 
with  respect  to  the  sample  substrate.  All  the  molecular  beams 
were  incident  on  the  substrate  at  an  angle  of  60®.  Five  pairs 
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j  :  Cross-sectional  view  | _ j  :  Top  view 

Fig.  1.  Schematic  presentation  of  [110],  [110],  [100],  and  [010]  stripes 
formed  on  the  GaAs  (001)  surface. 

of  undoped  0.2-yLtm-thick  GaAs/0.2-)um-thick  AlojGaojAs 
layers,  and  an  undoped  0.3-yC6m-thick  GaAs  cap  were  succes¬ 
sively  grown  by  MBE  on  the  (001)  patterned  substrates  at  a 
substrate  temperature  of  620  °C,  an  AS4  pressure  of 
3.3  X 10“^  Torr,  a  V/III  flux  ratio  of  7.4  for  GaAs  and  6.2  for 
AlGaAs,  and  a  substrate  rotation  speed  of  60  rpm.  The 
growth  rate  was  0.77  /im/h  for  GaAs  and  1.10  /xm/h  for 
AlGaAs,  corresponding  to  0.76  and  1.08  ML/s,  respectively. 
The  AlGaAs  layers  were  used  as  “markers”  in  order  to  ob¬ 
serve  how  the  growth  proceeded.  The  layers  grown  on  the 
stripes  were  closely  examined  by  scanning  electron  micros¬ 
copy  (SEM)  at  the  positions  shown  in  Fig.  1.  No  variations 
in  the  as-etched  sidewall  profile,  growth  rate,  and  after¬ 
growth  surface  morphology  were  confirmed  across  the  whole 
sample  (20X25  mm^)  for  any  of  the  samples  investigated. 

III.  RESULTS 

Although  the  epilayer-substrate  interface  was  not  clearly 
observed  since  the  first  layer  grown  was  GaAs,  a  definite 
interface  maintaining  the  initial  etching  profile  was  con¬ 
firmed  on  Si-doped  GaAs  single  layers  grown  under  similar 
conditions  and  deeply  stain-etched. 

A.  [110]  stripe 

Figure  2  shows  (110)  cross-sectional  views  of  the  after¬ 
growth  profiles  of  the  (lll)A-related  sidewalls  of  the  [110] 
stripes  with  various  values  of  0.  The  surface  index  (Imn) 
corresponding  to  each  value  of  6  is  also  shown  (within  ±2°) 
with  /,  m,  The  brighter  layers  correspond  to  GaAs  and 
the  darker  layers  to  AlGaAs.  Generation  of  extra  facets  on 
the  sidewalls  is  described  as  follows:  (a)-(d)  For  ^>21°,  two 
extra  facets  with  facet  angles  ^/=21°  and  55°  to  the  (001) 
substrate  plane  were  identified  and  were  assigned  to  (114) A 
and  (111) A,  respectively.  The  geometrical  value  of  9^  for  the 
(114)A  facet  is  19°  and  not  21°  measured  in  the  present 
study.  The  value  of  21°  was  obtained  by  averaging  the  values 
measured  over  all  the  samples  with  the  (114) A  facet.  The 
scatter  of  the  measured  values  was  very  small,  within  ±1°. 
The  value  of  6f  for  the  (114) A  facet  on  the  (lll)A  patterned 


Growth  temperature  =  620  ’C 

V/  ffl  flux  ratio  =  7.4(GaA8)/6.2(Alo,3Gao.7A8) 


(a)  #=  54-  /  (111)A  (b)  tf  =  48-  /  (445)A  |(T10)  cross-sectional  view  1 


(c)  «  =  37  •  /  (112)A  (d)  «  =  24 '  /  (113)A 


Fig.  2.  (110)  cross-sectional  views  of  the  aftergrowth  profiles  of  the  (lll)A- 
related  sidewalls  of  the  [110]  stripes  with  various  slopes. 

substrates  grown  simultaneously  with  the  (001)  substrates 
was  33°±1°  [to  the  (lll)A  substrate  plane]  and  not  the  geo¬ 
metrical  value  of  35°.^  The  sum  of  these  two  values  is  54°, 
just  corresponding  to  the  angle  of  the  (001)  plane  to  the 
(111) A  plane.  The  source  of  the  deviation  is  not  clear  but  it  is 
possible  that  the  (114)A  facet  misoriented  by  2°  towards  the 
[111]A  direction  is  really  grown,  for  example,  in  a  step-flow 
mode.  The  values  of  Of  for  the  other  facets  evaluated  in  the 
same  way  were,  except  for  the  (lll)B  facet  discussed  below, 
in  accordance  with  the  geometrical  values  within  the  uncer¬ 
tainty  in  evaluation  mentioned  above.  The  measured  values 
of  6f  are  used  and  the  present  (114)  A  facet  misoriented  by  2° 
towards  the  [111]A  direction  is  simply  expressed  as  the 
(114)  A  facet  in  the  present  article. 

As  0  approached  21°,  Of  of  the  (114) A  facet,  the  (114) A 
facet  developed  more  towards  the  sidewall,  while  the  (lll)A 
facet  receded  towards  the  lower  (001)  substrate  plane. 

(e)  For  ^^21°,  both  (114) A  and  (111) A  facets  disappeared 
and  no  extra  facets  grew,  leading  to  a  flat  and  uniform  side- 
wall  that  maintained  the  initial  as-etched  profile. 

Table  I  summarizes  the  facet  generation  behavior  for  the 
[110]  stripe.  In  order  to  obtain  flat  and  uniform  layers  on  the 
sidewall  that  maintain  the  initial  as-etched  pattern,  it  is  nec¬ 
essary  to  keep  0  smaller  than  21°.  This  result  is  close  to  that 
suggested  in  Refs.  7  and  8  under  a  higher  growth  tempera¬ 
ture  of  720  °C  and  a  lower  V/III  flux  ratio  of  1,  which  indi¬ 
cates  that  (114) A  facet  generation  cannot  be  well  controlled 
by  the  growth  conditions. 

B.  [110]  stripe 

Figure  3  shows  (11C>)  cross-sectional  views  of  the  after¬ 
growth  profiles  of  the  (lll)B -related  sidewalls  of  the  [110] 
stripes  with  various  values  of  0.  The  HF+H2O2+H2O  mix¬ 
tures  used  produced  an  inverted  (lll)A-related  mesa  to- 
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Table  I.  Summary  of  facet  generation  behavior  during  MBE  on  GaAs  (001)  patterned  substrates. 


Substrate  Sidewall  slope  Extra  facet  on  sidewall  Extra  facet  on  comer 


[no]  stripe 

Slope 

Orientation 

Slope 

Orientation 

^21° 

55° 

21° 

(lll)A 

(114)A“ 

<21° 

No  facets 

[110]  stripe 

Slope 

Orientation 

Slope 

Orientation 

^53° 

53° 

(111)6*’ 

(001) 

53°-24° 

24° 

(113)B 

<24° 

No  facets 

[100]  stripe 

Slope 

Orientation 

Slope 

Orientation 

^45° 

72° 

45° 

(031) 

(011) 

No  facets 

450.190 

38° 

19° 

(045) 

(013) 

No  facets 

<19° 

No  facets 

^13° 

(116)A,  etc. 

“Actually,  (H4)A  misoriented  by  2°  towards  the  [111]A  direction. 
^Actually,  (lll)B  misoriented  by  2°  towards  the  [001]  direction. 


gether  with  the  normal  (lll)B -related  sidewall  for  large 
Generation  of  extra  facets  on  the  sidewahs  is  described  as 
follows:  (a),(b)  For  6^53°,  an  extra  (lll)B  facet  with 
df=53°  was  observed  at  the  edge  of  the  inverted  mesa.  For 
this  facet,  the  measured  value  of  Of  is  smaller  by  2°  than  the 
geometrical  value  of  55°,  that  is,  the  facet  is  really  misori¬ 
ented  by  2°  towards  the  [001]  direction.  This  may  be  related 
to  the  spontaneous  evolution  of  the  (lll)B  facets  tilted  by 
2.5°  towards  the  [112]A  direction  during  MBE  of  GaAs  on 
exact  (lll)B  substrates.^^  The  situation  is  similar  to  the 
(114) A  facet  but  Of  of  the  (lll)B  facet  on  the  (lll)A  pat¬ 
terned  substrates  was  exactly  the  geometrical  value  of  71°.^ 
The  discrepancy  is  still  unclear.  _ 

(c)-(e)_For  53°>^^24°,  the  (lll)B  facet  disappeared  and 
an  extra  (113)B  facet  with  0f=24°  developed  as  ^decreased. 


Growth  temperature  =  620  'C 
V  /  IH  flux  ratio  =  7.4(GaAs)/6.2(Alo.3Gao.7As) 


(a)  /?  =  57 '  /  (998)B  (b)  ^  =  52  *  /  (889)B  1(110)  cross-sectional  viewT] 


Fig.  3.  (110)  cross-sectional  views  of  the  aftergrowth  profiles  of  the  (lll)B- 
related  sidewalls  of  the  [110]  stripes  with  various  slopes. 


(f)  For  24°  >^,  the  (113)B  facet  disappeared  and  no  extra 
facets  were  observed.  _ 

The  generation  of  the  (lll)B  facet  on  the  [110]  stripe 
with  an  inverted  mesa  has  been  reported  in  the  literature 
under  different  growth  conditions.^’^^’^^’^"^  The  present  study 
provided  the  first  systematic  data  on  the  growth  behavior  for 
the  [110]  stripes  over  a  wide  6  range,  especially  for  those 
without  inverted  mesas. 

Table  I  summarizes  the  facet  generation  behavior  for  the 
[110]  stripe.  In  order  to  obtain  layers  on  the  sidewall  that 
maintain  the  initial  as-etched  pattern,  it  is  necessary  to  keep 
6  smaller  than  24°.  In  fact,  taking  the  surface  morphology  of 
the  sidewall  layers  into  account,  the  usable  0  range  is  further 
restricted  as  discussed  below. 

C.  [100]  stripe 

Figure  4  shows  (110)  cross-sectional  views  of  the  after¬ 
growth  profiles  of  the  sidewalls  of  the  [100]  stripes  with 
various  values  of  6.  The  HF+H2O2+H2O  mixtures  used  pro¬ 
duced  round  intersection  profiles  between  the  sidewall  and 
the  lower  substrate  plane  for  large  0.  Since  the  [100]  stripe 
does  not  intersect  the  (110)  cleavage  plane  at  right  angles,  as 
shown  in  Fig.  1,  the  actual  value  of  6  was  evaluated  from  the 
experimentally  observed  apparent  value  6'  on  the  SEM  pho¬ 
tographs  using  the  equation  ^=tan“^(tan  ^Vcos  45°).  Gen¬ 
eration  of  extra  facets  on  the  sidewalls  is  described  as  fol¬ 
lows:  (a)-(c)  For  ^>45°,  extra  (031)  and  (011)  facets  with 
0f=12°  and  45°  were  clearly  identified.  As  6  became  closer 
to  45°,  the  (Oil)  facet  developed  more  towards  the  sidewall, 
while  the  (031)  facet  diminished  and  finally  disappeared. 

(d)-(f)  For  45°^ ^>19°,  the  (Oil)  facet  disappeared  and 
an  extra  (013)  facet  with  65^=19°  appeared.  As  6  approached 
19°,  the  (013)  facet  also  developed  more  towards  the  side- 
wall.  The  generation  of  an  extra  (045)  facet  with  0y^=38°,  as 
shown  in  (e),  is  interesting  in  that  the  facet  exists  on  the 
sidewalls  with  (9<38°,  that  is,  6f  of  the  facet,  whereas  the 
other  facets  observed  for  8^  6f  vanish  for  $<  Of ,  except  for 
the  (111) A  facet.  We  confirmed  from  a  close  inspection  of 
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Growth  temperature  =  620  “C 

V/  I  flux  ratio  =  7.4(GaAs)/6.2(Alo.3Gao.7As) 


(a)  <?=  89”  /(OlO) 


(b)  ^  =  71  ”  /  (031) 


1(1 1 0)  cross-sectional  view  1 


0,2  jum 
0.2  fim 


GaAs  [110]  |g=tan-^(V2tan$’) 

‘^o.sGao.yAs 


(c)  ^  =  47”  /  (098)  (d)  /?  =  39”  /  (045) 


(f)  ^  =  21  ”  /  (025) 


Fig.  4.  (110)  cross-sectional  views  of  the  aftergrowth  profiles  of  the  (OlO)-related  sidewalls  of  the  [100]  stripes  with  various  slopes. 


the  SEM  photographs  that  the  (045)  facet  actually  exists  for 
^=43°  (the  growth  profile  was  similar  to  that  for  ^—39°  and 
not  shown  here),  39°,  and  21°  [(d)  and  (f)  of  Fig.  4],  that  is, 
the  (045)  facet  accompanies  the  (013)  facet. 

(g)  For  ^=^19°,  the  (013)  facet  [and  the  (045)  facet]  also 
disappeared  and  no  other  extra  facets  were  observed. 

Table  I  summarizes  the  facet  generation  behavior  for  the 
[100]  stripe.  In  order  to  maintain  the  initial  as-etched  profile 
of  the  sidewall  after  growth,  it  is  necessary  to  keep  0  smaller 
than  19°.  It  was,  however,  impossible  to  obtain  flatness  and 
imiformity  of  the  sidewall  layers  comparable  to  those  for  the 

[110]  stripes  with  ^^21°,  probably  due  to  the  high-index 
nature  of  the  sidewalls. 

D.  Intersections  of  orthogonal  stripes 

In  Ref.  1,  the  growth  behavior  on  the  corners  of  ridge- 
type  triangles  on  the  (lll)A  substrates,  in  other  words,  on  the 
intersection  of  two  crystallographically  equivalent  sidewalls, 
was  investigated.  Since  the  (001)  surface  has  twofold  rota¬ 
tional  symmetry  (Fig.  1),  it  will  be  interesting  to  study  the 
growth  behavior  on  the  intersection  of  the  nonequivalent 
[110]  and  [110]  stripes  and  on  the  two  nonequivalent  inter¬ 
sections  of  the  equivalent  [100]  and  [010]  stripes,  that  is,  the 
“(lll)A-related  corner”  and  “(lll)B -related  corner.” 

Figure  5  shows  top  views  of  the  aftergrowth  profiles  of 
the  intersections  of  the  [110]  and  [110]  stripes  with  various 
values  of  6.  Generation  of  extra  facets  on  the  corners  was 
observed.  Although  they  could  not  be  definitely  identified 
due  to  the  nonflat  and  different  as-etched  profiles  of  the  or¬ 
thogonal  sidewalls  and  to  the  occurrence  of  complicated 


sidewall  profiles  at  the  intersection,^^  the  following  assign¬ 
ments  will  be  reasonable  from  the  discussion  in  the  previous 
section:  the  extra  facet  that  appeared  for  (a),(b)  seems  to  be 
(Oil)  and  the  extra  facet  that  appeared  for  (c),(d)  seems  to  be 
(013).  No  extra  facets  are  observed  for  (e),(f). 

Figure  6  shows  top  views  of  the  aftergrowth  profiles  of 
the  intersections  of  the  [100]  and  [010]  stripes  with  various 
values  of  0.  There  was  no  diffeence  observed  between  the 
as-etched  (lll)A-related  and  (lll)B -related  corners.  Poor 
morphologies  of  the  sidewall  layers  compared  to  those  for 
the  [110]  stripes,  the  equivalency  of  the  growth  behaviors  on 
the  [100]  and  [010]  stripes  and,  to  be  in  sharp  contrast  to 
this,  the  nonequivalency  of  the  growth  behaviors  on  the 

(111) A-related  and  (lll)B -related  comers  can  be  well  con¬ 
firmed  from  Fig.  6.  It  should  be  especially  noted  in  (d),(e) 
that  extra  facets^are  generated  on  the  (lll)A-related  corners 
but  not  on  the  (lll)B -related  corners. 

IV.  DISCUSSION 
A.  (114)A  and  (111)A  facets 

The  generation  of  the  (114)A  facet  on  the  [110]  stripe  has 
also  been  reported  in  the  literature  under  various  growth  con¬ 
ditions  but  only  for  exact  (m);^2,3,5,6,i3,22 

(112) A-(113)A^“^’^'^’^^  sidewalls.  In  Ref.  1,  the  (114)A  facet 
was  shown  to  also  develop  on  the  (lll)A  patterned  sub¬ 
strates.  Therefore,  it  was  confirmed  for  the  first  time  in  the 
present  study  that  the  generation  of  the  (114) A  facet  is  a 
common  feature  independent  of  the  substrate  and  sidewall 
orientations.  The  development  of  the  (114) A  facet  towards 
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(a)^=54“  /{m)A^^  ^=57'  /(998)B^^  (b)^=49"  /(445)A;  5  =  47“  /(334)B 


(f)5=13“  /(116)A;  5=12"  /(Il7)B 

1)  [llOlstripe 

2)  [llOlstripe 


Fig.  5.  Top  views  of  the  aftergrowth  profiles  of  the  intersections  of  the  [110]  and  [110]  stripes  with  various  sidewall  slopes. 


(a)  89“  /(lOO) 


(b)  (9=  47“  /  (908) 


(e)  =  13  “  /  (104) 


[001] 

[100] 

[010] 


1)  (lll)A“reIated  comer 

2)  (Ill)B-related  comer 


Fig.  6.  Top  views  of  the  aftergrowth  profiles  of  the  intersections  of  the  [100]  and  [010]  stripes  with  various  sidewall  slopes. 
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the  sidewall  with  decreasing  6  [Figs.  2(a) -2(d)]  was  also 
observed  on  the  (lll)A  patterned  substrates.^  As  was  men¬ 
tioned  in  Ref.  1,  it  seems  to  be  caused  by  enhancement  of  the 
lateral  growth  due  to  the  increase  in  the  terrace  width  of 
microsteps  formed  on  the  as-etched  sidewall. 

For  54°^  ^^48°,  the  layers  grown  on  the  sidewall  exhib¬ 
ited  step  structures  composed  of  the  equivalent  (Oll)-related 
and  (lOl)-related  facets.  The  average  interstep  distance 
shortened  from  1.5  to  0.5  fim  as  ^decreased  from  54®  to  48°. 
We  also  observed  an  A1  composition  variation  (seen  as  a 
lateral  gradation  in  the  AlGaAs  layers)  corresponding  to  the 
step  structures.  Since  these  sidewalls  correspond  to  the 
(lll)A  planes  misoriented  by  l°-7°  towards  the  [001]  direc¬ 
tion,  it  is  natural  that  the  sidewall  layers  exhibited  the  step 
structures  identical  with  those  grown  on  the  planar  (lll)A 
substrates  misoriented  towards  the  [001]  direction. For 
41°^  ^^21°,  the  surface  morphology  of  the  sidewall  layers 
was  poor  except  for  the  (114) A  facet  showing  a  smooth  sur¬ 
face.  After  vanishing  the  (114)A  facet  for  ^=^21°,  the  side- 
wall  layer  exhibited  a  smooth  surface  suitable  for  device  ap¬ 
plication. 

The  development  of  the  (111) A  facet  at  the  intersection  of 
the  sidewall  and  the  lower  substrate  plane  was  mentioned  in 
Ref.  26  and  is  obvious  in  the  SEM  photographs  shown  in,  for 
example,  Refs.  7,  8,  and  14.  The  reason  for  the  appearance  of 
the  (111) A  facet  for  0  smaller  than  Of  of  the  facet  is  the  low 
growth  rate  in  the  [111 ]A  direction  due  to  the  long  Ga  diffu¬ 
sion  length  on  the  (lll)A  plane  and  the  location  at  the  con¬ 
cave  intersection  of  the  substrate  and  the  sidewall. 

In  contrast  to  the  (001)  triangles  on  (lll)A  substrates,^  the 
exponential  thickness  variation  of  the  layers  grown  on  the 
(001)  substrate  plane  persists  after  flat  and  uniform  layers  on 
the  sidewall  are  grown  for  ^^21°,  as  shown  in  Figs.  2(e)  and 
5(f),  due  to  the  lateral  Ga  diffusion  from  the  sidewall  to  the 
(001)  substrate  plane.  Exponential  thickness  variations  of  the 
sidewall  layers  adjacent  to  the  (lll)A  facet  on  the  basis  of 
the  same  mechanism  are  also  observed  in  Figs.  2(b)  and  2(c). 
In  order  to  obtain  flat  and  uniform  layers  across  the 
sidewall-substrate  plane  intersection,  6  must  actually  be 
smaller  than  10°.  This  point  is  critically  important  for  device 
application,  such  as  lateral  p-n  junctions  and  carrier  con¬ 
finement  structures  using  the  amphoteric  Si  doping 
characteristics, because  they  need  a  steep  p~n 
transition  across  the  junction.  For  such  small  angles,  that  is, 
a  small  difference  in  the  orientation  between  the  substrate 
plane  and  the  sidewall,  steep  p~n  transitions  will  not  be 
easily  obtained  without  good  consideration  of  growth  condi¬ 
tions.  This  point  has  been  ignored  so  far.  For  (111) A  pat¬ 
terned  substrates,  combination  of  the  (111) A  substrate  plane 
and  the  (113) A  sidewall  (^=30°)  gave  rise  to  a  simple  p-n 
junction  with  a  planar  interface  and  without  extra  facets  or 
exponential  thickness  variations,  which  showed  excellent 
current  blocking  characteristics. 

B.  (111  )B  and  (113)B  facets 

It  should  be  noted  tha^the  layers  grown  on  the  sidewalls 
with  0  around  Of  of  the  (lll)B  plane  exhibited  a  very  rough 


surface  morphology,  whereas  the  (lll)B  facet  with  a  rela¬ 
tively  smooth  surface  grew  at  the  edge  of  the  (001)  surface 
without  underlying  (lll)B  surfaces  [Fig.  5(a)].  One  reason 
for  that  is  because  the  present  growth  conditions  of  high 
temperature  and  high  AS4  pressure  probably  did  not  match 
the  growth  on  the  (1 1 1)B  surface  favoring  high  temperatures, 
low  AS4  pressures,  and  low  growth  rates.^^”^^  The  round  as- 
etched  profiles  of  the  sidewalls^^  may  be  another  source  of 
generating  such  a  faceted  surface.  As  to  the  (lll)B  facet,  a 
lateral  flow  of  e^ess  Ga  adatoms  not  incorporated  on  the 
initially  formed  (lll)B  plane  to  the  substrate  plane  resulted 
in  the  low  growth_rate  in  the  [111]B  direction,  hence  the 
formation  of  the  (lll)B_facet  with  a  relatively  smooth  sur¬ 
face.  In  Ref.  1,  the  (lll)B  facet  has  been  reported  to  also 
develop  on  the  (lll)A  patterned  substrates.  _ 

It  is  observed  in  Figs.  5(c)  and  5(d)  that  the  (1I3)B  facet 
showed  a  smooth  surface  and  it  develops  towards  the  side- 
wall  with  the  rough  surface  as  0  decreased.  The  whole  side- 
wall  is  expected  to  be  covered  with  the  smooth  (113)B  facet 
for  ^around  24°,  Of  of  the  (113)B  facet  (not  checked  experi¬ 
mentally).  The  generation  of  the  (113)B  facet  has  been  re- 
ported  in  Ref.  42  in  the  lateral  growth  of  GaAs  on  (lll)B 
patterned  substrates  by  metalorganic  MBE  (MOMBE)  under 
high  AS4  pressures.  This  and  the  discussions  in  Refs.  35-37 
clearly  indicate  that  the  growth  proceeds  more  easily  on  the 
(113)B  surface  than  the  (lll)B  surface  due  to  freedom  from 
the  growth-hampering  surface  As  trimer  formation  specific  to 
the  (1_11)B  surface.  The  smooth  surface  was  maintained  after 
the  (113)B  facet  vanished  for  /9=20°,  as  shown  in  Fig.  5(e), 
but  the  surface  morphology  gets  worse  again  as  0  decreased 
further.  Consequently,  the  [110]  stripes  with  flat  and  uniform 
sidewall  layers  maintaining  the  initial  as-etched  pattern  can 
be  obtained  in  the  narrow  0  range  of  24°  to  (at  least)  20°. 
The  apparently  irregular  cross-sectional  growth  pattern  re¬ 
flecting  the  rough_surface,  as_shown  in  Fig.  3(c),  is  really 
composed  of  the  (lll)B  and  (113)B  facets. 


C.  (011)  and  (013)  facets 

Recently,  growth  of  the  (011)  facet  on  the  [100]  stripe 
with  the  (010)  sidewall  and  quantum  wires  formed  taking 
advantage  of  the  (Oil)  facet  growth  were  reported  in  Refs. 
18-21  and  25.  In  these  references,  the  as-etched  profile  of 
the  sidewall  has  been  limited  to  the  type  shown  in  Fig.  4(a). 
Therefore,  we  provided  the  first  experimental  results  on  the  0 
dependence  of  the  growth  on  the  [100]  stripe  in  the  present 
study.  In  the  present  study,  the  (013)  facet  developed  after 
the  (Oil)  facet  vanished  for  ^<45°  under  high  temperature 
and  AS4  pressure,  while  it  has  been  shown  in  Ref.  18  that  the 
(oil)  and  (013)  facets  coexist  under  low  temperatures  and/or 
high  AS4  pressures.  It  has  been  shown  in  Refs.  10,  11,  13,  23, 
25  that  the  Ga  diffusion  length  on  the  (001)  surface  shortens 
as  the  AS4  pressure  increases  and/or  the  temperature  de¬ 
creases.  If  this  holds  for  the  (Oil)  and  (013)  surfaces  and  the 
Ga  diffusion  length  on  the  (011)  surface  shortens  more 
steeply  than  that  on  the  (013)  surface  as  the  temperature 
decreases,  the  growth  rate  of  the  (011)  facet  relative  to  that 
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of  the  (013)  facet  increases,  hence  the  contribution  of  the 
(013)  facet  becomes  larger.  This  qualitatively  explains  the 
difference  in  the  two  results. 

The  (Oil)  facet  has  been  shown  to  also  develop  on  the 
(1 1 1)A  patterned  substrates. ^ 

D.  Growth  behavior  contrast  on  [110]  and  [110] 
stripes 

The  growth  behaviors  on  the  [110]  and  [110]  stripes  can 
be  more  distinctly  compared  in  Fig.  5.  The  surface  morphol¬ 
ogy  is  better  for  the  l^ers  on  the  (lll)A-related  sidewalls 
than  for  those  on  the  (lll)B-related  sidewalls.  This  may  be 
attributed,  first,  to  the  present  growth  conditions  optimized 
for  the  (111) A  surface  and,  second,  to  the  anisotropic  struc¬ 
ture  of  the  (2X4)-reconstructed  (001)  surface"^^  during  MBE. 
The  latter  has  demonstrated  by  a  distinct  difference  in  the 
straightness  of  microsteps  produced  by  misorientations  to¬ 
wards  the  [110]  and  [110]  directions  using  scanning  tunnel¬ 
ing  microscopy."^^’^^  It  should,  however,  be  noted  from  (e) 
that  flat  and  uniform  layers  can  be  simultaneously  grown  on 
both  types  of_sidewalls  for  6  around  20°  only,  that  is,  on  the 
(114) A  and  (114)B  sidewalls.  This  suggests  that  the  (113)B 
and  (114)B  surfaces  may  have  wider  optimum  growth  con¬ 
ditions,  which  are  closer  to  the  optimum  growth  conditions 
for  the  (001)  surface,  than  the  (lll)B  surface  does. 

E.  Intersection  of  [100]  and  [010]  stripes 

The  identification  of  the  facets  on  the  (lll)A-related  cor¬ 
ners  was  made  on  the  basis  of  their  orientations  and  the 
values  of  Of  evaluated  as  follows:  the  angle  0^  of  the  inter¬ 
section  line  to  the  (001)  substrate  plane  can  be  evaluated 
from  the  experimentally  determined  value  of  6  using  the 
simple  relation  ^^=tan“^(tan  0/V2).  The  value  of  ^=19°,  es¬ 
timated  as  corresponding  to  the  onset  boundary  of  the  facets, 
gives  the  value  of  ^^=13°.  This  value  corresponds  to  Of  of 
the  (116) A  plane.  The  (lll)A-related  facet  presented  in  Fig. 
6(e)  corresponds  to  (119) A.  Table  I  also  summarizes  the 
facet  generation  behavior  at  the  intersections  of  the  [100]  and 
[010]  stripes.  This  result  implies  that  it  is  impossible  to  grow 
GaAs  and  AlGaAs  layers  by  MBE  on  ridge-type  squares 
formed  by  the  [100]  and  [010]  stripes  without  modifying 
initial  as-etched  pattern  at  least  under  the  present  growth 
condition.  In  Ref.  21,  a  ridge-type  square  of  the  same  type 
encircled  with  equivalent  four  (011)  facets  was  proposed  for 
a  novel  quantum  dot.  The  structure  shown  in  Fig.  1  of  Ref. 
21  exhibits  the  nonequivalent  (lll)A-related  and  (lll)B- 
related  comers  although  the  authors  did  not  mention  it. 

F.  Phenomena  common  to  three  stripe  patterns 

Although  the  three  stripes  showed  their  respective  facet 
generation  behaviors  as  discussed  above,  the  following  point 
is  worth  mentioning  as  common  to  the  three  stripes:  the  per¬ 
sistent  facets  generate  on  the  three  stripes,  that  is,  the 
(114) A  facet  on  the  [110]  stripe,  the  (113)B  facet  on  the 
[110]  stripe,  and  the  (013)  facet  on  the  [100]  stripe,  have  Of 
in  the  narrow  range  of  19° -24°.  This  point  seems  to  be  re¬ 


lated  to  the  crystal  structure  of  GaAs  composed  of  the  tetra¬ 
hedral  Ga-As  bonding,  as  was  also  suggested  for  the  (lll)A 
patterned  substrate  in  Ref.  1. 

G.  Comparison  with  (111)A  patterned  substrates 

Comparing  the  facet  generation  behavior  during  MBE  be¬ 
tween  the  (001)  patterned  substrate  described  in  the  present 
article  and  the  (111) A  patterned  substrate  described  in  Ref.  1, 
the  following  common  and  different  features  are  pointed  out: 

(1)  Growth  of  extra  (114) A,  (lll)B,  and  (110)  facets  and 

their  lateral  development  with  0  approaching  their  Of  s  are 
common  to  the  (111) A  and  (001)  patterned  substrates.  The 
directions  of  the  lateral _development  of  the  (114)A,  (lll)B, 
and  (110)  facets  are  [221]B,  [n.2]A,  and  [011],  respecfively, 
for  the  (001)  substrate  and  [221] A,  [112]B,  and  [001],  re¬ 
spectively,  for  the  (lll)A  substrate.  Thus,  the  lateral  devel¬ 
opment  does  not  depend  on  the  difference  in  the  microstep 
properties  of  the  as-etched  sidewalls.  _  _ 

(2)  Growth  of  extra  (001),  (113)A,  (159),  (238),  and  (125) 
facets  are  specific  to  the  (lll)A  patterned  substrates  and 
growth  of  extra  (lll)A,  (113)B,  (031),  (045),  and  (013)  fac¬ 
ets  are  specific  to  the  (001)  patterned  substrates. 

(3)  There  is  a  critical  value  for  0,  below  which  flat 
and  uniform  layers  on  the  sidewall  and  substrate  plane  grow 
maintaining  the  initial  as-etched  profile  with  no  extra  facets 
or  no  exponential  thickness  variations  of  layers  on  the  sub¬ 
strate  plane  near  the  sidewall.  The  value  of  ^^rit  is  located  in 
the  narrow  range  of  33°-35°  independent  of  the  sidewall 
orientation  for  the  (111) A  substrate,  whereas  it  varies  be¬ 
tween  10°  and  24°  depending  on  the  sidewall  orientation  for 
the  (001)  substrate. 

(4)  As  long  as  0>  ^^rit »  the  interfaces  between  the  layers 
on  the  (001)  substrate  plane  and  the  adjacent  extra  facets  are 
curved  towards  the  layers  on  the  substrate  plane,  whereas  the 
interface  was  flat  and  perpendicular  to  the  substrate  plane, 
irrespective  of  0,  for  the  (111) A  substrate.  This  point  may 
hinder  the  formation  of  lateral  p-n  junctions  having  a  steep 
p-n  transition  region  with  structural  simplicity  by  utilizing 
the  amphotericity  of  Si  doping  on  the  (001)  patterned  sub¬ 
strate. 

(5)  The  steps  with  Of- 16°-17°  appear  near  the  sidewall- 
substrate  plane  boundary  independent  of  the  sidewall  orien¬ 
tation  for  the  (lll)A  substrates  while  no  steps  are  observed 
for  the  (001)  substrates. 

These  points  reflect  not  only  differences  in  the  crystallo¬ 
graphic  and  chemical  properties  between  the  two  substrates 
but  also  differences  in  the  interaction  of  adatoms  between  the 
substrate  plane  and  adjacent  sidewall.  These  points  will  be 
discussed  in  more  detail  in  Ref.  46. 

V.  SUMMARY  AND  CONCLUSION 

Extra  facet  generation  during  MBE  growth  of  GaAs/ 
AlGaAs  multilayers  on_(001)  GaAs  substrates  patterned  with 
stripes  running  in  the  [110],  [110],  and  [100]  directions  and 
having  various  slopes,  designated  as  [110]  stripe,  [110] 
stripe,  and  [100]  stripe,  respectively,  has  been  systematically 
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investigated.  It  has  been  confirmed  that  extra  (lll)A  and 
(114)Afacets  are  generated  on  the  [110]  stripes,  extra  (lll)B 
and  (113)B  facets  on  the  [110]  stripes,  and  extra  (031),  (Oil), 
(045),  and  (013)  facets  on  the  [100]  stripes,  depending  on  the 
intersection  angle  0  of  the_sidewall  and  the  substrate  plane. 

Among  them,  (114)A,  (113)B,  and  (013)  facets  are  impor¬ 
tant  because  they  persist  for  a  wide  range  of  6  and,  therefore, 
play  a  detrimental  role  in  forming  flat  and  uniform  sidewall 
layers  on  the  (001)  patterned  substrates.  It  has  been  made 
clear  that  no  extra  facets  are  generated  on  the  sidewalls  and 
flat  and  uniform  layers  maintaining  the_  initial  as-etched 
stripe  patterns  can  be  grown  for  the  [110]  stripes  with 
10°^^,  the  [110]  stripes  with  24°^ ^^20°,  and  the  [100] 
stripes  with  19°^^.  Extra  (lll)A-related  facets  have  devel¬ 
oped  on  the  (lll)A-related  intersection  of  the  equivalent 
[100]  and  [010]  stripes,  while  no  extra  facets  have  developed 
on  the  (lll)B-related  intersection.  The  (114)A,  (lll)B,  and 
(110)  facets  have  been  confirmed  as  common  to  the  (lll)A 
and  (001)  patterned  substrates. 

ACKNOWLEDGMENTS 

The  authors  would  like  to  thank  Dr.  Y.  Furuhama  and  Dr. 
H.  Inomata  for  their  encouragement  throughout  this  work. 

^T.  Takebe,  M.  Fujii,  T.  Yamamoto,  K.  Fujita,  K.  Kobayashi,  and  T.  Wa- 
tanabe,  J.  Cryst.  Growth  162,  31  (1996). 

^W.  T.  Tsang  and  A.  Y.  Cho,  AppL  Phys.  Lett.  30,  293  (1977). 

^J.  S.  Smith,  P.  L.  Derry,  S.  Margalit,  and  A.  Yariv,  Appl.  Phys.  Lett.  47, 
712  (1985). 

^D.  L.  Miller,  Appl.  Phys.  Lett.  47,  1309  (1985). 

^T.  Yuasa,  M.  Manno,  T.  Yamada,  S.  Narituka,  K.  Shinozaki,  and  M.  Ishii, 
J.  Appl.  Phys.  62,  764  (1987). 

^E.  Kapon,  M.  C.  Tamargo,  and  D.  M.  Hwang,  Appl.  Phys.  Lett.  50,  347 
(1987). 

^H.  P.  Meier,  R.  F.  Broom,  P.  W.  Epperlein,  E.  van  Gieson,  Ch.  Harder,  H. 
Jackel,  W.  Walter,  and  D.  J.  Webb,  J.  Vac.  Sci.  Technol.  B  6,  692  (1987). 
^H.  R  Meier,  E.  van  Gieson,  R.  F.  Broom,  W.  Walter,  D.  J.  Webb,  C. 
Harder,  and  H.  .Tackel,  Inst.  Phys.  Conf.  Sen  91,  609  (1988). 

^S.  Nilsson,  E.  van  Gieson,  D.  J.  Arent,  H.  P.  Meier,  W.  Walter,  and  T. 
Forster,  Appl.  Phys.  Lett.  55,  972  (1989). 

^^M.  Hata,  T.  Isu.  A.  Watanabe,  and  Y.  Katayama,  Appl.  Phys.  Lett.  56, 
2542  (1990). 

^^M.  Hata,  T.  Isu.  A.  Watanabe,  and  Y.  Katayama,  J.  Vac,  Sci.  Technol.  B  8, 
692  (1990). 

^^Y.  Nakamura,  S.  Koshiba,  M.  Tsuchiya,  H.  Kano,  and  H,  Sakaki,  Appl. 
Phys.  Lett.  59,  700  (1991). 

Q.  Shen,  M.  Tanaka,  and  T.  Nishinaga,  10th  Symposium  Record  on 
Alloy  Semiconductor  Physics  and  Electronics,  1991  (unpublished),  p.  65. 
^^S.  Shimomura,  S.  Ohkubo,  Y.  Yuba,  S.  Namba,  S.  Hiyamizu,  M.  Shigeta, 
T.  Yamamoto,  and  K.  Kobayashi,  Surf.  Sci.  267,  13  ^992). 

*^Y.  Morishita,  Y.  Nomura,  S.  Goto,  Y.  Katayama,  and  T.  Isu,  Inst.  Phys. 
Conf.  Sen  120,  19  (1992^ 

^^M.  Walther,  T.  Rohr,  G.  Bbhm,  G.  Trankle,  and  G.  Weimann,  J.  Cryst. 
Growth  127,  1045  (1993). 


^^H.  Saito,  M.  Sugimoto,  M.  Anan,  and  Y.  Ochiai,  Jpn.  J.  Appl.  Phys.  32, 
L1034  (1993). 

^^M.  L6pez,  T.  Ishikawa,  and  Y.  Nomura,  Jpn,  J.  Appl.  Phys.  32,  L1051 
(1993). 

^^K.  Eberl,  R  Grambow,  A.  Lehmann,  A.  Kurtenbach,  K.  v.  Klitzing,  D. 
Heitmann,  M.  Dilger,  and  M.  Hohenstein,  Appl.  Phys.  Lett.  63,  1059 
(1993). 

^“^Y.  Liu,  S.  Shimomura,  N.  Sano,  K.  Gamo,  A.  Adachi,  and  S.  Hiyamizu, 
Semicond,  Sci.  Technol.  8,  2197  (1993). 

^‘M.  L6pez,  T.  Ishikawa,  and  Y.  Nomura,  Electron.  Lett.  29,  2225  (1993). 
Q.  Shen,  M.  Tanaka,  K.  Wada,  and  T.  Nishinaga,  J.  Cryst.  Growth  135, 
85  (1994). 

^^X.  Q.  Shen,  D.  Kishimoto,  and  T.  Nishinaga,  Jpn.  J.  Appl.  Phys.  33,  11 
(1994). 

^"^S.  Koshiba,  Y.  Nakamura,  M.  Tsuchiya,  H.  Noge,  Y.  Nagamune,  T.  Noda, 
and  H.  Sakaki,  J.  Appl.  Phys.  76,  4138  (1994). 

^^M.  L6pez  and  Y.  Nomura,  J.  Cryst.  Growth  150,  68  (1995). 

^^T.  Takamori  and  T.  Kamijo,  Appl.  Phys.  Lett.  77,  187  (1995). 

"^T.  Takebe,  T.  Yamamoto,  M.  Fujii,  and  K.  Kobayashi,  J.  Electrochem. 
Soc.  140,  1169  (1993). 

^^L.  J.  Schowalter,  K.  Yang,  and  T.  Thundat,  J.  Vac.  Sci.  Technol.  B  12, 
2579  (1994). 

^^T.  Yamamoto,  M.  Inai,  T.  Takebe,  M.  Fujii,  and  K.  Kobayashi,  J.  Cryst. 
Growth  127,  865  (1993). 

^®M.  Fujii,  T.  Yamamoto,  M.  Shigeta,  T.  Takebe,  K.  Kobayashi,  S. 

Hiyamizu,  and  I.  Fujimoto,  Surf.  Sci.  267,  26  (1992). 

^^K.  Kobayashi,  T.  Takebe,  T.  Yamamoto,  M.  Fujii,  M.  Inai,  and  D.  Lovell, 
J.  Electron.  Mater.  22,  161  (1993). 

^^M.  Inai,  T.  Yamamoto,  M.  Fujii,  T  Takebe,  and  K.  Kobayashi,  Jpn.  J. 
Appl.  Phys.  32,  523  (1993). 

^^T.  Yamamoto,  M.  Inai,  T.  Takebe,  and  T.  Watanabe,  Jpn.  J.  Appl.  Phys.  32, 
L28  (1993). 

^"^K.  Tsutsui,  H.  Mizukami,  0.  Ishiyama,  S.  Nakamura,  and  S.  Furukawa, 
Jpn.  J.  Appl.  Phys.  29,  468  (1990). 

^^P.  Chen,  K.  C.  Rajkumar,  and  A.  Madhukar,  Appl.  Phys.  Lett.  58,  1771 
(1991). 

^^T.  Hayakawa,  M.  Nagai,  M.  Morishima,  H.  Horie,  and  K.  Matsumoto, 
Appl.  Phys.  Lett.  59,  2287  (1991). 

"^^T.  Hayakawa,  M.  Morishima,  and  S.  Chen,  Appl.  Phys.  Lett.  59,  3321 
(1991). 

^^T.  Hayakawa,  M.  Morishima,  M.  Nagai,  H.  Horie,  and  K.  Matsumoto, 
Surf.  Sci.  267,  8  (1992). 

Fu,  K.  Zhang,  and  D.  L.  Miller,  J.  Vac.  Sci.  Technol.  B  10,  779  (1992). 
A.  Woolf,  Z.  Sobiesierski,  D.  I.  Westwood,  and  R.  H.  Williams,  J. 
Appl.  Phys.  71,  4908  (1992). 

A.  Woolf,  J.  P.  Williams,  D.  I.  Westwood,  Z.  Sobiesierski,  J.  E.  Aubrey, 
and  R.  H.  Williams,  J.  Cryst.  Growth  127,  913  (1993). 

"^^T.  Isu,  Y.  Morishita,  S.  Goto,  Y.  Nomura,  and  Y.  Katayama,  J.  Cryst. 
Growth  127,  942  (1993). 

^^M.  D.  Pashley,  K.  W.  Habarern,  W.  Friday,  J.  M.  Woodall,  and  R  D. 
Kirchner,  Phys.  Rev.  Lett.  60,  2176  (1988). 

D.  Pashley,  K.  W.  Habarern,  and  J.  M.  Gaines,  Appl.  Phys.  Lett.  58, 
406  (1991). 

^^M.  D.  Pashley,  K.  W.  Habai'em,  and  J.  M.  Gaines,  Surf.  Sci.  267,  153 
(1992). 

"^^T  Takebe,  M.  Fujii,  T.  Yamamoto,  K.  Fujita,  and  T.  Watanabe  (in  prepa¬ 
ration). 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


InP/InGaAs  single  heterojunction  bipolar  transistors  grown  by  solid-source 
molecular  beam  epitaxy  using  a  phosphorus  valved  cracker 

W.  L.  Chen®' 

Solid-State  Electronics  Laboratory,  Department  of  Electrical  Engineering  and  Computer  Science, 

The  University  of  Michigan,  Ann  Arbor,  Michigan  48109-2122 

T.  R  Chin  and  J.  M.  Woodall 

School  of  Electrical  and  Computer  Engineering,  Purdue  University,  West  Lafayette,  Indiana  47907-1285 

G.  I.  Haddad 

Solid-State  Electronics  Laboratory,  Department  of  Electrical  Engineering  and  Computer  Science, 

The  University  of  Michigan,  Ann  Arbor,  Michigan  48109-2122 

(Received  1  March  1996;  accepted  1  May  1996) 

In  this  work,  InP/InGaAs  single  heterojunction  bipolar  transistors  (SHBTs)  with  excellent 
performance  were  grown  by  solid-source  molecular  beam  epitaxy  using  a  phosphorus  valved 
cracker.  The  maximum  dc  and  differential  current  gain  are  —27  and  32  with  breakdown  voltages 
Vceo'^6  V  and  ^  for  the  SHBT  with  a  5000  A  InGaAs  collector  layer  doped  at  4X10^^ 

cm“^.  The  ideality  factors  for  lb  and  Ic  are  —1.12  and  1.07  and  the  transistors  also  show  very 
uniform  current  gain  down  to  10“^®  A  range.  For  high  frequency  performance,  the  maximum and 
/max  around  80  and  125  GHz  for  the  SHBT  with  an  emitter  area  of  32  ©  1996  American 

Vacuum  Society. 


1.  INTRODUCTION 

InP/InGaAs  heterojunction  bipolar  transistors  (HBTs) 
have  demonstrated  excellent  high  frequency  performance, 
and  they  are  promising  candidates  for  integration  into  opto¬ 
electronic  light  wave  communication  systems.  Most  of  the 
reported  InP-based  HBTs  were  grown  by  gas- source  MBE 
(GSMBE),  chemical  beam  epitaxy  (CBE)  and  metalorganic 
chemical  vapor  deposition  (MOCVD)  using  highly  toxic  hy¬ 
drides,  e.g.,  PH3  and  ASH3,  requiring  complicated  and  costly 
utilities  to  handle.  It  is  desirable  to  use  solid- source  phos¬ 
phorus  to  grow  P-containing  compounds  for  device  and  cir¬ 
cuit  applications  to  improve  safety  concerns.  There  have 
been  some  attempts  to  use  solid  phosphorus  as  source  mate¬ 
rial  in  a  MBE  system.  However,  the  main  problems  were  (1) 
the  impurities  in  the  source  material  and  (2)  fire  hazard 
caused  by  white  phosphorus.  To  reduce  fire  hazard,  even  GaP 
and  InP  chunks  were  evaporated  to  obtain  P2  but  accompa¬ 
nied  with  Ga  and  In  impurity  atoms.  Recently,  a  valved  phos¬ 
phorus  cracker  with  improved  designs  was  used  to  generate 
P2  successfully  with  reduced  impurities  and  fire  hazard. 
Therefore,  good  device  results  of  optoelectronic  and  elec¬ 
tronic  devices  were  reported  without  using  hydrides.  There 
have  been  few  results  of  InP/InGaAs  HBTs  reported  using 
solid  phosphorus  in  a  MBE  system.^’^ 

In  this  article,  a  solid-source  MBE  system  is  used  to  grow 
InP/InGaAs  HBTs  which  show  excellent  dc  and  high  fre¬ 
quency  performance.  The  article  starts  with  the  growth  and 
processing  techniques.  Then  material  characterization  results 
are  discussed,  including  the  quality  of  the  InP/InGaAs  het¬ 
erojunction  and  the  etching  characteristics  of  the  heterojunc- 
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tion.  After  the  discussions  of  material  quality,  dc  and  micro- 
wave  device  results  are  described  with  the  discussions  of 
device  quality.  Finally,  a  brief  summary  will  be  given  at  the 
end  of  the  article. 

II.  GROWTH  AND  FABRICATION 

The  growth  was  performed  in  a  modified  Varian  GEN-II 
CBE  system,  which  has  been  described  elsewhere.^  A  valved 
cracker  is  used  as  the  phosphorus  source  while  standard  ef¬ 
fusion  cells  are  used  for  As,  Be,  and  group-III  metals.  A  Si 
filament  is  used  as  the  n-type  doping  source.  P4  molecules 
are  thermally  cracked  into  P2  at  900  °C  before  injected  into 
the  growth  chamber.  The  typical  phosphorus  beam  equiva¬ 
lent  pressure  is  1-2X 10“^  Torr.  Reflection  high-energy  elec¬ 
tron  diffraction  (RHEED)  oscillations  are  used  to  calibrate 
growth  rates,  compositions,  and  surface  V/III  ratio.  The  typi¬ 
cal  growth  temperature  for  InP  and  InGaAs  are  480  °C. 
Double  crystal  x-ray  rocking  curve  linewidth  of  an  1  pim 
InGaAs  layer  is  22  arcsec. 

Special  care  is  taken  at  the  InP(top)/InGaAs  (bottom)  in¬ 
terface.  A  shutter  sequence  is  designed  so  that  the  InGaAs  is 
not  exposed  to  phosphorus  flux,  which  means  the  arsenic 
shutter  closes  first,  then  one  monolayer  of  indium  metal  is 
deposited  and  finally  the  phosphorus  shutter  opens.  As 
shown  in  Fig.  1(a),  the  InP/InGaAs  interface  of  the  40  A  well 
is  grown  with  such  a  sequence  while  the  50  A  well  is  grown 
with  the  normal  shutter  sequence,  which  means  the  phos¬ 
phorus  shutter  opens  before  the  In  shutter.  Low  temperature 
(10  K)  photoluminescence  (PL)  of  the  40  and  50  A  InP/ 
InGaAs  wells  are  shown  in  Fig.  1(b).  The  significant  im¬ 
provement  of  the  PL  intensity  and  narrow  linewidth  (4.7 
meV)  is  attributed  to  the  smooth  InP/InGaAs  interface  with¬ 
out  the  formation  of  InGaAsP.  The  improved  InP/InGaAs 
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(a)  InGaAs  InGaAs 


Fig.  1.  (a)  InP/InGaAs  single  quantum  wells  (SQWs)  with  40  and  50  A 
wells  which  are  grown  using  different  shutter  sequences,  (b)  10  K  PL  of  the 
two  InP/InGaAs  SQWs  shown  in  (a). 

interface  also  results  in  very  uniform  surface  after  selectively 
etching  InP  away  from  InGaAs. 

The  typical  InP/InGaAs  HBT  structure,  as  shown  in  Fig. 
2,  consists  of  a  1500  A  InGaAs  n+  emitter  contact  layer,  a 
500  A.  n-\-  InP  layer,  a  1000  A  InP  emitter  layer  doped  at 
5X10^^  cm“^,  a  50  A  InGaAs  undoped  spacer,  a  650  A  In¬ 
GaAs  base  layer  doped  at  5X10^^  cm”^,  a  5000  A  InGaAs 
collector  layer  doped  at  4X10^^  cm“^,  and  a  6000  A  n  + 
InGaAs  collector  contact  layer.  Si  and  Be  were  used  as 
n-type  and  /7~type  dopants. 

The  InP/InGaAs  HBTs  were  processed  using  a  self- 
aligned  process.  In  the  process,  HC1:H3P04  solution  was 
used  for  etching  the  InP  layer  selectively  over  the  InGaAs 
layer,  and  H3P04:H202:H20  solution  was  adopted  for  etch- 
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Fig.  2,  The  InP/InGaAs  HBT  structure  grown  by  solid-source  MBE. 
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ing  the  InGaAs  layer  selectively  over  the  InP  layer.  The  sepa¬ 
ration  between  the  emitter  mesa  and  the  base  metal  is  less 
than  0.2  jam.  Then,  Ti/Pt/Au  was  used  as  the  emitter  and 
collector  ohmic  contacts  and  Pt/Ti/Pt/Au  was  used  as  the 
base  ohmic  contact,  resulting  in  contact  resistivities  of 
^7X10~^  flcm^  (emitter),  6X10“^  flcm^  (collector),  and 
3X10“^  il  cm^  (base).  After  the  deposition  of  the  ohmic  con¬ 
tacts,  PECVD  Si02  were  deposited  all  over  the  wafer  fol¬ 
lowed  by  via  hole  openings  using  RIE.  Finally,  Ti/Al/Ti/Au 
was  used  as  interconnect  metals  for  dc  and  microwave  prob¬ 
ing. 

III.  TRANSISTOR  CHARACTERISTICS 

The  typical  common  emitter  1-V  characteristics  are 
shown  in  Fig.  3  for  the  HBTs  with  emitter  sizes  of  32  /xm^. 
The  maximum  dc  current  gain  Idlb  and  differential  current 
gain  (dlcfdlb)  are  around  27  and  32,  which  are  commonly 
observed  for  HBTs  grown  by  other  growth  techniques  with 
similar  base  doping  and  base  thickness.  The  current  gain  is 
also  very  uniform  down  to  10“^^  A  as  shown  in  Fig.  4.  The 
ideality  factors  for  Ib  and  Ic  are  —1.12  and  1.07.  Based  on 
the  common  emitter  I-V  characteristics  and  the  Gummel 
plots,  the  surface  recombination  current  is  not  a  dominant 
factor  even  in  the  range  of  low  current  densities  and  there  is 
no  significant  leakage  path  in  the  emitter  layer,  the  base  layer 
or  the  emitter/base  junction,  indicating  good  quality  of  InP/ 
InGaAs  HBTs  grown  by  solid-source  MBE.  If  there  is  any 
significant  leakage  path  caused  by  either  poor  quality  of  the 
emitter/base  junction  or  high  defect  densities  in  the  emitter 
or  base  layers,  the  current  gain  would  be  very  nonuniform  in 
the  common  emitter  I-V  characteristics  and  high  intrinsic 
base  leakage  current  should  be  seen  in  the  Gummel  plots. 

The  other  important  device  parameters  are  the  offset  and 
breakdown  voltages.  The  offset  voltage  is  less  than  0.2  Y  in 
the  common  emitter  I-V  characteristics,  indicating  a  normal 
emitter/base  junction.  The  breakdown  voltages  BV^^^  and 
^Vcbo  ^  both  voltages  are  defined  at 
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Fig.  4.  The  Gummel  plot. 


/c=  100  yaA  for  the  HBTs  with  32  emitter  sizes.  More¬ 
over,  temperature  dependent  measurements  are  also  per¬ 
formed  to  verified  the  breakdown  mechanism  as  a  tunneling 
procedure  which  is  normally  observed  for  InP/InGaAs  HBTs 
grown  by  other  techniques. 

For  high  frequency  performance,  the  HBTs  with  32 
emitter  sizes  were  biased  at  different  output  current  densities. 
As  shown  in  Figs.  5  and  6,  the  highest (80  GHz)  and/^^ax 
(125  GHz)  are  obtained  at  the  output  current  density  of 
around  1X10^  A/cm^.  Such  a  combination  of  fj  and 
corresponds  to  a  time  constant  of  0.21  ps.  Biasing  the 

HBT  beyond  this  current  density,  fj  and  start  to  fall-off 
due  to  the  Kirk  effect.  The  HBTs  seem  to  have  stable  char¬ 
acteristics  even  under  high  output  current  densities  and  there 
is  no  rapid  degradation  of  transistor  characteristics  observed 
during  the  dc  and  microwave  measurements. 


Fig.  5.  Gain  vs  frequency  plots. 
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Fig.  6.  /y  and  at  different  bias  conditions. 

IV.  CONCLUSIONS 

High  performance  InP/InGaAs  HBTs  have  been  grown 
successfully  by  solid-source  MBE  using  a  phosphorus  valved 
cracker.  The  HBT  performance  is  as  competitive  as  those 
obtained  by  other  growth  techniques.  To  our  knowledge,  the 
high  frequency  performance  (/^ax)  of  Ihe  HBTs  is  the  highest 
value  reported  for  devices  grown  using  this  growth  tech¬ 
nique.  In  conclusion,  the  P-valved  cracker  could  be  used  as 
an  alternative  source  material  to  improve  the  safety  concerns 
significantly  about  using  conventional  P  source  materials 
such  as  PH3 . 
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Arsenic  incorporation  during  GaAs/GaAs(100)  molecular  beam  epitaxy  is  studied  in  situ  with  laser 
single-photon  ionization  time-of-flight  mass  spectrometry  and  reflection  high-energy  electron 
diffraction  (RHEED).  Incident  and  scattered  fluxes  of  Ga  and  As„  species  in  front  of  the  growing 
GaAs  wafer  are  ionized  repetitively  by  a  pulsed  laser  beam  of  118  nm  (10.5  eV)  photons.  The 
methods  to  obtain  and  interpret  time-of-flight  mass  spectra  and  the  simultaneous  RHEED 
measurements  are  described.  The  real  time  behaviors  of  incident  Ga  and  desorbing  As2  and  AS4 , 
obtained  without  mass  spectral  cracking,  are  studied  during  growth  of  GaAs  layers  with  AS4  and 
when  growth  is  arrested  as  a  function  of  substrate  temperature  and  Ga/As4  flux  ratio.  During  growth 
only  with  AS4 ,  both  As2  and  AS4  are  desorbed  or  scattered  in  varying  amounts  depending  on  flux 
and  substrate  temperature  conditions.  Without  an  incident  gallium  flux,  desorbing  AS4  decreases 
while  desorbing  As2  increases  with  increasing  surface  temperature.  During  gallium  deposition  and 
GaAs  growth,  the  amounts  of  desorbing  arsenic  fluxes  decrease  linearly  with  increasing  Ga/As4  flux 
ratio,  but  the  arsenic  incorporation  rate  saturates  at  a  Ga/As4  flux  ratio  ^2,  i.e.  (Ga/As^^).  The  total 
integrated  incorporation  of  arsenic  increases  linearly  with  increasing  Ga/As4  flux  ratio  when  the 
surface  is  allowed  to  recover  with  an  incident  arsenic  flux  after  the  gallium  flux  is  terminated.  In  the 
range  of  substrate  temperatures  optimum  for  layer-by-layer  GaAs  growth  with  AS4,  AS4 
incorporation  dominates  at  low  temperatures,  while  AS4  and  As2  incorporations  contribute  equally 
at  high  temperatures.  Surface  reaction  sequences  and  mechanisms  of  arsenic  incorporation  are 
discussed  and  compared  with  measured  RHEED  results  and  previous  experimental  and  theoretical 
results.  ©  1996  American  Vacuum  Society, 

I.  INTRODUCTION 

The  growth  of  GaAs  by  molecular  beam  epitaxy  (MBE) 
has  been  studied  extensively  by  in  situ  tools  over  the  past 
thirty  years.  After  the  discovery  that  oscillations  in  the  re¬ 
flection  high-energy  electron  diffraction  (RHEED)  specular 
beam  intensity  correspond  to  layer-by-layer  epitaxy  of 
GaAs,^  the  field  exploded  with  new  investigations  of  Ga  sur¬ 
face  diffusion,^  Ga-stabilized  growth,^"^  surface 
coverages, and  kinetics  of  surface  relaxation  after 
growth. These  experiments  measured  microscopic 
properties  based  on  surface  reconstructions,  which  give  in¬ 
formation  about  the  arrangement  of  surface  species  averaged 
over  a  wide  temperature  range  and  surface  area.  Specular 
beam  intensity,  which  is  a  measure  of  the  surface  roughness, 
however,  is  not  necessarily  a  linear  function  of  the  surface 
coverage.  While  much  information  about  the  surface 
growth  has  been  deduced  from  RHEED  and  other  methods, 
basic  mechanisms  and  rates  for  GaAs  growth  are  still  inac¬ 
cessible:  more  work  is  needed  to  understand  the  reaction 
kinetics  and  incorporation  mechanisms  of  gallium  and  ar¬ 
senic  surface  species  on  a  molecular  level. 

In  situ  tools  that  are  species  specific  are  important  to  gain 
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a  better  understanding  of  the  microscopic  picture  of  GaAs 
growth  mechanisms.  Early  studies  using  modulated  molecu¬ 
lar  beam  techniques  provide  the  base  of  kinetic  information 
on  GaAs  growth,  including  proposed  orders  of  the  reactions 
involved  for  growth  with  As2  (Ref.  15)  and  AS4  (Ref.  16). 
Considerable  insight  on  growth  mechanisms  of  GaAs  since 
these  early  studies  has  been  provided  by  Monte  Carlo 
simulations^^“^^  and  other  modeling  work  based  on  RHEED 
reconstructions.^^”^^  Results  from  these  studies  linked  Ga 
migration  to  reactivity,  indicating  that  formation  of  a  surface 
Ga  cluster  is  key  to  GaAs  growth,  but  the  availability  of 
basic  rate  data  of  Ga-As4/Ga-As2  growth  is  still  limited. 

Recent  mass  spectrometric  data  show  promise  that  non- 
intrusive  real  time  monitors  of  specific  surface  species  can 
give  a  better  microscopic  picture  of  the  epitaxy,  providing 
surface  incorporation  mechanisms  and  rates.  Reflection  mass 
spectrometry^"^’^^  (REMS)  has  emerged  as  one  such  method 
to  obtain  information  by  monitoring  gaseous  fluxes  scatter¬ 
ing  or  desorbed  from  a  surface.  REMS  has  been  used  to 
study  gallium  desorption,  growth  of  multiple  quantum  well 
structures,^^  InAs  growth, cation  incorporation  in  other 
III-V  structures, and  GaAs  growth.^^’^^  A  comprehensive 
study  on  GaAs  growth  by  Brennan  et  al?^  has  provided  in¬ 
sight  into  AS4  and  AS2  incorporation  during  GaAs  MBE 
growth.  Drawbacks  of  this  method  include  the  difficulty  to 
interpret  mass  spectra  due  to  fragmentation  of  molecules  by 
the  electron  impact  ionizer  and  the  inability  to  measure  inci¬ 
dent  growth  species. 
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This  article  describes  a  similar  mass  spectrometric  study 
of  the  GaAs  growth  interface  using  single-photon  ionization 
time-of-flight  mass  spectrometry  (SPI-TOFMS)  concurrently 
with  RHEED.  The  single-photon  ionization  method  provides 
a  non-intrusive  monitor  that  is  species- specific  and  comple¬ 
ments  the  macroscopic  information  provided  by  RHEED. 
The  method  detects  both  incident  and  scattered  or  desorbed 
fluxes.  Mass  spectra  are  easily  interpreted  since  the  photons 
used  for  ionization  provide  enough  energy  to  ionize,  but  not 
fragment,  the  molecules  involved  in  growth.  Quantification 
of  mass  signals  is  provided  by  relative  ionization  probabili¬ 
ties  and  detection  efficiencies  of  incident  Ga  and  AS4  and 
scattered  As2  and  AS4  species.  The  real  time  changes  in  mass 
signals  during  growth  are  related  to  incorporation  of  species 
into  the  growing  wafer  and  to  changes  in  reactive  sticking 
probabilities  on  the  surface. 

In  this  article,  we  examine  these  parameters  as  a  function 
of  both  substrate  temperature  and  Ga/As4  flux  ratio.  During 
gallium  deposition  in  the  range  of  substrate  temperatures 
studied,  the  reactive  sticking  of  AS4  dominates  arsenic  incor¬ 
poration  at  lower  surface  temperatures,  while  the  incorpora¬ 
tion  of  both  As2  and  As  4  during  the  growth  with  AS4  contrib¬ 
utes  to  arsenic  incorporation  at  higher  surface  temperatures. 
The  reactive  sticking  of  both  As2  and  AS4  increase  with  in¬ 
creasing  incident  Ga  flux  until  a  gallium  flux  is  reached  in 
which  the  gallium  rich  surface  saturates  the  arsenic  sticking 
kinetics  on  the  surface.  An  additional  arsenic  flux  is  required 
after  Ga  deposition  is  stopped  to  consume  the  excess  surface 
gallium  and  produce  a  stoichiometric  film  of  GaAs.  Mecha¬ 
nisms  are  proposed  to  describe  the  observed  changes  in 
sticking  probabilities  and  desorption  and  incorporation  rates. 
The  SPI-TOFMS  measurements  are  compared  with  both  the 
RHEED  data  as  well  as  with  prior  theoretical  and  experimen¬ 
tal  results. 

II.  EXPERIMENTAL  CONFIGURATION 

A  schematic  of  the  experiment  is  shown  in  Fig.  1.  The 
ultra-high  vacuum  chamber  is  equipped  with  a  400  //s  ion 
pump  and  a  Ti  sublimator  to  obtain  base  pressures  of 
1  X  10“^  Pa  (8  X  10“^^  Ton).  The  chamber  is  equipped  with 
an  Auger  electron  spectrometer,  RHEED  spectrometer,  laser 
ionization  source  and  time-of-flight  mass  spectrometer,  an  Ar 
ion  sputter  gun,  a  residual  gas  analyzer,  a  nude  ionization 
gauge,  and  a  load  lock  for  sample  transfer. 

The  cryoshielded  epitaxial  source  region  contains  three 
shuttered  effusive  ovens  that  are  differentially  pumped  with  a 
30  //s  ion  pump.  The  arsenic  source  contains  a  second  stage 
of  heating  for  cracking  of  the  AS4  molecular  beam  to  an 
As2  molecular  beam.  This  oven  is  positioned  22  degrees  be¬ 
low  the  surface  normal.  Although  only  AS4  is  used  for  the 
GaAs  growth  studies,  this  cracking  feature  is  used  to  cali¬ 
brate  the  laser  ionization  efficiencies  of  AS4  vs.  As2 .  The  two 
ovens  containing  gallium  and  silicon  (not  used)  are  posi¬ 
tioned  at  an  angle  of  42  degrees  below  the  surface  normal. 

The  RHEED  gun  is  positioned  so  that  high  energy  (20 
keV)  electrons  graze  the  sample  surface  and  strike  a  custom 
fluorescent  screen.  The  fluorescent  screen  is  positioned  op- 
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Fig.  1.  Schematic  illustration  of  the  experimental  apparatus. 

posite  the  gun  for  monitoring  the  diffraction  pattern  and 
specular  beam  intensity  during  growth,  but  the  screen  has  a 
cutaway  portion  to  transmit  the  laser  beam  in  the  opposite 
direction  to  the  RHEED  electrons.  A  charge  coupled  device 
(CCD)  camera  and  television  monitor  allow  for  the  real  time 
measurement  of  changes  in  the  specular  RHEED  beam  in¬ 
tensity  with  a  photodiode  positioned  to  sample  the  monitor 
screen. 

Single-photon  ionization  is  accomplished  by  passing  a 
pulsed  beam  of  118  nm  photons  (which  have  10.5  eV  of 
energy)  approximately  1  cm  in  front  of  the  GaAs  substrate  at 
a  repetition  rate  of  10  Hz.  Here,  the  photons  ionize  a  small 
fraction  of  the  gaseous  incoming,  scattering,  and  desorbing 
growth  species.  The  substrate  is  positioned  between  the  ex¬ 
traction  plates  of  a  Wiley-MacLaren  TOFMS.^^  The  ions 
created  are  extracted  perpendicular  to  the  substrate  and  up  a 
1  meter  time-of-flight  tube,  which  is  held  at  a  -2000  V  bias. 
The  ions  then  come  into  contact  with  a  focused  mesh  elec¬ 
tron  multiplier  detector,  which  is  differentially  pumped  by  a 
30  //s  ion  pump  and  is  biased  at  -2600  V.  This  contact 
causes  a  cascading  of  electrons  whose  amplification  is  di¬ 
rectly  proportional  to  the  number  of  ions.  The  signal  is  am¬ 
plified,  converted  to  a  voltage  by  a  charge- sensitive  pre-amp, 
and  the  signal  is  sent  to  the  gated  integrating  boxcars  for  real 
time  monitoring  of  mass  peaks  or  to  a  digital  oscilloscope 
and  laboratory  computer  for  recording  of  a  mass  spectral 
trace  resulting  from  a  single  laser  pulse. 

The  photons  used  for  ionization  are  created  by  first 
tripling  the  1064  nm  output  of  a  Nd:  YAG  laser  in  KDP  crys¬ 
tals.  The  characteristics  of  the  resulting  355  nm  light  are  5  ns 
pulse  duration,  20-60  mJ/pulse,  and  10  Hz  repetition  rate. 
The  355  nm  light  is  focused  into  a  static  cell  of  Xe:Ar 
(1400:10300  Pa),  where  the  photon  frequency  is  again  tripled 
by  a  four  wave  mixing  process.^^"^^  The  conversion  effi¬ 
ciency  of  this  process  is  estimated  from  the  Pt  photodiode 
signal  to  be  approximately  10“^,  resulting  in  hundreds  of  nJ 
of  118  nm  light.  The  118  nm  photons  pass  through  a  LiF  lens 
and  window  into  the  chamber  and  are  either  focused  or  col¬ 
limated  into  the  extraction  region  of  the  TOFMS  by  the  LiF 
lens.  The  remaining  355  nm  photons  are  dispersed  in  the 
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UHV  chamber.  The  118  nm  photons  enter  the  chamber  next 
to  the  RHEED  screen  and  opposite  the  RHEED  gun.  This 
coaxial  arrangement  allows  for  simultaneous  SPI-TOFMS 
and  RHEED  monitoring  of  epitaxial  growth.  Other  geom¬ 
etries  are  also  possible  in  the  future,  such  as  a  remote  loca¬ 
tion  of  the  TOFMS  to  avoid  surrounding  the  GaAs  wafer 
region  with  grids  and  extraction  plates. 

Fluctuations  in  the  power  of  the  118  nm  beam  are  moni¬ 
tored  by  a  platinum  photodiode  placed  near  the  RHEED  gun. 
The  photons  impinge  on  a  piece  of  platinum,  creating  elec¬ 
trons  via  the  photoelectric  effect.  The  electrons  are  collected 
by  a  surrounding  stainless  steel  mesh.  The  fluctuations  in  the 
signal  output  of  this  photodiode  are  used  to  normalize  the 
amplitude  fluctuations  in  the  intensity  of  the  mass  signals. 

Integrating  boxcars  are  gated  on  the  arrival  time  of 
Ga"^,  As4^,  and  As2^  ions  and  the  output  of  the  platinum 
photodiode.  The  area  under  each  peak  is  calculated  by  the 
boxcar  integrator  on  each  laser  pulse  and  is  recorded  by  the 
laboratory  computer.  Normalization  of  the  mass  signals  is 
discussed  in  detail  below.  The  output  of  the  RHEED  photo¬ 
diode  is  simultaneously  recorded  by  the  computer  to  produce 
a  real  time  record  of  the  surface  smoothness  oscillations  dur¬ 
ing  growth  from  the  corresponding  behavior  of  the  RHEED 
specular  beam  intensity. 

Samples  of  GaAs  (100)  are  approximately  8  mm  x  21  mm 
in  dimension  and  are  0.4 -0.5  mm  thick.  The  wafers  are 
cleaned  by  Ar  ion  sputtering  of  the  surface  to  remove  carbon 
and  oxygen  contaminants  and  then  heating  the  wafers  radia- 
tively  under  a  flux  of  AS4  to  remove  the  remaining  oxide 
layer.  Auger  electron  spectra  confirm  the  removal  of  carbon 
and  oxygen  from  the  GaAs  surface.  The  RHEED  pattern  is 
monitored  until  a  2X4  arsenic- stable  reconstruction  appears. 
A  buffer  layer  of  GaAs  is  grown  before  each  experiment  to 
create  a  smooth  substrate.  The  temperature  is  measured  by  a 
type  K  thermocouple  clamped  to  the  GaAs  surface. 

III.  RESULTS 

A.  Mass  spectra  and  flux  calibrations 

Figure  2(a)  shows  a  time-of-flight  mass  spectrum  result¬ 
ing  from  a  summation  average  of  200  laser  pulses  monitor¬ 
ing  the  gaseous  species  in  front  of  a  GaAs  substrate  bathed  in 
As4  and  held  at  757  ±5  K.  The  AS4  peak  at  mass  300  results 
from  four  contributions:  incident,  specularly  scattering,  des¬ 
orbing,  and  background  tetramer  species  in  the  chamber.  The 
As2  peak  results  from  desorption  from  the  heated  substrate 
and  a  small  amount  of  cracking  of  the  AS4  by  scattered  355 
nm  photons  (<0.4%  of  As4).^^  In  this  trace,  the  AS4  incom¬ 
ing  flux  comprises  (97 ±3%)  of  the  AS4  signal,  while  the 
scattered  and  desorbed  portion  represents  (3±3)%  of  the  sig¬ 
nal.  The  signal  from  background  AS4  has  been  subtracted. 
Because  the  laser  beam  is  collimated  and  positioned  low  in 
front  of  the  substrate  in  this  experiment,  the  photons  intersect 
a  large  portion  of  the  incoming  AS4  flux.  The  small  peaks  to 
the  right  of  the  arsenic  signals  are  due  to  ringing  in  the  elec¬ 
tronics  during  the  mass  sweep. 

Figure  2(b)  shows  a  time-of-flight  mass  spectrum  taken 
with  200  laser  pulses  during  gallium  deposition.  The  two 


Fig.  2.  (a)  Time-of-flight  mass  spectrum  of  a  GaAs  wafer  held  at  757  K  and 
bathed  in  an  AS4  flux  of  0.1  ML/s,  and  (b)  time-of-flight  mass  spectrum  of  a 
GaAs  wafer  held  at  757  K  under  AS4  and  Ga  fluxes  (Ga  flux =0.2  ML/s, 
AS4  flux =0.1  ML/s). 

isotopes  of  Ga  appear  at  masses  69  and  71  u  when  the  shutter 
is  opened.  Arsenic  dimer  signal  is  still  present,  but  this  signal 
has  decreased  considerably  due  to  incorporation  of  arsenic 
into  the  growing  wafer.  The  AS4  signal  has  also  decreased 
due  to  the  increased  sticking  and  incorporation  of  AS4  on 
available  surface  gallium. 

In  the  limit  of  weak  photoionization,  the  mass  signals  can 
be  related  to  the  density  of  molecules  according  to  the  equa¬ 
tion: 

I=nXaXpXlXE, 

where  /^number  of  ions  detected,  density  of  molecules, 
o‘=118  nm  ionization  cross  section,  p=iht  number  of  pho¬ 
tons  per  laser  pulse,  /=the  path  length  of  the  photons,  and 
E— collection  efficiency  of  the  time-of-flight  detector.  Typi¬ 
cal  values  of  these  parameters  for  an  incoming  flux  of  AS4 
are  as  follows:  n(As4)  =  3X10^  molecules/cm^,  (As4)  =  5 
X10~^^  cm^  p^l.5  X  10^^  /  -  1  cm,  E  =  0.005,  and 
7  =  50  ions  detected.  The  number  of  ions  detected  was  calcu¬ 
lated  from  the  current  produced  by  the  electron  multiplier 
using  its  specified  gain  and  the  efficiency  was  derived  from 
this  result. 

Note  that  the  TOE  signal  is  a  measure  of  the  density  of 
molecules.  To  determine  the  corresponding  flux,  one  must 
apply  a  velocity  correction: 
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Velocity  corrected  integrated  Ga  time-of-f light  signai 

Fig.  3.  Plot  of  Ga  flux  determined  by  the  period  of  RHEED  oscillations  vs 
velocity  corrected  integrated  time-of-flight  gallium  signal. 


j  =  nXv, 

where  7= flux  and  velocity.  The  AS4  molecules  emanating 
from  the  oven  source  travel  at  a  velocity  of  approximately 
200  m/s.  In  typical  experiments,  the  AS4  flux  is  equivalent  to 
a  deposition  rate  at  the  GaAs  wafer  of  0.1  ML  s”^ 

An  advantage  of  the  laser  TOP  method  is  that  the  TOP 
signal  is  sensitive  to  slight  changes  in  the  flux  that  are  not 
visible  using  the  thermocouple  readings  on  the  oven  sources 
or  the  chamber  pressure  ion  gauge.  One  can  therefore  closely 
track  sudden  pressure  bursts  from  a  source  or  depletion  of  an 
oven  species. 

Because  the  collection  efficiency  of  the  experimental 
setup  can  vary  from  day  to  day,  calibration  of  the  fluxes  is 
necessary.  The  flux  of  Ga  atoms  impinging  on  the  surface 
was  determined  using  the  RHEED  oscillation  periods  for 
GaAs  growth  measured  over  a  range  of  Ga  fluxes.  Pigure  3  is 
a  plot  of  the  RHEED  oscillation  period  as  a  function  of  in¬ 
tegrated  gallium  time-of-flight  signal,  corrected  for  velocity 
as  determined  from  the  gallium  source  temperature.  The  lin¬ 
ear  behavior  of  this  plot  indicates  that  the  time-of-flight  sig¬ 
nal  accurately  measures  the  incoming  flux  of  gallium. 

The  incident  AS4  flux  and  contribution  of  scattered  AS4 
signal  were  calibrated  using  the  saturation  of  the  AS4  sticking 
probability  on  a  gallium-rich  surface.  Second  order  kinetics 
dominates  the  process,  which  dictates  a  maximum  sticking 
probability  for  the  AS4  of  0.5  on  a  gallium  rich  surface. The 
change  to  a  gallium-rich  surface,  in  this  case,  occurs  when 
the  AS4  incident  flux  is  only  half  the  incident  Ga  flux.  The 
decrease  in  scattered  AS4  on  a  gallium-rich  surface  should  be 


one-half  of  the  total  amount  of  scattered  AS4 .  The  rest  of  the 
signal  is  assumed  to  result  from  incoming  and  background 
tetramer  signal. 

Based  on  the  incident  fluxes  of  Ga  and  AS4  determined  by 
RHEED  and  their  respective  integrated  TOP  signals,  a  rela¬ 
tive  flux  sensitivity  ratio  measured  with  the  laser  is  obtained 
for  As4:^^Ga;  this  value  is  (375  ±25)  for  one  specific  geom¬ 
etry  (see,  for  example.  Pig.  2).  The  result  is  reasonable  since 
the  laser  is  positioned  to  intersect  a  greater  portion  of  the 
incident  AS4  beam  than  the  incident  Ga  beam  and  the  cross 
section  for  ionization  of  Ga  is  smaller  than  for  AS4 .  Note, 
however,  that  this  ratio  is  not  the  difference  in  ionization 
cross  sections. 

The  quantification  of  scattered  fluxes  of  As2  and  AS4  is 
accomplished  by  using  a  measured  relative  density  sensitiv¬ 
ity  ratio  that  relates  the  TOP  signals  to  the  product  of  their 
118  nm  ionization  cross  sections  (cr),  densities  (n),  and  col¬ 
lection  efficiencies  (E): 

TOP(As4)/TOP(As2) 

= [  (t(As4)  XE(  AS4)  X  n  (  As4)]/[  o'(  AS2)  X  £:(  AS2)  X  ^  ( AS2)] . 

The  As4:As2  sensitivity  ratio  has  been  determined  using  two 
methods  in  our  laboratory.  In  the  first  method,  described 
previously,^^  the  temperature  of  the  cracker  stage  of  the  ar¬ 
senic  oven  is  varied  to  produce  different  relative  amounts  of 
As2  and  AS4  coming  from  the  oven.  Two  As2  molecules  are 
formed  from  the  dissociation  of  each  AS4  molecule.  When 
the  temperature  of  the  cracker  is  raised,  any  decrease  in  the 
AS4  signal  is  assumed  to  correspond  to  a  factor  of  two  in¬ 
crease  in  the  As2  signal.  Using  this  method  under  a  variety  of 
oven  settings,  a  sensitivity  ratio  of  2.1  ±0.4  is  measured.  This 
value  differs  substantially  from  previously  reported  numbers 
from  our  laboratory  due  to  modifications  to  the  time-of-flight 
tube  and  extraction  region,  the  removal  of  an  orifice  in  the 
arsenic  source,  and  the  current  use  of  lower  arsenic  fluxes 
and  higher  laser  powers.  When  prior  experimental  conditions 
were  matched,  a  sensitivity  ratio  of  9±1  was  measured, 
much  closer  to  the  previously  reported  density  sensitivity 
ratio  of  1?^  The  previously  measured  ratio  is  discarded  due 
to  the  many  chamber  modifications,  and  we  believe  this 
newly  measured  ratio  reflects  the  approximate  ratio  of  ion¬ 
ization  cross  sections. 

The  second  method  employed  to  determine  the  sensitivity 
ratio  involves  varying  the  surface  temperature  of  a  molybde¬ 
num  foil  sample  and  comparing  the  relative  amounts  of 
AS4  and  As2  scattering  or  desorbing  from  the  substrate. 
Again,  a  decrease  in  scattering  or  desorbing  AS4  is  assumed 
to  correspond  to  an  increase  in  desorbing  As2,  due  to  in¬ 
creased  cracking  of  AS4  to  As2  on  the  heated  surface.  Using 
this  method,  the  sensitivity  ratio  As4:As2  was  found  to  be 
2.2±0.3.  The  measured  density  sensitivity  ratio  for 
As4:As2  is  also  consistent  with  arguments  based  on  the  elec¬ 
tron  configurations  of  the  molecules.^^ 

To  convert  this  sensitivity  ratio  to  a  flux  ratio,  the  relative 
velocities  of  scattering  As2  and  AS4  are  needed.  Corrections 
for  differences  in  mass,  translational  temperature,  and  angu¬ 
lar  distributions  of  desorbing  AS2  and  AS4  are  needed  to  cal- 
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Fig.  4.  Real  time  monitoring  of  As4^,  As2'^,  and  ^^Ga"^  time-of-flight  and  RHEED  photodiode  signals.  The  gallium  oven  shutter  is  opened  at  20  s  and  closed 
at  45  s.  The  wafer  is  bathed  in  AS4  until  85  seconds  when  the  arsenic  oven  shutter  is  closed. 


culate  the  relative  velocities  accurately.  The  angular  distribu¬ 
tions  of  desorbing  As  2  and  AS4  probably  do  not  differ 
enough  to  affect  the  results  within  experimental  uncertainty. 
The  temperature  of  the  molecules,  however,  can  affect  the 
relative  velocities  more  strongly.  Foxon  and  Joyce  reported 
that  desorbing  AS4  molecules  in  the  temperature  range  of 
300-600  K  are  completely  thermally  accommodated  to  the 
GaAs  (100)  surface.  Arthur  and  Brown  investigated  the 
velocities  of  As2  and  AS4  desorbing  from  a  GaAs  (111)  sur¬ 
face  and  found  that  the  velocity  distribution  of  As2  is  at  a 
lower  temperature  than  the  surface,  while  AS4  is  even  lower 
in  temperature.^^  Despite  this  result,  over  the  temperature 
range  we  studied,  the  difference  in  As2  and  AS4  velocities 
relative  to  the  surface  temperature  would  not  affect  the  final 
sensitivity  ratio  outside  the  experimental  uncertainty,  pro¬ 
vided  we  assume  the  translational  temperatures  of  the  mol¬ 
ecules  are  within  100  K  of  one  another.  The  mass  correction 
is  applied  and  a  relative  flux  sensitivity  ratio  of  1.5  ±0.3  is 
used  to  convert  the  desorbed  As2  data  to  provide  the  correct 
relative  fluxes  of  arsenic.  This  allows  the  direct  comparison 
of  trends  in  AS2  and  AS4  behavior  as  well  as  accurate  mea¬ 
surement  of  the  behavior  of  the  total  arsenic  signal. 

In  summary,  quantification  of  the  relative  fluxes  has  been 
accomplished  by  calibration  of  the  incoming  Ga  and  AS4 
fluxes  using  RHEED  and  known  surface  kinetics.  Relative 
fluxes  of  As2  and  AS4  were  determined  using  the  measured 
density  sensitivity  ratio  and  appropriate  velocity  corrections. 

B.  Real  time  monitoring  of  growth  species 

The  gated  mass  signals  provide  information  on  the  real 
time  behavior  of  the  surface  growth  process.  Figure  4  shows 


a  typical  average  of  four  data  runs  recorded  in  real  time 
under  closely  reproduced  conditions  of  constant  oven  fluxes 
(Ga/As4  =  2.3)  and  constant  substrate  temperature  (757±5 
K).  Boxcar  integrators  are  used  to  measure  the  ^^Ga,  As2, 
AS4  TOFMS  signals,  the  Pt  photodiode  signal,  and  the 
RHEED  photodiode  signal.  The  mass  signals  shown  have 
been  normalized  to  the  118  nm  laser  power  fluctuations.  In 
this  run,  the  GaAs  substrate  is  initially  bathed  in  a  flux  of 
AS4 .  A  desorbed  arsenic  dimer  signal  is  present  due  to  the 
elevated  temperature  of  the  GaAs  wafer.  Both  AS4  and  AS2 
signals  are  constant  in  time,  indicating  the  steady-state  con¬ 
centration  of  the  surface  desorbed  and  scattered  species.  At 
time=20  s,  the  gallium  shutter  is  opened  and  growth  com¬ 
mences.  The  gallium  time-of-flight  signal  increases  to  a 
steady-state  level,  corresponding  directly  to  the  flux  from  the 
oven.  The  As2  and  AS4  signals  both  decrease  and,  after  pass¬ 
ing  through  the  transient  change,  reach  new  steady  state  lev¬ 
els.  The  RHEED  intensity  oscillations  shown  indicate  that 
Ga-stable  growth  is  occurring,  which  is  further  evidenced  by 
a  change  from  the  arsenic-stable  2X4  reconstruction  to  the 
gallium-stable  4X2  reconstruction.  When  the  gallium  oven  is 
closed  at  time=45  s,  the  gallium  signal  drops  to  zero  and  the 
arsenic  signals  recover  to  their  initial  steady-state  values. 
This  recovery  is  slow  due  to  the  excess  surface  gallium 
present  when  growth  is  terminated,  and  indicates  the  con¬ 
tinuing  incorporation  of  arsenic  into  the  growing  layers  of 
GaAs.  At  85  s,  the  arsenic  shutter  is  closed  to  obtain  a  mea¬ 
sure  of  background  AS4  and  As2  in  the  chamber.  After  each 
run,  the  surface  is  annealed  to  810±5  K  in  a  flux  of  arsenic 
and  is  cooled  to  the  temperature  of  interest  at  a  rate  of  ap- 
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Fig.  5.  Real-time  behavior  of  a  hypothetical  arsenic  time-of-flight  signal. 
The  gallium  shutter  is  opened  at  20  seconds  and  closed  at  45  s.  Level 
1 -Level  3=the  steady-state  decrease  in  arsenic  signal  and  the  area  2=the 
total  incorporation  of  arsenic  into  the  growing  wafer. 

proximately  1  K/s.  Subsequent  runs  of  data  under  nearly 
identical  conditions  are  performed  after  the  initial  RHEED 
intensities  match.  Note  that  the  scatter  in  the  AS4  signal  is 
clearly  larger  than  that  in  the  AS2  signal.  We  believe  that  the 
large  flux  of  the  tetramer  molecule  is  affecting  the  noise 
created  at  the  multiplier  detector,  possibly  due  to  breakup  of 
the  larger  tetramer  molecules  into  multiple  fragments.  The 
gain  on  the  detector,  however,  must  be  maintained  at  a  level 
at  which  the  As2  and  Ga  mass  signals  can  also  be  observed. 
Therefore,  all  of  the  data  presented  in  this  article  will  show  a 
relatively  large  uncertainty  for  the  tetramer  molecules. 

Figure  5  shows  an  example  of  a  hypothetical  arsenic 
TOFMS  signal  during  deposition  to  explain  the  terminology 
used  for  the  remainder  of  the  results.  The  signal  shown  is 
analogous  to  either  of  the  As2  or  AS4  signals  displayed  in 
Fig.  4.  The  “initial  steady-state  signal”  is  shown  by  the  line 
marked  1  and  describes  the  amount  of  arsenic  present  in  the 
gas  phase  while  the  wafer  is  being  bathed  in  an  AS4  flux  prior 
to  gallium  deposition.  The  level  of  the  signal  marked  3  is 
termed  “steady-state  signal  during  growth”  and  refers  to  the 
amount  of  arsenic  present  in  the  gas  phase  while  GaAs 
growth  is  occurring.  The  term  “steady-state  decrease  in  sig¬ 
nal”  refers  to  the  quantity  obtained  when  the  level  of  the 
dashed  line  marked  3  is  subtracted  from  the  level  of  the 
dashed  line  marked  1 .  This  level  can  be  likened  to  a  change 
in  the  reactive  sticking  on  the  surface  or  a  change  in  desorp¬ 
tion  rate,  both  of  which  indicate  incorporation  into  the  grow¬ 
ing  wafer.  Note  the  transient  region  of  arsenic  incorporation, 
but  this  will  not  be  considered  here.  Level  3  only  includes 
the  portion  of  signal  that  has  reached  a  new  steady-state  level 
with  time.  The  term  “total  integrated  incorporation”  refers  to 
area  2  in  the  figure.  This  value  is  a  measure  of  all  the  arsenic 
incorporated  into  the  GaAs  layers  grown  during  and  after 
gallium  deposition.  The  empirical  units  of  measured  steady- 
state  levels  of  arsenic  are  integrated  TOFMS  signal.  Arsenic 
signals  are  normalized  using  the  relative  sensitivity  ratios 


described  earlier.  The  empirical  units  of  incorporation  are 
integrated  TOFMS  signal  X  time.  The  relative  flux  and  mass 
corrections  are  made  so  that  the  steady-state  levels  of  arsenic 
are  directly  proportional  to  the  flux  of  arsenic  atoms  and  the 
measured  total  integrated  incorporation  of  arsenic  is  propor¬ 
tional  to  coverage  of  arsenic  atoms. 

The  ^^Ga,  As2,  and  AS4  TOF  and  RHEED  signals  were 
recorded  for  each  growth  run  in  real  time  at  a  constant  sub¬ 
strate  temperature  of  757  K  and  over  a  gallium  flux  range  of 
0.025-0.3  ML/s.  As  shown  in  Fig.  4,  the  shuttering  scheme 
begins  with  bathing  the  surface  in  a  0.1  ML/s  flux  of  AS4, 
which  is  sufficient  to  maintain  an  arsenic-stable  RHEED  re¬ 
construction.  The  gallium  oven  shutter  is  opened  at  time=10 
s  and  closed  at  time=45  s.  The  surface  is  then  allowed  to 
“recover,”  if  necessary,  under  a  flux  of  AS4  until  time=85  s, 
then  the  AS4  shutter  is  closed  to  measure  background  arsenic 
species.  An  average  of  four  runs  of  data  under  identical  flux 
conditions  followed  by  an  average  of  every  ten  data  points  is 
made  and  these  are  used  to  obtain  the  steady-state  decreases 
in  arsenic  species  and  integrated  arsenic  incorporation. 

Figures  6  (a) -(c)  demonstrate  the  behavior  of  the  steady- 
state  decrease  in  AS4  (a),  AS2  (b),  and  total  arsenic  (c)  as  a 
function  of  velocity-corrected  and  integrated  gallium  TOF 
signal  (which  is  directly  proportional  to  gallium  flux).  The 
results  indicate  that  the  reactive  sticking  of  AS4 ,  As2 ,  and 
total  arsenic  increase  with  increasing  gallium  flux,  but  that 
the  sticking  is  eventually  saturated  when  enough  oversupply 
of  gallium  is  added  to  produce  a  gallium  stable  surface.  Pan¬ 
els  (d)-(f)  of  Fig.  (6)  show  the  total  integrated  incorporation 
of  AS4  (a),  As2  (b),  and  total  arsenic  (c)  at  each  gallium  flux. 
These  panels  display  an  increasing  linear  trend  in  arsenic 
incorporation  with  increasing  gallium  flux.  This  linear  be¬ 
havior  is  a  reflection  of  the  additional  incorporation  of  ar¬ 
senic  that  occurs  after  the  gallium  oven  shutter  is  closed  and 
growth  is  terminated.  The  incoming  arsenic  tetramer  flux 
eventually  consumes  the  excess  surface  gallium  at  the  higher 
gallium  fluxes  to  produce  stoichiometric  GaAs. 

The  ^^Ga,  As2  ,  and  AS4  TOF  and  RHEED  signals  were 
also  recorded  in  real  time  over  the  substrate  temperature 
range  of  620-830  K.  The  surfaces  were  initially  bathed  in  a 
0.2  ML/s  flux  of  AS4,  which  is  sufficient  to  maintain  an 
arsenic-stable  RHEED  reconstruction  over  the  temperature 
range  studied.  The  Ga  oven  shutter  was  opened  at  time =20  s 
producing  a  Ga  flux  of  0.28  ML/s  and  was  closed  at 
time=65  s.  The  AS4  shutter  was  closed  at  time=85  s  to  mea¬ 
sure  the  background  arsenic.  At  each  surface  temperature, 
two  runs  of  data  were  collected  and  averaged  together. 

Figure  7  is  a  plot  of  the  initial  As2  steady-state  signal 
level  as  a  function  of  substrate  temperature.  This  figure 
shows  that  the  rate  of  desorption  of  As2  increases  with  in¬ 
creasing  surface  temperature.  Figures  8  (a)-(c)  include  plots 
of  the  steady-state  decreases  in  AS4  (a),  As2  (b),  and  total 
arsenic  (c)  as  a  function  of  substrate  temperature  during 
growth.  Panels  (d)-(f)  display  the  total  integrated  incorpora¬ 
tion  of  AS4  (d),  As2  (e),  and  total  arsenic  (f)  into  the  growing 
wafer  as  a  function  of  substrate  temperature.  Panels  (a)  and 
(d)  of  Fig.  8  show  that  reactive  sticking  and  incorporation 
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Veiocity  corrected  integrated  gailium  time-of-f light  signal 

Fig.  6.  Steady-state  decreases  in  (a)  AS4 ,  (b)  As2  and  (c)  total  arsenic,  and 
total  incorporation  of  (d)  AS4 ,  (e)  As2  and  (f)  total  arsenic  as  a  function  of 
velocity  corrected  integrated  Ga  TOF  signal  when  growing  GaAs  with 

AS4. 

rate  of  AS4  dominate  the  incorporation  of  arsenic  into  the 
growing  wafer  at  lower  surface  temperatures,  while  panels 
(b)  and  (e)  of  Fig.  8  show  that  reactive  incorporation  of 
As2  increases  to  be  almost  equal  to  the  AS4  at  higher  surface 
temperatures.  The  observed  trends  indicate  a  change  in  the 
mechanisms  for  growth  of  GaAs  over  this  temperature  range, 
and  the  data  reflect  a  correlation  in  the  incorporation  prob¬ 
abilities  of  AS4  vs.  As2  .  Panels  (c)  and  (f)  show  that  the  total 
arsenic  reactive  sticking  and  incorporation  rate  has  a  slight 
downward  slope  with  increasing  surface  temperature,  which 
suggests  that  the  surface  Ga  population  decreases  at  higher 
surface  temperatures.  Figure  9  shows  the  integrated  Ga  TOF 
signal  over  the  temperature  range  studied  at  a  constant  Ga 
flux,  which  is  a  measure  of  the  incident  gallium  atoms.  The 
upward  slope  of  this  plot  indicates  that  there  may  be  an  in¬ 
crease  in  the  gas  phase  gallium  atoms  at  elevated  surface 
temperatures,  also  implying  that  the  sticking  probability  of 
gallium  decreases  slightly  at  higher  surface  temperatures. 

IV.  DISCUSSION 

We  now  turn  to  a  more  detailed  discussion  of  Figs.  6-9. 
In  Section  A,  a  discussion  of  the  GaAs  surface  prior  to  gal¬ 
lium  deposition  includes  a  proposed  reaction  scheme  and 
suggests  mechanisms  to  describe  the  observed  trends  in  the 
initial  steady-state  arsenic  signals.  The  surface  during  gal¬ 


Sample  temperature  (K) 

Fig.  7.  The  effects  of  varying  the  substrate  temperature  on  the  initial  steady- 
state  As2  time-of-flight  signal  under  a  constant  AS4  flux. 


lium  deposition  and  growth  is  covered  in  Section  B.  Another 
reaction  scheme  is  proposed  with  a  discussion  of  mecha¬ 
nisms  for  each  reaction.  The  effect  of  G2JAS4  flux  ratio  is 
described  in  Section  B  (1)  along  with  a  more  detailed  dis¬ 
cussion  of  Fig.  6.  Section  B  (2)  covers  the  effect  of  substrate 
temperature  on  the  growth  of  GaAs.  Figures  8  and  9  are 
described  in  more  detail  in  terms  of  the  proposed  reactions 
and  mechanisms. 


A.  The  surface  prior  to  gallium  deposition 

In  the  temperature  range  of  our  measurements  and  in  the 
absence  of  the  gallium  flux,  As2  can  desorb  from  the  surface 
and/or  the  bulk  of  the  GaAs  wafer.  The  incoming  AS4  mol¬ 
ecules  can  then  replace  that  arsenic  lost  from  the  wafer.  The 
following  reaction  scheme  is  developed  to  describe  these 
events: 

AS4(g), 

/ 


As4(in)  - 

As4(ph)  ^  As4(g), 

(1) 

2As2  +  As4(g), 

/ 

2As4(ph) 

\ 

2As4(g), 

(2) 

GaAs(b) 

Ga(ch)+As2(g), 

(3) 

As2(ch) 

AS2(g), 

(4) 

where  in = incoming  flux,  ch=a  chemisorbed  state  of  the 
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Fig.  8.  Steady-state  decreases  in  (a)  AS4 ,  (b)  As2  and  (c)  total  arsenic,  and 
total  incorporation  of  (d)  AS4 ,  (e)  As2  and  (f)  total  arsenic  as  a  function  of 
substrate  temperature. 

molecule,  ph=a  physisorbed  state  of  the  molecule,  b=bulk, 
and  g=a  molecule  in  the  gas  phase  (either  specularly  scat¬ 
tered  or  desorbed  from  the  surface). 

First,  it  is  important  to  discuss  which  species  are  mea¬ 
sured  with  the  TOFMS.  Reaction  (1)  describes  what  the  in¬ 
coming  tetramer  species  can  do  once  in  contact  with  the 
surface.  In  these  experiments  we  cannot  distinguish  between 
AS4  species  that  are  specularly  reflected  [in  this  equation, 
denoted  by  As4(g)]  and  those  which  are  desorbed  from  the 
surface  [here,  denoted  by  As4(ph)^As4(g)].  The  differences 
between  these  two  gaseous  AS4  molecules  involve  the  geom¬ 
etry  with  which  they  scatter  from  the  surface,  their  surface 
residence  time,  and  the  temperature  with  which  they  leave 
the  surface.  Specularly  reflected  or  directly  “scattered”  ar¬ 
senic  tetramers  would  leave  the  surface  at  or  near  the  specu¬ 
lar  angle  from  the  incoming  beam  and  move  with  a  velocity 
similar  to  that  of  the  incoming  tetramer  molecules,  assuming 
there  is  little  loss  of  energy  to  the  wafer  during  contact. 
According  to  a  standard  model  of  surface  desorption,  desorb¬ 
ing  tetramer  molecules  that  are  completely  thermally  accom¬ 
modated  would  leave  the  surface  in  a  cos  (0)  distribution 
(where  6  is  the  angle  measured  from  the  normal  to  the  sur¬ 
face)  and  travel  at  a  velocity  determined  by  the  temperature 
of  the  surface.  The  barrier  to  desorption  of  physisorbed  AS4 
has  been  measured  to  be  0.38  eV.^^  At  the  surface  tempera¬ 
tures  involved  in  this  study,  the  surface  lifetime  of  AS4  would 
be  <10  ^  s,  which  is  not  measurable  experimentally  using 
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Fig.  9.  Integrated  gallium  time-of-flight  signal  as  a  function  of  substrate 
temperature. 

our  technique.  Further,  the  spatial  difference  between  the  two 
types  of  molecules  cannot  be  resolved  because  the  laser 
beam  probes  a  broad  angular  region  in  our  system.  A  de¬ 
crease  in  the  detection  of  gas  phase  tetramer  species  is  as¬ 
sumed  to  correspond  to  an  increase  in  chemisorbed  AS4 .  The 
wafer  is  therefore  bathed  in  excess  AS4  so  that  physisorbed 
AS4  is  in  constant  supply  as  a  precursor  to  chemisorption. 
The  As2  molecules  detected  must  all  result  from  desorption, 
since  As2  is  not  provided  externally. 

Reaction  (2)  describes  how  physisorbed  AS4  can  chemi¬ 
sorb  to  the  surface.  In  the  absence  of  gallium,  AS4  has  a 
sticking  coefficient  of  zero  on  an  arsenic  stable  surface  at 
these  surface  temperatures.  However,  when  gallium  is 
present,  AS4  binds  to  the  surface  via  precursor  [As4(ph)]  me¬ 
diated  chemisorption,  which  is  dissociative  and  is  known  to 
obey  second  order  kinetics,  as  demonstrated  by  Foxon  and 
Joyce.  Reaction  (3)  describes  the  behavior  of  a  GaAs  wafer 
that  is  heated  in  the  absence  of  an  incoming  arsenic  flux. 
When  heated,  arsenic  desorbs  from  the  bulk  as  dimers,  leav¬ 
ing  gallium  to  form  islands  on  the  wafer  surface,  in  turn 
driving  the  chemisorption  of  AS4 .  Therefore,  as  the  surface 
temperature  increases,  one  would  expect  a  decrease  in  the 
amount  of  As4(g)  signal,  corresponding  to  increased  chemi¬ 
sorption  of  the  tetramer.  Initial  signal  levels  of  AS4  do  indeed 
decrease  with  increasing  surface  temperature. 

Reaction  (3)  describes  one  source  of  the  As2(g)  signal.  At 
an  elevated  surface  temperature,  arsenic  from  bulk  GaAs  has 
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enough  energy  to  overcome  the  barrier  to  desorption.  Sec¬ 
ond,  because  the  initial  surface  is  kept  under  sufficient  AS4 
flux  to  maintain  the  arsenic- stable  2X4  reconstruction,  ar¬ 
senic  is  assumed  to  exist  as  dimers  on  the  surface. These 
dimers  will  also  desorb  at  an  elevated  surface  temperature 
[Reaction  (4)].  One  would  therefore  expect  an  increase  in 
As2  signal  as  the  surface  temperature  is  raised.  The  data  in¬ 
deed  show  that  the  initial  gas  phase  As2  increases  (Fig.  7) 
with  increasing  surface  temperature. 

A  fit  of  the  data  displayed  in  Fig.  7  was  attempted  to 
elucidate  an  activation  energy  for  desorption.  Measurement 
of  the  amount  of  AS2  desorbing  from  the  surface  was  as¬ 
sumed  to  be  a  measure  of  the  overall  rate  of  the  Reaction  (4): 

rateocA:X(^”), 

where  /?=the  order  of  the  reaction  with  respect  to  coverage 
of  arsenic,  coverage  and  /c=rate  constant  for  As2  desorp¬ 
tion.  However,  an  Arrhenius  plot  using  either  a  reaction  order 
of  one  or  two  showed  non-linearity,  indicating  that  the  acti¬ 
vation  energy  and,  therefore,  the  rate  constant  most  likely 
depend  on  coverage.  Further  experiments  that  measure  the 
absolute  value  of  the  rate  constant  or  coverage  are  necessary 
for  a  complete  kinetic  analysis.  In  summary,  the  initial 
TOFMS  levels  of  As2  and  AS4  increase  and  decrease,  respec¬ 
tively,  with  increasing  surface  temperature. 

B.  The  surface  during  gallium  deposition  and  growth 

The  growing  surface  of  GaAs  is  considerably  more  com¬ 
plicated  than  the  initial  wafer  surface.  The  following  reac¬ 
tions  describe  the  interaction  of  incoming  gallium  with  the 
arsenic  species  present  on  the  surface  before  growth.  Reac¬ 
tions  (l)-(4)  will  still  be  referred  to  in  this  section  and  ad¬ 
ditionally: 


Ga(in)— >Ga„  (ch) +Ga(g), 

(5) 

( 1/2)  As2(ch)+  ( l/«)Ga„— i-GaAs(b), 

(6) 

2  As4(  ph) + Ga„  — >  2  As2(ch)  +  As4(g) . 

(7) 

The  term  Ga„(ch)  is  used  here  to  represent  the  fact  that 
gallium  can  form  different  sized  islands  on  the  surface  after 
deposition.  Reaction  (5)  describes  the  behavior  of  the  incom¬ 
ing  flux  of  gallium  atoms  [Ga(in)].  At  the  surface  tempera¬ 
tures  used  in  this  study,  we  assume  that  the  incoming  gallium 
chemisorbs  on  the  surface  with  a  sticking  coefficient  of  unity 
at  the  lowest  substrate  temperatures."^^  The  entire  Ga  TOFMS 
signal  below  this  temperature  is  assumed  to  result  from  the 
incident  flux  of  gallium  atoms,  while  at  higher  surface  tem¬ 
peratures  the  scattered  flux  from  the  heated  surface  addition¬ 
ally  contributes  to  the  signal.  The  surface  temperature  can 
also  affect  the  size  of  these  Ga  clusters  and  the  mobility  of 
Ga  atoms.  The  interaction  of  the  surface  gallium  with  As2 
and  AS4  is  described  by  Reactions  (6)  and  (7). 

Reaction  (6)  describes  the  incorporation  of  AS2  present  on 
the  surface  with  available  Ga  to  form  GaAs.  A  possible 
mechanism  for  this  reaction  includes  a  slow  step  in  which 
the  gallium  must  come  into  contact  with  AS2  either  by  land¬ 
ing  on  top  of,  or  diffusing  to,  As2  sites.  The  fast  step  of  GaAs 


formation  would  occur  provided  enough  energy  is  available 
to  break  As-As  bonds  and  form  Ga-As  bonds.  This  proposed 
model  assumes,  however,  that  As2  is  relatively  static  on  the 
arsenic-stable  surface  and  not  appreciably  mobile.  In  other 
words,  the  barrier  to  diffusion  for  As2  is  assumed  to  be  ap¬ 
proximately  equal  to  or  greater  than  the  barrier  to  desorption 
of  As2.  a  bonding  argument  can  support  this  assumption. 
Four  bonds  to  underlying  Ga  atoms  must  be  broken  or 
lengthened  for  each  arsenic  dimer  to  induce  diffusion.  It  may 
be  electronically  more  feasible  for  a  second  and  third  bond  to 
form  within  the  arsenic  dimer  species  once  these  surface 
bonds  are  broken  than  for  four  new  surface  bonds  to  form 
elsewhere  on  the  arsenic-stabilized  GaAs.  In  terms  of  Reac¬ 
tion  (6),  an  increase  in  chemisorbed  gallium  would  increase 
the  GaAs  formation  rate  and  decrease  the  surface  desorption 
rate  of  As2(g)  via  Reaction  (4). 

Reaction  (7)  describes  a  reaction  that  is  governed  by  the 
same  factors  as  Reaction  (3).  Available  surface  gallium  still 
drives  the  chemisorption  of  AS4 ,  but  the  arsenic  is  now  sup¬ 
plied  by  incoming  tetramers  rather  than  by  available  dimers 
on  the  surface.  The  same  precursor  mediated  dissociative 
chemisorption  mechanism  [As4(ph)]  applies  in  this  case. 
Once  arsenic  dimers  are  chemisorbed,  formation  of  GaAs 
with  underlying  Ga„  via  Reaction  (6)  is  rapid.  Therefore,  an 
increase  in  chemisorbed  gallium  is  expected  to  drive  the 
chemisorption  of  AS4  and  result  in  a  decreased  rate  of  scat¬ 
tered  or  desorbed  As4(g).  Both  Reactions  (6)  and  (7)  predict 
that  decreases  in  the  AS4  or  As2  TOFMS  signals  can  be  di¬ 
rectly  related  to  incorporation  of  arsenic  into  the  growing 
GaAs  wafer. 

1.  Effect  of  GalAs^  flux  ratio 

During  deposition,  the  steady-state  decreases  in  arsenic 
TOFMS  signals  are  affected  by  the  incoming  gallium  flux. 
Figures  6  (a) -(c)  show  the  change  in  these  values  with  in¬ 
creasing  gallium  flux.  As  the  Ga  flux  is  raised,  the  steady- 
state  decreases  of  AS4  and  As2  also  rise.  Available  gallium 
sites  increase  the  sticking  probability  of  AS4  on  the  surface 
and  decrease  the  desorption  rate  of  As2  from  the  surface. 
Eventually,  the  steady-state  decreases  are  saturated  by  excess 
available  Ga.  The  second  order  reaction  of  AS4  on  a  Ga- 
covered  surface  predicts  that  the  AS4  sticking  coefficient  will 
become  0.5  on  a  gallium-rich  surface.  The  Ga-rich  condi¬ 
tion  occurs  when  the  flux  (Ga)^2  X  flux  (AS4)  and  estab¬ 
lishes  the  break  in  the  linear  increase  of  the  steady-state  sig¬ 
nals  shown  in  Figs.  6  (a)-(c).  The  As2  desorption  also  is 
saturated  at  this  Ga  flux  value,  indicating  that  the  usual  de¬ 
sorption  sites  have  been  blocked  by  available  gallium,  and 
this  also  coincides  with  the  AS4  turnover  point  when  excess 
gallium  is  being  supplied  to  the  surface. 

The  change  in  the  RHEED  diffraction  pattern  at  this  point 
shows  an  alteration  in  the  surface  reconstruction  from  the 
arsenic- stable  2X4  to  the  gallium-stable  4X2  during  growth, 
which  further  supports  that  the  surface  is  now  gallium  rich. 
The  period  of  the  RHEED  oscillations  also  indicates  that 
growth  of  GaAs  on  a  Ga-stable  surface  is  occurring.  Once 
the  surface  is  gallium  rich,  formation  of  GaAs  layers  is  lim¬ 
ited  by  the  available  incident  arsenic  flux.  The  period  of  the 
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RHEED  oscillation  begins  to  reflect  the  rate  of  arsenic  atom 
incorporation  rather  than  the  rate  of  gallium  deposition,  as  it 
does  at  lower  Ga/As4  flux  ratios.  Growth  of  GaAs  can  no 
longer  proceed  stoichiometrically  during  deposition  of  gal¬ 
lium  at  these  flux  ratios. 

Once  the  gallium  shutter  is  closed,  any  excess  surface  Ga 
continues  to  react  with  incoming  arsenic  since  chemisorption 
of  As4  is  driven  by  available  Ga  sites.  After  the  gallium 
shutter  is  closed,  arsenic  is  still  incorporated  into  the  wafer. 
“Tails”  appear  in  the  As2  and  AS4  mass  signals  vs.  time, 
indicating  that  reactions  of  arsenic  species  with  excess  sur¬ 
face  gallium  are  still  occurring.  Figures  6  (d)-(f)  describe 
the  changes  in  the  total  incorporation  of  arsenic  species  as  a 
function  of  Ga  flux.  Panel  (f)  illustrates  that  the  incorpora¬ 
tion  of  total  arsenic  is  indeed  linearly  increasing  through  the 
entire  Ga  flux  and  indicates  that  growth  will  proceed 
stoichiometrically  if  the  surface  remains  under  sufficient 
AS4  flux.  Because  the  Ga  and  AS4  ovens  were  shuttered  at 
identical  times  for  all  data  runs,  the  surfaces  under  the  high¬ 
est  gallium  fluxes  were  not  given  enough  time  to  recover 
before  the  AS4  was  closed  for  background  arsenic  readings. 
Therefore,  the  two  points  at  the  highest  gallium  fluxes  in 
panels  (d)-(f)  lie  below  the  line  used  to  fit  the  arsenic  incor¬ 
poration  as  a  function  of  gallium  flux.  The  behavior  of  des¬ 
orbing  arsenic  mass  signals  vs  Ga  flux  matches  the  arsenic 
incorporation  results  obtained  by  Brennan,  Tsao,  and 
Hammons^ ^  in  their  REMS  studies  as  well  as  early  studies  by 
Foxon  and  Joyce. In  both  studies,  saturation  of  reactive 
sticking  of  arsenic  species  and  additional  sticking  on 
gallium-rich  surfaces  after  termination  of  growth  were  ob¬ 
served. 

2.  Effect  of  surface  temperature 

The  effect  of  surface  temperature  on  the  incorporation  of 
arsenic  is  less  straightforward  than  the  effect  of  gallium  flux. 
Figures  8  (a)  and  (b)  show  the  steady- state  decreases  in 
As4  and  AS2  signals  during  gallium  deposition  as  a  function 
of  the  surface  temperature.  The  figure  shows  a  decrease  in 
AS4  and  an  increase  in  the  steady-state  decrease  of  As2  with 
increasing  surface  temperature,  indicating  that  the  rate  of  re¬ 
active  sticking  and  incorporation  of  AS4  dominates  at  lower 
surface  temperatures,  while  incorporation  of  As2  becomes 
more  significant  at  the  higher  surface  temperatures. 

This  observed  trend  can  be  explained  in  terms  of  relative 
rates  of  diffusion  of  surface  species  and  surface  Ga  cluster 
size.  At  low  surface  temperatures,  Ga  on  the  surface  is  un¬ 
able  to  diffuse  for  Ga  diffusion  ^--1.3  eV  (Ref.  2)]  ap¬ 
preciably,  but  the  mobility  of  the  gallium  atoms  increases 
with  increasing  surface  temperature.  Arsenic  dimers  are  still 
assumed  to  be  static  on  the  surface  and  will  desorb  rather 
than  diffuse  at  elevated  surface  temperatures.  The  arsenic 
tetramer  molecules,  however,  have  a  very  low  activation  bar¬ 
rier  to  diffusion,  0.25  eV,^^  and  can  diffuse  appreciably  at  all 
surface  temperatures  studied. 

The  surface  temperature  can  also  affect  the  size  of  the 
gallium  clusters  on  the  surface.  At  low  surface  temperatures, 
gallium  may  be  able  to  diffuse  across  the  surface  and  find 


another  gallium  atom  or  cluster,  bind  there,  and  form  a  two- 
dimensional  cluster.  The  desorption  energy  of  Ga  from  a  Ga 
cluster  [^2.7  eV  (Ref.  42)]  can  be  assumed  to  be  an  upper 
limit  on  the  energy  of  activation  for  Ga-Ga  cluster  bond 
breakage  and  subsequent  Ga  diffusion.  Because  this  energy 
is  still  substantially  greater  than  the  barrier  to  diffusion,  low 
surface  temperatures  may  not  allow  gallium  atoms  to  break 
away  from  the  cluster  and  diffuse  again  as  atoms  on  the 
surface.  At  higher  surface  temperatures,  the  gallium  atoms 
can  overcome  this  barrier  and  diffuse  across  the  surface 
forming  Ga-As  bonds. 

Monte  Carlo  simulations  and  other  theoretical  studies 
have  postulated  that  the  formation  of  two-dimensional  gal¬ 
lium  deposits  on  the  surface  is  key  to  arsenic  incorporation. 
Although  the  details  of  incorporation  mechanisms  differ, 
most  studies  involve  As2  as  the  reactive  incident  arsenic  spe¬ 
cies  and  require  formation  of  a  two-dimensional  gallium 
cluster  composed  of  up  to  four  gallium  atoms^^  for  reaction 
with  incoming  arsenic.  Steric  factors  may  lead  one  to  suspect 
that  gallium  cluster  size  is  also  an  important  factor  for  for¬ 
mation  of  GaAs  using  an  incident  flux  of  AS4 . 

The  relative  diffusion  rates  of  Ga,  AS2,  and  AS4  could 
affect  the  reaction  to  form  GaAs  in  the  following  way.  At 
low  temperatures,  one  might  expect  that  Ga  would  not  only 
form  large  clusters,  but  would  be  more  accessible  to  reaction 
with  incoming  AS4,  since  Ga  and  As2  are  relatively  static 
compared  to  diffusing  AS4 .  The  next  step  would  be  the  dif¬ 
fusion  of  physisorbed  AS4  to  Ga  reaction  centers  [Reaction 
(7)],  followed  by  GaAs  formation  via  Reaction  (6),  in  which 
As2  is  provided  by  dissociative  chemisorption  of  AS4  on  gal¬ 
lium.  One  would  then  expect  to  see  a  greater  contribution  to 
incorporation  of  arsenic  in  GaAs  from  AS4  at  low  tempera¬ 
tures  and  a  subsequently  larger  relative  decrease  in  As4(g)  as 
seen  in  Fig.  8  (a). 

At  higher  temperatures,  gallium  is  able  to  diffuse  to  the 
static  As2  sites  on  the  surface.  The  pathway  of  Reaction  (6) 
contributes  more  to  the  GaAs  growth.  Gallium  atoms  diffuse 
directly  to  As2  sites  and  insert  into  dimer  bonds,  forming 
GaAs  and  decreasing  the  desorption  of  As2(g)  considerably. 
The  lack  of  availability  of  Ga  reaction  centers  for  AS4  chemi¬ 
sorption,  in  turn,  would  result  in  a  smaller  sticking  coeffi¬ 
cient  of  AS4  and  small  relative  steady-state  decrease  in 
AS4(g). 

The  total  steady-state  decrease  in  arsenic  described  in 
panel  8  (c)  has  a  slightly  negative  slope  (  “  3.7  ±  0.7  X  10“^ 
[integrated  TOF  total  arsenic  signal/temperature  (K)]),  indi¬ 
cating  that  at  the  lowest  temperatures  the  reactive  sticking 
coefficient  of  arsenic  on  the  surface  during  growth  is  in¬ 
creased.  The  chemisorption  of  both  arsenic  dimer  and  tet¬ 
ramer  is  driven  by  the  amount  of  gallium  on  the  surface 
[Reactions  (6)  and  (7)].  Figure  9  shows  a  slight  increasing 
trend  in  the  amount  of  gas-phase  gallium  as  the  surface  tem¬ 
perature  is  increased,  indicating  that  there  may  be  some  scat¬ 
tered  gallium  from  the  heated  GaAs  surface.  This  correlates 
with  a  decrease  in  the  sticking  probability  of  incident  gallium 
atoms  on  GaAs  at  higher  surface  temperatures.  Arthur  also 
observed  a  decrease  in  the  sticking  coefficient  of  gallium  on 
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GaAs  at  surface  temperatures  higher  than  750  This  de¬ 
crease  in  available  surface  gallium  may,  in  turn,  decrease  the 
reactive  sticking  of  both  arsenic  species. 

The  total  incorporation  of  arsenic  is  affected  by  these  pro¬ 
posed  changes  in  reactive  sticking.  Figure  8  (f)  has  a  slope  of 
-  1.3  ±  0.6  X  10”^  [integrated  TOF  total  arsenic  signal  X  time 
(s)/temperature  (K)],  indicating  that  the  total  arsenic  incor¬ 
poration  decreases  with  increasing  surface  temperature.  The 
total  incorporation  is  limited  by  the  amount  of  surface  gal¬ 
lium  available  during  the  entire  growth  run.  If  the  sticking 
coefficient  of  gallium  decreases,  then  the  total  amount  avail¬ 
able  for  film  growth  also  decreases. 

The  changes  in  the  signals  observed  in  Figs.  8  (c),  and  (f), 
and  9  reflect  very  small  changes  in  the  actual  fluxes  of  gal¬ 
lium  and  arsenic  species.  The  measured  change  in  period  of 
the  RHEED  oscillations  from  0.26  to  0.28  (bilayers  GaAs)/s 
over  this  temperature  range  is  an  indication  that  only  small 
changes  in  the  scattered  flux  of  gallium  and  the  total  incor¬ 
porated  fluxes  of  arsenic  species  occur. 

The  TOFMS  results  are  shown  to  be  quite  sensitive  to 
changes  in  the  surface  temperature  during  growth,  allowing 
us  to  explore  possible  mechanisms  for  incorporation.  The 
proposed  mechanisms  are  supported  by  arguments  based  on 
previously  measured  activation  energies  of  diffusion  and  for¬ 
merly  obtained  theoretical  results. 

V.  CONCLUSIONS 

We  have  measured  changes  in  the  gas  phase  steady-state 
levels  of  As2  ,  As  4 ,  and  Ga  species  during  homoepitaxy  of 
GaAs  with  simultaneous  SPI-TOFMS  and  RHEED.  Results 
obtained  by  varying  the  Ga/As4  flux  ratio  are  consistent  with 
observed  RHEED  reconstructions  and  intensity  oscillations 
as  well  as  previously  obtained  results  measured  with  modu¬ 
lated  molecular  beam  experiments  and  REMS.  Results  ob¬ 
tained  by  varying  the  substrate  temperature  are  also  consis¬ 
tent  with  RHEED  observations.  A  proposed  model  based  on 
theoretical  results  and  formerly  measured  activation  energies 
for  diffusion  describes  the  incorporation  of  arsenic  into  GaAs 
in  terms  of  gallium  surface  mobility  and  sticking  probability. 

SPI-TOFMS  is  a  non-intrusive  tool  that  complements  the 
macroscopic  information  obtained  by  RHEED  measurements 
by  providing  real  time  species-specific  data  on  surface  mol¬ 
ecules.  This  method  also  permits  measurement  of  growth 
rates  and  incorporation  over  a  larger  range  than  that  of 
RHEED  alone.  Growth  mechanisms  and  kinetic  data  on 
other  MBE  systems  should  also  be  easily  elucidated  by  using 
this  technique. 
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We  report  the  growth  and  characterization  of  1.55  ptm  wavelength  GalnAsP  based  semiconductor 
lasers  grown  by  solid  source  molecular  beam  epitaxy.  Quaternary  compositions  were  reproducible 
over  time.  Photoluminescence  and  x-ray  diffraction  spectra  indicate  abrupt  quantum  well  interfaces. 

Separate  confinement  heterostructure  laser  diodes  with  four  quantum  wells  had  threshold  current 
densities  as  low  as  580  A/cm^  and  275  A/cm^  for  unstrained  Gao^ylno  53 As  and  strained 
Gao  27lno.73 Asq  gPo.2  wells,  respectively.  These  results  are  as  good  as  the  best  results  reported  for 
similar  lasers  grown  by  any  growth  technique.  ©  1996  American  Vacuum  Society, 


1.  INTRODUCTION 

Thin  films  of  Gaj^In(i_^)As^P(i_y)  can  be  grown  with  lat¬ 
tice  constants  equal  to  InP  and  direct  band  gap  energies  be¬ 
tween  0.75  and  1.35  eV.  This  material  system  has  been  stud¬ 
ied  extensively  for  its  applications  to  1.3  and  1.55  jam 
wavelength  semiconductor  lasers.  A  number  of  crystal 
growth  techniques,  gas  source  molecular  beam  epitaxy^ 
(GSMBE),  chemical  beam  epitaxy^  (CBE),  and  metalorganic 
vapor  phase  epitaxy^  (MOVPE),  are  successful  in  producing 
high  quality  unstrained  1.55  ptm  separate  confinement  het¬ 
erostructure  (SCH)  lasers.  Graded  index  separate  confine¬ 
ment  heterostructures  (GRINSCH)^  and  compressive  strain^ 
lead  to  further  improvements  in  performance.  By  optimizing 
these  parameters,  threshold  current  densities  as  low  as 
100  PJcvc?  have  been  achieved  for  single  quantum  well 
GRINSCH  Lasers.^ 

Valved  effusion  cells  containing  solid  arsenic^  and 
phosphorus^  allow  for  precise  control  of  group  V  fluxes  in 
MBE.  These  cells  replace  the  toxic  gases  arsine  and  phos¬ 
phine  commonly  used  for  Ga^In(i_^)As^P(i_^)  epitaxy,  by 
their  less  toxic,  solid  counterparts.  By  combining  two  valved 
effusion  cells,  arsenide/phosphide  heterostructures  with 
abrupt  interfaces^  and  Ga^-In(i^j,-)As^P(i„y)  films  lattice 
matched  to  InP^^  have  been  grown  by  solid  source  MBE. 
These  advances  led  to  the  first  report  of  a 
Ga;^In(i_;,.)As^P(i_y)  based  SCH  laser  diode,  operating  at 
1.35  fjum,  grown  by  solid  source  MBE.^^  In  this  article,  we 
report  Ga_^In(i_^)As^P(i_^)  based  1.55  /xm  SCH  laser  diodes 
grown  by  solid  source  MBE  and  compare  their  performance 
to  laser  diodes  grown  by  GSMBE,  CBE,  and  MOVPE. 

The  number  of  quantum  wells,  amount  and  sign  of  strain, 
and  GRINSCH  versus  SCH  core  structure  all  affect  the  per¬ 
formance  of  laser  diodes.  To  compare  laser  diodes  grown  by 
solid  source  MBE  to  those  grown  using  other  techniques,  we 
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chose  structures  similar  to  those  reported  by  other  research¬ 
ers.  We  first  grew  a  simple  double  heterostructure  (DH)  laser 
diode  with  an  emission  wavelength  of  1.35  yam.  Next,  we 
grew  two  SCH  laser  diodes  designed  for  emission  at  1.55 
yam.  The  first  contained  four  unstrained  Gao47lno  53AS  quan¬ 
tum  wells;  whereas,  the  second  contained  four  compressively 
strained  Gao.27lno.73Aso,8Po.2  quantum  wells.  The  threshold 
current  densities  for  unstrained  SCH  broad  area  laser  di¬ 
odes  are  comparable  to  reports  in  the  literature.  Broad  area 
lasers  made  from  the  SCH  with  four  compressively  strained 
quantum  wells  had  •/.h  values  as  good  as  or  better  than  re¬ 
ports  of  similar  laser  diodes  grown  by  GSMBE,  ^  CBE,^  or 
MOVPE.^'^'^ 

II.  EXPERIMENT 

The  material  was  grown  in  a  VG  V80  MBE  machine 
pumped  by  a  3000  / /s  cryopump  and  a  400  / /s  ion  pump. 
Standard  effusion  cells  containing  elemental  sources  sup¬ 
plied  the  fluxes  of  gallium  and  indium  as  well  as  the  n-  and 
p-type  dopants,  silicon  and  beryllium.  The  InP  growth  rate 
was  1.3  monolayers/s,  and  the  quaternary  growth  rates  were 
near  1.8  monolayers/s.  Two  independent  valved  effusion 
cells  (EPI  Components,  Inc.)  were  used  to  supply  and  regu¬ 
late  group  V  fluxes  of  AS4  and  P2  with  cracker  temperatures 
of  500  and  1000  °C,  respectively.  One  cell  contained  elemen¬ 
tal  arsenic.  The  other  cell  was  loaded  with  elemental  red 
phosphorus  that  was  later  converted  to  white  phosphorus  be¬ 
fore  growth. 

The  laser  diode  structures  were  grown  on  2  in.  sulfur- 
doped  (100)  InP  substrates  misoriented  4°  towards  [111] A, 
Wafers  were  etched  for  1  min  in  a  solution  of  4  H2SO4:  1 
H2O:  1  H2O2,  rinsed,  and  exposed  to  an  ultraviolet-ozone 
cleaning  step^^  for  3  min.  In  the  growth  chamber,  the  sub¬ 
strate  was  exposed  to  a  phosphorus  flux  (P2)  with  a  beam 
equivalent  pressure  (BEP)  of  1.5X10“^  mbar  and  heated  to 
500  ^C.  At  this  temperature,  the  surface  reconstruction 
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changed  from  a  diffuse  lXltoa2X4  pattern  as  seen  by 
reflection  high  energy  electron  diffraction  (RHEED).  The 
temperature  was  increased  an  additional  25  °C  and  held  for 
several  minutes  before  dropping  it  to  the  growth  temperature 
of  475  °C,  which  was  constant  during  growth. 

The  layer  structure  for  the  DH  laser  diode  consisted  of  0.5 
fim  Si-doped  (2X10^^)  InP,  0.2  fim  unintentionally  doped 
Gao.29Ino.71Aso.65Po.35’  2.0  jiim  Be-doped  (2X10^^)  InP,  and  a 
cap  layer  of  0.2  /jm  Be-doped  (5X10^^)  Gao47lno  53As.  The 
unstrained  SCH  laser  diode  structure  was  1  /mm  of  Si-doped 
(5X10^^)  InP,  0.24  /im  unintentionally  doped  core,  1.5  /im 
of  Be-doped  (5X 10^^)  InP,  and  a  0.2  jum  Be-doped  (I  X 10^^) 
Gao47lno.53As  cap.  The  core  consisted  of 
Gao.27Ino.73Aso.57Po.43  (band  gap  wavelength  \g=l27  juum) 
and  four  lattice  matched  65  A  Gao.47Ino.53As  quantum  wells 
separated  by  125  A.  The  strained  SCH  laser  diode  had 
a  similar  structure  except  for  the  core,  which  contained 
four  compressively  strained  {e^+0J%)  105  A 

Gao.27lno.73Aso.8Po.2  quantum  wells  separated  by  100  A.  In¬ 
terruptions  of  10  s  were  inserted  at  each  interface  during  the 
growth  to  change  the  group  V  fluxes. 

III.  RESULTS  AND  DISCUSSION 

The  compositions  of  the  layers  in  the  laser  diodes  were 
confirmed  from  the  characterization  of  Gao.27Ino.73Aso.57Po.43 
and  Gao.47Ino.53As  reference  layers  grown  before  and  after 
the  lasers.  The  precise  compositions  were  determined  from 
the  lattice  mismatch  and  the  band  gap.  Symmetric  (004) 
rocking  curves  acquired  by  a  double-crystal  x-ray  diffracto¬ 
meter  were  used  to  examine  the  lattice  mismatch  of  the  qua¬ 
ternary  reference  films.  All  films  were  nearly  lattice  matched 
with  InP.  The  full  width  at  half-maximum  (FWHM)  line- 
widths  of  the  quaternary  peaks  (30  arcsec)  were  broader  than 
the  linewidths  of  the  InP  substrate  peaks  (20  arcsec).  Band 
gaps  were  measured  using  room  temperature  photolumines¬ 
cence  (PL).  The  quaternary  PL  peaks  had  FWHM 
linewidths  between  42  and  48  meV,  which  are  similar  to  the 
best  results  reported  for  MBE^^’^^  and  comparable  to  other 
growth  techniques. 

The  reproducibility  of  the  group  V  composition  over  time 
was  remarkable.  Lattice  matched  quaternary  films  were 
grown  by  MBE  by  simply  adjusting  the  group  V  fluxes  to  a 
calibrated  value  as  measured  by  an  ion  gauge.  Three  lattice 
matched  quaternary  films  with  compositions  near 
Gao.27Ino.73Aso.57Po.43  were  grown,  each  separated  from  the 
others  by  a  period  over  a  week.  Before  each  growth,  the  AS4 
and  P2  fluxes  were  adjusted  to  give  BEPs,  measured  by  an 
ion  gauge,  of  1.3X10“^  mbar  for  AS4  and  1.7X10"^  mbar  for 
P2.  The  lattice  mismatch  (Aa/a)  varied  within  ±0.0005,  cor¬ 
responding  to  a  change  in  As  mole  fraction  of  less  than 
±0.015.  By  converting  the  BEPs  into  atomic  fluxes, the 
relative  atomic  incorporation  coefficient  of  AS4  to  P2  for 
these  growth  conditions  was  found  to  be  9:1  and  is  similar  to 
the  values  reported  by  others. 

The  laser  diode  samples  were  also  characterized  using 
photoluminescence  and  x-ray  diffraction.  The  cap  and  the 
upper  cladding  layer  were  removed  from  the  SCH  lasers  be- 


Fig.  1,  Photoluminescence  spectra  at  (a)  300  K  and  (b)  15  K  from  a  SCH 
with  unstrained  quantum  wells. 


fore  collecting  the  PL  spectra.  The  PL  spectra  from  the  un¬ 
strained  SCH  laser  is  shown  in  Fig.  1(a).  The  peak  at  1.57 
jiim  (FWHM  27  meV)  corresponds  to  the  electron  to  heavy- 
hole  transition.  The  electron  to  light  hole  transition  appears 
as  a  higher  energy  shoulder.  Figure  1(b)  is  the  corresponding 
spectrum  at  low  temperature  (15  K).  The  peak  position  and 
linewidth  are  1.45  /im  and  6  meV,  respectively.  This  narrow 
linewidth  is  evidence  of  abrupt  interfaces  and  uniformity  of 
the  four  quantum  wells.  A  smaller  peak  appears  at  22  meV 
lower  energy  and  is  likely  associated  with  an  acceptor  bound 
exciton  transition.  The  PL  results  from  the  strained  SCH 
are  similar.  The  room  and  low  temperature  PL  peaks  appear 
at  1.55  and  1.43  /mm,  respectively,  with  FWHM  linewidths  of 
25  and  7  meV. 

Symmetric  (004)  x-ray  rocking  curve  spectra  were  ana¬ 
lyzed  for  each  laser  diode  sample.  The  spectrum  from  the 
strained  SCH  laser  structure  is  shown  in  Fig.  2(a).  The  cen¬ 
tral  peak  results  from  the  InP  substrate  and  cladding  layers. 
The  Gao.47Ino.53As  cap  and  Gao.27Ino.73Aso.57Po.43  core  layers 
are  nearly  lattice  matched  to  InP  and  appear  as  a  weak  shoul¬ 
der  on  the  central  peak.  Three  superlattice  satellite  peaks  can 
be  resolved  to  the  left  of  the  InP  central  peak.  These  result 


Fig.  2,  Symmetric  (004)  x-ray  diffraction  spectra  from  a  SCH  with  com¬ 
pressively  strained  wells:  (a)  experiment;  (b)  simulation. 
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Inverse  Cavity  Length  (1/cm) 

Fig.  3.  Plot  of  7th  vs  inverse  cavity  length  for  SCH  laser  diodes  with  (a) 
unstrained  and  (b)  compressively  strained  quantum  wells. 


from  the  long  range  periodicity  introduced  within  the  core  by 
the  four  strained  quantum  wells.  A  simulated  spectrum  was 
obtained  using  a  commercially  available  dynamical 
simulation^^  and  appears  in  Fig.  2(b).  The  best  fit  was  found 
for  a  model  structure  with  abrupt  interfaces  and  quaternary 
compositions  given  above.  A  superlattice  periodicity  of  220 
A,  which  is  close  to  the  205  A  value  expected,  gave  the  best 
fit  of  simulated  to  experimental  data. 

Broad  area  laser  diodes  were  processed  from  the  three 
laser  samples.  The  n  contact  was  formed  by  blanket  metal- 
ization  with  AuGe/Ni/Au,  and  the  p  contact  was  defined  by 
100-/xm-wide  Ti/Pt/Au  stripes.  A  selective  etch  was  used  to 
remove  the  0.2  /xm  Gao47lno.53As  contact  layer  to  obtain 
electrical  isolation.  The  facets  were  formed  by  cleaving  and 
were  uncoated.  Threshold  current  densities  were  determined 
using  the  dimensions  of  the  p  contacts,  and  no  correction  was 
made  for  current  spreading. 

Laser  diodes  produced  from  the  DH  laser  structure  had 
performance  characteristics  typical  of  high  quality 
material.  Diodes  with  cavity  lengths  of  1  mm  had  an  emis¬ 
sion  wavelength  of  1.35  pm  and  a  threshold  current  density 
Ah  of  900  A/cm^.  The  characteristic  temperature  Tq  was  62 
K.  Unstrained  SCH  laser  diodes  also  performed  well.  The 
emission  wavelength  for  a  cavity  length  of  1  mm  was  1.57 
/xm,  and  7^^  versus  inverse  cavity  length  is  plotted  in  Fig. 
3(a).  /jh  was  580  A/cm^  for  a  cavity  length  of  2  mm.  This 
value  is  comparable  to  values  from  the  literature  of  500  to 
700  A/cm^  for  similar  devices. The  internal  quantum  ef¬ 
ficiency  rji  was  0.75  and  the  internal  loss  a  was  20  cm“\  Tq 
was  measured  for  temperatures  between  15  and  55  °C  and 
found  to  be  55  K. 

The  /jh  values  for  laser  diodes  processed  from  the  SCH 
containing  four  compressively  strained  (^^+0.7%)  quantum 
wells  are  plotted  as  a  function  of  inverse  cavity  length  in  Fig. 
3(b).  Diodes  had  values  as  low  as  275  A/cm^  for  a  cavity 
length  of  2.5  mm.  The  values  of  Tq  (47  K),  77,(0.67),  and  a 
(12  cm“^)  are  also  similar  to  published  results.  The  spectrum 
of  a  500-/xm-long  device  is  shown  in  Fig.  4. 

The  values  reported  above  are  better  than  results  re- 
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Fig.  4.  Emission  spectrum  of  a  500-/u,m-long  SCH  laser  diode. 

ported  for  GSMBE^  and  CBE^  and  as  good  as  the  best  results 
reported  for  four  quantum  well  SCH  laser  diodes  with  com¬ 
pressive  strain  grown  by  MOVPE.^’^^  The  best  4  values 
reported  for  these  devices  decrease  with  increasing  strain: 
324  A/cm^  (£^+0.9%),^  286  A/cm^  (£-  +  1.2%),^^  and  268 
A/cm^(£^  +  1.8%).^  We  expect  further  reductions  in  7^^  be¬ 
low  the  275  A/cm^  (£=^+0.7%)  reported  in  this  letter,  by 
increasing  the  strain. 

IV.  CONCLUSION 

We  have  shown  that  solid  source  MBE  can  produce  high 
quality  GalnAsP  based  laser  diodes  on  InP.  Quaternary  com¬ 
positions  can  be  reproduced  by  simply  adjusting  the  group  V 
fluxes,  as  measured  by  an  ion  gauge,  to  a  calibrated  value 
before  each  growth.  Quantum  well  interfaces  are  abrupt  as 
determined  by  photoluminescence  and  x-ray  diffraction  mea¬ 
surements.  The  threshold  current  densities  of  1.55  pm  SCH 
laser  diodes  grown  by  solid  source  MBE  are  comparable 
with  the  best  results  reported  to  date  for  any  growth  tech¬ 
nique. 
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In  this  work  we  report  on  the  in  situ  investigation  of  MBE  grown  ZnSe  surfaces  performed 
simultaneously  by  reflectance  anisotropy  spectroscopy  (RAS)  and  reflection  high-energy  electron 
dilfraction  (RHEED).  Reconstructions  and  anisotropic  reflectance  of  the  ZnSe(OOl)  surface  are 
studied  in  the  temperature  range  from  50  °C  to  380  °C.  With  increasing  temperature  the  ZnSe 
surface  evolves  from  a  Se-rich  c(2X2)se  via  a  Se-rich  (2X1)  to  a  Zn-rich  c(2X2)zn  surface 
reconstruction  with  the  transition  temperatures  depending  on  whether  the  surface  is  Se  stabilized  or 
not.  Each  surface  reconstruction  as  verified  by  RHEED  is  accompanied  by  a  characteristic  RAS 
spectrum.  Time  resolved  measurements  of  the  RAS  signal  at  fixed  photon  energies  allowed  to 
determine  the  activation  energy  (0.7  eV)  for  the  Se  desorption  from  the  (2X1)  reconstructed 
ZnSe(OOl)  surface.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Since  reflectance  anisotropy  spectroscopy  (RAS/RDS  — 
reflectance  difference  spectroscopy)  has  been  introduced  as  a 
new  surface  sensitive  optical  technique^  many  studies  on 
III-V  semiconductor  surfaces  have  been  performed  by  a 
number  of  groups  establishing  correlations  between  the  sur¬ 
face  reconstructions  measured  by  reflection  high-energy 
electron  diffraction  (RHEED)  and  the  reflectance  anisotropy 
spectra.^*""^  In  this  work  we  report  on  a  comparative  RHEED 
and  RAS  investigation  on  the  II-VI  semiconductor  surface 
of  ZnSe(OOl). 

Wide  gap  ZnSe-based  II-VI  compounds  are  of  increasing 
interest  due  to  their  application  in  blue-green  laser  diodes 
operating  at  room  temperature.^"^  One  of  the  reasons  for  the 
rather  short  life  time  of  these  laser  diodes  are  dislocations 
formed  at  the  ZnSe/GaAs  interface.^  It  is  therefore  of  impor¬ 
tance  to  analyze  the  growth  of  wide  gap  II-VI  compounds 
on  GaAs  substrates  in  more  detail.  One  of  the  mostly  used  in 
situ  characterization  methods  during  ZnSe  molecular  beam 
epitaxy  (MBE)  is  reflection  high-energy  electron  diffraction 
(RHEED).  Two  dominating  ZnSe(OOl)  surface  reconstruc¬ 
tions  have  been  found,  a  (2X1)  and  a  c(2X2),  assigned  to 
Se-rich  and  Zn-rich  surface  reconstructions,  respectively.^^ 
Recently,  applying  the  electron  counting  rule^^  and  perform¬ 
ing  first  principles  total-energy  calculations  it  was  stated  in 
Refs.  12  and  13,  that  the  Zn-rich  c(2X2)  surface  reconstruc¬ 
tion  is  characterized  by  a  monolayer  (ML)  Zn- 

coverage  that  is  not  forming  any  dimer  bonds.  If  the  ZnSe 
surface  is  exposed  to  a  sufficiently  high  Se  beam  pressure  a 
Se-rich  (2X1)  reconstruction  is  typically  formed^^  character¬ 
ized  by  a  0se=l.O  ML  Se-coverage  with  one  Se-Se  dimer 
per  surface  unit  mesh.^^’^^  This  is  supported  by  recently  pub- 
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lished  STM  results^"^:  Se  dimers  oriented  along  [110]  have 
been  identified  on  (2X1)  reconstructed  ZnSe  overlayers  (2 
ML  thick)  on  GaAs(OOl).  However,  high  resolution  STM 
images  of  relaxed  ZnSe(OOl)  surfaces  to  our  knowledge  are 
still  lacking. 

Recently,  the  existence  of  an  additional  Se-rich  c(2X2) 
reconstruction  at  low  temperatures  and  under  extremely  Se- 
rich  conditions  was  predicted  from  total  energy  calculations 
by  Garcia  and  Northrup.^^’^^  [For  distinction  and  clarity  we 
use  in  the  following  the  notation:  Zn-rich  c(2X2) 

=  c(2X2)zn  and  Se-rich  c(2X2)^c(2X2)se.] 

Reflectance  anisotropy  spectroscopy  (RAS),  which  we 
have  applied  to  the  ZnSe  (001)  surfaces  additionally  to 
RHEED,  is  an  optical  in  situ  technique  highly  sensitive  to 
surface  effects  on  cubic  III-V  and  II-VI  semiconductors 
while  suppressing  their  isotropic  bulk  contribution.  RAS 
spectra  yield  information  on  type,  direction  and  concentra¬ 
tion  of  surface  dimers  with  only  limited  and  indirect  re¬ 
sponse  to  long  range  order  (see,  e.g..  Ref.  3  and  references 
therein).  Therefore  RAS  is  complementary  to  RHEED  mea¬ 
surements  but  for  II-V  semiconductors  it  has  been  applied 
up  to  now  only  ex  situ  for  the  contactless  determination  of 
carrier  concentration  in  doped  and  oxidized  ZnSe(OOl) 
layers. 

In  this  work  the  Se-rich  c(2X2)se  reconstruction  of  ZnSe 
(001)  predicted  by  Garcia  and  Northrup^^’^^  is  experimen¬ 
tally  verified  by  RHEED.  A  RAS  data  base  is  gained  for  the 
complete  set  of  reconstructions  characteristic  for  the  (001) 
surface  of  relaxed  ZnSe  in  the  50  °C  to  380  ®C  substrate 
temperature  range.  The  RHEED  patterns  which  are  well 
proven  as  an  effective  MBE  growth  monitor^^  are  used  here 
to  assign  surface  reconstructions  to  the  related  characteristic 
RAS  features.  Because  RAS  can  monitor  local  changes  at  the 
surface  before  long  range  order  is  established  it  is  utilized  in 
this  work  also  to  study  the  dynamics  of  Se  desorption  from 
the  ZnSe  (001)  surface  in  UHV 
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II.  EXPERIMENT 

The  experiments  were  performed  in  a  IWI  growth  cham¬ 
ber  for  the  ZnSe  material  system  that  is  connected  via  an 
UHV  module  track  to  a  separate  GaAs  growth  chamber 
(within  a  6-chamber  RIBER  2300  system).  Elemental  Zn  and 
Se  source  materials  of  6N  purity  have  been  used.  The 
RHEED  system  is  equipped  with  a  35  keV  electron  gun  and 
the  reflection  patterns  can  be  monitored  with  a  CCD  camera. 
Substrates  were  exactly  oriented  (±0.5°  off)  GaAs  (001) 
wafers,  being  chemically  etched  before  loading  them  into  the 
system.  GaAs-buffers  (200  nm,  undoped)  were  grown  in  the 
III-V  chamber  at  600  °C  to  obtain  an  atomically  flat  and 
clean  surface.  After  transferring  them  into  the  II-VI  MBE 
chamber  a  well  ordered  As-rich  c(4X4)  reconstruction  was 
verified  by  RHEED  and  RAS  at  a  substrate  temperature  of 
T^50°C.  Subsequently,  at  7=300  °C  a  ZnSe  buffer  was 
grown  thick  enough  (about  1.5  /mm)  to  suppress  Fabry-Perot 
interferences  at  photon  energies  above  the  Eq  gap  (about  2.7 
eV)  of  ZnSe.  Below  2.7  eV  ZnSe  is  optically  transparent  and 
therefore  the  RAS  spectra  are  dominated  by  strong  interfer¬ 
ence  patterns  due  to  the  multiple  reflections  between  the 
ZnSe/GaAs  interface  and  the  sample  surface.  Because  of  the 
GaAs-ZnSe  lattice  mismatch  the  ZnSe  relaxes  when  a  buffer 
thickness  of  about  200  nm  is  reached.^^  Therefore,  in  the 
about  300  nm  thick  substrate  near  region  of  the  ZnSe  layer 
the  density  of  dislocations  is  very  high  yielding  a  significant 
optical  bulk  anisotropy.  Otherwise,  once  the  ZnSe  buffer  is 
grown  thick  enough  (>1  /xm)  according  to  cross-sectional 
transmission  electron  microscopy  images only  a  few  dislo¬ 
cations  are  reaching  the  surface.  This  is  in  line  with  the  ob¬ 
servation  of  RHEED  patterns  typical  for  two-dimensional 
growth. 

For  photon  energies  less  than  2.7  eV  the  superposition  of 
dislocation  induced  bulk  anisotropy,  surface  anisotropy,  and 
interference  effects  complicates  the  analysis  of  the  RAS 
spectra.  Therefore  we  discuss  in  the  following  only  the  2.7 
eV  to  5.0  eV  RAS  features.  These  are  predominantly  surface 
related,  i.e.,  they  probe  only  the  uppermost  monolayers  of 
the  sample.  The  RAS  system  which  follows  the  standard 
design  of  Aspnes^  was  mounted  at  a  strain-reduced  pyrom¬ 
eter  window  of  the  MBE  chamber. 

The  real  part  of  the  reflectance  anisotropy 

Ar  ,r[no]-r[iio] 

^  ''[no] +''[100] 

is  presented  in  the  following.  Below  the  bulk  critical 
point,  i.e.,  below  about  4.5  eV  for  ZnSe,  the  real  part  of  the 
reflectance  anisotropy  is  directly  related  to  the  imaginary 
part  of  the  surface  dielectric  anisotropy  (^[rio]“^[iio])-^ 
Therefore  it  mirrors  the  anisotropic  absorption  properties  of 
the  reconstructed  ZnSe  surface.  The  imaginary  part  of  Ar/r 
was  also  measured  and  used  to  check  the  Kramers-Kronig 
consistency  between  the  real  and  imaginary  part  of  the  mea¬ 
sured  data  thus  minimizing  systematic  errors. The  influence 
of  residual  window  strain  on  the  RAS  signal  was  compen¬ 


sated  by  taking  spectra  in  two  perpendicular  sample  orienta¬ 
tions.  The  window  anisotropy  cancels  out  when  the  two 
spectra  are  subtracted  from  one  another. 

Light  induced  thermal  heating  effects  and  UV-photon  ac¬ 
tivated  reactions  can  cause  growth  modifications  as  it  is 
known  from  ZnSe  photo-MBE.^®  The  75  W  Xe-arc  lamp 
used  in  our  RAS  setup  yields  a  total  energy  flux  of  about  0.3 
W/cm^  to  the  sample  with  only  about  0. 1  W/cm^  in  the  spec¬ 
tral  region  above  the  ZnSe  Eq  gap.  Therefore,  and  because  no 
change  in  the  spectra  was  observed  when  part  of  the  light 
was  blocked  with  filters,  photo-induced  modifications  of  the 
ZnSe  surface  can  be  excluded. 

III.  RESULTS  AND  DISCUSSION 

The  meanwhile  well  studied  GaAs(OOl)  surface  is  known 
to  form  a  variety  of  As-rich  reconstructions  [c(4X4), 
7(2x4),  /3(2X4),  a  (2X4)]  with  As  coverages  decreasing 
with  increasing  substrate  temperature  (1.75,  1.0,  0.75,  and 
0.5  monolayers,  respectively).^^’^^  In  order  to  check  if  simi¬ 
larly  on  ZnSe  several  Se-rich  surface  reconstructions  can  be 
found,  we  studied  the  ZnSe(OOl)  surface  in  the  50  °C  to 
380  °C  substrate  temperature  range.  For  starting  from  a  very 
Se  rich  surface  the  sample  was  initially  in  situ  capped  at 
7=50  °C  with  an  amorphous  (a-)  Se  layer.  The  results  of 
these  experiments  are  summarized  in  Fig.  1.  The  spectra 
were  taken  with  increasing  substrate  temperature  under  two 
conditions:  without  Se  stabilization  [Fig.  1(a)]  and  under  a 
Se  beam  pressure  of  /7se=2.8X  10~^  Ton*  [Fig.  1(b)].  At 
about  100  °C  the  Se  cap  desorbs  and  a  c(2X2)  RHEED  pat¬ 
tern  appears  with  clear  reconstructions  in  the  [100]  and 
[010]  directions.  To  our  knowledge  this  is  the  first  observa¬ 
tion  of  a  c(2X2)se  reconstruction  at  low  temperatures  and 
under  extremely  Se-rich  conditions  which  was  predicted  re¬ 
cently  from  total  energy  calculations  by  Garcia  and 
Northrup.^^’^^  The  RAS  signal  is  positive  in  the  complete 
spectral  range  and  features  a  characteristic  minimum  (termed 
B  in  Fig.  1)  at  about  3.7  eV.  This  RAS  feature  found  in  our 
experiments  to  be  typical  for  this  reconstruction  should  be 
related  to  the  Se  trimers  which  according  to^^’^^  form  an 
energetically  preferred  c(2X2)se  reconstruction  with  1.5  ML 
Se  surface  coverage.  The  authors  in  Ref.  13  stated  that  this 
c(2X2)se  is  unlikely  to  be  prepared  from  Se  capped  surfaces. 
This  is  apparently  incorrect,  at  least  for  the  in  situ  capped 
and  decapped  samples  used  in  our  experiments.  However, 
the  RAS  feature  B  is  more  pronounced  under  Se  flux  indi¬ 
cating  that  a  sufficiently  high  Se  supply  is  essential  for  the 
stabilization  of  the  c(2X2)se  reconstruction. 

Further  increase  in  temperature  transforms  the  surface 
into  a  (2X1)  reconstruction  with  the  transition  temperatures 
depending  on  whether  a  stabilizing  S e-flux  is  utilized  or  not 
(about  220  °C  and  150  °C,  respectively).  The  RAS  spectrum 
of  this  well  known  Se-rich  (2X1)  reconstruction  is  charac¬ 
terized  by  a  minimum  at  about  3.0  eV  (termed  A  in  Fig.  1) 
and  a  positive  RAS  signal  increasing  with  photon  energy  to  a 
maximum  at  about  4.8  eV.  We  attribute  this  characteristic 
RAS  minimum  A  to  Se  dimers  found  to  be  typical  for  the 
(2X1)  surface  by  first  principles  total-energy  calculations 
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(a)  photon  energy  [eV] 


(b)  photon  energy  [eV] 

Fig.  1.  RA  spectra  taken  during  heating  up  a  just  grown  and  subsequently  Se 
capped  thick  ZnSe  layer  on  GaAs(OOl)  (a)  without  Se  stabilization  and  (b) 
under  a  constant  Se  pressure  of  /?se— 2.8X  10“^  Torr,  The  characteristic  RAS 
features  are  labeled  hy  A,  B,  and  E^.  We  assume  A  and  B  to  be  dimer  and 
trimer  related  resonances,  respectively.  is  close  to  a  bulk  critical  point. 
The  RAS  zero  level  and  the  substrate  temperature  belonging  to  each  spec¬ 
trum  are  given  in  the  right  part  of  the  figure.  The  surface  reconstructions 
measured  by  RHEED  synchronously  to  the  RAS  spectra  are  given  in  the  left 
part  of  the  figure. 


and  by  relative  peak  intensity  analysis  in  grazing  incidence 
x-ray  scattering  experiments.^^  In  the  4.8  eV  to  5.5  eV  pho¬ 
ton  energy  range  a  sharp  decrease  to  negative  RAS  values  is 
found.  This  feature  was  labelled  Ey  in  Fig.  1  because  its 
energy  position  is  close  to  the  ZnSe  Ei  bulk  critical  point.^^ 
which  might  be  modified  by  surface  effects  or  remaining 
strain  of  the  incompletely  relaxed  ZnSe  layer. 


Under  S e-stabilization  the  (2X1)  reconstruction  is  stable 
in  the  complete  temperature  range  from  220  °C  to  380 
[Fig.  1(b)]  with  the  RAS  features  A  and  E^  shifting  to  lower 
energies  with  increasing  temperature.  Without  Se  stabiliza¬ 
tion  the  surface  transforms  at  about  260  °C  to  the  Zn-rich 
c(2X2)zn  reconstruction.  The  related  RAS  spectrum  is  quite 
flat  in  the  complete  spectral  range  except  for  the  4.8  eV  E^ 
feature.  This  is  again  consistent  to  the  total-energy  calcula¬ 
tions  in  Refs.  12  and  13  where  it  was  concluded  that  the 
c(2X2)zn  surface  reconstruction  is  characterized  by  a  Zn- 
coverage  of  0=0.5  ML  not  forming  any  dimer  bonds. 

After  monitoring  the  transformations  of  the  ZnSe  surface 
occurring  with  increasing  substrate  temperature  the  inverse 
process  was  studied  by  slowly  cooling  the  sample  down  to 
50  °C  under  constant  Se  flux.  Both  the  c(2X2)se  and  the 
(2X1)  reconstruction  were  found  again. 

The  Fabry-Perot  interferences  at  photon  energies  below 
the  E^  bulk  critical  point  of  ZnSe  at  about  2.6  eV  could  not 
be  quantitatively  interpreted  even  for  known  ZnSe  thick¬ 
nesses  due  to  the  superposition  of  surface  anisotropy  and 
bulk  anisotropy  effects.  The  contribution  of  bulk  anisotropy 
is  obvious  in  the  a-Se  RAS  spectrum  in  Fig.  1(b).  The  sur¬ 
face  of  amorphous  cap  layers  is  known  to  be  isotropic^^  and 
in  fact  gives  a  zero  line  signal  in  the  high  photon  energy  part 
of  the  spectrum.  Below  2.7  eV,  where  the  Se-cap  is  suffi¬ 
ciently  transparent,  predominantly  ZnSe  bulk  anisotropies 
originating  from  the  partially  relaxed  and  dislocation  rich 
region  close  to  the  ZnSe/GaAs  interface  should  contribute  to 
the  interference  modified  RAS  features.  Forthcoming  experi¬ 
ments  with  ZnSe  layers  grown  on  ZnSe  (001)  substrates 
should  allow  for  the  investigation  of  surface  anisotropy  also 
in  this  low-energy  spectroscopic  range. 

In  a  further  set  of  experiments  we  studied  dynamic 
changes  of  the  ZnSe(OOl)  surface  stoichiometry.  For  this  pur¬ 
pose  the  Se  desorption  with  increasing  substrate  temperature 
was  monitored  with  100  ms  time  resolution.  The  3.1  eV  pho¬ 
ton  energy  was  chosen  for  this  experiment  because  it  is  sen¬ 
sitive  to  both  the  presumably  dimer  related  RAS  minimum 
A  and  the  trimer  related  feature  B  [Fig.  1(a)].  The  result  of 
the  desorption  experiments  is  given  in  Fig.  2.  Obviously  the 
surface  phase  transition  from  c(2X2)se  to  (2X1)  reconstruc¬ 
tion  is  shifted  towards  higher  temperatures  by  about  50  K  in 
the  case  of  Se  stabilization. 

At  substrate  temperatures  above  250  °C  this  phase  transi¬ 
tion  from  c(2X2)sc  to  (2X1)  reconstruction  can  be  driven 
just  by  switching  off  the  Se  flux  and  thus  the  Se  desorption 
can  be  studied.  The  slope  of  the  logarithmically  plotted  tran¬ 
sients  from  the  (2X1)  to  the  c(2X2)zn  RAS  level  at  3.1  eV 
yields  the  Se  desorption  rate.  From  the  slope  of  the  Arrhenius 
plot  of  the  desorption  rate  (insert  in  Fig.  2)  an  activation 
energy  of  0.7  eV  for  the  Se  desorption  from  a  (2X1)  recon¬ 
structed  ZnSe  surface  is  determined.  This  result  is  in  ±0.1 
eV  agreement  with  our  recently  published  RHEED  intensity 
transient  experiments  in  the  same  temperature  range.  In 
general,  however,  RHEED  experiments  yielded  activation 
energies  in  the  relatively  wide  0.5  eV  to  1.2  eV  range^^~^^ 
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Fig.  2.  Se  desorption  as  monitored  by  RAS  at  3.1  eV  while  gradually  heat¬ 
ing  up  the  ZnSe  sample  which  is  a-Se  capped  (circles)  at  the  beginning  of 
the  experiment.  The  reconstruction  as  measured  by  RHEED  is  indicated  by 
symbols:  c(2X2)se  (triangles),  (2X1)  (squares)  and  c{2X2)zn  (diamonds). 
The  filled  symbols  indicate  pse=0  Torr  and  the  open  symbols  are  used  for 
pse=2.8X10”^  Torr.  Above  250  °C  the  ZnSe  surface  can  be  switched  from 
the  Se-rich  (2X1)  reconstruction  to  the  Zn-rich  c(2X2)2n  reconstruction 
simply  by  closing  the  Se  shutter.  The  insert  gives  the  substrate  temperature 
dependence  of  the  related  Se  desorption  rate  that  was  determined  from  RAS 
transients  taken  at  3.1  eV  during  this  Se  desorption  process. 

depending  on  substrate  temperature,  electron  energy  and  ob¬ 
viously  other  experimental  details. 

Finally,  in  preliminary  experiments  RAS  and  RHEED 
have  been  used  to  monitor  the  ZnSe  on  ZnSe  growth.  At  our 
standard  growth  conditions  (300  °C  substrate  temperature, 
growth  rate  500  nm/h)  RHEED  gave  a  (2X1)  pattern,  i.e., 
the  same  reconstruction  as  before  growth  under  Se  flux  only. 
Otherwise  RAS  sensitively  monitors  the  changing  surface 
status  due  to  growth  initiation:  the  RAS  signature  A  disap¬ 
pears  and  a  fairly  flat  RAS  spectrum  is  measured  during 
growth  similar  to  that  of  the  c(2X2)2n  reconstructed 
ZnSe(OOl)  surface  in  Fig.  1(b).  We  assume  that  the  Se 
dimers  causing  the  RAS  signature  A  of  the  (2X1)  pregrowth 
reconstruction  are  mostly  broken  when  growth  is  started. 

IV.  SUMMARY 

In  this  work  RAS  reference  spectra  have  been  determined 
for  ZnSe  (001)  surfaces  characterized  by  RHEED  under 
MBE  conditions.  The  Se-rich  c(2X2)se  surface  predicted  re¬ 
cently  by  Garcia  and  Northrup  in  Refs.  12  and  13  was  veri¬ 
fied.  A  reflectance  anisotropy  feature  at  3.6  eV  characteristic 
for  this  surface  reconstruction  was  related  to  the  Se  trimers 
which  are  energetically  preferred  according  to  the  total- 
energy  calculations  in  Refs.  12  and  13.  Under  Se  flux  and  at 
substrate  temperatures  above  200  °C  we  found  the  well 


known  Se-rich  (2X1)  reconstruction  featuring  a  probably  Se- 
dimer  related  RAS  minimum  at  3.0  eV  and  a  strong  RAS 
maximum  close  to  the  ZnSe  Ei  bulk  critical  point.  Without 
Se  stabilization  and  at  substrate  temperatures  above  260 
the  ZnSe  surface  is  Zn-rich  c(2X2)2n  reconstructed.  The  re¬ 
lated  RAS  spectrum  is  flat  with  a  minimum  close  to  the  Ey 
bulk  critical  point. 
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Photoluminescence  and  x-ray  photoelectron  spectroscopy  have  been  used  to  study  the  solvent  effect 
on  surface  properties  of  GaAs  passivated  in  different  sulfide  solutions.  It  has  been  found  that  sulfur 
passivation  could  be  made  more  efficient  by  decreasing  the  solution  dielectric  constant  through  the 
use  of  various  alcohols  as  solvents.  Specifically,  the  band  edge  photoluminescence  intensity  is  much 
higher,  the  total  amount  of  oxides  is  lower,  and  sulfur  coverage  is  higher  compared  with  a  GaAs 
surface  treated  in  an  aqueous  sulfide  solution.  The  role  of  the  solvent  in  sulfur  passivation  is 
discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

It  is  known  that  the  surface  treatment  of  III-V  semicon¬ 
ductors  in  sulfide-containing  solutions  or  gases  results  in  a 
dramatic  decrease  of  the  surface  state  density  in  the  middle 
of  the  band  gap.  Attendant  reduction  of  the  surface  recombi¬ 
nation  velocity  allows  one  to  improve  the  performance  and 
reliability  of  many  devices.^"^  However,  this  improved  sur¬ 
face  electronic  quality  is  not  long  lived,  so  the  problems  of 
an  increase  of  passivation  efficiency  and  stability  remain  un¬ 
solved. 

In  order  to  improve  properties  of  passivating  coats,  differ¬ 
ent  approaches  to  sulfur  passivation  were  developed.  Among 
these  are  passivation  from  a  gas  phase,^"^  electrochemical 
passivation,^"^ ^  and  photochemical  passivation.  Besides, 
different  sulfide-containing  substances  such  as  sodium 
sulfide,^  ammonium  sulfide,^  sulfur  chloride, thiols, 
and  other  complex  organic  compounds were  used  for  pas¬ 
sivation. 

In  recent  years,  investigations  focused  on  studies  of  the 
interaction  of  sulfur  atoms  or  ions  with  the  surface  of  III-V 
semiconductors  and,  in  particular,  on  chemical  processes 
which  lead  to  changes  of  surface  atomic  and  electronic  struc¬ 
tures  in  the  course  of  sulfidizing.  It  was  shown that  to 
perform  a  photoelectrochemical  reaction  of  sulfidizing  in  so¬ 
lutions,  the  presence  of  protons  is  necessary.  Therefore,  any 
liquid  which  contains  protons  after  self-dissociation  can  be 
used  as  a  solvent  in  passivating  sulfide  solution.  Being  a 
protic  liquid,  water  is  often  used  to  prepare  solutions  of  in¬ 
organic  sulfides.  Other  protic  liquids  including  different  al¬ 
cohols  also  could  be  considered  as  a  solvent  in  passivating 
solutions.  For  instance,  in  electrochemical  passivation,  the 
sodium  sulfide  solution  in  ethylenglycol  was  used.^^ 

The  solvent  should  have  a  great  effect  on  both  the  kinetics 
and  the  pathway  of  any  chemical  reaction  involving  ions.^^ 
We  have  shown  that  the  sulfide  treatment  in  isopropanol- 
based  solutions  results  in  more  effective  passivation  of  GaAs 
compared  with  aqueous  solutions.^^  Besides,  the  photolumi¬ 
nescence  properties  of  GaAs  treated  in  isopropanol-based 
sulfide  solutions  remain  constant  even  after  intensive  laser 
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irradiation.  Additionally,  the  sulfur  passivation  of  InGaAs/ 
AlGaAs  lasers  in  isopropanol-based  solutions  markedly  in¬ 
creases  their  catastrophic  optical  damage  level.^^ 

In  this  article  we  investigated  the  role  of  the  solvent  in  the 
course  of  sulfur  passivation  of  III-V  semiconductors.  Using 
photoluminescence  and  x-ray  photoelectron  spectroscopy  we 
studied  the  properties  of  GaAs  surfaces  treated  in  solutions 
of  sodium  sulfide  in  different  protic  solvents. 


II.  EXPERIMENT 

The  investigations  were  performed  on  n-GaAs(lOO) 
(n  =  4XlO^^  cm“^)  10  fjm  thick  epilayers  grown  on  semi- 
insulating  substrates  by  metal-organic  chemical-vapor  depo¬ 
sition  (MOCVD).  Immediately  before  sulfidizing,  some 
specimens  were  etched  in  a  1:8:500  solution  of 
H2S04:H202:H20  at  room  temperature  for  30  s. 

The  sulfide  treatment  was  carried  out  by  immersing  the 
samples  in  different  sulfide  solutions  under  illumination  with 
a  200  W  incandescent  lamp.  The  solutions  used  were  satu¬ 
rated  solutions  of  sodium  sulfide  (Na2S:9H20)  in  water  or  in 
different  alcohols  such  as  ethylenglycol,  ethanol,  isopro¬ 
panol,  butanol,  and  tert-butanol.  The  immersion  time  was 
about  1  min  as  a  rule.  In  aqueous  solutions,  the  samples  were 
treated  at  different  temperatures  (from  0  to  100  °C).  In 
alcohol-based  solutions,  the  treatment  was  carried  out  at 
room  temperature. 

The  intensity  of  band-edge  photoluminescence  of  GaAs  at 
room  temperature  {hv=  1.412  eV)  was  measured.  Photolu¬ 
minescence  was  excited  by  a  Xe  laser  (7=3.0  kW/cm^). 

To  investigate  the  GaAs  surface  composition  x-ray  photo¬ 
electron  spectroscopy  was  used.  Photoelectron  spectra  were 
recorded  with  a  Perkin-Elmer  PHI  5400  spectrometer  using 
the  Mg  {hv=  1253.6  eV)  x-ray  source  operated  at  300  W. 
Angles  of  incidence  of  x-ray  radiation  on  the  crystal  surface 
were  45°  and  5"".  The  samples  were  put  into  the  vacuum 
chamber  not  later  than  10  min  after  completing  the  sulfidiz¬ 
ing  process.  Data  analysis  were  carried  out  using  Perkin- 
Elmer  software.  The  identification  of  the  chemical  bonds  cor¬ 
responding  to  different  peaks  in  spectra  was  accomplished 
according  to  Ref.  23. 
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Fig.  1.  Dependence  on  the  treatment  temperature  of  the  photoluminescence 
intensity  of  GaAs(lOO)  («  =  4X  10^^  cm~^)  treated  in  saturated  aqueous  so¬ 
dium  sulfide  solution  for  1  min.  The  intensity  of  the  untreated  sample  was 
used  as  a  reference. 

III.  RESULTS 

A.  Photoluminescence  measurements 

Following  the  sulfide  treatment  in  any  of  the  solutions,  the 
photoluminescence  properties  of  GaAs  improved  consider¬ 
ably.  The  band-edge  photoluminescence  intensity  of  GaAs 
treated  in  aqueous  sulfide  solutions  became  higher  when  the 
treatment  temperature  was  increased  for  the  same  immersion 
time  (Fig.  1). 

The  photoluminescence  intensity  of  GaAs  sulfidized  in 
any  alcohol-based  solution  is  always  considerably  higher 
then  that  of  GaAs  sulfidized  in  an  aqueous  solution  (Fig.  2). 


GaAs 

n=4  -lO^^cm"^ 


TIME,  MIN 


Fig.  2.  Variation  of  the  photoluminescence  intensity  of  GaAs(lOO) 
(n  =  4X  10^^  cm~^)  treated  in  different  saturated  sulfide  solutions.  The  in¬ 
tensity  of  the  untreated  sample  was  used  as  a  reference. 


The  greatest  increase  of  the  intensity  of  sulfide  treated  GaAs 
relative  to  that  of  the  untreated  one  has  been  achieved  after 
treatment  in  tert-butanol-based  solution.  It  should  be  noted 
that  the  pretreatment  of  the  semiconductor  in 
H2S04:8H202:500H20  etch  solution  before  sulfidizing  did 
not  lead  to  any  increase  of  the  photoluminescence  intensity 
of  solution-treated  GaAs. 

The  degradation  of  the  properties  of  the  sulfide-treated 
GaAs  surface  also  depended  on  the  solvent  used  (Fig.  2).  The 
photoluminescence  intensity  of  GaAs,  treated  in  aqueous  so¬ 
lutions  at  any  temperature,  rapidly  (in  15-20  min)  dropped 
down  to  typical  intensity  values  of  untreated  GaAs.  The  pho¬ 
toluminescence  intensity  of  the  GaAs  surface  treated  in  eth- 
ylenglycol  and  ethanol-based  solutions  also  decreased  with 
time  of  the  laser  irradiation  but  to  a  lesser  extent.  On  the 
other  hand,  the  Xe-laser  irradiation  did  not  affect  the  photo¬ 
luminescence  properties  of  GaAs  treated  in  isopropanol,  bu¬ 
tanol,  or  tert-butanol-based  solutions. 

B.  X-ray  photoelectron  spectroscopy 

The  survey  photoelectron  spectra  contained  Ga,  As,  C, 
and  O  peaks  regardless  of  the  angle  of  incidence.  After  sul¬ 
fidizing,  the  S  peaks  appeared  in  spectra.  In  structures  treated 
in  alcohol-based  solutions,  the  Na  peak  was  also  seen  in  the 
spectrum.  Figures  3  and  4  show  high  resolution  spectra  of 
Ga  3^/  and  As  3  J  peaks  for  untreated  structures  and  struc¬ 
tures  treated  in  different  sulfide  solutions  (namely,  aqueous, 
at  two  temperatures,  ethylenglycol-,  ethanol-,  and 
isopropanol-based  solutions). 

Deconvolution  of  the  Ga  3  J  spectra  showed  the  following 
(Fig.  3):  In  the  spectrum  of  untreated  GaAs  two  components 
are  clearly  seen,  namely,  a  Ga-As  bond  (binding  energy  19.9 
eV)  and  a  Ga-0  bond  (binding  energy  21.2  eV)  [Fig.  3(a)]. 
In  the  spectra  of  sulfide  treated  GaAs,  the  Ga-0  bond  com¬ 
ponent  has  a  smaller  chemical  shift  (binding  energy  21.0 
eV);  moreover,  its  intensity  in  spectra  of  GaAs  treated  in 
aqueous  and  ethylengly col-based  solutions  was  practically 
the  same  as  in  untreated  GaAs  [Figs.  3(a)-3(d)].  In  the  spec¬ 
tra  of  GaAs  treated  in  ethanol-  and  isopropanol-based  solu¬ 
tions,  the  intensity  of  the  Ga-0  bond  component  was  con¬ 
siderably  reduced  but  simultaneously  one  more  component 
was  clearly  seen  (binding  energy  20.55  eV),  presumably  as¬ 
sociated  with  the  formation  of  the  Ga-S  bonds  on  the  sur¬ 
face  [Figs.  3(e)  and  3(f)]. 

Deconvolution  of  the  As  3^  spectra  showed  the  following 
(Fig.  4):  The  spectrum  of  untreated  GaAs  can  be  resolved 
into  three  components,  i.e.,  an  As-Ga  bond  (binding  energy 
40.8  eV),  elemental  As^  (binding  energy  41.55  eV),  and  an 
As-0  bond  (binding  energy  43.7  eV)  [Fig.  4(a)].  After  treat¬ 
ment  in  any  solution  an  additional  peak  with  binding  energy 
42.5  eV  appeared.  This  peak  could  be  associated  with  the 
formation  on  the  surface  of  As-S  bonds  [Figs.  4(b)-4(f)]. 

According  to  our  analysis  of  the  spectra,  the  content  of 
As-0  decreases  after  the  sulfide  treatment  and  depends  on 
the  solution  used.  Among  sulfide  treated  structures,  it  was 
highest  in  GaAs  treated  in  the  aqueous  solution  at  room  tem¬ 
perature  and  lowest  in  GaAs  treated  in  the  isopropanol- 
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Fig.  3.  Ga  x-ray  photoelectron  spectra  of  GaAs(lOO)  preliminary  etched  in  a  1:8:500  solution  of  H2S04:H202:H20  at  room  temperature  for  30  s  after 
various  surface  treatments  taken  at  an  electron  take-off  angle  of  45  deg.  (a)  Untreated  surface,  (b)  treated  in  saturated  aqueous  sodium  sulfide  solution  at  room 
temperature,  (c)  treated  in  saturated  aqueous  sodium  sulfide  solution  at  100  °C,  (d)  treated  in  saturated  sodium  sulfide  solution  in  ethylenglycol,  (e)  treated  in 
saturated  sodium  sulfide  solution  in  ethanol,  and  (f)  treated  in  saturated  sodium  sulfide  solution  in  isopropanol. 


based  solution  (Table  I).  The  intensity  of  the  peak  relating  to 
the  As-S  bonds  also  depended  on  the  solution  used:  it  was 
highest  in  GaAs  treated  in  the  isopropanol-based  solution 
and  lowest  in  the  GaAs  treated  in  the  aqueous  solution  at 
room  temperature  (Table  I). 

The  elemental  As^  content  on  the  surface  slightly  de¬ 
creased  after  the  treatment  in  the  aqueous  solutions.  On  the 
contrary,  treatment  in  the  alcohol-based  solutions  did  not 
lead  to  any  reduction  of  As^  content  (Table  I). 

It  should  be  stressed  that  the  total  amount  of  Ga  and  As 
oxides  decreased  after  the  treatment  in  any  solution.  This 
reduction  was  greatest  after  the  treatment  in  the  isopropanol- 
based  solution. 

Spectra  obtained  at  angle  of  x-ray  incidence  of  5°  showed 
that  elemental  As^  and  sulfides  were  concentrated  in  near¬ 
surface  10  A  thick  region  (their  content  increased  by  a  factor 
of  2  compared  with  the  spectra  at  a  take-off  angle  of  45‘"). 

The  estimates  carried  out  according  to  Ref.  2  showed  that 
the  sulfur  coverage  of  the  GaAs  surface  depended  on  the 
solvent  used  (Table  I). 

IV.  DISCUSSION 

The  experimental  data  testify  that  the  treatment  of  the 
GaAs  surface  in  solutions  of  sodium  sulfide  in  different  al¬ 
cohols  allows  one  to  obtain  more  effective  passivation  than 


the  usual  treatment  in  aqueous  sulfide  solutions.  Indeed,  the 
passivation  in  alcohol-based  solutions  leads  to  the  greatest 
enhancement  of  the  photoluminescence  intensity.  On  the 
other  hand,  the  treatment  in  alcohol-based  solutions  leads  to 
more  effective  removal  of  oxides  from  the  semiconductor 
surface  and  to  the  formation  of  sulfides  there  (Figs.  3  and  4 
and  Table  I). 

In  order  to  explain  these  facts,  it  is  necessary  to  consider 
the  role  of  the  solvent  in  electronic  and  chemical  processes  at 
the  interface  between  the  semiconductor  and  the  solution  in 
the  course  of  sulfidizing.  First,  let  us  consider  the  role  of  the 
solvent  in  the  process  of  modification  of  the  surface  states 
spectrum. 

The  enhancement  of  the  GaAs  photoluminescence  in  the 
course  of  sulfur  passivation  can  be  attributed  to  the  decrease 
of  the  surface  recombination  velocity,  i.e.,  to  the  lowering  of 
the  activity  of  the  surface  states.  From  the  chemical  point  of 
view  such  states  represent  so-called  soft  acid  Lewis 
centers,  since  they  can  easily  capture  electrons  from  the 
conduction  band  of  a  semiconductor.  To  passivate  these  sur¬ 
face  states,  it  is  necessary  at  least  to  weaken  their  ability  to 
capture  electrons  of  the  semiconductor.  It  could  be  achieved, 
for  instance,  by  covalent  bonding  of  the  acid  Lewis  centers 
of  the  surface  with  appropriate  bases.  According  to  the  prin¬ 
ciple  of  hard  and  soft  acids  and  bases,^"^  the  soft  acid  can 
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Fig.  4.  As  3d  x-ray  photoelectron  spectra  of  GaAs(lOO)  preliminarily  etched  in  a  1:8:500  solution  of  H2S04:H202:H20  at  room  temperature  for  30  s  after 
various  surface  treatments  taken  at  an  electron  take-off  angle  of  45  deg.  (a)  Untreated  surface,  (b)  treated  in  a  saturated  aqueous  sodium  sulfide  solution  at 
room  temperature,  (c)  treated  in  saturated  aqueous  sodium  sulfide  solution  at  100  °C,  (d)  treated  in  saturated  sodium  sulfide  solution  in  ethylenglycol,  (e) 
treated  in  saturated  sodium  sulfide  solution  in  ethanol,  and  (f)  treated  in  saturated  sodium  sulfide  solution  in  isopropanol. 


form  the  covalent  bond  with  a  soft  base.  The  sulfide  ions  are 
exactly  such  soft  bases and  therefore  quite  suitable  for 
III-V  surface  passivation.  When  a  covalent  bond  is  formed, 
the  hardness  of  the  cloud  of  electrons  shared  by  the  sulfide- 
ion  and  the  surface  state,  according  to  Ref.  25,  will  be  higher 
than  the  hardness  of  the  separate  entities,  resulting  in  a  less 
active  recombination  center. 

If  the  covalent  bond  formation  occurs  in  a  liquid  polar 
medium,  a  solvate  shell  consisting  of  oriented  dipole  mol¬ 
ecules  of  the  solvent  would  be  formed  around  the  ion  and 
near  semiconductor  surface.  The  solvate  shell  should  greatly 
affect  both  the  properties  of  the  electron  cloud  of  the  sulfur 
ion  and  the  electronic  distribution  in  near- surface  region  of 
the  semiconductor  and  the  parameters  of  the  resulting  cova¬ 


lent  bonds.  Indeed,  it  has  been  shown  by  computations^^  that 
the  hardness  of  the  covalent  bond  formed  in  a  liquid  medium 
depends  on  the  solvation  strength  in  such  a  way  that  the  bond 
hardness  increases  with  the  decrease  of  the  solvent  dielectric 
constant.  That  is,  more  hard  surface  states  are  formed  after 
passivation  in  solutions  with  a  lower  dielectric  constant 
value  (i.e.,  in  alcohols)  than  is  the  case  with  the  usual  passi¬ 
vation  from  aqueous  sulfide  solutions.  Therefore,  the  effi¬ 
ciency  of  nonradiative  surface  recombination  of  GaAs 
treated  in  alcohol-based  solutions  with  low  dielectric  con¬ 
stant  value  should  be  lower. 

Indeed,  with  the  decrease  of  the  dielectric  constant  value 
of  the  solvent  being  used,  the  photoluminescence  intensity  of 
the  sulfidized  GaAs,  which  can  be  considered  as  a  measure 


Table  I.  Relative  intensities  of  different  peaks  and  sulfur  coverage  for  GaAs  treated  in  different  solutions. 


Treatment 

S2j 

As-0 

As-S 

As« 

Ga-0 

Ga-S 

Sulfur  coverage 
(MLs) 

Ga  3d+As  3d 

Ga  3fi?+As  3d 

Ga3d+As3d 

Ga3d+As3d 

Ga  3d-\-As  3d 

Ga3^/+As3J 

Untreated 

0 

0.12 

0 

0.19 

0.07 

0 

0 

Na2S+H20  20  °C 

0.04 

0.07 

0.04 

0.14 

0.05 

0 

0.4 

Na2S+H20  100  °C 

0.05 

0.03 

0.05 

0.13 

0.07 

0 

0.7 

Na2S+C2H4(OH)2  20  °C 

0.07 

0.03 

0.07 

0.16 

0.08 

0 

1.1 

Na2S-hC2H50H  20  °C 

0.10 

0.05 

0.06 

0.19 

0.02 

0.04 

0.8 

Na2S+C3H70H  20  X 

0.12 

0.02 

0.09 

0.19 

0.02 

0.03 

1.3 
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Fig.  5.  Dependence  of  the  photoluminescence  intensity  of  sulfide  treated 
GaAs(lOO)  (n  =  4xl0’''’  cm“^)  on  the  reciprocal  dielectric  constant  of  the 
solvent  used.  The  intensity  of  the  untreated  sample  was  used  as  a  reference. 


of  the  passivation  efficiency,  increases  (Fig.  5). 

Consider  now  the  role  of  the  solvent  in  chemical  reactions 
in  the  course  of  sulfidizing  of  the  semiconductor  surface.  The 
photoelectrochemical  reaction  of  the  sulfur  coat  formation  on 
a  III-V  surface  during  sulfidizing  in  solution  proceeds  with 
participation  of  different  ions  (protons,  sulfide  ions,  etc.).  In 
the  course  of  this  reaction,  the  electrostatic  interaction  of  ion 
reagents  with  the  semiconductor  surface  should  play  an  im¬ 
portant  role.  Indeed,  on  a  semiconductor  surface  there  is  a 
constant  dipole  moment  due  to  band  bending  and  adsorbed 
foreign  atoms.  Moreover,  because  of  the  partly  ionic  charac¬ 
ter  of  semiconductor  chemical  bonds,  Ga  and  As  atoms  in  a 
crystal  lattice  are  in  fact  ions  and  give  a  contribution  to  this 
electrostatic  interaction.^^ 

In  particular,  the  choice  of  the  solvent  affects  the  dielec¬ 
tric  constant  of  the  medium  in  which  photoelectrochemical 
reaction  of  sulfidizing  takes  place.  The  replacement  of  the 
aqueous  solution  with  an  alcohol-based  one  leads  to  a  sig¬ 
nificant  decrease  of  the  dielectric  constant  and  hence  to  the 
increase  of  the  electrostatic  interaction.  On  the  one  hand,  this 
should  lead  to  more  effective  association  of  the  sulfide  ions 
with  the  surface  atoms  of  the  semiconductor^^  and,  on  the 
other,  to  the  increase  of  the  charge  transfer  rate  between  the 
semiconductor  and  the  passivating  solution.^^  Hence  the  de¬ 
crease  of  the  solvent  dielectric  constant  should  lead  to  the 
increase  of  the  equilibrium  constant  of  the  photoelectro¬ 
chemical  reaction  of  sulfidizing.  It  should  be  noted  that  the 
increase  of  the  temperature  of  the  aqueous  solution  also  leads 
to  the  decrease  of  its  dielectric  constant. 

Based  on  the  x-ray  photoelectron  spectroscopy  data,  the 
equilibrium  constant  of  this  reaction  was  estimated  as  the 
ratio  of  the  relative  intensity  of  the  S  2^  peak  (Table  I,  col¬ 
umn  1)  to  the  concentration  of  sulfur  ions  in  the  solution.  It 
is  seen  that  with  the  decrease  of  the  solvent  dielectric  con¬ 
stant,  the  equilibrium  constant  of  the  reaction  increases  (Fig. 


0.00  0.02  0.04  0.06 

1/s 


Fig.  6.  The  estimated  value  of  the  equilibrium  constant  of  the  reaction  of 
sulfur  coat  formation  on  GaAs  surface  treated  in  different  sulfide  solutions 
vs  the  reciprocal  dielectric  constant  of  the  solvent  used. 

6).  Moreover,  the  equilibrium  constant  is  an  exponential 
function  of  the  reciprocal  dielectric  constant  of  the  solvent, 
and  exactly  such  dependence  should  be  observed  in  any  re¬ 
action  of  ion  association  in  a  solution.^^ 

On  the  other  hand,  the  decrease  of  the  solvent  dielectric 
constant  leads  to  the  reduction  of  the  total  amount  of  oxides 
on  the  sulfur-treated  surface  of  GaAs  (Fig.  7).  This  fact  sug¬ 
gests  that  the  removal  of  oxides  from  the  surface  in  the  so¬ 
lution  occurs  as  a  result  of  dissociation.  During  the  dissocia¬ 
tion,  complex  oxide  ions  from  the  surface  get  into  the 
solution  and  on  the  surface  some  charge  is  formed.  This 
charge  enforces  the  electrostatic  interaction  of  sulfide  ions 
with  the  surface  atoms  of  the  semiconductor. 

Consequently,  the  decrease  of  the  dielectric  constant  value 
of  sulfide  solution  by  appropriate  choice  of  the  solvent  leads 
to  a  more  effective  passivation  of  the  surface  as  evidenced  by 


Fig.  7.  The  dependence  of  the  total  amount  of  oxides  remaining  after  GaAs 
surface  passivation  in  solutions  on  the  reciprocal  dielectric  constant  of  the 
solvent  used.  The  1/6=0  correspond  to  the  untreated  sample. 
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the  photoluminescence  enhancement  of  the  semiconductor 
after  sulfur  treatment  and  by  the  increase  of  efficiency  of 
both  oxide  removal  and  sulfide  formation. 

Thus  the  sulfur  passivation  of  the  GaAs  surface  in 
alcohol-based  solutions  with  low  dielectric  constants  is  more 
effective  than  traditionally  used  passivation  from  aqueous 
sulfide  solutions. 

V.  CONCLUSION 

Properties  of  GaAs  treated  in  solutions  of  sodium  sulfide 
in  different  protic  solvents  were  studied  by  photolumines¬ 
cence  and  x-ray  photoelectron  spectroscopy.  It  has  been  ob¬ 
served  that  the  decrease  of  the  dielectric  constant  value  of  the 
solution  through  the  use  of  alcohols  as  solvents  allows  one  to 
obtain  a  more  effective  surface  passivation.  It  has  been  found 
that  with  the  decrease  of  the  solution  dielectric  constant,  the 
band-edge  photoluminescence  intensity  of  the  semiconductor 
is  enhanced,  the  total  amount  of  oxides  on  the  surface  is 
reduced,  and  the  amount  of  sulfides  is  increased  as  well. 
These  results  testify  that  the  sulfur  coverage  formation  dur¬ 
ing  solution  passivation  occurs  by  association  of  sulfide  ions 
with  surface  atoms  of  the  semiconductor.  The  formation  of 
covalent  bonds  between  sulfur  and  surface  atoms  in  solutions 
having  a  low  dielectric  constant  leads  to  the  formation  of 
harder  surface  states,  which  are  less  active  in  nonradiative 
recombination. 
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SiH4  and  H2O2  have  been  successfully  used  for  the  deposition  of  silicon  dioxide  for  shallow  trench 
isolation.  With  this  chemistry,  it  is  possible  to  fill  up  trenches  without  voids  up  to  an  aspect  ratio 
2.3:1.  The  very  good  gap  filling  is  due  to  the  presence  of  SiOH  groups  in  the  oxide  film.  To  obtain 
a  density  close  to  that  of  thermal  oxide,  the  film  must  be  annealed  at  high  temperature.  The  electrical 
characteristics  are  equivalent  to  those  obtained  using  thermal  silicon  dioxide.  ©  1996  American 
Vacuum  Society. 


For  shallow  trench  isolation  (STI)  by  silicon  dioxide 
(Si02),  it  is  important  to  obtain  a  homogeneous  gap  filling 
(no  voids)  with  a  density  and  stoichiometry  close  to  that  of 
thermal  oxide  for  both  physical  (etching...)  and  electrical 
properties  (interface  trap  and  charge  densities).^ 

The  conventional  Si02  deposition  process  uses  low- 
pressure  chemical  vapor  deposition  (LPCVD),^  or  plasma- 
enhanced  CVD  (PECVD).^’^  The  main  problem  arising  in 
such  methods  is  the  appearance  of  voids  due  to  nonconfor- 
mal  step  coverage. New  ways  are  currently  being  investi¬ 
gated  to  minimize  this  spurious  effect  resulting  in  homoge¬ 
neous  oxide  layers.  For  example,  subatmospheric  CVD 
(SACVD)  operating  with  a  tetraethylorthosilicate  (TEOS) 
and  O3  mixture^’^  provide  a  very  good  gap  filling,  but  this 
technique  depends  critically  on  the  state  of  the  surface.^  In¬ 
deed,  for  vertical  trenches,  it  is  not  possible  to  use  plasma 
treatment  to  promote  the  growth.  A  silicon  rich  buffer  layer 
is  necessary  to  avoid  this  problem. 

In  the  present  article,  we  propose  a  new  solution  for  de¬ 
positing  Si02  to  fill  up  vertical  shallow  trenches.  To  obtain 
homogeneous  Si02  films,  the  film  should  be  liquid  at  some 
step  in  the  deposition  processes  for  a  complete  filling  of  the 
trench.  Brinker  and  Scherer^  reported  that  Si(OH)4  is  a  liquid 
obtained  as  a  byproduct  in  the  reaction  of  TEOS  with  H2O. 
In  this  case,  the  main  reaction  step  is  controlled  by  the  OH 
production.  Our  intention  was  to  obtain  Si(OH)4  from  the  gas 
phase  in  a  CVD  process.  We  demonstrate  below,  that  a  mix¬ 
ture  of  SiH4+H202  is  the  best  way  to  obtain  a  growth  of 
Si02  and  fill  up  the  trenches.  This  layer  must  be  annealed, 
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however,  in  order  to  recover  a  quality  equivalent  to  that  of 
thermal  oxide. 

Patterned  8  in.  /?-type  silicon  wafers  with  a  resistivity  of 
8-15  n  cm  were  used  to  study  the  gap  filling.  For  electrical 
and  physical  characterization,  the  wafers  were  unpattemed. 
The  experimental  setup  is  described  in  detail  elsewhere. A 
cross  section  of  the  CVD  deposition  chamber  (Electrotech 
Planar  200)  is  presented  in  Fig.  1.  The  SiH4  and  H2O2  gases 
are  introduced  through  a  duplex  shower  head  arrangement 
which  avoids  any  gas  cross  contamination  until  they  enter 
the  reaction  chamber.  The  H2O2  liquid  is  controlled  by 
weight  measurement  and  is  flash-evaporated  before  entering 
the  CVD  chamber.  Hydrogen  peroxide  is  usually  available  in 
industrial  30%  concentrated  water  solution. 

All  process  parameters  were  optimized  in  order  to  obtain 
a  thickness  uniformity  better  than  2%  (Icr).  The  substrate 
temperature  was  varied  from  0  to  40  °C  in  different  experi¬ 
ments.  The  total  pressure  was  controlled  via  a  throttle  at  1 
Torr.  The  SiH4  flow  was  40  seem  and  the  hydrogen  peroxide 
solution  flow  was  1  g/min.  Annealing  after  deposition  was 
carried  out  in  one  step:  a  conventional  furnace  annealing  at 
1050  °C  for  30  min  under  N2  atmosphere.  The  deposited 
films  were  characterized  by  Fourier  transform  infrared 
(FTIR)  spectroscopy  on  bare  wafers.  The  gap  filling  proper¬ 
ties  were  checked  by  scanning  electron  microscopy  (SEM) 
observation  on  STI  test  structures.  For  the  C(V)  electrical 
characteristics,  metal  oxide  semiconductors  (MOS)  struc¬ 
tures  were  made  according  to  the  following  sequence:  silicon 
substrate,  thermal  oxide  (250  A),  CVD  oxide  (1000  A),  and 
aluminum  contact. 

A  FTIR  spectrum  of  the  oxide  layer  (4000  A)  deposited  at 
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Fig.  1.  Cross  section  of  CVD  chamber. 

However,  if  it  originates  from  OH,  we  should  be  able  to 
observe  a  companion  peak  at  1640  which  is  not  the 

case  here.  We  therefore  propose  the  following  explanation 
for  this  peak:^^  in  solid  film,  the  Si-OH  vibration  [from  sih 
anol  (Si(OH)4)]  occurs  at  3680  cm“^;  but  for  a  liquid  phase, 
it  is  shifted  to  about  3380  cm“^  Moreover,  the  peak  at  930 
cm“^  in  the  PECVD  or  LPCVD  oxides  corresponds  to  the 
vibration  between  a  hydrogen  atom  and  an  oxygen  atom  it¬ 
self  linked  to  a  silicon  atom.^^  This  explanation  confirms  the 
presence  of  SiOH  groups  in  the  Si02  film. 

If  the  substrate  temperature  is  increased  to  40  °C  during 
deposition,  the  3380  and  930  cm~^  peaks  [Fig.  2(b)]  shift  to 
lower  values  which  is  consistent  with  the  evolution  of  a  “liq¬ 
uid”  vibration  versus  temperature:  the  film  becomes  “dry” 
when  the  temperature  increases.  We  can  conclude  that  the 
film  deposited  in  the  temperature  range  of  the  experiments  is 
in  a  liquidlike  state  just  after  deposition  because  of  the  pres¬ 
ence  of  Si(OH)4  groups. 

To  correlate  this  FTIR  analysis  with  the  gap  filling  prop¬ 
erties,  we  performed  SEM  measurements  on  STI  structures. 
Figure  3  presents  SEM  micrographs  showing  the  influence  of 
a  liquid  SiOH  content  on  the  gap  filling.  Clearly  with  sub¬ 
strate  temperature  during  deposition  at  0  °C,  the  gap  filling  is 
very  good  for  aspect  ratio  up  to  2.3:1  [the  trenches  are  0.3 
yum  wide  and  0.7  /xm  deep  (Fig.  4)],  while  at  40  °C  the  gap 
filling  becomes  poorer. 

After  annealing,  the  film  has  the  same  Si/0  concentration 
ratio  as  stoichiometric  silicon  dioxide  (for  as  deposited  films, 
Si/0  was  already  0.5).  This  is  confirmed  by  the  fact  that  the 


0  °C  substrate  temperature  is  shown  in  Fig.  2(a).  Three  main 
peaks  are  present:  the  SiO  stretching  is  located  at  1075  cm“^ 
which  compares  well  with  the  thermal  oxide  (1077  cm“^).  In 
addition,  we  can  observe  a  peak  at  3380  cm~^  This  3380 
cm“^  peak  often  results  from  OH  vibration  in  water. 
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Fig.  2.  CVD  oxide  IR  spectra:  (a)  substrate  temperature  equal  to  0  °C;  (b) 
substrate  temperature  equal  to  40  °C. 
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Fig.  3.  SEM  view  for  substrate  temperature  (a)  at  0  °C;  (b)  at  40  °C. 


min  under  N2  are  closer  to  those  of  a  thermal  oxide.  Conven¬ 
tional  high-frequency  capacitor  tests  and  a  quasi-static  test 
have  been  used  for  fixed  charge  and  interface  trap  density 
measurements.  The  fixed  electrical  charge  density  is 
about  5X10^  cm~^  and  the  interface  trap  density  is  about 
3X10^  cm~^  eV”^  (for  as  deposited  films,  these  values  were, 
respectively,  2X10^^  cm“^  and  10^^  cm“^  eV“^).  These  val¬ 
ues  are  very  low.  This  Si02  filling  technique  can  be  used  for 
the  STI  process. 

In  conclusion,  using  this  low-energy  CVD  process  after 
moderate  temperature  annealing,  we  can  thus  obtain  all  the 
parameters  close  to  those  of  a  thermal  oxide:  The  density  and 
the  electrical  characteristics  are  similar.  Contrary  to  the  stan¬ 
dard  CVD  process,  the  oxide  film  is  homogeneous  in  the 
trenches  (no  voids)  and  no  cracks.  The  silanol  plays  the  key 
role  for  gap  filling.  To  obtain  a  higher  aspect  ratio,  we  expect 
that  the  SiOH  group  number  in  the  oxide  layer  must  be  in¬ 
creased.  Moreover,  annealing  leads  to  a  thermal  oxidelike 
film. 
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film  becomes  more  stable  because  there  are  no  SiOH  groups 
in  the  liquid  state  in  the  Si02  film.  Secondly,  the  density  of 
the  film  measured  by  mass  measurement  varies  from  1.65 
g/cm^  as  deposited  to  2.1  g/cm^  after  annealing,  which  com¬ 
pares  favorably  with  2.20  g/cm^  for  thermal  oxide.  Finally, 
the  electrical  properties  after  annealing  at  1050  °C  for  30 


Fig.  4.  SEM  view  with  trenches  0.3  fjm  wide,  0.7  deep. 
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To  increase  the  integration  of  dynamic  random  access 
memory  (DRAM),  the  area  of  the  memory  cell  must  be  de¬ 
creased.  For  1  or  4  Gbit  DRAM,  it  becomes  0.21  ^tm^  or 
0.05  /xm^,  respectively.  Despite  the  shrinkage  of  the  cell 
area,  the  capacitance  of  the  memory  cell  must  be  kept  20-25 
fF.  One  method  of  achieving  these  requirements  is  to  form  a 
trench  capacitor  that  has  a  high  aspect  ratio.  Some  types  of 
trench  capacitors  with  high  aspect  ratios  have  been 
developed.^  However,  there  are  two  difficulties  to  fabricate 
the  trench  capacitors:  one  is  the  doping  method  to  the  trench 
side  wall,  and  the  other  is  the  quantitative  measuring  method 
of  the  uniformity  for  the  trench  doping.  For  the  latter  prob¬ 
lem,  we  proposed  the  new  measuring  method  of  the  unifor¬ 
mity  for  the  trench  side-wall  doping  corresponding  to  64 
Mbit  integration  by  using  the  secondary  ion  mass  spectrom¬ 
etry  (SIMS).^  The  purpose  of  this  article  is  to  show  that  the 
measuring  method  has  high  resolution  because  of  the  poly¬ 
crystalline  Si  (poly-Si)  that  fills  up  the  trench,  and  therefore 
it  is  effective  to  the  analysis  of  the  uniformity  of  the  As 
concentration  for  the  trench  side  wall  corresponding  to  1-4 
Gbit  and  beyond  integrations.  First,  the  uniformity  of  the 
trench  sputtering  by  filling  up  the  trench  with  a  poly-Si  is 
examined.  Second,  whether  the  method  detects  precisely  the 
As  concentrations  at  the  random  two  points  of  the  trench  side 
wall  which  ratio  for  the  concentration  is  approximately  two 
is  discussed.  Figure  1  shows  the  schematic  cross  section  of 
the  specimen  prepared  for  analysis  by  SIMS,  A  trench  array 
is  formed  in  a  p  type  Si  wafer  by  the  reactive  ion  etching, 
and  the  size  of  one  trench  is  0,5-0.6  /xm  in  width  and  2-4 
pm  in  depth.  The  As  planar-type  solid-diffusion  method^’^  is 
used  for  the  trench  side-wall  doping  of  the  first  experiment. 
The  tilt-angle  ion  implantation  method"^  is  used  for  the  trench 
side- wall  doping  of  the  second  experiment:  Double  implan¬ 
tation  of  As  ions  is  carried  out  under  both  conditions  of 
5X10^^  cm“^,  150  keV,  7°  off  the  normal  incidence  to  the 
surface  of  the  sample  and  of  2X10^^  cm“^,  150  keV,  20°  off 


*Published  without  author  corrections. 


angle  as  shown  by  the  inset  of  Fig.  3.  For  the  first  and  the 
second  experiments,  the  poly-Si  is  deposited  on  the  wafer  by 
the  low-pressure  chemical  vapor  deposition  (LPCVD) 
method  without  forming  Si02  on  the  surface  of  the  trench. 
The  reason  of  this  is  described  in  Ref.  2.  The  wafer  surface  is 
leveled  by  sputtering  the  poly-Si  with  oxygen  radicals.  To 
examine  the  effect  of  the  poly-Si  which  fills  up  the  trench, 
the  specimen  without  filling  up  the  poly-Si  in  it  is  also  pre¬ 
pared  for  the  experiment.  By  sputtering  the  specimen  with  or 
without  the  poly-Si  in  the  trench  from  the  wafer  surface  to 
the  level  of  the  trench  bottom  by  Cs"^,  the  uniformity  of  the 
As  concentration  along  the  trench  side  wall  is  measured.  The 
detected  ions  are  either  As“  or  AsSi“.  The  relationship  be¬ 
tween  the  resolution  of  the  measuring  method  and  the  sam¬ 
pling  time  of  these  ions  by  the  detector  of  the  SIMS  is  also 
examined.  Figure  2  shows  the  profiles  of  As  concentration 
along  the  trench  depth.  Sputtering  the  As  ions  by  Cs"^  occurs 
parallel  to  the  substrate  surface  for  the  specimen  with  poly-Si 
in  the  trench,  and  it  occurs  nonuniformly  for  the  specimen 
without  poly-Si.  It  is  found  that  the  precise  measurement  is 
not  carried  out  if  the  trench  is  not  filled  up  with  the  poly-Si. 
For  A ,  the  peak  of  As  concentration  is  observed  at  the  depth 
of  3.5  /xm.  This  peak  is  due  to  the  bottom  As  diffusion  layer 
with  the  thickness  of  0.1  /xm.  The  sampling  time  of  the  As 


Cs+ 

i  I  I  4  4  1 


Trench  array 


Fig,  1.  Schematic  cross  section  of  the  specimen  prepared  for  analysis  by 
SIMS. 
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Fig.  2.  Profiles  of  As  concentration  along  the  trench  depth.  As  planar-type 
solid-diffusion  method  is  used  for  the  trench  side-wall  doping. 


ions  for  A  is  larger  than  that  for  B.  It  is  found  that  the 
measuring  method  detects  the  thin  As  diffusion  layer  by  fix¬ 
ing  the  sampling  time  to  the  optimum  value. 

Next,  whether  the  present  method  detects  precisely  the  As 
concentration  at  the  random  two  points  which  ratio  for  the 
concentrations  is  approximately  two  and  whether  the  method 
is  feasible  for  the  analysis  of  the  trench  doping  with  1-4 
Gbit  and  beyond  integrations  are  discussed.  Figure  3  shows 
the  profile  of  the  As  concentration  along  the  trench  depth  by 
the  present  SIMS  measurement.  The  reason  why  the  SIMS 
intensity  is  weak  from  the  surface  to  the  depth  of  approxi- 


Fig.  3.  Profiles  of  As  concentration  along  the  trench  depth.  Double  implan¬ 
tation  of  As  ions  to  the  trench  side  wall  is  carried  out  by  the  tilt-angle  ion 
implantation  method. 


mately  0.5  /im  is  due  to  the  residual  poly-Si  on  the  wafer 
surface  that  is  not  removed  by  the  leveling  with  the  oxygen 
radicals.  And  the  opening  of  the  trench  corresponds  to  the 
peak  of  the  first  spike  that  is  approximately  0.65  jmm  from 
the  surface.  The  two  spikes  of  the  SIMS  profile  correspond 
to  the  concentrations  of  the  diffusion  layers  at  the  wafer  sur¬ 
face  and  at  the  trench  bottom,  respectively.  The  relationship 
between  the  ion  currents  and  the  As  concentration  of  the 
trench  side  wall  is  discussed.  From  the  relationship  between 
the  ion  implantation  conditions  and  the  geometric  shape  of 
the  trench,  the  ratio  of  the  As  concentration  of  the  side  wall 
(Cl)  from  the  opening  of  the  trench  to  the  depth  of  1.6  /xm 
and  that  (C2)  from  the  depth  of  1.6  /xm  to  the  bottom  of  the 
trench,  C1/C2  becomes  2.1.  The  ion  current  (/i)  at  the  depth 
of  0.8  /xm  from  the  opening  of  the  trench  corresponds  to  Ci 
and  that  (72)  at  the  depth  of  2.2  /xm  corresponds  to  C2.  It  is 
found  that  the  ratio  7i/72  of  the  measured  values  of  the  ion 
currents  becomes  2.0,  and  it  is  almost  the  same  as  the  calcu¬ 
lated  value  of  C1/C2 .  At  the  depth  of  approximately  1.5  /xm 
from  the  opening,  a  broad  peak  is  observed.  This  peak  is 
considered  to  be  due  to  the  reflected  As  ions  from  the  trench 
side  wall  to  the  opposite  one.  The  ratio  of  the  ion  current  at 
the  peak  and  becomes  1.1.  It  is  found  that  the  measuring 
method  detects  precisely  the  As  concentrations  at  the  random 
two  points  of  the  trench  side  wall  which  ratio  for  the  con¬ 
centrations  is  1.1  to  2.0.  If  the  total  quantity  of  As  ions  flying 
to  the  detector  of  the  SIMS  from  the  trench  side  wall  in  the 
detection  area  is  more  than  the  detection  limit  of  it,  this 
method  would  be  applied  to  the  analysis  of  the  uniformity 
for  the  As  concentration  of  the  trench  side  wall  whose  di¬ 
mension  is  smaller  than  the  present  one.  Because  the  trench 
depth  becomes  larger  and  the  trench  width  becomes  smaller 
for  the  higher  integration,  there  will  not  be  much  difference 
of  the  total  As  ions  at  the  trench  side  wall  for  the  present  and 
higher  integrations.  Therefore,  even  if  the  integration  of  the 
device  increases  up  to  1-4  Gbit  and  beyond,  it  would  be 
possible  for  the  method  to  detect  As  ions  as  accurately  as  the 
measured  value  shown  in  Fig.  3. 

In  conclusion,  it  is  found  that  the  present  measuring 
method  of  As  ions  at  the  trench  side  wall  using  SIMS  detects 
precisely  the  concentrations  at  the  random  two  points  whose 
ratio  for  the  concentrations  is  1.1  to  2.0.  This  method  is 
thought  to  be  effective  for  the  analysis  of  the  uniformity  of 
the  trench  doping  for  the  1-4  Gbit  and  beyond. 
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The  exponent  in  Eq.  (13)  in  the  Appendix  is  incorrect. 
The  correct  equation  is 


1 


67ry\ 

(«(n+Tjcl^ 


2/(2n-l) 


.  (13) 


It  is  then  clearly  seen  that  Eq.  (13)  correctly  reduces  to  Eq. 
(6)  for  the  case  n  =  2. 
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PREFACE 


This  volume  contains  papers  presented  at  the  23rd  Annual  Conference  on  the  Physics  and 
Chemistry  of  Semiconductor  Interfaces  (PCSI-23).  The  Conference  was  held  at  the  Sumner  Au¬ 
ditorium  of  the  Scripps  Institute  of  Oceanography  and  the  Sea  Lodge  on  La  Jolla  Shores  Beach, 
from  21  to  25  January  1996.  La  Jolla  featured  outstanding  winter  weather  and  views  of  the  Pacific, 
with  opportunities  to  carry  out  discussions  while  walking  the  beach. 

The  Conference  continued  to  emphasize  fundamental  understanding  of  interface  problems 
relevant  to  emerging  areas  of  technology.  The  size  of  the  meeting,  as  measured  by  the  number  of 
attendees  and  abstracts  submitted,  increased  about  10%  relative  to  1995.  A  total  of  81  papers  were 
presented  in  8  sessions  of  invited  talks  intermixed  with  oral  and  poster  presentations.  Silicon- 
related  topics,  including  SiGe  and  Si/Si02  interfaces,  accounted  for  about  40%  of  the  Conference 
Program,  III-V  semiconductor  topics  represented  about  30%,  and  the  rest  was  distributed  among 
a  variety  of  topics  including  wide-bandgap  semiconductors,  metal- semiconductor  interfaces,  new 
techniques,  etc.  Silicon-related  topics  continued  to  gain  attention,  as  did  wide-bandgap  materials. 
A  particularly  stimulating  session  was  held  on  the  atomic  structure  of  the  Si/Si02  interface,  where 
Reed  McFeely  and  Alfredo  Pasquarello  provided  experimental  and  theoretical,  respectively,  per¬ 
spectives  that  fueled  a  spirited  debate  concerning  the  interpretation  of  experiments  that  probe  the 
microscopic  structure  of  the  interface.  A  Tuesday  evening  rump  session  focused  on  atomic  struc¬ 
ture  at  buried  interfaces,  one  of  the  themes  to  be  highlighted  in  1997.  Also  continuing  a  trend  was 
the  increase  in  theoretical  contributions.  Decreasing  activity  was  evident  in  metal-semiconductor 
interfaces  and  in  III-V  epitaxy,  although  heteroepitaxy  continues  to  receive  substantial  attention. 

The  Conference  succeeded  through  the  efforts  of  many  individuals.  We  would  like  to  recog¬ 
nize  in  particular  the  many  anonymous  reviewers  who  performed  their  work  carefully,  thoroughly, 
and  efficiently;  ensuring  that  the  high  standards  of  the  Journal  of  Vacuum  Science  and  Technology 
be  maintained;  Jack  Dow  for  conference  management,  Carole  Dow  for  registration  management, 
and  Becky  York,  who  provided  a  smooth  interface  between  the  Editor  and  the  Journal  of  Vacuum 
Science  and  Technology.  The  commitment  of  Gary  McGuire  for  making  this  collection  a  part  of 
JVST  is  also  appreciated,  along  with  that  of  Galen  Fisher,  who  provided  an  effective  link  with  the 
American  Vacuum  Society.  Financial  support  from  Larry  Cooper  of  the  Office  of  Naval  Research 
is  gratefully  acknowledged.  The  Conference  was  held  under  the  sponsorship  of  the  American 
Vacuum  Society  through  the  Electronic  Materials  and  Processing  Division,  and  the  Office  of  Naval 
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In  this  article  the  presence  of  threading  and  misfit  dislocations  in  strained  modulation-doped  Si/SiGe 
heterostructures  and  their  effect  on  electron  mobility  are  investigated.  It  is  found  that  the 
low-temperature  electron  mobility  is  sensitive  to  threading  dislocations  when  their  density  exceeds 
3X10^  cm”^,  and  decreases  by  two  orders  of  magnitude  when  the  threading  dislocation  density  is 
lx  10^^  cm^^.  The  room-temperature  mobility  is  reduced  under  the  same  conditions  by  10%  and 
50%,  respectively.  Misfit  segments  in  the  graded  Ge-content  buffer  limit  the  mobility  when  the  Si 
channel  is  0.4  yam  or  less  away  from  that  buffer.  Misfit  dislocations  at  the  Si-channel  bottom 
interface  very  strongly  scatter  the  electrons  in  the  channel  once  a  continuous  network  of  misfit 
segments  is  created.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

In  the  past  decade  there  has  been  increasing  interest  in  the 
growth  of  strained  Si/SiGe  layers  for  studying  the  material 
properties  as  well  as  for  potential  high-speed  device 
applications.^’^  Due  to  the  large  lattice  mismatch  between  Si 
and  Ge,  a  relaxed  SiGe  buffer  layer  can  only  be  achieved  at 
the  expense  of  creating  misfit  dislocations  (md).  A  certain 
fraction  of  these  md  results  in  dislocations  threading  to  the 
surface  of  the  sample.  In  the  case  of  growing  a  fully  relaxed 
Sio  7Geo.3  buffer  directly  on  Si  the  resulting  density  of  thread¬ 
ing  dislocations  (td)  is  typically  in  the  range  of  1X10^®  and 
1X10^^  cm~^.  The  use  of  a  graded  Ge-content  buffer  layer^ 
grown  on  a  Si  substrate  has  allowed  the  growth  of  strained  Si 
imbedded  between  relaxed  SiGe  layers  with  a  low  density  of 
threading  dislocations  (IXIO^’-IXIO^  cm“^). 

Extremely  high  electron  mobilities  in  modulation-doped 
Si/SiGe  both  at  room  temperature"^  and  at  low  temperature^"^ 
have  been  reported  by  various  groups.  Figure  1  shows  the 
evolution  of  low-temperature  electron  mobility  since  1985.  It 
is  important  to  stress  that  there  are  two  major  advancements, 
one  in  1990  and  the  other  in  1994,  both  due  to  the  reduction 
in  dislocation  scattering,  as  will  be  described  in  this  article. 

II.  LAYER  DESIGN 

The  layers  under  study  were  grown  by  ultrahigh  vacuum 
chemical  vapor  deposition  (UHV-CVD)  in  the  temperature 
range  of  500-560  °C.  A  schematic  of  the  layer  structure  is 
shown  in  Fig.  2.  First,  a  step-graded  Sii^^Ge^^.  layer  is 
grown,  reaching  a  30%-40%  Ge  composition.  This  is  fol¬ 
lowed  by  the  growth  of  a  relaxed  Sii-^Ge^^.  buffer  layer, 
where  x  is  in  the  range  of  0.25-0.35.  A  strained  Si  channel  is 
then  grown  to  a  thickness  in  the  range  of  8-15  nm,  followed 
by  an  undoped  15  nm  Sij.^Gej,.  spacer  layer,  an  n-type 
doped  Sii_^Ge^.  supply  layer,  and  a  4-nm-thick  Si  cap  layer. 
The  Ge  fraction  and  the  degree  of  relaxation  of  the  buffer 
layer  were  determined  by  Auger  analysis  and  high-resolution 
x-ray  diffraction,  respectively.  The  measurement  of  layer 
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thickness  and  the  identification  of  defects  were  achieved  by 
cross-sectional  and  planar-view  transmission  electron  mi¬ 
croscopy  (TEM),  and  the  mobility  measurements  were  per¬ 
formed  in  the  range  of  0.4-300  K  using  standard  Van  der 
Pauw  and  Hall  bar  geometries. 

In  order  to  investigate  the  effect  of  scattering  by  the  dis¬ 
locations  we  have  systematically  varied  the  density  of 
threading  and  misfit  dislocations.  The  former  can  be  con¬ 
trolled  by  changing  the  grading  rate  of  the  Ge  content  in  the 
buffer.  In  the  extreme  limit  of  an  abrupt  SiGe  buffer  (i.e., 
infinite  grading  rate)  the  density  of  threading  dislocations  is 
about  lx  10^^  cm“^.  A  slow  grading  rate  of  30%  Ge  per 
micron,  on  the  other  hand,  results  in  a  density  of  threading 
dislocations  in  the  Si  channel  of  approximately  1 X 10^  cm~^. 

III.  EFFECT  OF  THREADING  DISLOCATIONS 

In  the  original  work  of  Abstreiter  et  al.^  the  measured 
electron  mobility  at  4.2  K  was  about  2000  cm^/V  s.  In  that 
case,  a  relaxed  Sio.75Geo.25  was  grown  on  a  Si  substrate. 
From  our  own  experiments,  layers  of  similar  design  have  a 
density  of  td  of  about  1X10^^  cm"^.  At  low  temperature, 
where  the  thermal  effects  and  scattering  by  phonons  is  dra¬ 
matically  reduced,  the  electron  mobility  becomes  very  sensi¬ 
tive  to  residual  scattering  mechanisms  due  to  background 
charge  impurities,  roughness,  and  dislocations. 

A  threading  dislocation  forms  a  strong  barrier  to  electron 
transport  and,  hence,  depletes  the  electrons  around  it.  When 
the  distance  between  dislocations  is  significantly  longer  than 
the  electron  mean  free  path,  the  electrons  can  move  freely  in 
the  regions  between  dislocations.  Once  the  density  increases 
to  the  extent  that  there  is  an  overlap  between  the  depletion 
regions  of  adjacent  dislocations,  the  electrons  are  forced  to 
scatter  off  the  dislocations  and  the  mobility  decreases  dra¬ 
matically.  If  the  distance  between  dislocations  approaches 
the  electron  Fermi  wavelength  (about  30-50  nm  in  our  case) 
localization  effects^  become  clear  and  modify  the  electron 
transport  properties. 

More  quantitatively,  the  depletion  region,  which  is  on  the 
order  of  the  Debye  length  (and,  hence,  depends  on  the  elec¬ 
tron  density)  is  estimated  to  be  in  the  range  of  0.2 -0.4  ^am. 
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Fig.  1 .  Evolution  of  low-temperature  electron  mobility  in  modulation-doped 
Si/SiGe  heterostructures. 

Thus  the  overlap  is  expected  to  take  place  when  the  density 
of  threading  dislocations  is  about  0.6-2.3X10^  cm“^. 

In  order  to  change  the  density  of  dislocations,  the  grading 
rate  of  the  buffer  was  changed  (keeping  the  relaxed  SiGe 
buffer  layer  thickness  at  1  Figure  3  shows  that  the 
dislocation  density  can  be  reduced  in  a  controlled  way  from 
lx  10^^  cm“^  down  to  1X10^  cm“^.  The  corresponding  in¬ 
crease  in  low-temperature  electron  mobility  is  also  shown. 
Extrapolating  the  mobility  at  thin  graded  Ge-content  buffers 
(dotted  line)  shows  that  the  threading  dislocations  start  lim¬ 
iting  the  mobility  when  their  density  exceeds  1X10^  cm“^, 
in  agreement  with  our  simple  calculation  above.  When  the 
density  of  dislocation  is  below  this  limit,  the  mobility  is  no 
more  sensitive  to  any  further  reduction  and  is  limited  by 
other  scattering  mechanisms. 

At  room  temperature,  on  the  other  hand,  thermal  diffusion 
and  phonon  scattering  obscure  the  effect  of  scattering  by  td. 
For  a  td  density  in  the  range  of  1 X  10^-1  X 10^^  cm~^  the 
channel  electron  mobility  varied  from  2700  to  1300  cm^/V  s. 


Fig.  2.  Schematic  of  the  layer  structure  grown  by  UHV-CVD. 
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Fig.  3.  Effect  of  grading  rate  of  the  Ge  content  in  the  buffer  on  the  density 
of  threading  dislocations  and  the  electron  mobility  measured  at  0.4  K. 

IV.  EFFECT  OF  MISFIT  DISLOCATIONS 

During  the  relaxation  of  the  graded  Ge-content  buffer, 
misfit  dislocations  are  generated  and  are  responsible  for  the 
strain  relaxation.  Each  misfit  segment  is  associated  with  a 
strain  field,  which  locally  perturbs  the  band  structure  around 
it  and,  thus,  forms  a  strong  scattering  potential.  Once  the 
graded  Ge-content  buffer  has  reached  its  final  Ge  content  and 
relaxes  to  the  lattice  constant  thereof,  no  additional  misfit 
dislocations  are  created.  However,  the  strain  field  decays 
over  a  length  scale  comparable  to  the  thickness  of  the  layer  it 
is  trying  to  relax.  Hence,  if  the  electron  channel  is  grown 
within  that  range,  the  electron  mobility  senses  the  presence 
of  the  dislocations.^ 

In  order  to  systematically  study  this  effect,  we  have  fixed 
the  graded  buffer  layer  thickness  and  changed  the  thickness 
of  the  following  relaxed  SiGe  buffer,  separating  the  Si  chan¬ 
nel  from  the  misfit  dislocations.  Figure  4  shows  the  depen¬ 
dence  of  the  mobility  on  this  separation.  There  is  a  very  fast 
increase  in  mobility  for  relaxed  buffer  layer  thickness  up  to 
about  0.5  jjm,  which  agrees  with  the  expected  range  of  the 


Fig.  4.  Electron  mobility  as  a  function  of  the  relaxed  Sio  7Geo.3  buffer  thick¬ 
ness  grown  on  top  of  a  0.7-/xm-thick  graded  Ge-content  buffer. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2778 


K.  Ismail:  Effect  of  dislocations  in  strained  Si/SiGe 


Fig.  5.  Electron  mobility  as  a  function  of  Si-channel  thickness  grown  on  a 
relaxed  Sig jGeo^  buffer.  The  arrow  indicates  the  Matthews  and  Blakeslee 
critical  thickness. 


Strain  field  perturbation  (the  graded  buffer  is  0.7  fxm  in  this 
case).  Any  further  increase  in  buffer  layer  thickness  beyond  1 
jim  does  not  seem  to  change  the  mobility  noticeably.  Some 
reduction  in  mobility  in  the  case  of  even  thicker  buffer  may 
be  due  to  the  increase  in  surface  roughness,  which  is  not  the 
subject  of  this  study.  At  room  temperature,  the  mobility  does 
not  change  (within  the  experimental  error)  for  buffer  layers 
thicker  than  0.1  /xm,  indicating  that  for  device  applications 
the  thickness  of  the  buffer  (and  the  corresponding  growth 
time)  can  be  safely  reduced. 

Another  source  of  misfit  dislocations,  which  has  been  ig¬ 
nored  so  far,  is  due  to  the  strain  in  the  active  Si  channel. 
Since  the  buffer  layers  have  threading  dislocation  densities  in 
the  order  of  10^  to  10^  cm“^,  each  of  these  will  glide  as  soon 
as  the  Si-channel  thickness  exceeds  the  Matthews-Blakeslee 
critical  thickness  resulting  in  a  misfit  dislocation  at  the 
bottom  interface  of  the  Si  channel. 

Due  to  the  proximity  of  these  dislocations  and  their  atten¬ 
dant  strain  field  to  the  centroid  of  the  electron  gas,  it  induces 
a  very  strong  scattering  potential.  Thus  the  electron  mobility 
becomes  a  very  sensitive  tool  to  detect  the  presence  of  such 
dislocations.  Figure  5  shows  the  mobility  as  a  function  of 
Si-channel  layer  thickness  at  0.4  K.  A  dramatic  reduction  in 
mobility  starts  at  a  channel  layer  thickness  which  is  in  good 
agreement  with  (shown  by  the  arrow).  It  is  interesting  to 
note  that  there  are  two  competing  effects  which  influence  the 
mobility.  The  first  is  the  increase  in  dislocation  density  with 
the  Si-channel  thickness.  At  the  same  time,  the  dislocations 
are  separated  more  from  the  centroid  of  the  electron  gas. 
This  competition  may  explain  the  slight  increase  in  mobility 
at  a  channel  thickness  of  20  nm  compared  to  15  nm. 

In  order  to  confirm  the  above  results,  two  samples  corre¬ 
sponding  to  a  Si-channel  thickness  of  10  and  15  nm  were 
backgated,  which  allowed  varying  the  charge  density  in  the 
channel  as  well  as  modifying  the  position  of  the  charge  cen¬ 
troid  within  the  quantum  well.^^  When  a  positive  backgate 
voltage  is  applied,  the  electron  density  is  increased  and  the 
centroid  of  the  electron  wave  function  is  moved  away  from 
the  top  towards  the  bottom  interface  (Fig.  6).  In  the  sample 
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Fig.  6.  Effect  of  backgating  on  electron  mobility  in  a  high  mobility  sample 
with  no  misfit  dislocations  at  the  Si-channel  interface  (A)  and  a  low  mobil¬ 
ity  sample  with  1 X 10^  cm”^  misfits  at  the  bottom  of  the  Si  channel  (•).  The 
insert  shows  the  electron  wave  function  at  “  10  and  + 10  V  on  the  backgate. 

with  no  misfit  segments  at  the  bottom  interface,  the  electron 
mobility  increases  as  a  function  of  positive  gate  bias.  In  the 
sample  with  misfit  dislocations,  on  the  other  hand,  there  is  a 
clear  drop  in  mobility  at  positive  gate  voltage  associated  with 
the  reduction  in  the  spacing  between  the  charge  centroid  and 
the  dislocations.  Although  we  expect  a  shift  of  only  2  nm  in 
the  position  of  the  centroid,  there  is  a  reduction  by  more  than 
a  factor  of  2  in  mobility. 

In  summary,  we  have  systematically  investigated  the  ef¬ 
fect  of  dislocations  on  the  electron  mobility  in  modulation- 
doped  Si/SiGe  layers.  It  is  found  that  a  density  of  threading 
dislocations  higher  than  1X10^  cm“^  starts  affecting  the  low- 
temperature  mobility  and  that  a  density  of  IX 10^^  cm~^  can 
reduce  it  by  more  than  two  orders  of  magnitude.  For  the 
same  range  of  threading  dislocation  density,  the  room- 
temperature  mobility  is  reduced  from  around  2700  to  1300 
cm^/V  s.  Misfit  dislocations  in  the  graded  Ge-content  buffer 
have  a  negligible  effect  at  room  (low)  temperature  when  the 
electron  channel  is  more  than  0.1  (0.5)  /xm  away  from  the 
graded  buffer.  As  it  gets  closer  than  this  limit  a  dramatic 
reduction  in  mobility  is  observed.  Misfit  segments  at  the  bot¬ 
tom  of  the  Si-channel  interface  can  be  avoided  when  the  Si 
channel  is  grown  thinner  than  the  critical  thickness.  Once 
that  thickness  is  exceeded  the  mobility  degrades  very  quickly 
due  to  the  proximity  of  the  induced  strain  field. 
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Using  a  real-time  Green’s  functions  formalism,  we  investigate  the  influence  of  depletion  charge 
scattering  on  the  room  temperature  mobility  in  scaled  silicon  metal-oxide- semiconductor  field-effect 
transistors  and  low-temperature  transport  in  Ino  4AI0  ^As-Iuq  ^Gao  6As  modulation  doped 
heterostructures.  Our  simulation  results  suggest  that  depletion  charge  scattering,  which  is  usually 
ignored,  has  considerable  impact  on  the  electron  transport  in  silicon  inversion  layers  near  the 
threshold  gate  voltage,  even  at  room  temperature.  We  also  find  that  the  weighting  coefficients  a  and 
b  (for  the  inversion  and  depletion  charge  densities)  strongly  depend  on  the  substrate  doping  and 
deviate  from  that  reported  in  the  literature.  In  the  case  of  modulation  doped  heterostructures,  the 
low-temperature  mobility  is  limited  by  alloy  and  Coulomb  scattering.  Intersubband  scattering 
considerably  affects  the  broadening  of  the  states  which,  in  turn,  leads  to  mobility  reduction. 
©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Electron  transport  properties  of  quasi-two-dimensional 
(Q2D)  systems  have  been  studied  quite  extensively,  experi¬ 
mentally  as  well  as  theoretically.  The  inversion  layer  at  a 
silicon  interface^  represents  the  most  widely  employed 
physical  realization  of  the  Q2D  concepts  and  the  electron 
mobility  of  the  inversion  layer  electrons  is  one  of  the  most 
important  parameters  for  device  characterization.  It  has  been 
confirmed  experimentally^’^  that  for  substrates  with  low  dop¬ 
ing  densities,  the  electron  mobility  follows  a  universal  curve 
when  plotted  as  a  function  of  the  effective  electric  field. 
However,  pushing  the  devices  towards  scaling  limits'^  neces¬ 
sarily  leads  to  thinner  oxide  films  (<10  nm)  and  higher  sub¬ 
strate  dopings  (>10^^  cm"^).  For  these  devices,  it  was  re¬ 
cently  demonstrated^  that  experimental  mobilities 
significantly  deviate  from  the  universal  curves  near  the 
threshold  voltage  as  a  consequence  of  significant  depletion 
charge  scattering. 

Since  the  invention  of  the  modulation  doping  technique, 
transport  properties  of  two-dimensional  electron  gases  in 
high-mobility  modulation  doped  AlGaAs-GaAs  heterojunc¬ 
tions  (MDHs)  have  also  been  a  subject  of  great  interest  from 
both  the  technological  and  fundamental  point  of  view.^"^^ 
The  enhanced  mobilities  in  these  structures  are  attributed  to 
the  spatial  separation  of  the  electrons  from  their  parent  do¬ 
nors,  which  significantly  reduces  the  scattering  rate  from  the 
remote  impurities.^ This  has  stimulated  much  research 
toward  their  application  in  low-power,  high-speed  devices. 
Recently,  there  has  been  an  increased  interest  devoted  to  the 
growth  of  InAlAs-InGaAs  lattice  matched 
heterojunctions^^”^^  which  have  better  carrier  confinement 
due  to  a  larger  conduction-band  discontinuity  and  a  lighter 
conduction-band  effective  mass  (compared  to  AlGaAs- 
GaAs).  However,  since  the  electron  gas  in  the  InAlAs- 


^^Permanent  address:  McDonnel  Douglas  Helicopter  Systems,  5000  E.  Mc¬ 
Dowell  Rd.,  Mesa,  AZ  85215-9797. 


InGaAs  MDH  is  mostly  localized  within  a  ternary  random 
alloy  (InGaAs),  alloy  scattering^^  is  much  more  effective  so 
that  it  is  hard  to  imagine  that  the  low-temperature  mobility  of 
InGaAs  based  modulation  doped  heterostructures  may  ex¬ 
ceed  the  very  high  mobilities  already  reported  in  AlGaAs- 
GaAs  heterostructures. 

The  theoretical  model  for  the  calculation  of  the  subband 
structure  and  the  electron  mobilities  of  the  Q2D  electron  gas, 
taking  into  consideration  all  major  scattering  mechanisms 
and  both  intrasubband  and  intersubband  scattering,  is  briefly 
described  in  Sec.  II.  In  Sec.  Ill,  devoted  to  transport  in  sili¬ 
con  inversion  layers,  we  describe  the  influence  of  depletion 
charge  scattering  on  the  mobility  results  near  the  threshold 
gate  voltage.  We  also  show  how  the  weighting  coefficients 
for  the  inversion  and  depletion  charge  densities,  which  ap¬ 
pear  in  the  definition  of  the  average  electric  field,  vary  with 
doping  density.  Low-temperature  transport  properties  of  the 
modulation  doped  Ino  4AI0  6As-InQ ^Ga^  heterostructures 
are  discussed  in  Sec.  IV.  Some  concluding  comments  related 
to  the  present  work  are  given  in  Sec.  V. 


II.  THEORETICAL  MODEL 

The  subband  structure  is  obtained  from  a  self-consistent 
solution  of  the  Schrodinger  and  Poisson  equations,  coupled 
with  the  Dyson  equation  for  the  retarded  Green’s  function. 
Exchange-correlation  corrections  to  the  chemical  potential 
are  calculated  within  the  local  density  approximation.  The 
self-energy  contributions  from  the  various  scattering  mecha¬ 
nisms  are  evaluated  by  employing  the  self-consistent  Born 
approximation.  Screening  is  treated  within  the  random  phase 
approximation.  In  the  calculation  of  the  conductivity,  we 
have  taken  account  of  a  correction  term  due  to  the  particle- 
hole  ladder  diagrams.  More  details  on  the  calculational 
model  and  the  analytical  expressions  for  the  broadening  of 
the  states  and  conductivity  can  be  found  elsewhere. 
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Substrate  doping  N  [cm'^ 


Fig,  1.  Room  temperature  mobility  vs  inversion  charge  density  for  two 
different  devices  with  high  doping  densities.  For  both  devices,  interface-trap 
density  is  1.75X10^^  cm~^.  We  use  an  exponential  model  for  the 
surface-roughness  autocorrelation  function.  We  find  that  the  rms  height  of 
the  bumps  is  0.26  nm  (0.25  nm)  for  the  device  with  lower  (higher)  doping. 
For  both  devices,  roughness  correlation  length  is  estimated  to  be  1.4  nm. 


Fig.  2.  Variation  of  the  weighting  coefficients  a  and  b~l-{z}/W  with 
substrate  doping.  We  find  that  the  penetration  of  the  wave  functions  into  the 
oxide  is  about  1  nm  for  very  high  values  of  .  Slightly  higher  values  for 
the  coefficient  b  were  obtained  for  V^>1X10^^  cm“^  (not  shown  in  this 
figure)  due  to  the  closer  confinement  of  the  carriers  to  the  interface  and 
increased  population  of  the  lower-lying  subbands. 


III.  SILICON  INVERSION  LAYERS  WITH  HIGH 
DOPING  PROFILES 

The  physical  model  employed  in  these  calculations  in¬ 
cludes  scattering  with  all  bulk  phonons  (acoustic  and  nonpo¬ 
lar  optical),  surface-roughness  scattering,  and  Coulomb  scat¬ 
tering  from  both  ionized  impurities  in  the  depletion  region 
and  interface/oxide  charges.  For  intervalley  scattering  (zero- 
order  and  first-order  interactions),  we  use  the  same  phonon 
energies  and  coupling  constants  as  those  relevant  for  bulk 
silicon.^^  The  anisotropy  of  the  deformation  potential  inter¬ 
action  is  also  accounted  for.^^  More  details  about  the  model 
can  be  found  in  the  Appendix. 

In  Fig.  1,  we  compare  our  simulation  results  for  the  mo¬ 
bility  with  the  experimental  effective  mobility  data  for 
« -channel  metal-oxide-semiconductor  field-effect  transistors 
(MOSFETs)  fabricated  on  (100)  Si  wafers.  Filled  circles  and 
triangles  represent  the  experimental  effective  mobility  data 
derived  from  drain  conductance  measurements.^  The  inver¬ 
sion  charge  density  has  been  determined  from  the  gate- 
channel  capacitance  measurements.  Solid  (dashed)  lines  are 
the  corresponding  simulation  results  when  depletion  charge 
scattering  is  included  (omitted)  in  the  calculations.  A  total  of 
four  subbands  (three  from  the  unprimed  and  one  from  the 
primed  ladder  of  subbands)  are  taken  into  account.  We  use 
the  real  density  of  states  function  (DOS)  when  evaluating  the 
self-consistent  potential-energy  profile.  In  contrast  to  some 
previous  studies, we  do  not  ignore  the  real  self-energy 
corrections  (renormalizations  of  the  spectrum)  when  evaluat¬ 
ing  the  broadening  of  the  states  and  conductivity.  We  find 
that  depletion  charge  scattering  leads  to  mobility  reduction  of 
about  9%  near  the  gate  voltage  thresholds  (small  values  of 
N^)  for  the  device  with  cm“^.  For  the  device 

with  higher  substrate  doping  (A^  =7.7X10^^  cm“^),  the  mo¬ 
bility  reduction  is  about  12.5%.  For  both  devices,  the  impor¬ 
tance  of  surface-roughness  scattering  is  shifted  towards 
lower  gate  voltages.  In  addition,  scattering  due  to  acoustic 


and  nonpolar  optical  phonons  is  found  to  be  of  lesser  impor¬ 
tance  in  these  structures. 

It  has  already  been  reported^^’^^  that  room  temperature 
electron  mobilities  show  universal  behavior  independent  of 
the  substrate  doping  when  plotted  as  a  function  of  the  aver¬ 
age  electric  field  E^^={el€^^{aN s  +  bN where  a  and  b 
are  the  weighting  coefficients  for  the  inversion  and  deple¬ 
tion  Ajepi  charge  densities.  In  Fig.  2  we  show  the  variation  of 
these  two  coefficients  with  substrate  doping.  We  find  that  a 
significantly  deviates  from  0.5  for  substrates  with  very  high 
doping  densities.  We  also  observe  that  the  weighting  coeffi¬ 
cient  b  decreases  with  increasing  substrate  doping  due  to  a 
decrease  of  the  depletion  region  width.  Results  shown  in  Fig. 
2  differ  from  those  obtained  by  Matsumoto  and  Uemura^"^ 
(a  =0.5  and  ^  =  1),  for  example,  because  in  their  analytical 
result  for  the  effective  electric  field,  the  terms  of  the  order  of 
{z)IW  (suggested  by  Krutsick  and  White^^),  where  (z)  is  the 
centroid  of  the  electron  density  distribution  and  W  is  the 
depletion  region  width,  were  neglected. 

IV.  lno4Alo6As-lno4Gao6As  MODULATION  DOPED 
HETEROSTRUCTURES 

At  low  temperatures,  the  dominant  scattering  mechanisms 
in  the  InQ  4AI0  gAs-Ing^Gao  ^As  MDH  shown  in  Fig.  3  are: 
remote  impurity  (from  the  ^depletion  layer),  interface-trap, 
alloy  disorder,  interface-roughness  and  deformation  poten¬ 
tial,  and  piezoelectric-coupled  acoustic  mode  scattering. 
Polar  optical  phonon  scattering  is  found  not  to  play  any  sig¬ 
nificant  role  at  1.6  K.  Screening  of  the  alloy  disorder^^  and 
deformation  potential  scattering  potentials  by  the  free  carri¬ 
ers  is  also  neglected  because  of  their  short-range  nature.  This 
effect  should  certainly  have  been  included  for  the  long-range 
electron-piezoelectric  acoustic  phonon  interaction,  but  since 
this  scattering  process  is  extremely  weak  at  1.6  K,  its  matrix 
element  is  also  left  unscreened.  Assuming  that  deformation 
potential  scattering  is  an  elastic  process,  one  has  hcoQlksT 
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Fig.  3.  Cross-sectional  view  of  the  <5-doped  Ino.4Alo.6As-Ino.4Gao.6As  hetero- 
structure  on  (lOO)-oriented  semi-insulating  GaAs  substrate  with  intermedi¬ 
ate,  undoped,  and  compositionally  step-graded  buffer  layers. 


^  pk^T  =  ^TJT,  where  o)q  is  the  phonon  fre¬ 

quency,  is  the  Boltzmann  constant,  is  the  longitudinal 
elastic  constant,  m*  is  the  effective  mass,  and  p  is  the  crystal 
density.  Using  the  material  parameters  summarized  in  Table 
I,  we  find  that  the  equivalent  temperature  is  K. 

Thus,  the  equipartition  approximation  can  be  assumed  to 
hold  for  temperatures  above  1  K  for  the  acoustic  mode  scat¬ 
tering  of  thermal  electrons.  The  matrix  elements  for  surface- 
roughness,  acoustic  phonon,  and  alloy  disorder  scattering  can 
be  found  elsewhere.^^  For  the  wavevector  dependent  matrix 
element  for  scattering  between  subband  n  and  m  due  to  Cou¬ 
lomb  charges  located  at  a  distance  ^^<0  from  the  interface 
(in  the  InAlAs  layer),  we  use  the  following  expression: 


Table  I.  Values  of  the  In^.Gai  As  material  parameters  used  in  the  calcula¬ 
tion  of  the  broadening  of  the  states,  density  of  states  function,  and  mobility. 
For  the  electron  effective  mass  and  static  dielectric  constants  of  the 
Ino.4Alo.6As,  we  use  m*=0.99  mo  and  e=11.5  60,  respectively.  The  energy 
gaps  of  strained  and  unstrained  In^.Gai_^.As  layers  can  be  found  in  Ref.  25. 


Parameter 

Notation  and 
units 

Value 

Relative  electron  effective  mass 

m* 

0.058  for  X =0.4 

Relative  static  dielectric  constant 

€ 

13.1  +  1.5X 

Deformation  potential  constant 

D  (eV) 

8.6-2.8X 

Piezoelectric  constant 

P 

0.052-0.007  9x 

Longitudinal  elastic  constant 

Cl  (xio®  nW) 

13.97-3.99x 

Lattice  constant 

a  (A) 

5.653  25+0.405  15x 

Alloy  disorder  parameter 

{V)  (eV) 

0.5-0.55 

Crystal  density 

P  (g/cm^) 

5.504 

Fig.  4.  Gate-voltage  dependence  of  the  total  electron  density  for  the  MDH 
shown  in  Fig.  3.  Also  shown  in  this  figure  are  the  experimental  data  derived 
from  the  low-field  Hall  and  resistivity  measurements,  Shubnikov-de  Haas 
magnetoresistance  measurements  as  well  as  the  theoretical  simulations  re¬ 
ported  in  Ref.  16.  The  effective  mass  enhancement  due  to  conduction  band 
nonparabolicity  for  the  carriers  residing  in  the  ground  and  first  excited  sub¬ 
band  is  shown  in  the  inset. 


{n\U(q,Zi)\m)  = 


2q€i, 


1: 


dz  lAn(z) 


£6 - £w 


^-q\z-Zi\ 


X  f  dz  (h 

J  -00 


in  which  the  image  term  due  to  the  difference  of  the  static 
dielectric  constants  in  the  well  and  in  the  barrier  layer 
(6^)  is  properly  taken  into  account.  The  effective  mass  en¬ 
hancement  with  increasing  gate  voltage  due  to  conduction 
band  nonparabolicity  is  also  taken  into  account.^^ 

When  solving  for  the  subband  structure,  the  number  of  the 
self-consistent  field  iterations  was  considerably  decreased 
when  using  the  analytical  solutions  of  the  Poisson  equation. 
We  calculate  that  in  equilibrium,  the  electron  transfer  from 
the  ^doped  region  to  the  well  (see  Fig.  3)  is  governed  by 

e'^Nj^L  e'^  ^  , 

6^  6/,  i  J -00 


(W+L)^ 


N't 


-L^ 


(z)dz 


+  Nf 


'tf]{z)dz 


(2) 


where  L=30  nm  and  W  =  10  nm  are  the  widths  of  the  barrier 
and  spacer  layers,  Ef  the  Fermi  level,  <I>q=0.8  eV  is  the 
Shottky  barrier  height,  Vq  is  the  gate  voltage,  A£'^=0.52  eV 
is  the  conduction  band  offset,  Np—^XlO^^  cm~^  is  the  Si 
5-doping  sheet,  and  N'liNf)  is  the  sheet  electron  density  of 
the  ith  subband  in  the  barrier  (quantum  well). 
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Fig.  5.  Cumulative  and  component  theoretical  mobilities  vs  gate  voltage. 
Filled  circles  are  the  experimental  data  determined  from  Hall  effect  and 
resistivity  measurements  (Ref.  16).  The  material  parameters  employed  in  the 
simulations  are  summarized  in  Table  I.  Also  shown  in  the  figure  are  the 
mobilities  of  the  carriers  in  the  ground  (/xi)  and  first  excited  subband 


Self-consistent  results  for  the  gate-voltage  dependent 
electron  densities  are  shown  in  Fig.  4.  We  observe  good 
agreement  between  the  experimental  data  derived  from  the 
Hall  effect  and  SdH  measurements^^  and  the  theoretical  cal¬ 
culations  for  all  gate  voltages.  Our  simulation  results  are  also 
in  agreement  with  the  theoretical  calculations  reported  by 
Wieder  et  We  find  that  the  significant  effective  mass 
enhancement  (see  the  inset  of  Fig.  4)  due  to  conduction  band 
nonparabolicity  prevents  the  population  of  the  third  subband 
until  Vq>()A  V.  We  also  find  that,  in  contrast  to  silicon  in¬ 
version  layers  where  exchange-correlation  effects  are  signifi¬ 
cant  even  at  room  temperature,  many-body  corrections  do 
not  considerably  affect  the  subband  structure  in  the  MDH 
because  of  the  small  effective  mass. 

Gate- voltage  dependence  of  the  low-temperature  theoreti¬ 
cal  mobilities  is  shown  in  Fig.  5.  Although  the  average  mo¬ 
bility  is  jl  =  ^iN iiJiilN ^ ,  measured  mobilities  shown  in  Fig.  5 
are  Hall  mobilities'^  so  that  for  a  multisubband  case,  the 
expression  is  more  suitable  for  compar¬ 

ing  the  experimental  results  with  the  theoretical  calculations. 
To  take  into  account  the  diffusion  effects,  the  <5-doping  sheet 
is  modeled  as  a  4  nm  wide  layer  centered  8  nm  away  from 
the  interface  with  Si  doping  density  of  7.52X10^^  cm“^. 
Interface-roughness  scattering  is  assumed  to  exist  at  both 
heterointerfaces  of  the  quantum  well.  We  assume  that  the 
rms  height  of  the  roughness  is  one  monolayer  (ML)  (A=2.83 
A).  The  lateral  spread  of  the  interface  roughness  is  taken  to 
be  ^=100  A.  For  this  particular  heterostructure,  we  find  that 
surface-roughness  and  acoustic  phonon  limited  component 
mobilities  (not  shown  on  the  figure)  are  of  the  order  of 
(0.5-l)X10^  and  10^  cmVV  s,  respectively.  By  fitting  the 
calculated  mobilities  at  high  gate  voltages  to  the  experimen¬ 
tal  data,  we  were  able  to  determine  the  alloy-disorder  scat¬ 
tering  parameter  in  the  system  to  be  in  the  range  0.5-0.55. 
Component  mobility  results  presented  in  Fig.  5  suggest  that 
the  inclusion  of  alloy  scattering,  in  addition  to  Coulomb  scat¬ 
tering  (from  the  ionized  donors  in  the  (5-doped  region  and 


interface  traps),  leads  to  good  agreement  between  the  theo¬ 
retical  calculations  and  the  experimental  data.  The  dip  in  the 
theoretical  mobility  at  -0.85  V  signals  that  a  new  chan¬ 
nel  (second  subband)  for  scattering  opens  in  the  MDH  which 
leads  to  a  sudden  increase  in  phase  space  for  scattering  be¬ 
tween  the  two  subbands  to  occur.  Since  the  carriers  residing 
in  the  first  excited  subband  are  at  about  four  times  larger 
average  distance  from  the  heterointerface  (compared  to  those 
in  the  ground  subband),  they  do  not  feel  as  much  the  Cou¬ 
lomb  scattering  potentials  and  exhibit  about  two  times  higher 
mobility. 

V.  CONCLUSIONS 

Our  results  suggest  that  depletion  charge  scattering,  which 
has  been  ignored  in  some  previous  theoretical  studies,  is  re¬ 
sponsible  for  the  deviation  of  the  mobility  from  the  universal 
curves  near  the  threshold  voltage.  At  high  gate  voltages  (high 
values  of  A^),  due  to  the  closer  confinement  of  the  carriers  to 
the  interface,  the  magnitude  of  the  mobility  in  these  highly 
doped  samples  is  considerably  reduced  by  surface-roughness 
scattering,  in  addition  to  the  still  significant  intervalley  scat¬ 
tering.  We  also  find  that  the  weighting  coefficients  a  and 
b — that  appear  in  the  definition  of  the  average  electric 
field — show  a  pronounced  doping  dependence.  For  the 
Ino4Alo  6As-Ino4Gao  6As  MDH,  we  find  that  at  7=1.6  K  the 
magnitude  of  the  mobility  is  limited  by  Coulomb  and  alloy 
scattering.  From  the  subband  structure  (gate-voltage  depen¬ 
dence  of  the  electron  density)  and  mobility  simulations,  we 
calculate  that  the  relevant  effective  mass  at  the  conduction 
band  minima  of  the  Ino.4Gao.6As  ternary  alloy  is  m* =0.058 
mo-  An  observed  effective  mass  enhancement  suggests  that 
nonparabolicity  effects  are  considerable  in  this  MDH  and 
must  be  taken  into  account  if  accurate  mobility  results  are 
desired.  The  effect  of  the  many-body  interactions,  calculated 
in  the  local  density-functional  approximation,  is  found  not  to 
be  important  for  the  MDH. 
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APPENDIX 

The  expression  for  the  matrix  elements  for  scattering  be¬ 
tween  subbands  n  and  m  due  to  a  Coulomb  charge  in  the 
depletion  region,  in  the  oxide,  or  right  at  the  interface,  in  the 
presence  of  a  different  dielectric  medium,  can  be  found  in 
the  review  article  by  Ando,  Fowler,  and  Stem.^  For  the 
surface-roughness  matrix  element,  we  use  the  result  obtained 
by  Ando^^  who  corrected  the  result  obtained  by  Matsumoto 
and  Uemura^"^  by  taking  into  account  the  change  in  the  elec¬ 
tron  density  distribution  and  the  effective  dipole  moment  of 
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the  deformed  Si-Si02  surface  in  addition  to  the  usual  local 
potential  energy  term  that  equals  the  linear  term  of  the  Taylor 
expansion  of  the  surface  potential. 

For  anisotropic  elastic  continuum  such  as  silicon,  for 
which  the  deformation  potential  constant  a  is  a  tensor, 
matrix  element  squared  for  scattering  between  subbands  n 
and  m  due  to  acoustic  phonons,  in  the  usual  equipartition 
approximation,  equals  to^^ 

|(«|t/r(q)h)l^=^,[Af„Jq)]X„,,  (Al) 


In  this  case,  one  has  to  consider  the  first-order  term  of  the 
interaction.  The  squared  matrix  element  for  scattering  be¬ 
tween  subbands  n  (a  valley)  and  m  {/3  valley)  due  to  this 
first-order  process  is  given  by^^ 

fiD^ 

where 

Cnm=  •  (A8) 


where 

^nm=  f  dz  (A2) 

Jo 

is  the  sound  velocity,  and  p  is  the  Si  mass  density.  The 
effective  deformation  potential  constant  that  appears  in  ex¬ 
pression  (Al)  is  calculated  from 

1 

=  —  dq,^l{eQ)\.rM?,  (A3) 

^  nm  0 

where 


•^nm(?z)=  f  dz  (A4) 

Jo 

and  is  the  angle  between  the  wave  vector  Q  of  the  emitted 
(absorbed)  phonon  and  the  longitudinal  axis  of  the  valley. 
According  to  Herring  and  Vogt,^^  the  effective  deformation 
potentials  Aj^((9q)  (X=LA  or  TA)  that  appear  in  Eq.  (A3)  are 
given  by 

Ala( 

and 


Ata( ^q) cos( 0Q)sin( 0q).  (A5b) 

In  Eqs.  (A5a)  and  (A5b),  H„  is  the  uniaxial  shear  potential, 
and  is  the  dilatation  potential  that  are  believed  to  have 
values  of  approximately  9.0  and  “11.7  eV,  respectively.^ 

It  is  important  to  point  out  that  Eq.  (A5b)  accounts  for  the 
contribution  of  both  TA  branches. 

The  matrix  element  for  nonpolar  optical  phonon  scatter¬ 
ing  is  generally  found  from  a  deformable  ion  model.  The 
zero-order  process,  which  describes  a  perturbation  that  can 
be  associated  with  the  local  dilatation  or  compression  of  the 
lattice  which  produces  local  fluctuation  in  the  electron  en¬ 
ergy,  dominates  at  moderately  high  electron  energies.  The 
matrix  elements  for  scattering  between  subbands  n  and  m 
(that  belong  to  a  and  /3  valleys)  due  to  this  zero-order  inter¬ 
action  is  given  by 


Kn|<°V>l^  = 


2pV(o, 


where  the  deformation  field  and  <Wo\  is  the  frequency  of 
the  phonon  mode. 

When  the  zero-order  matrix  element  for  the  optical  or 
intervalley  interaction  vanishes,  then  is  identically  zero. 


In  the  scattering  among  the  equivalent  valleys,  there  are 
two  types  of  phonons  that  might  be  involved  in  the  process. 
The  first  type,  the  so-called  g  phonon  couples  the  two  vdleys 
along  opposite  ends  of  the  same  axis,  i.e.,  (100)  to  (100). 
This  is  an  umklapp  process  and  has  a  net  phonon  wavevector 
03iT/a.  The  so-called  /  phonons  couple  the  (1 00)  valley  with 
(010),  (001),  etc.  Hence,  the  reciprocal  lattice  vector  in¬ 
volved  in  the  g-process  is  Giqo  f  P^'^cess  is 

Gill- 

Within  a  four  subband  approximation,  scattering  between 
the  two  valleys  in  the  69 »  ^2  subbands  involves  only 

g-type  phonons.  The  scattering  between  these  three  minima 
is  treated  by  using  a  high-energy  phonon  of  750  K  activation 
temperature  (treated  as  zero-order  interaction)  and  134  K 
phonon  treated  via  first  order  interaction.  Scattering  between 
the  60,  6] ,  and  €2  subbands  and  the  four  €q  subbands  involves 
/  phonons  with  activation  temperatures  of  630  and  230  K 
treated  via  zero-order  and  first-order  interaction,  respectively. 
Scattering  between  the  €q  subbands  involves  both  g  and  / 
phonons  with  activation  temperatures  of  630  K  (zero-order 
interaction)  and  190  K  (first-order  interaction).  All  of  the 
high-energy  phonons  are  assumed  to  be  coupled  with  a  value 
of  Z)j^=9XlO^  eV/cm,  and  all  of  the  first-order  coupled 
phonons  are  assumed  to  be  coupled  with  Djx=5.6  eV.  These 
values  are  consistent  with  the  results  given  in  Refs.  20  and 
32. 
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We  report  an  investigation  of  barrier  formation  and  electron  transport  across  InAs  and  GaAs 
interfaces  by  ballistic  electron  emission  microscopy.  The  barrier  height  was  found  to  decrease  with 
the  InAs  thickness  and  the  detailed  variation  correlates  with  the  relaxation  of  the  InAs  layer.  When 
the  thickness  is  below  one  monolayer,  three  thresholds  are  observed  and  they  are  attributed  to 
electron  transmission  into  the  T,  L,  and  X  valleys  of  the  GaAs,  respectively.  As  the  thickness 
increases  to  three  monolayers  or  beyond,  only  two  thresholds  are  observed  and  the  reasons  are 
explained  in  the  context  of  the  band  structure  of  the  system.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

The  growth  of  InAs  on  GaAs  (001)  has  recently  attracted 
a  large  degree  of  attention  since  it  is  possible  to  realize  zero 
dimensional  quantum  dots  through  strain  relaxation.  It  is 
well  known  that  the  InAs  layer  undergoes  a  number  of  tran¬ 
sitions  as  the  thickness  of  the  InAs  layer  increases.^  Initially, 
the  InAs  layer  grows  pseudomorphically  strained  onto  the 
GaAs  substrate;  above  ^1.6  monolayers,  due  to  the  large 
mismatch  {-12%),  it  starts  to  island  coherently;  finally  mis¬ 
fit  dislocations  are  generated  and  the  islands  coalesce  as  the 
thickness  increases  further.  The  aim  of  this  article  is  to  study 
how  the  various  stages  of  InAs  film  development  affect  the 
barrier  formation  between  Au  and  GaAs,  with  particular  at¬ 
tention  being  paid  to  the  barrier  height,  electron  transport, 
and  the  homogeneity  of  the  interface. 

Ballistic  electron  emission  microscopy  (BEEM)  has  been 
used  for  this  study.  The  technique  is  a  three-terminal  exten¬ 
sion  of  scanning  tunnelling  microscope  (STM).  It  uses  the  tip 
of  the  STM  to  inject  hot  electrons  into  the  metal  surface  and 
collects  the  fraction  of  them  transmitted  over  the  barrier  into 
the  semiconductor  substrate.  It  can  measure  the  barrier 
height  with  very  high  lateral  resolution.^'^  Furthermore,  the 
semiconductor  band  structure  near  the  interface  can  also  be 
probed  since  transmission  into  the  different  conduction 
bands  (valleys)  can  be  measured.^ 

This  article  is  organized  as  follows:  the  sample  growth  is 
presented  in  Sec.  II,  and  is  followed  by  the  results  of  BEEM 
measurements  with  discussions  (Sec.  III).  Finally  our  conclu¬ 
sions  are  presented  in  Sec.  IV. 

11.  GROWTH  DETAILS 

All  the  samples  studied  were  prepared  in  a  Vacuum  Gen¬ 
erators  ultrahigh  vacuum  (UHV)  system  consisting  of  a  con¬ 
ventional  solid  source  VG  V80H  molecular  beam  epitaxy 
(MBE)  machine  attached,  via  an  UHV  gate  valve,  to  a  home 
designed  metallization  chamber.  Both  the  MBE  and  the  met¬ 
allization  chamber  have  been  described  previously.^ 


^^Electronic  mail:  matthai@cardiff.ac.uk 


The  substrate  used  were  highly  doped  (n  =  lXlO^^  cm“^) 
on-axis  GaAs  (100)  wafers  which,  following  a  standard  etch, 
were  In  mounted  onto  Mo  Blocks.  Oxide  desorption  was 
performed  in  the  growth  chamber  of  the  MBE  machine  under 
an  As4  flux  of  F(As4)~5X10^'^  molecules  cm"^  s“^  and  fol¬ 
lowed  by  growth,  at  a  sample  temperature  of  600  °C  and  a 
growth  rate  of  1.0  /^m/h,  of  a  thick  (0.5-0.7  /urn)  low  doped 
GaAs  layer  (n  =  5X10^^  cm“^).  Following  this  the  sample 
temperature  was  lowered  to  500  °C  before  growth  of  the  fi¬ 
nal  doped  {n  =  l0^^  cm”^)  InAs  layer  at  a  rate  of  0.5  /xm/h. 
Islands  formed  at  low  coverages  (equivalent  to  a  few  mono- 
layers  of  InAs),  undergo  a  complex  development  with  time, 
temperature,  and  coverage  once  growth  has  been  terminated. 
At  a  temperature  of  500  °C  this  development  is  reported  to 
take  a  few  seconds,^’ therefore,  in  this  study,  all  the  InAs 
surfaces  were  kept  at  500  °C  for  1  min  following  cessation  of 
growth  for  equilibrium  to  be  reached.  The  sample  was  then 
allowed  to  cool  to  ^^300  °C  before  removal  from  the  growth 
chamber  and  transfer  to  the  metallization  chamber  where  a 
very  thin  gold  film  (50  A)  was  deposited.  The  predeposition 
pressure  was  ~3X  10"^^  mb  and  the  pressure  during  the  met¬ 
allization  was  controlled  to  around  5X10~^  mb.  The  deposi¬ 
tion  rate  was  —0.33  A  per  second.  The  BEEM  measurements 
were  performed  using  a  WA  STM  system  in  air  at  room 
temperature. 

III.  MEASUREMENTS  AND  DISCUSSIONS 

A  set  of  ten  samples  were  investigated,  with  InAs  thick¬ 
ness  ranging  from  0-40  monolayers.  Figure  1  shows  three 
typical  BEEM  spectra  for  the  cases  of  1,  5,  and  27  monolay¬ 
ers  of  InAs.  Fitting  the  data  with  a  5/2  power  law,^^  it  was 
found  there  were  three  thresholds  for  the  case  of  0- 1  mono- 
layer  with  the  energy  separation  between  the  two  lower 
thresholds  (—300  meV)  in  agreement  with  that  between  the 
r  and  the  L  valleys  of  GaAs.^^  The  separation  between  the 
first  and  third  thresholds  (—470  meV)  is  consistent  with  the 
T—X  valley  separation.  For  the  cases  of  3-27  monolayers 
of  InAs,  however,  only  two  thresholds  were  observed  with 
the  energy  separations  varying  in  the  range  from  340-410 
meV.  These  results  will  be  discussed  in  detail  later. 
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(a)  Tip  Voltage  (V) 


(b)  Tip  Voltage  (V) 


0.4  0.8  1.2  1.6 


(c)  Tip  Voltage  (V) 

Fig.  1.  Three  representative  BEEM  current  curves  for  the  cases  of  1,  5,  and  27  monolayers  of  InAs,  respectively.  Three  thresholds  were  obtained  from  fitting 
for  the  case  of  one  monolayer  of  InAs  (a)  while  only  two  were  seen  in  the  other  cases  (b)  and  (c). 


Figure  2  shows  the  variation  of  the  barrier  heights  (lowest 
thresholds)  as  a  function  of  the  InAs  layer  thickness.  The 
“zero  monolayer”  sample  is  simply  a  Au/GaAs  reference 
system.  The  data  show  that  the  barrier  decreases  with  the 


Fig.  2.  The  variation  of  barrier  height  as  a  function  of  the  InAs  layer  thick¬ 
ness. 


thickness  of  InAs  interlayer.  One  monolayer  of  InAs  lowers 
the  barrier  from  0.90-0.80  eV,  a  value  which  is  uniform 
across  the  interface.  No  barriers  as  high  as  that  of  Au/GaAs 
were  measured  in  this  structure,  which  indicates  that  a  uni¬ 
form  InAs  film  has  been  produced.  As  the  thickness  of  the 
InAs  layer  is  increased  to  three  monolayers  the  barrier  was 
found  to  decrease  to  ~0.74  eV.  The  barrier  height  remains 
approximately  constant  for  up  to  27  monolayers,  after  which 
it  reduces  to  —0.63  eV  for  the  case  of  33  monolayers  (100 
A)  and  beyond. 

It  is  well  known  that  the  contact  between  Au  and  n  InAs 
material  is  ohmic, thus  the  measured  barrier  for  the  thick 
InAs  layer  can  be  seen  as  the  effective  band  offset  between 
the  InAs  and  GaAs.  It  is  interesting  to  note  that  the  measured 
barriers  fall  into  three  groups  (Fig.  2),  which  coincide  with 
the  three  transition  stages  of  the  InAs  film.  A  thin  uniformly 
strained  InAs  film  (one  monolayer)  has  a  barrier  height  of 
0.80  eV;  relaxation  of  strain  through  islanding  (3-27  mono- 
layers)  resulted  in  a  smaller  height  of  —0.74  eV;  heavily 
dislocated  films  (^33  monolayers)  have  the  lowest  barrier  of 
—0.63  eV.  The  barrier  height  at  each  stage  does  not  vary 
considerably,  with  the  largest  fluctuation  of  about  ±50  meV. 
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Fig.  3.  The  histogram  of  the  barrier  heights  obtained  from  the  sample  with 
27  monolayers  of  InAs. 

Most  of  the  observed  barriers  are  very  close  to  the  average 
value.  Figure  3  shows  a  histogram  for  the  case  of  27  mono- 
layers.  The  measured  barriers  over  different  positions  are 
largely  concentrated  around  0.74  eV,  which  indicates  the  in¬ 
terface  is  predominately  a  uniform  one. 

Oloumi  and  Matthai^^  calculated  the  band  alignment  at 
the  InAs/GaAs  interface  and  found  that  the  band  offsets  were 
crucially  dependent  on  the  interlayer  separation  at  the  inter¬ 
face.  For  InAs  pseudomorphically  strained  on  GaAs  sub¬ 
strate,  they  found  that  if  the  interlayer  separation  is  fully 
energy  minimized,  which  was  akin  to  the  As  layer  being 
pulled  towards  the  In  atom  and  away  from  the  Ga  atom,  a 
conduction  band  offset  of  I.O  eV  (higher  than  that  the 
Schottky  barrier  between  Au  and  GaAs)  can  be  achieved. 
However,  when  the  interlayer  separation  was  found  to  be  the 
average  of  the  bond  lengths,  the  conduction  band  offset  was 
found  to  reduce  to  half  of  the  previous  value.  This  latter 
configuration  could  be  thought  to  represent  a  fully  relaxed 
InAs  film  whereas  in  the  former  case,  the  first  layer  InAs 
atoms  are  strained  onto  the  substrate.  The  conclusion  would 
therefore  be  that  when  the  strain  is  relieved  at  the  interface 
(e.g.,  through  misfit  dislocation)  there  is  a  corresponding  de¬ 
crease  in  the  conduction  band  offset.  Our  result  of  0.63  eV 
(^33  monolayers)  and  0.74  (=^27  monolayers),  although  not 
in  agreement  with  the  calculated  value,  is  consistent  with  the 
calculated  trend,  assuming  that  ^33  monolayers  represents 
the  structure  when  the  strain  in  the  InAs  film  is  relieved  by 
the  onset  of  dislocations. 

The  idea  of  the  strain  relaxation  through  islanding  is  con¬ 
sistent  with  above  argument  as  well  since  the  islands  are  said 
to  have  an  average  size  of  around  300  which  is  much 
larger  than  the  expected  BEEM  resolution."^"^  The  barrier 
heights  measured  on  top  of  the  islands  are,  therefore,  repre¬ 
senting  the  band  offset  between  the  InAs  island  and  the 
GaAs  substrate.  The  different  average  value  at  each  transi¬ 
tion  stage  is  probably  a  result  of  a  different  degree  of  strain 
relaxation.  For  thick  InAs  layers  (^33  monolayers),  the  dis¬ 
locations  may  play  a  dominate  role  in  the  barrier  formation; 


Fig.  4.  The  band  alignment  at  the  InAs/GaAs  interface.  The  relative  posi¬ 
tions  of  the  r,  L,  and  X  valleys  for  Fig.  5  are  obtained  from  Ref,  17.  Other 
work  (such  as  Ref,  15)  suggests  a  similar  configuration. 


while  in  the  region  from  3-27  monolayers,  the  measured 
barrier  height  is  probably  the  consequence  of  the  band  offset 
between  the  strained  InAs  islands  and  the  GaAs  substrate. 
For  even  thinner  InAs  layers  (below  one  monolayer),  it  is 
inappropriate  to  talk  about  the  band  structure  of  the  InAs,  the 
measured  barrier  height  is  essentially  the  Schottky  barrier 
between  Au  and  GaAs,  but  modified  in  the  interface  region 
by  the  dipoles  from  the  InAs  interlayer. 

Another  important  feature  in  the  BEEM  spectra  is  that 
there  are  three  thresholds  for  the  case  of  the  Au/GaAs  contact 
(zero  monolayer  of  InAs)  and  for  the  samples  with  very  thin 
layer  of  InAs  (one  monolayer  or  less),  but  only  two  [Figs. 
1(b)  and  1(c)]  can  be  found  when  the  InAs  interlayer  is  in¬ 
creased  to  three  monolayers  or  beyond.  For  systems  with 
three  or  five  monolayers  of  InAs,  the  two  observed  thresh¬ 
olds  have  energy  separations  of  340  and  370  meV,  respec¬ 
tively.  When  the  InAs  layer  thickness  increased  to  10  mono- 
layers,  the  energy  separation  of  the  two  thresholds  becomes 
410  meV  and  remains  at  that  value  for  up  to  27  monolayers. 
For  samples  with  33  monolayers  of  InAs  or  thicker,  it  was 
difficult  to  ascertain  exactly  how  many  upper  thresholds  ex¬ 
ist  because  the  BEEM  current  is  noisy,  which  is  not  surpris¬ 
ing  since  the  barrier  height  of  0.63  eV  is  about  the  low  limit 
of  room  temperature  operation. 

As  mentioned  earlier,  the  observed  three  thresholds  for 
the  case  of  a  very  thin  layer  of  InAs  (below  one  monolayer) 
are  identified  as  electron  transmission  into  the  F,  L,  and  X 
valleys  of  the  GaAs,  respectively.  To  understand  why 
only  two  thresholds  are  observed  for  3-27  monolayers  of 
InAs,  it  is  necessary  to  examine  the  band  structure,  or  band 
alignment,  of  the  system  in  more  detail.  The  first  point  to 
note  is  that  when  considering  transport  along  (001)  direction, 
the  X  point  can  be  projected  onto  the  F  point  whereas  the  L 
point  cannot.  Thus  F-L  valley  transmission  can  only  occur 
when  the  condition  for  lateral  momentum  conservation  is 
relaxed.  This  is  more  likely  to  happen  at  the  Au-InAs  inter¬ 
face  than  at  the  semiconductor  heterojunction  (InAs/GaAs). 
Second,  the  X,L  valleys  of  the  InAs  are  higher  in  energy 
than  those  in  GaAs^^’^^  (see  Fig.  4).  Finally,  the  relative  po- 
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sition  of  the  L  and  X  valleys  in  InAs  do  change  with  strain. 
Thus  of  the  four  possible  transport  routes  into  the  higher 
valleys  (r,X)  of  GaAs,  Au-r(InAs)-L(GaAs),  Au- 
r(InAs)-X(GaAs),  Au-L  (InAs) -L  (GaAs),  and  Au- 
X(InAs)-X(GaAs),  only  the  first  two  could  show  up  as  a 
threshold  at  the  corresponding  valley  position  of  GaAs  and 
of  these  only  the  second  satisfies  the  conservation  rule.  The 
last  two  routes,  if  in  operation,  would  manifest  as  a  threshold 
at  the  corresponding  InAs  valley  position  (rather  than  GaAs). 
Since  some  theoretical  calculations  predict  that  the  L  valley 
of  InAs  is  about  1.2  eV  higher  than  the  T  point, we 
therefore  believe  that  the  second  threshold  [at  1.13  eV  in  Fig. 
1(c)]  for  10-27  monolayers  of  InAs  is  a  measure  of  the  X 
valley  energy  in  GaAs.  The  separation  between  the  two 
thresholds  (410  meV)  is  in  a  broad  agreement  with  T-X 
spacing.  The  second  threshold  observed  for  the  thinner 
(three  and  five  monolayers)  InAs  interlayers  (340  and  370 
meV)  could  be  a  result  of  direct  electron  scattering  from  Au 
into  the  L  “valley”  of  GaAs.  As  mentioned  earlier,  when  the 
InAs  interlayer  is  only  one  monolayer  or  even  thinner  the 
direct  electron  scattering  from  Au  into  the  L  valley  minimum 
(300  meV  from  the  T  point)  of  GaAs  is  strong  and  clearly 
visible  [Fig.  1(a)].  As  the  interlayer  thickness  increases,  the 
scattering  strength  from  Au  into  the  L  valley  minimum  of 
GaAs  is  expected  to  decrease  and  become  invisible.  How¬ 
ever,  since  the  scattering  strength  is  believed  to  be  energy 
dependent,  an  increase  in  energy  would  therefore  help  to 
enhance  the  electron  scattering  rate. 

Finally,  we  would  like  to  point  out  that  we  have  not  seen 
the  current  contribution  from  the  X  valley  of  the  GaAs  for 
these  two  samples  (three  and  five  monolayers).  The  precise 
reason  is  unknown.  One  possible  explanation  is  that  the  X 
valley  contribution  is  probably  close  in  energy  with  the  ob¬ 
served  second  threshold,  which  would  be  very  difficult  to 
resolve  from  the  fitting  procedure.  It  is  therefore  possible 
that  the  second  threshold  is  in  fact  due  to  a  combination  of 
two  processes.  Obviously,  further  attention  is  needed  for 
samples  with  a  such  thin  interlayer  (~5  monolayers)  of 
InAs. 

IV.  CONCLUSIONS 

Ballistic  electron  emission  microscopy  has  been  used  to 
probe  the  barrier  variation  of  Au/InAs/GaAs  system  as  the 


thickness  of  the  InAs  was  varied  between  0  and  40  mono- 
layers.  The  barrier  height  generally  decreases  with  the  InAs 
thickness  and  its  variation  has  been  found  to  correlate  to  the 
strain  relaxation  of  the  InAs  layer.  The  transport  properties 
were  investigated  as  well,  and  the  absence  of  the  electron 
transmission  into  the  L  valley  of  the  GaAs  substrate  for 
samples  with  a  thicker  InAs  layer  (10-27  monolayers)  was 
explained  by  the  requirement  of  lateral  momentum  conserva¬ 
tion  across  the  InAs  and  GaAs  interface,  a  requirement  that 
is  relaxed  for  the  less  perfect  Au/InAs  and  Au/GaAs  inter¬ 
faces.  For  systems  with  intermediate  thickness  of  the  InAs 
layer  (three  or  five  monolayers),  the  second  threshold  in 
BEEM  spectra  is  believed  to  be  a  result  of  direct  electron 
scattering  from  Au  into  the  GaAs. 
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Direct  simulations  of  interface  roughness  effects  on  transport  properties  of  tunnel  structures  are 
performed  using  the  planar  supercell  stack  method.  The  method  allows  for  the  inclusion  of  realistic 
three-dimensional  rough  interfacial  geometries  in  transport  calculations.  For  double  barrier  resonant 
tunneling  structures,  we  used  our  method  to  analyze  the  effect  of  roughness  at  each  of  the  four 
interfaces,  and  to  test  for  sensitivity  of  transport  properties  to  island  size  and  height.  Our  simulations 
yields  the  following  conclusions:  (1)  We  find  that  scattering  of  off-resonance  states  into 
on-resonance  states  provides  the  dominant  contribution  to  interface  roughness  assisted  tunneling. 
Analyses  of  scattering  strength  sensitivity  to  interface  layer  configurations  reveals  preferential 
scattering  into  tsk\\^27rl\  states,  where  X  is  the  island  size.  (2)  We  find  that  roughness  at  interfaces 
adjacent  to  the  quantum  well  can  cause  lateral  localization  of  wave  functions,  which  increases  with 
island  size  and  depth.  Lateral  localization  can  result  in  the  broadening  and  shifting  of  transmission 
resonances,  and  the  introduction  of  preferential  transmission  paths.  In  structures  with  wide  and  tall 
islands,  it  is  possible  to  find  localization  over  “islands’’  as  well  as  localization  over  “oceans.”  (3) 
The  leading  rough  interface  is  the  strongest  off-resonance  scatterer,  while  rough  interfaces  adjacent 
to  quantum  well  are  the  strongest  on-resonance  scatterers.  The  trailing  interface  is  the  weakest 
scatterer.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

While  simple  one-dimensional  models  can  explain  the 
qualitative  behavior  of  double  barrier  resonant  tunneling  di¬ 
odes,  they  are  less  successful  at  reaching  quantitative  agree¬ 
ment  with  experimental  results,  particularly  in  predicting 
peak-to-valley  current  ratios.  The  discrepancy  has  been  at¬ 
tributed  to  electron-phonon  scattering,  electron-electron  in¬ 
teractions,  band  structure  effects,  impurity  scattering,  and  in¬ 
terface  roughness.  In  this  work,  we  focus  on  the  effect  of 
interface  roughness  which,  to  some  extent,  can  be  manipu¬ 
lated  by  controlling  the  growth  process.  Several  theoretical 
works  have  analyzed  interface  roughness  effects  previously. 
Liu  and  Coon^  were  among  the  first  to  approach  the  subject, 
modeling  rough  interfaces  with  regularly  corrugated  patterns. 
Chevoir  and  Vinter"^  and  Rudberg^  studied  the  topic  with 
Fermi’s  golden  rule.  Leo  and  MacDonald^  also  treated  inter¬ 
face  roughness  to  leading  order,  but  with  a  technique  which 
preserved  unitarity.  These  perturbative  approaches'”^  have 
been  successful  in  dealing  with  small  island  sizes.  More  re¬ 
cently,  Johansson^  applied  the  coherent  potential  approxima¬ 
tion  which  included  multiple  scattering  events.  One  assump¬ 
tion  frequently  invoked  by  many  of  the  previous  works  is 
that  roughness  scattering  from  the  four  interfaces  in  a  double 
barrier  structure  can  be  treated  independently.  Several  au¬ 
thors  treated  interface  roughness  only  at  a  single  interface, 
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and  there  is  some  controversy  as  to  which  one  of  the  four 
interfaces  plays  the  more  important  role.^’^  In  this  work  we 
formulate  the  problem  in  real  space  with  a  tight-binding  su¬ 
percell  model  which  allows  for  arbitrary  spatial  variations 
within  laterally  repeating  supercells.  Exact  solutions  are  ob¬ 
tained  in  our  calculations.  Our  model  is  well-suited  for 
studying  effects  such  as  lateral  spatial  localization,  providing 
a  good  complement  to  previous  studies. 

II.  METHOD 

In  the  planar  supercell  stack  method  (PSSM),  we  define 
the  active  region  of  a  structure  as  a  stack  of  layers  per¬ 
pendicular  to  the  z  direction,  with  each  layer  containing  a 
periodic  array  of  rectangular  planar  supercells  of  NxXNy 
sites.  A  one-band  nearest-neighbor  tight-binding  Hamiltonian 
is  used  to  describe  the  potential  over  this  volume  of  interest. 
Our  model  is  formally  equivalent  to  the  one-band  effective 
mass  equation^ 

1 

z  m{x) 

discretized  over  a  Cartesian  grid,  and  subject  to  periodic 
boundary  conditions  (with  supercell  periodicity)  in  the  x  and 
y  directions,  and  open  boundary  conditions  in  the  z  direction. 
Since  the  planar  supercell  model  may  be  considered  as  a 
multiband  model  with  N^^Ny  bands,  transmission  coeffi¬ 
cients  for  structures  described  by  the  planar  supercell  stack 
can  be  determined  exactly  by  the  direct  application  of  the 
multiband  quantum  transmitting  boundary  method.^  Our 
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Fig.  I.  Transmission  coefficients  for  a  set  of  GaAs/AlAs  double  barrier 
structures  each  containing  two  rough  interfaces  (between  the  leading  elec¬ 
trode  and  the  first  barrier,  and  between  the  quantum  well  and  the  second 
barrier).  The  different  structures  have  different  island  sizes,  as  indicated  in 
the  legend.  Also  included  for  comparison  is  a  structure  where  each  of  the 
two  interface  layers  is  replaced  by  a  VGA  AIq  sGao^As  alloy  layer.  32X32 
planar  supercells  are  used  for  this  calculation. 

method  requires  accurate  and  efficient  solutions  of  large 
sparse  linear  systems,  which  is  achieved  using  the  quasi- 
minimal  residual  method. 

III.  RESULTS  AND  DISCUSSION 

We  examine  how  the  transmission  properties  of  GaAs/ 
AlAs  double  barrier  heterostructures  are  modified  by  the 
presence  of  interface  roughness.  We  consider  a  set  of  struc¬ 
tures  with  L|y=12  (monolayers)  GaAs  wells  and 
AlAs  barriers.  For  each  GaAs- AlAs  interface  on  the  left  (in¬ 
cident)  side  of  a  AlAs  barriers,  a  50%  random  coverage  in¬ 
terface  layer  is  inserted  between  the  pure  GaAs  and  AlAs 
layers.  This  configuration  corresponds  to  having  rough  in¬ 
verted  interfaces  and  smooth  normal  interfaces,  i.e.,  if  we 
number  the  four  interfaces  1  through  4,  starting  from  the 
incident  side,  then  interfaces  1  and  3  are  rough,  and  inter¬ 
faces  2  and  4  are  smooth.  A  simulated  annealing  algorithm^  ^ 
can  be  used  to  generate  random  configurations  of  interfacial 
layers  with  different  island  sizes.  Note  that  the  actual  island 
size,  denoted  by  X ,  depends  on  the  supercell  configuration  as 
well  as  the  lateral  discretization  size.  In  this  article,  32X32 
planar  supercells  are  used  in  all  calculations. 

Figure  1  shows  transmission  spectra  near  the  n  “  1  reso¬ 
nance  for  the  structures  discussed  above.  These  structures 
have  average  island  sizes  (X)  of  5.3  A,  119  A,  and  232  A. 


For  comparison  we  also  show  the  result  for  a  reference  struc¬ 
ture  in  which  each  of  the  two  rough  interfacial  layers  is  re¬ 
placed  by  a  smooth  virtual  crystal  approximation  (VGA) 
Alo.5Gao.5As  alloy  layer.  The  incident  plane  waves  are  cho¬ 
sen  to  have  no  in-plane  momentum  (k||  =  0).  Our  results  are 
similar  to  those  we  obtained  in  a  calculation  using  a  smaller 
(20X  20)  planar  supercell.^^  We  describe  the  main  points  be¬ 
low  and  refer  readers  to  a  more  detailed  discussion  in  Ref¬ 
erence  12, 

Figure  1(a)  demonstrates  the  dependence  of  interface 
roughness  scattering  on  island  size.  Evidently,  if  the  length 
scale  of  roughness  is  small  compared  to  de  Broglie  wave¬ 
length,  as  in  the  case  of  the  X  =  5.3  A  structure,  the  interface 
roughness  potential  is  felt  by  the  electron  only  in  an  aver¬ 
aged  sense,  and  thus  replacing  the  rough  interfacial  layer 
with  a  smooth  VGA  layer  is  a  good  approximation.  For  the 
structures  with  larger  island  sizes  of  X=  119  A  and  X  =  232 
A,  the  transmission  spectra  show  a  series  of  satellite  peaks 
not  found  in  the  reference  (VGA)  spectrum.  The  presence  of 
satellite  peaks  are  due  to  interface  roughness  assisted  reso¬ 
nant  tunneling, which  can  be  pictured  as  follows:  A  nomi¬ 
nally  off-resonance  state  is  elastically  scattered  by  interface 
roughness  into  a  state  with  a  different  kp .  If  it  so  happens 
that  the  electron  energy  with  respect  to  the  scattered  k||  is 
on-resonance,  then  resonant  tunneling  occurs,  leading  to  en¬ 
hance  transmission.  Since  coupling  between  different  kj| 
states  is  determined  by  the  interface  roughness  configuration, 
we  would  expect  to  find  preferential  scattering  into 
AA:||^27r/X  states. Arrows  in  Figure  1(a)  indicates  posi¬ 
tions  where  preferential  scattering  is  expected  to  occur  for 
the  X=  119  A  and  X  =  232  A  structures  based  on  the  simple 
estimate  using  island  sizes;  the  results  seem  to  confirm  our 
expectations. 

In  principle,  interface  roughness  scattering  can  result  in 
continuous  range  of  Ak|j .  In  our  model,  an  incident  wave  can 
only  be  scattered  into  states  with  Ak||  equal  to  a  reciprocal 
lattice  vector  g  of  the  (finite)  planar  supercell.  Thus,  the  ap¬ 
pearance  of  discrete  satellite  peaks  in  our  calculate  transmis¬ 
sion  spectra  is  an  artifact  of  the  finite  supercell  size  effect. 
This  artifact  can  be  mitigated  by  the  use  of  larger  supercells. 
The  results  of  the  present  32  X  32  supercell  calculation  shows 
closer  spacing  between  satellite  peaks  than  the  previous 
20X20  supercell  calculation.^^  In  the  limit  of  very  large  su¬ 
percell  sizes,  the  spacings  between  the  satellite  peaks  would 
become  smaller  than  the  resonance  peak  width,  and  the  sat¬ 
ellite  peaks  would  then  coalesce,  giving  a  much  smoother 
appearance. 

Figure  1(b)  shows  that  the  main  resonance  peak  broadens 
and  shifts  to  lower  energy  as  island  size  increases.  We  at¬ 
tribute  this  to  lateral  wave  function  localization.  The  rough 
interfacial  layer  between  the  quantum  well  and  the  second 
barrier  introduces  well  width  fluctuation.  If  the  island  sizes 
are  sufficiently  large,  we  could  consider  the  quantum  well  as 
consisting  of  wide- well  (L|y=  13,  in  this  example)  and 
narrow-well  12)  regions.  If  a  quasibound  level  is  lo¬ 

calized  (laterally)  in  the  wide-well  region,  its  resonance  en¬ 
ergy  would  be  lowered.  Evidently,  the  degree  of  lateral  lo- 
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Fig.  2.  Transmission  coefficients  for  a  set  of  GaAs/AlAs  double  barrier 
structures  with  interface  roughness  only  in  one  of  the  four  interfacial  layers. 
The  total  transmission  coefficient  is  shown  in  the  top  panel,  while  the 
transmission  coefficient  for  only  the  scattered  components  (i.e., 
I^incident  ^  j^transmitted^  shown  in  the  bottom  panel. 


Fig.  3.  Transmission  coefficients  for  two  GaAs/AlAs  double  barrier  struc¬ 
tures.  The  rough  interfacial  layer  between  the  quantum  well  and  the  barrier 
for  one  structure  (dashed  line)  is  one  monolayer  thick,  and  for  the  other 
(solid  line)  is  three  monolayers. 


calization  increases  with  island  size.  Reference  12  examines 
wave  function  localization  directly;  it  also  demonstrates  pref¬ 
erential  transmission  paths  resulting  from  wave  function  lo¬ 
calization.  Note  also  that  since  the  second  barrier  is  thinner 
where  the  quantum  well  is  wider,  an  electron  localized  in  the 
wide-well/thin-barrier  region  can  escape  with  more  ease 
through  the  thinner  barriers,  leading  to  shorter  quasibound 
state  lifetime,  or  broader  resonance,  as  seen  in  Figure  1(b). 

In  Figure  2  we  explore  the  role  of  the  individual  inter¬ 
faces.  Strictly  speaking,  with  wave  function  coherence  across 
the  entire  device  structure,  encompassing  all  four  interfaces, 
we  cannot  separate  out  the  effects  due  to  the  different  rough 
interfacial  layers.  Nevertheless,  it  is  instructive  to  study  the 
following.  We  examine  four  different  configurations.  Con¬ 
figuration  1  is  similar  to  the  configurations  discussed  earlier, 
except  that  the  third  interface  layer  (between  the  quantum 
well  and  the  second  barrier)  is  now  replaced  by  a  smooth 
VC  A  layer,  leaving  only  the  leading  interface  (1)  rough.  We 
use  “~r-B-W-v-B-”  to  denote  the  layer  sequence  of  this 
structure.  Configurations  2,  3,  and  4  are  similarly  con¬ 
structed,  each  with  one  rough  interface  and  one  VC  A  inter¬ 
face;  their  layer  sequences  are  shown  in  the  inset  of  Figure  2. 
We  note  that  the  main  resonance  peak  is  down- shifted  only 
in  configurations  2  and  3,  since  lateral  localization  of  wave 
function  only  occurs  if  one  of  the  interfaces  adjacent  to  the 
quantum  well  is  rough.  We  also  note  that  interface  roughness 


scattering  is  the  least  important  in  configuration  4,  where  the 
rough  interfacial  layer  is  located  away  from  the  incident  side. 
This  is  because  unless  the  incident  electron  is  on-resonance, 
and  the  transmission  coefficient  is  reasonably  large,  the  wave 
function  is  largely  located  on  the  incident  side,  with  ex¬ 
tremely  small  probability  of  penetrating  across  the  device 
structure  to  the  transmitted  side  to  sense  the  roughness  at  the 
far  interface.  The  same  argument  explains  the  fact  that  off- 
resonance  scattering  is  the  largest  in  configuration  1,  where 
the  leading  interface  is  rough. 

Figure  3  shows  the  transmission  spectra  of  two  structures 
with  similar  island  sizes  but  different  island  heights.  The  first 
structure  is  identical  to  the  \  =  232  A  structure  from  Figure 
1.  The  second  structure  is  similar  to  the  first  one,  except  that 
the  random-alloy  interfacial  region  between  the  quantum 
well  and  the  second  barrier  is  now  expanded  into  three 
monolayers  (at  the  expense  of  the  surrounding  well  and  bar¬ 
rier  layers,  so  as  to  keep  the  effective  well  width  the  same). 
We  note  that  the  main  resonance  peak  of  the  second  structure 
(with  taller  islands)  is  lower  than  the  first  structure;  its  wave 
function  also  shows  a  higher  degree  of  lateral  confinement 
over  the  wide-well  region.  It  is  also  interesting  to  note  that 
the  wave  function  for  the  transmission  peak  at  218  meV 
shows  the  opposite  type  of  lateral  confinement,  i.e.,  the  wave 
functions  is  localized  over  the  islands  themselves  (the 
narrow-well  regions)  rather  than  over  the  “ocean.”  This  in¬ 
dicates  a  splitting  of  the  n  =  1  quasibound  state  into  orthogo- 
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nal  states  laterally  localized  in  different  regions  of  the  quan¬ 
tum  well. 

IV.  SUMMARY 

We  studied  the  effect  of  interface  roughness  on  resonant 
tunneling  in  double  barrier  structures  using  a  tight-binding 
supercell  model.  We  find  that  interface  roughness  assisted 
scattering  of  off-resonance  states  into  on-resonance  state  pro¬ 
vides  the  dominant  scattering  channel.  We  showed  that  scat¬ 
tering  strength  is  sensitive  to  the  configuration  of  the  rough¬ 
ness  layer,  and  in  particular,  we  find  preferential  scattering 
into  /c||^27r/X  states  (\  =  island  size).  We  showed  that  lat¬ 
eral  wave  function  localization  increases  with  island  size  and 
depth,  and  is  responsible  for  broadening  and  shifting  of 
transmission  resonances.  Both  types  of  lateral  localization 
(over  “islands”  or  over  “ocean”)  are  possible.  We  also  ex¬ 
plored  the  roles  of  the  individual  interfaces,  and  showed  that 
the  leading  rough  interface  is  the  strongest  off-resonance 
scatterer,  while  the  rough  interfaces  adjacent  to  quantum  well 
are  the  strongest  on-resonance  scatterers. 
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Electron  transport  at  Au/lnP  interface  with  nanoscopic  exclusions 
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We  present  an  investigation  of  electron  transport  at  the  Au/InP  metal  semiconductor  (MS)  interface 
in  the  presence  of  nanoscopic  barrier  inhomogeneities.  In  particular,  we  focus  on  the  transport 
regime  wherein  the  low  barrier  inhomogeneous  regions  are  pinched  off  by  the  depletion  potential  of 
the  surrounding  higher  barrier  region.  To  realize  this,  we  have  fabricated  a  composite  MS  structure 
comprising  of  a  known  density  of  nanometer  sized  Ag  aerosol  particles  on  InP  overlayed  by  a 
uniform  Au  film.  Temperature  dependent  current- voltage  {1-V)  measurements  were  performed  to 
characterize  the  electron  transport  at  the  composite  MS  interface.  The  experimental  observations  are 
explained  using  a  simple  model  for  the  MS  junction  current  in  which  the  Ag/InP  regions  are 
assumed  to  be  identical  and  noninteracting.  Our  results  clearly  demonstrate  that  the  electron 
transport  at  the  MS  interface  is  significantly  affected  by  low  barrier  regions  even  in  the  pinch-off 
regime.  In  addition,  the  influence  of  the  particle  density  and  the  Ag/InP  barrier  heights  on  the  diode 
characteristics  is  also  investigated.  It  is  suggested  that  Schottky  barrier  inhomogeneities  could  be  the 
main  source  of  the  usually  observed  larger-than-unity  ideality  factors  in  diodes.  Furthermore,  our 
results  indicate  possibilities  of  using  such  composites  to  explore  the  physics  and  applications  of 
nanoinjecting  contacts.  ©  1996  American  Vacuum  Society. 


\.  INTRODUCTION 

The  formation  of  Schottky  barrier  at  metal  semiconductor 
(MS)  interface  has  been  a  subject  of  extensive  investigations 
for  several  years.  ^  Pinning  of  the  Fermi  level  by  electronic 
states  at  the  MS  interface  has  been  frequently  invoked  to 
explain  the  formation  of  Schottky  barriers. More  recently, 
experimental  investigations  of  epitaxial  MS  interfaces  sug¬ 
gest  that  the  barrier  heights  depend  on  the  structure  of  the 
MS  interface.'^  It  is  therefore  plausible,  though  not  certain, 
that  nonepitaxial  MS  interfaces  could  be  inhomogeneous  re¬ 
sulting  in  a  distribution  of  barrier  heights  at  the  interface. 
Some  investigations  have  recognized  the  presence  of  inho¬ 
mogeneities  and  their  effect  on  diode  characteristics.^”^ 
Theoretical  modeling  of  the  effect  of  such  inhomogeneities 
on  the  electron  transport  appears  to  be  rather  successful  in 
accounting  for  a  wide  variety  of  properties  including  tem¬ 
perature  dependence  of  electron  transport  properties  at  the 
interface  as  well  as  the  variation  of  the  ideality  factor  of  the 
Schottky  barriers.^  However,  it  is  very  difficult  to  verify  the 
above  model  in  nonepitaxial  MS  interfaces  since  the  exact 
nature  and  extent  of  the  inhomogeneities  is  usually  unknown. 
In  cases  where  the  inhomogeneous  region  is  not  pinched  off 
by  the  surrounding  barrier,  a  parallel  conduction  model 
wherein  the  current  is  assumed  to  be  a  sum  of  currents  flow¬ 
ing  in  all  the  individual  patches  can  be  used.  This  model  has 
been  employed  with  reasonable  success  to  explain  some  of 
the  observed  anomalous  characteristics  of  diodes.^’ How¬ 
ever,  as  pointed  out  in  Ref.  8,  the  parallel  conduction  model 
is  in  serious  error  if  the  barrier  height  varies  spatially  on  a 
scale  less  than  or  comparable  to  the  depletion  width.  MS 
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interfaces  characterized  by  the  above  conditions  present  an 
interesting  situation  and  electron  transport  at  such  MS  junc¬ 
tions  is  relatively  unexplored.^ 

In  this  work  we  address  the  above  questions  by  using 
Au/Ag  composite  barriers  formed  on  weakly  « -doped  epitax¬ 
ial  InR  The  main  purpose  of  using  a  composite  structure  is  to 
deliberately  introduce  inhomogeneities  such  that  the  spatial 
variation  of  the  barrier  height  is  on  a  scale  comparable  to  the 
depletion  width.  The  composites  were  formed  by  first  depos¬ 
iting  a  known  density  of  Ag  aerosol  particles  (radius^  17 
nm)  on  InP  surface,  and  subsequently  Au  is  evaporated  onto 
the  sample.  This  results  in  nanoscopic  and  well  controlled 
barrier  inhomogeneities  at  the  MS  interface  produced  by  the 
embedded  Ag  particles.  Temperature  dependent  current- 
voltage  (I-V)  measurements  under  forward  bias  were  per¬ 
formed  to  investigate  the  effect  of  the  inhomogeneities  intro¬ 
duced  by  the  Ag  particles  on  the  electron  transport  at  the 
composite  MS  interface.  The  experimental  results  are  ex¬ 
plained  using  a  simple  model  for  the  junction  current 
wherein  the  Ag/InP  regions  are  assumed  to  be  identical  and 
noninteracting.  In  this  work,  we  demonstrate  that  the  elec¬ 
tron  transport  at  the  MS  interface  is  significantly  affected  by 
the  low  barrier  regions  even  in  the  pinch-off  regime. 

II.  EXPERIMENT 

The  inhomogeneous  Schottky  barriers  were  produced  by 
forming  a  composite  MS  structure  on  5-yam- thick  nominally 
undoped  InP.  The  InP  samples  were  grown  by  metal  organic 
chemical  vapor  deposition  (MOCVD)  on  InP  substrates. 
The  composite  MS  interface  was  realized  by  first  depositing 
a  known  density  of  nanometer  sized  Ag  aerosol  particles  on 
InP.  Subsequently,  340-yam~diam,  100-nm-thick  Au  dots 
were  made  by  evaporation  of  Au  through  a  metal  contact 
mask.  A  schematic  sketch  of  the  composite  diode  structure  is 
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(a) 


Fig.  1.  (a)  A  schematic  diagram  of  the  composite  MS  diode  structure  show¬ 
ing  the  embedded  Ag  aerosol  particles,  (b)  An  (976  nm)^  AFM  image  of  the 
Ag  aerosol  particles  for  a  representative  sample  with  a  nominal  particle 
density  of  about  10^  cm“^.  The  image  was  obtained  using  a  Topometrix 
Explorer  1000  operated  in  ambient  air.  The  Ag  particles  are  uniform  and 
typically  17  nm  in  radius. 


shown  in  Fig.  1(a).  Contact  mode  atomic  force  microscopy 
(AFM)  image  of  the  Ag  particles  is  shown  in  Fig.  1(b).  Prior 
to  deposition  of  the  Ag  aerosol  particles,  the  InP  samples 
were  cleaned  in  standard  organic  solvents,  etched  in  HF:H20 
(1:10)  for  1  min,  and  rinsed  thoroughly  in  deionized  water. 
After  deposition  of  the  aerosols,  without  additional  process¬ 
ing,  the  samples  were  loaded  into  the  evaporation  chamber 
keeping  the  time  of  exposure  of  the  sample  surface  to  ambi¬ 
ent  air  as  short  as  possible.  The  physical  dimensions  of  the 
deposited  Ag  aerosols  (around  17  nm  in  radius)  and  the  dop¬ 
ing  of  InP  (*=«1X10^^  cm“^)  are  such  that  the  nanoscopic 
inhomogeneities  produced  by  Ag/InP  regions  are  pinched-off 
by  the  depletion  potential  of  the  surrounding  Au/InP  regions. 
In  the  above  context,  a  region  at  the  MS  interface  is  said  to 
be  pinched-off  if  the  majority  carriers  (originating  from  out¬ 
side  the  space  charge  region)  reach  the  MS  interface  by  sur¬ 
mounting  a  potential  barrier  that  is  higher  than  the  band  edge 
position  at  the  region.^  Due  to  this  peculiar  physical  situa¬ 
tion,  we  refer  to  the  nanoscopic  inhomogeneities  as  nano¬ 
scopic  exclusions. 

Two  sets  of  composite  samples  were  fabricated,  one  with 
a  Ag  aerosol  particle  density  of  5X10^  cm~^,  and  the  other 
with  a  much  lower  density  of  10^  cm~^.  Further,  prior  to  Au 
evaporation,  some  of  the  samples  were  also  subjected  to  a 
moderate  heat  treatment  of  350  °C  for  5  min  in  nitrogen 
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atmosphere  to  reduce  the  barrier  height  of  the  Ag/InP  re¬ 
gions.  Control  samples  consisting  only  of  Au  dots  were  also 
made  for  comparison.  For  a  meaningful  comparison,  the  InP 
surface  preparation  conditions  for  the  control  samples  were 
kept  as  identical  as  possible  to  that  of  the  composites.  The 
back  ohmic  contact  for  all  the  samples  was  made  by  alloying 
Au-Sn  to  the  n  InP  substrates.  The  diodes  are  characterized 
by  I~V  measurements  in  the  temperature  range  290-77  K. 
The  I-V  measurements  were  performed  using  a  Keithley 
236  source-voltage  current  meter  configured  by  LabVEIW 
software. 

The  aerosol  generator  set  up,  which  is  described  in  detail 
elsewhere,^ ^  consists  of  a  furnace  tube  with  an  Ag  containing 
ceramic  boat.  This  first  furnace  for  the  aerosol  generation 
was  kept  at  a  temperature  of  1240  °C,  resulting  in  a  suffi¬ 
ciently  high  number  of  particles  with  diameters  below  100 
nm.  After  a  size  selection  procedure,  Ag  particles  were  sent 
through  a  second  tube  furnace.  This  furnace  was  operated  at 
500  °C,  leading  to  a  reshaping  of  the  aerosol  particles.  After 
this  sintering  step,  the  particles  exhibit  an  almost  spherical 
appearance.  Then,  to  keep  the  size  distribution  of  the  par¬ 
ticles  narrow  after  reshaping  in  the  second  furnace,  a  second 
size  selection  was  carried  out.  The  resulting  Ag  particles 
were  deposited  onto  InP  surface  in  a  deposition  chamber  by 
a  perpendicular  electric  field.  The  whole  process  was  run 
under  nitrogen  carrier  flow  to  avoid  oxidation  of  the  par¬ 
ticles.  The  physical  density  of  the  deposited  particles  can  be 
varied  by  choosing  appropriate  deposition  times.  Contact 
mode  AFM  investigations  of  the  Ag  aerosol  particles  show 
that  the  particles  are  quite  uniform,  and  are  about  17  nm  in 
radius  [Fig.  1(b)]. 

III.  RESULTS  AND  DISCUSSION 

Variation  of  the  Schottky  barrier  height  (SBH)  at  the  MS 
interface  implies  the  variation  of  the  potential  from  region  to 
region.  The  composite  MS  structure  described  before  can  be 
thought  of  as  an  inhomogeneous  Schottky  barrier  with  a  cer¬ 
tain  density  of  low  barrier  regions  (due  to  the  Ag  particles)  in 
an  otherwise  uniform  high  SBH  (Au/InP).  The  Ag  particle- 
InP  interface  region  can  be  approximated  to  a  circular  patch 
of  some  effective  radius  and  due  to  the  almost  spherical 
shape  of  the  particles  the  effective  patch  radius  will  be  less 
than  or  at  best  equal  to  the  geometrical  radius  of  the  particle. 
For  a  doping  of  1X10^^  cm“^,  the  depletion  width  due  to 
Au/InP  (barrier  height  of  0.5  V)  is  of  the  order  of  250  nm 
which  is  significantly  larger  than  the  radius  of  the  Ag  par¬ 
ticles.  Under  these  physical  conditions,  the  potential  V(0,z) 
along  the  z  axis  and  at  the  center  of  the  low  barrier  Ag/InP 
circular  patch  of  effective  radius  can  be  expressed  as^ 


Z  1 

2 

Z 

V(0,z)  =  Vbb 

+  y„+y„-A 

1‘ 

(1) 

where  Vbb  —  —  is  the  band  bending  due  to  the  MS 

junction  with  a  uniform  SBH  of  W  is  the  depletion  width 
defined  as  where  is  the  dielectric  constant 

and  q  the  electronic  charge,  is  the  difference  between  the 
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Fig.  2.  Conduction  band  potentials  along  the  z  axis  for  different  patch  radii, 
calculated  using  Eq.  (1)  for  zero  bias.  The  insert  shows  a  schematic  diagram 
of  an  isolated  low  barrier  circular  patch  indicating  the  relevant  coordinates. 
The  Ag/InP  regions  are  approximated  to  a  circular  patch  of  effective  radius 

/?Ag- 


Fermi  level  and  the  conduction  band  minimum,  is  the 
applied  bias,  and  is  the  difference  in  the  barrier 

potential  between  the  uniform  enveloping  barrier  and  the 
patch  Figure  2  shows  the  simulated  conduction  band 
potential  using  Eq.  (1).  The  radius  of  the  patch  is  varied  to 
illustrate  the  pinch-off  situation.  Clearly,  the  smaller  the  ra¬ 
dius  of  the  low  barrier  patch,  the  easier  it  is  to  obtain  pinch- 
off  conditions.  In  addition,  the  results  of  Fig.  2  also  point  to 
interesting  possibilities  for  obtaining  injecting  nano-ohmic 
contacts  by  suitable  choice  of  and  sample  structure 

in  terms  of  doping,  etc.  Detailed  simulations  and  discussions 
presented  in  Ref.  8  demonstrate  that  the  dependence  of  the 
potential  on  the  applied  bias  [Eq.  (1)]  significantly  affects  the 
transport  characteristics  of  inhomogeneous  MS  interfaces. 
Turning  to  the  evaluation  of  the  total  current  flow  in  such  MS 
interfaces,  the  exact  expression  for  the  current  is  rather  com¬ 
plicated  and  depends  on  the  detailed  nature  of  the  inhomog- 
enieties  and  their  distributions.^  However,  under  certain  rea¬ 
sonable  approximations,  analytical  expressions  can  be 
obtained  for  the  total  current  at  the  MS  junction.  In  particu¬ 
lar,  for  a  MS  interface  containing  a  low  density  of  nearly 
identical  low  barrier  circular  patches,  the  total  forward  bias 
current  (7^0^)  can  be  expressed  as^ 


I,ot=AA*T^  exp(-je<D,',)[exp(yeyj  -  1] 


X 


Aird 


(2) 


where  A  is  the  area  of  the  diode.  A*  is  the  Richardson  con¬ 
stant,  T  is  the  temperature,  d  is  the  areal  density  of  patches, 
P^qlkT  is  the  Boltzman  factor,  rf=€j{qN  q),  and 
y— 3(A/?^g/4)^^^.  The  parameters  y  and  d  are  characteristic 
of  the  low  barrier  patches. 

Within  some  reasonable  limits,  our  composite  MS  struc¬ 
ture  can  be  assumed  to  correspond  to  the  above  situation,  and 
the  model  for  the  electron  transport  [Eq.  (2)]  can  be  used  to 
qualitatively  illustrate  the  I-  V  characteristics  of  the  compos¬ 


Voltage  (V) 


Fig.  3.  Simulated  current-voltage  characteristics  at  the  inhomogeneous  MS 
(composite)  interface,  calculated  using  Eq.  (2).  The  values  of  the  parameters 
used  are  A  =9X10"'^  cm^  4>/,-0.5  V,  patch  radius =7  nm,  A =0.45  V, 
A^  =  1X10'^  cm“^,  and  patch  density  d=5X\0'^  cm““.  Effect  of  the  series 
resistances  are  also  included  in  the  calculations. 


ite  MS  diodes.  Figure  3  shows  the  simulated  forward  bias 
/”  V  characteristics  using  Eq.  (2)  for  our  composite  diode  for 
some  representative  parameters.  At  higher  temperatures,  the 
electron  transport  is  dominated  by  thermionic  emission  over 
the  Au/InP  barrier.  However,  at  low  temperatures,  at  low 
biases,  the  junction  current  is  dominated  by  the  Ag/InP  re¬ 
gions.  In  fact,  one  also  sees  a  development  of  anomalous 
plateaulike  sections  in  the  I-~V  curves  which  are  due  to  an 
ohmic  effect.  Since  the  current  densities  near  the  patches 
may  be  much  higher  than  the  average  current  density,  the 
current  through  the  patches  tends  to  be  limited  by  series  re¬ 
sistance  effects,  resulting  in  plateau-like  features  in  the  I-  V 
curves.  If  the  pinch-off  is  very  effective  (a  very  small  patch 
radius),  we  find  that  variation  of  the  local  SBH  at  the  Ag/InP 
interface  does  not  have  a  very  significant  effect  on  the  ap¬ 
pearance  of  the  1-V  curves.  However,  in  simulations  for 
larger  patch  radii,  variation  of  A  results  in  strong  changes  in 
the  I-V  curves  resulting  in  pronounced  plateau-like  features. 
To  investigate  the  influence  of  the  patch  (Ag  particle)  density 
on  the  transport  properties,  we  varied  the  patch  density  from 
5X10^  cm~^  down  to  10^  cm”^  while  keeping  the  other  pa¬ 
rameters  same  as  was  used  for  the  I-V  curves  of  Fig.  3.  The 
I-V  curves  (not  shown  here)  exhibit  a  systematic  depen¬ 
dence  with  particle  density,  especially  at  low  temperatures. 
The  higher  their  density,  the  larger  their  influence  on  the 
transport  properties.  We  also  note  here  that  in  simulations 
using  larger  effective  patch  radii,  we  observe  that  the  influ¬ 
ence  of  the  patch  density  on  the  I-V  curves  is  even  more 
pronounced.  Below,  we  discuss  our  experimental  results  and 
show  that  the  observed  characteristics  of  the  composite  di¬ 
odes  are  qualitatively  consistent  with  the  simulations  pre¬ 
sented  before. 

Figure  4  shows  the  measured  I-V  characteristics  at  se¬ 
lected  temperatures  for  a  representative  Au/InP  diode  (con- 
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Fig,  4.  Measured  I-  V  curves  for  a  representative  Au/InP  control  sample  at 
selected  temperatures. 

trol  sample).  Several  diodes  were  measured  and  their  char¬ 
acteristics  are  similar  to  those  shown  in  Fig.  4.  The  nieasured 
I-  V  curves  show  the  normal  semilogarithmic  behavior,  and 
there  is  no  perceptible  departure  in  linearity  with  tempera¬ 
ture.  The  barrier  height  obtained  from  an  activation  plot 
(I-V-T)  is  0.5  V  and  is  in  good  agreement  with  reported 
values  in  literature.  As  discussed  in  the  preceding  para¬ 
graph,  in  the  composite  samples,  depending  on  parameters 
such  as  particle  density  and  A,  one  can  expect  to  see  a  vari¬ 
ety  of  behaviors  in  the  temperature  dependent  I-V  curves. 
The  measured  I~  V  characteristics  for  a  representative  com¬ 
posite  diode  with  Ag  particle  density  of  5X10^  cm“^  are 
shown  in  Fig.  5.  At  high  temperatures  the  I-V  curves  are 
similar  to  the  control  samples.  However,  at  about  120  K, 
there  is  a  clear  plateaulike  structure  at  a  bias  of  0.3  V,  which 


Voltage  (V) 

Fig.  5.  Typical  set  of  measured  I-V  curves  for  the  Au/Ag/InP  composite 
samples  at  different  temperatures.  The  Ag  particles  are  as-deposited  and 
their  density  is  5X10^  cm”^.  The  departure  in  the  semilogarithmic  behavior 
of  the  I-V  characteristics  is  indicated  by  the  arrow. 
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Fig.  6.  (a)  Typical  set  of  measured  I-V  curves  for  the  Au/AgAnP  composite 
samples  at  different  selected  temperatures.  The  density  of  the  Ag  particles  is 
5X10^  cm“^  and  are  alloyed  to  the  InP  surface,  (b)  I-V  scans  at  selected 
temperatures  for  a  low  particle  density  (10^  cm“^)  composite  sample.  Here, 
also,  the  Ag  particles  were  alloyed  to  the  InP  surface.  The  arrows  indicate 
the  departure  in  the  semilogarithmic  behavior  of  the  I-  V  characteristics, 

persists  at  lower  temperatures.  Since  this  feature  is  clearly 
absent  in  the  control  samples,  we  can  attribute  this  signature 
to  electron  transport  at  the  low  barrier  Ag/InP  regions.  Fur¬ 
ther  proof  that  this  is  so  is  obtained  by  looking  at  the  char¬ 
acteristics  of  composites  wherein  the  deposited  Ag  particles 
were  alloyed  on  the  InP  surface  to  further  reduce  their  barri¬ 
ers.  The  I-V  characteristics  of  these  diodes  are  shown  in 
Fig.  6.  From  Eq.  (2),  we  can  see  that  the  currents  are  sensi¬ 
tive  to  the  value  of  A.  For  a  given  particle  density,  if  A  is 
made  large,  the  I-V  curves  can  be  expected  to  show  the 
influence  of  the  low  barrier  patches  at  a  higher  temperature. 
And,  for  a  given  A,  if  the  density  is  significantly  reduced,  the 
junction  current  will  be  dominated  by  the  low  barrier  patches 
only  at  relatively  lower  temperatures  [Eq.  (2)].  That  this  is 
indeed  the  case  is  seen  in  Fig.  6.  For  the  high  density  sample 
[Fig.  6(a)],  the  ohmic  effect  starts  to  show  up  at  as  high  a 
temperature  as  230  K  while  that  in  the  low  density  sample 
appears  at  a  lower  temperature  of  about  100  K  [Fig.  6(b)]. 
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Preliminary  simulations  using  Eq.  (2)  of  the  experimental 
/_  y  curves  (Figs.  5  and  6)  indicate  that  the  effective  patch 
radius  is  around  7  nm,  and  the  barrier  heights  are  about  0.3 
and  0.05  V  for  the  as-deposited  and  alloyed  particles,  respec¬ 
tively. 

All  of  the  aforementioned  results  are  qualitatively  consis¬ 
tent  with  Eq.  (2)  and  demonstrate  the  influence  of  nano- 
scopic  SBH  inhomogeneities  (nanoscopic  exclusions)  on  the 
electron  transport.  In  this  work,  by  deliberately  introducing 
the  inhomogeneities  with  good  size  control,  we  have  also 
been  able  to  demonstrate  the  influence  of  parameters  such  as 
the  particle  density  and  local  barrier  heights  on  transport 
properties  at  the  MS  junction.  Further  experiments  exploring 
other  possible  composite  structures  using  different  particle 
sizes  and  semiconductor  structures  are  underway. 

Finally,  we  comment  briefly  on  the  ideality  factors  of  the 
diodes.  The  ideality  factor  (n)  for  a  Schottky  diode  is  an 
emphirical  quantity  and  is  defined  sls  n~  for 

Va>3l/3.  An  ideal  Schottky  diode  is  characterized  by  an 
unity  ideality  factor.  However,  most  practical  diodes  usually 
have  ideality  factors  in  excess  of  unity.  Physical  mechanisms 
such  as  image  force  lowering, generation-recombination, 
interface  states,^^’^^  and  SBH  inhomogeneities^  have  been 
discussed  as  possible  sources  for  the  observed  larger-than- 
unity  ideality  factor  in  diodes.  For  a  meaningful  comparison 
of  the  ideality  factors  for  the  different  diodes  used  in  this 
work,  the  ideality  factors  were  determined  in  the  temperature 
range  from  300  to  250  K.  In  this  temperature  range  the  /-  F 
curves  for  all  the  diodes  are  semilogarithmic  (see  Figs.  4-6). 
The  control  samples  (Au/InP)  show  nearly  ideal  behavior 
and  the  ideality  factors  are  in  the  range  1.02-1.05.  However, 
for  all  the  composite  samples,  we  find  that  it  is  invariably 
larger  than  that  of  the  control  samples.  For  the  low  density 
sample  with  alloyed  Ag  particles,  the  ideality  factors  were 
typically  1.08-1.11.  Somewhat  larger  values  of  1.12-1.15 
was  found  for  the  high  density  samples  with  as-deposited  Ag 
particles.  The  worst  case  is  for  the  composites  formed  from 
high  density  alloyed  Ag  particles.  Here,  both  A  and  the  den¬ 
sity  are  large,  thereby  significantly  increasing  the  role  of  the 
Ag/InP  regions  in  electron  transport  across  the  MS  interface. 
In  these  samples,  the  ideality  factors  were  typically  1.4-1.55 
which  is  significantly  larger  that  what  is  determined  for  the 
control  samples.  These  observations  suggest  that  the  most 
significant  mechanism  for  larger-than-unity  ideality  factors 
usually  observed  in  diodes  is  largely  due  to  the  fact  that  most 
of  MS  interfaces  are  in  general  inhomogeneous.  Similar  ob¬ 
servations  have  been  made  in  epitaxial  MS  interfaces  which 
were  deliberately  made  inhomogeneous."^ 

IV.  CONCLUSIONS 

In  summary,  we  have  investigated  the  problem  of  electron 
transport  at  the  Au/InP  MS  interface  in  the  presence  of  nano- 
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scopic  barrier  inhomogeneities.  The  investigated  transport 
regime  pertains  to  an  interesting  situation  wherein  the  inho¬ 
mogeneous  regions  are  pinched  off  by  the  depletion  potential 
of  the  surrounding  higher  barrier  region.  To  study  this  prob¬ 
lem,  we  have  used  composite  MS  structure  comprising  of  a 
known  density  of  nanometer  sized  uniform  Ag  aerosol  par¬ 
ticles  on  InP  overlayed  by  a  uniform  Au  film.  Temperature 
dependent  /-  V  measurements  have  been  performed  to  char¬ 
acterize  the  electron  transport  at  the  composite  MS  interface. 
The  experimental  results  are  explained  using  a  simple  model 
for  the  current  across  the  inhomogeneous  MS  junction.  All 
the  Ag/InP  regions  have  been  assumed  to  be  nearly  identical 
and  noninteracting.  Our  results  clearly  demonstrate  that  the 
electron  transport  at  the  MS  interface  is  significantly  affected 
by  low  barrier  regions  even  in  the  pinch-off  regime.  It  is 
suggested  that  the  commonly  observed  larger-than-unity  ide¬ 
ality  factors  in  MS  diodes  is  largely  due  to  an  inhomoge¬ 
neous  MS  interface.  Furthermore,  our  results  indicate  possi¬ 
bilities  of  using  such  composite  MS  structures  to  explore  the 
physics  and  applications  of  nanoinjecting  MS  contacts. 
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Theoretical  and  experimental  studies  of  the  rate  of  decay  of  metastable  structures  are  compared 
quantitatively.  The  effect  of  decay  mechanism,  size,  and  periodicity  of  the  structure  on  the  rate  of 
decay  is  evaluated  within  both  a  coarse-grained  step-based  model  and  a  continuum  model.  For 
high-amplitude  structures,  the  decay  scales  with  size  {N)  and  time  as  The  exponents  a 

and  depend  on  the  mass  transport  mechanism.  The  size  scaling  is  a— 4  for  locally  conserved 
diffusive  flux  and  a^l  for  locally  nonconserved  flux.  The  time  scaling  exponent  is  yS=l/5  for 
diffusive  limited  mass  transport  and  1/4  for  step  attachment  limited  mass  transport.  Experiments 
were  performed  on  metastable  structures  of  controlled  sizes  3-5  nm  in  height,  prepared  by  direct 
current  heating  on  Si(lll).  Quantitative  agreement  with  theoretical  predictions  of  both  scaling 
(a:=4.3±0.5,  yS=0.2-0.3)  and  absolute  rate  of  decay  were  obtained.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

As  the  size  scale  of  artificially  structured  devices  shrinks 
into  the  submicrometer  regime,  it  becomes  increasingly  im¬ 
portant  to  understand  their  structural  properties  on  an  atomic 
level.  In  this  work,  we  present  the  results  of  a  combined 
theoretical  and  experimental  study  designed  to  test  our  un¬ 
derstanding  of  the  stability  of  nanoscale  structures.  This 
work  was  based  on  an  understanding  of  surface  mass  trans¬ 
port  in  terms  of  the  addition  and  removal  of  material  from 
steps  on  surfaces.  In  the  following  sections  we  present  first  a 
description  of  the  theoretical  formalism  used  to  describe  the 
decay  of  nanostructures.  We  then  present  results  on  the  fab¬ 
rication  and  decay  of  structures  of  a  similar  geometry  on  a 
Si(lll)  surface. 

II.  THERMODYNAMICS  OF  THE  STEP  MODEL 

Perturbations  of  a  thermodynamically  stable  surface  will 
eventually  relax  back  toward  equilibrium.  It  is  natural  to  as¬ 
sume  that  at  least  for  small  disturbances  the  relaxation  rate  is 
proportional  to  the  difference  in  free  energy  between  the  per¬ 
turbed  and  equilibrium  surface.  The  simplest  example  is  a 
system  above  the  roughening  temperature  with  isotropic  sur¬ 
face  tension  where  the  surface  free  energy  is  proportional  to 
the  total  area.  In  several  classic  papers,  Mullins^’^  derived 
the  equations  of  motion  for  one-dimensional  (ID)  surface 
profiles  under  different  mass  transport  mechanisms.  Because 
of  its  simplicity,  Mullins’  isotropic  continuum  model  has 
been  frequently  used  to  compare  with  experiment,  and  the 
relevant  physical  concepts  such  as  surface  tension  and  sur¬ 
face  diffusion  rates  are  often  determined  from  experimental 
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data,  even  when  the  underlying  physical  situation  is  rather 
different  from  that  considered  by  Mullins.  One  such  example 
is  that  of  surfaces  below  the  roughening  transition  tempera¬ 
ture.  In  this  case,  there  is  a  singular  orientation  at  which  the 
orientational  dependence  of  surface  free  energy  per  unit  area 
has  a  cusp.  This  causes  problems  in  a  continuum  treatment. 
Moreover,  the  system  in  this  case  is  certainly  not  isotropic. 
In  this  article,  we  further  develop  some  models  proposed 
previously  by  several  authors"^"^  for  the  relaxation  of  vicinal 
surfaces  below  the  roughening  transition.  The  common  fea¬ 
tures  of  these  models  are  the  use  of  steps  as  the  fundamental 
entities  and  the  change  from  partial  differential  equations  for 
a  continuum  model  to  a  set  of  coupled  ordinary  differential 
equations  describing  step  positions. 

For  vicinal  surfaces  below  the  roughening  temperature, 
the  terrace- step-kink  (TSK)  model  provides  an  adequate  de¬ 
scription.  In  this  model,  there  is  a  creation  energy  for  an 
isolated  step  and  steps  have  repulsive  interactions  with  each 
other  due  to  the  no-crossing  rule  and  an  elastic  or  dipole 
interaction  if  present.^”^  When  steps  basically  remain 
straight  a  ID  model  may  be  sufficient.  This  can  be  conve¬ 
niently  described  using  the  reduced  free-energy  density,  de¬ 
fined  as  the  surface  free  energy  per  unit  area  projected  onto 
the  low  index  facet  plane.  For  a  small  miscut  angle  (and 
therefore  low  step  density),  the  reduced  free  energy  density 
assumes  the  form^’^ 

S+gs\  (1) 

where  s  =  jtan  f)\  is  the  slope  of  the  surface,  /3  the  creation 
energy  of  an  isolated  step,  h  the  height  of  the  step,  and  g  the 
step  interaction  parameter  which  is  a  function  of  the  step 
stiffness,  the  temperature,  and  any  energetic  interaction  be- 
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Fig.  1.  Illustration  of  the  labeling  of  steps  and  terraces  and  kinetics  coeffi¬ 
cients. 


tween  the  steps. We  make  a  mean  field  approximation  by 
assuming  that  the  effective  Hamiltonian  or  the  free-energy 
functional  for  a  nonuniform  system  has  the  form  of  Eq.  (1) 
and  depends  only  on  the  local  surface  slope.  The  simplest 
way  to  proceed  is  to  define  the  local  slope  on  each  terrace  as 
/z//„ ,  where  +  i  is  the  width  of  the  nih  terrace.  See 
Fig.  1  for  the  labeling  of  steps  and  terraces.  Hence  we  can 
write 

H  =  f{l„)ln  =  Ly^  F{ln),  (2) 

n  n 

where  F(/)=/(/)/=/^/  +  yS  +  g/z^//^.  Following  Rettori  and 
Villain,^  we  define  the  chemical  potential  ix^  at  a  step  edge 
as  the  free  energy  change  arising  when  the  average  step  po¬ 
sition  moves  due  to  attachment  of  an  adatom.  Therefore, 
from  Eq.  (2),  we  find 

=  (3) 

by  assuming  a  square  lattice  for  the  adatoms,  where  a  is  the 
lattice  constant,  and  /xq  is  the  chemical  potential  of  the  atom 
in  the  solid.  Note  that  the  step  creation  energy  ^  drops  out  of 
the  expression  for  the  step  chemical  potential.  This  is  be¬ 
cause  Eq.  (2)  is  based  on  the  assumption  that  the  number  of 
steps  remains  constant  and  thus  only  makes  sense  when  the 
steps  are  in  the  same  direction  (i.e.,  there  is  no  annihilation 
of  steps  of  opposite  signs). 

Other  cases  are  more  complicated  and  introduce  new 
physical  considerations.  For  example,  at  the  tops  or  bottoms 
of  a  sinusoidal  profile  where  the  average  direction  coincides 
with  the  facet  direction  (and  hence  both  up  and  down  steps 
are  present),  two-dimensional  (2D)  islands  usually  form  on 
terraces.  As  the  average  positions  of  the  step  and  antistep 
move  closer,  they  have  a  greater  chance  to  annihilate  one 
another,  therefore  reducing  the  total  length  of  the  steps.  Us¬ 
ing  h/l  as  the  step  density  is  certainly  not  correct  for  this 
case.  Ozdemir  and  Zangwill^  studied  a  model  for  this  case 
which  ignored  the  interaction  term  gH^  between  steps  and 
anti  steps  but  still  did  not  explicitly  take  into  account  any 
changes  due  to  the  creation  energy  of  the  steps.  That  might 
explain  why  they  did  not  see  the  formation  of  true  facets  in 
their  model.  A  more  general  2D  step  model  that  explicitly 
takes  into  account  this  creation  energy  is  currently  being 
investigated.^^  However,  here  we  concentrate  both  theoreti¬ 
cally  and  experimentally  on  the  simpler,  but  still  challenging, 
problem  of  profiles  consisting  of  steps  of  the  same  sign,  and 
therefore  Eq.  (2)  is  sufficient.  Except  for  the  orientation 
along  the  facet  direction  for  long  wavelength  perturbations  at 


small  amplitude,  we  expect  that  different  discretization  meth¬ 
ods  should  yield  results  similar  to  those  of  the  continuum 
model. 


III.  KINETICS  OF  THE  STEP  MODEL 

Having  defined  the  step  chemical  potential,  we  can  pro¬ 
ceed  to  study  the  relaxation  process  arising  from  different 
kinetic  assumptions  in  the  step  model.  Many  different  kinet¬ 
ics  have  been  studied  previously  by  different  authors,  but  no 
standard  terminology  has  emerged.  In  this  article,  we  divide 
the  possible  kinetic  processes  into  two  families,  depending 
on  whether  or  not  the  adatoms  on  each  terrace  obey  a  local 
conservation  condition. 

A,  Kinetics  with  local  adatom  conservation: 
Attachment/detachment  limited  and  diffusion  limited 
cases 


One  standard  treatment  going  back  to  the  Burton- 
Cabrera-Frank  (BCF)  model assumes  that  the  two  neigh¬ 
boring  step  edges  serve  as  boundary  conditions  for  a  diffu¬ 
sion  equation  describing  adatom  diffusion  on  a  given  terrace. 
The  mass  transport  on  each  terrace  obeys  a  local  mass  con¬ 
servation  law.  Originally,  it  was  assumed  that  steps  acted  as 
perfect  sinks  for  adatoms.  A  more  general  treatment  assum¬ 
ing  a  finite  attachment  probability  of  the  adatoms  to  the  step 
edges  was  given  by  Ozdemir  and  Zangwill.^  This  problem  is 
usually  solved  under  the  quasistatic  approximation.  We  dis¬ 
cuss  here  two  special  limiting  cases:  (a)  the  attachment/ 
detachment  rate  limited  case  corresponding  to  a  very  small 
attachment  rate  at  the  step  edges  compared  with  the  adatom 
diffusion  rate  and  (b)  a  diffusion  limited  mechanism  (origi¬ 
nally  considered  by  BCF)  corresponding  to  fast  attachment 
rates  and  relatively  slow  diffusion. 

For  the  attachment/detachment  limited  process,  the  rapid 
diffusion  ensures  that  adatoms  have  a  well-defined  and  spa¬ 
tially  uniform  chemical  potential  on  each  particular  terrace. 
The  first  order  kinetics  theory  predicts  that 

—  =  /c  (/x„~/xj  +  /c  (/i„-/x,,-i),  (4) 

where  is  the  chemical  potential  of  an  adatom  on  the  ni\i 
terrace,  and  is  the  kinetic  coefficient  for  adatom  attach¬ 
ment  from  the  terrace  above  (below)  the  step.  In  the  quasi¬ 
static  approximation,  time  derivatives  in  the  adatom  concen¬ 
tration  field  are  neglected,  so  we  require  that  the  net  flux 
onto  any  terrace  from  the  two  neighboring  steps  is  zero.  This 
fixes  the  chemical  potential  on  the  terraces  as 
jjL\^  =  {K^ +  K~),  Thus  from  Eq.  (3)  and 
Eq.  (4),  we  find  the  equation  of  the  motion  of  steps  under  an 
attachment/detachment  limited  mechanism:^ 

K~ 


Igh^a^K^K-  I  -1 


■  + 


/3 

^n+{ 


(5) 


In  the  opposite  diffusion  limited  case,  rapid  adatom  ex¬ 
change  at  the  step  edges  causes  the  adatom  concentration  to 
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assume  the  local  equilibrium  value  exp[(/;t„ 

—  jiio)/ksT]  ^  Co^(  1  +  (/A„  -  /jiQ)/kQT),  where  and  julq  are 
the  equilibrium  values  for  the  completely  relaxed  surface.  In 
the  absence  of  evaporation  and  deposition,  and  under  the 
quasistatic  approximation,  the  diffusion  field  describing  the 
adatom  concentration  on  each  terrace  is  a  linear  function  of 
the  terrace  width.  Thus  one  can  relate  the  adatom  flux  or 
current  to  the  known  values  at  the  steps.  From  local  mass 
conservation  we  have^ 

dx  _D,c^a^  ^ 

dt  k^T  \ 

as  the  equation  of  the  step  motion  under  the  diffusion  limited 
mechanism. 


B.  Kinetics  without  local  conservation:  Attachment/ 
detachment  limited  case 


A  second  general  class  of  mass  transport  mechanisms 
arises  when  adatoms  can  hop  over  the  step  edges  quite  easily. 
In  such  cases,  local  conservation  is  not  important  since  con¬ 
tributions  from  many  other  terraces  influence  the  adatom 
density  on  a  given  terrace.  Moreover,  the  adatom  density 
cannot  have  a  singularity  at  the  step  edges.  This  picture  is 
physically  relevant  to  the  attachment/detachment  limited 
case  with  rapid  and  efficient  surface  diffusion.  Because  ada¬ 
toms  can  freely  diffuse  to  any  place  on  the  surface,  their 
concentration  and,  therefore,  the  adatom  chemical  potential 
assumes  a  constant  value  for  all  terraces.  In  the  absence 
of  evaporation  and  deposition  =  Therefore,  from  Eq. 
(4)  we  have"^ 


=  ■\-K  )(y3 


(7) 


-1 


as  the  equation  of  step  motion  under  the  attachment/ 
detachment  limited  mechanism  without  local  conservation. 
In  the  literature,  the  term  attachment/detachment  limited 
does  not  consistently  refer  to  cases  either  with  or  without 
local  conservation,  so  it  is  necessary  to  specify  the  conserva¬ 
tion  condition  to  avoid  confusion.  Note  that  the  formula  for 
this  case  is  the  same  as  for  the  evaporation-condensation 
mechanism,  where  the  constant  chemical  potential  is  deter¬ 
mined  by  that  of  the  vapor  phase. 


IV,  DECAY  OF  A  STEP  BUNCH  FOR  THE  STEP 
MODEL 


its  slope,  to  reduce  the  free  energy.  Since  no  steps  are  created 
or  destroyed  in  the  process,  the  step  creation  energy  ^  is 
irrelevant  here  (note  its  absence  in  the  various  equations  of 
motion).  Step  movement  is  associated  with  mass  transport, 
and  different  transport  mechanisms  will  yield  different  dy¬ 
namical  behaviors.  The  purpose  of  this  section  is  to  derive 
certain  qualitative  differences,  specifically  the  scaling  laws 
implied  by  the  different  mechanisms,  and  compare  these 
with  experiments. 

First  we  study  the  motion  of  an  isolated  step  bunch  con¬ 
sisting  of  N  steps  [(A  —  1)  terraces]  bounded  by  infinite  flat 
terraces.  In  the  attachment/detachment  limited  case,  we  as¬ 
sume  that  the  adatom  chemical  potential  on  the  two  flat  ter¬ 
races  equals  the  equilibrium  value.  The  geometry  is  similar 
to  that  considered  by  Ozdemir  and  Zangwill,^  except  at  the 
two  ends.  Most  of  their  conclusions  for  sinusoidal  profiles 
have  direct  analogs  here.  Most  importantly,  we  obtain  the 
scaling  behavior  through  a  separation  of  variables.  Consider¬ 
ing  a  “shape  preserving”  solution 

(8) 

where  T=?+ro,  and  the  A„  are  constants  depending  only  on 
the  system  size  and  kinetic  coefficients.  It  is  easy  to  show 
that  /3=l/4  for  the  attachment/detachment  limited  mecha¬ 
nism  (with  and  without  local  conservation)  and  1/5  in  the 
diffusion  limited  case.^  The  meaning  of  “shape  preserving” 
here  is  that  after  the  surface  has  acquired  the  shape  preserv¬ 
ing  profile,  the  profile  of  the  system  at  any  later  time  can  be 
obtained  by  multiplying  each  terrace  width  by  a  constant. 

It  is  instructive  to  look  at  the  continuum  limit  for  this 
problem.  As  an  example,  we  present  the  conserved 
attachment/detachment  case,  but  the  reasoning  for  the  rest  of 
the  cases  is  similar.  We  obtain  the  continuum  equation  from 
our  step  equations  by  replacing  differences  involving  the  step 
indices  by  derivatives.  In  many  cases,  the  resulting  equations 
reduce  to  those  derived  in  other  ways  by  earlier  workers,  but 
this  derivation  may  put  them  on  firmer  theoretical  ground.  As 
an  example,  for  the  attachment/detachment  limited  mecha¬ 
nism  with  local  conservation,  the  equation  of  motion  for  the 
continuum  model  is 


dz{x,t)  _  d 
dt  ^  dx 


1  d 
\Zx\  dx 


{\Zx\Zxx)  . 


(9) 


where  z(x,t)  is  the  height  of  the  surface,  and  is  the  partial 
derivative  of  z  with  respect  to  the  x  variable.  Using  the  scal¬ 
ing  form  z{x,t)  =  Z{xlt^'^)  =  Z{u),  from  Eq.  (9)  we  have 


Although  standard  lithographic  techniques  provide  a 
method  of  preparing  a  sinusoidal  perturbation  involving 
steps  of  both  signs,  this  is  not  the  simplest  or  cleanest  physi¬ 
cal  example  with  which  to  study  step  models  because  of  the 
problems  noted  above  with  the  theoretical  understanding  of 
step  annihilation.^^  Therefore  in  this  article,  we  restrict  our 
study  to  systems  with  steps  in  the  same  direction,  specifi¬ 
cally,  the  decay  of  a  step  bunch  under  different  kinetic 
mechanisms. 

Because  of  the  step-step  repulsion  [the  g  term  in  Eq.  (2)], 
a  step  bunch  will  spontaneously  spread  out,  thus  decreasing 


wZ'=45i 


d 

du 


1  d 
Z'  du 


(Z'Z") 


(10) 


The  exponent  1/4  is  consistent  with  the  step  model.  We  will 
solve  a  similar  scaling  equation  for  the  isotropic  continuum 
model  later  in  this  section.  Here  we  just  derive  some  scaling 
rules  from  general  considerations.  The  symmetry  of  the 
problem  requires  Z(  —  u)~—Z{u),  Observe  that  if  Z^ix/t^^^) 
is  a  solution  of  the  problem,  kZ^ix/t^^'^)  is  also  a  solution. 
Thus  we  can  use  the  same  scaling  function  Zq  to  describe 
bunches  of  different  sizes,  with  the  scale  factor  X  propor- 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2802 


Liu  et  a/.:  Relaxation  of  the  step  profile 


2802 


Fig.  2.  Solutions  of  A„  ,  the  prefactor  governing  the  width  of  the  terrace  [Eq. 
(8)],  at  the  center  of  the  bunch  (n-NI2)  for  the  conserved  attachment/ 
detachment  limited  case  as  a  function  of  different  bunch  size  N.  The  solid 
line  is  a  fit  to  N~  ^ . 


tional  to  N  [i.e.,  2\ZQ{^)  =  Nh],  Then  we  have  the  follow¬ 
ing  scaling  law  for  the  time  evolution  of  the  slope  Zx  at  the 
center  of  the  bunch: 

d  hZ^iO) 

=  -  [xZo(x/?i^^)]=  /vr 

(11) 

Similar  relationships  hold  for  the  discrete  step  models.  By 
numerically  determining  the  A„’s  for  different  bunch  sizes  N, 
we  can  now  calculate  the  terrace  width  at  the  center  of  the 
bunch  as  a  function  of  time  from  Eq.  (8).  Figure  2  plots 
Rf^=A^i2  (or  if  N  is  an  odd  number)  versus  the 

bunch  size  N.  Notice  that  1//?^  is  just  the  prefactor  for  the 
power  law  decay  of  the  slope  of  an  isolated  step  bunch.  The 
result  shows  IIN  as  would  be  expected;  the  agreement 
is  better  for  large  bunch  sizes.  Therefore  for  the  attachment/ 
detachment  limited  mechanism  with  local  conservation,  both 
the  continuum  model  and  the  step  model  predict  that  the 
slope  at  the  center  of  the  step  bunch  scales  as 

(12) 

By  similar  approaches,  we  find  that  for  all  three  cases  of 
step  motion,  aside  from  a  short  initial  transition,  the  decay  of 
the  slope  at  the  center  of  an  isolated  step  bunch  obeys  the 
scaling  law 

SQ{N,t)^m(tlN“)~^,  (13) 

where  m  is  a  constant  independent  of  the  bunch  size.  This 
scaling  law  is  exact  for  the  continuum  model  (or  the  step 
models  in  the  large  bunch  size  limit)  but  is  only  an  approxi¬ 
mation  for  the  step  models  with  finite  bunch  sizes.  For  the 
attachment/detachment  rate  limited  mechanism  with  local 
conservation,  we  find  y0=l/4  and  a— A,  For  the  diffusion 
limited  mechanism,  we  find  ^0=1/5  and  a=4.  For  the 
attachment/detachment  rate  limited  mechanism  without  local 
conservation,  we  find  /3=l/4  and  a-2. 

In  the  experiments  described  below,  the  surfaces  always 
have  a  finite  miscut  angles  and  step  bunches  are  separated  by 
finite  distances.  The  complete  analysis  is  only  possible  nu¬ 
merically,  but  for  small  modulations,  we  can  linearize  the 


Fig.  3.  Illustration  of  the  decay  of  periodic  step  bunches  from  the  numerical 
calculation  using  the  locally  conserved  attachment/detachment  limited 
mechanism.  The  number  of  steps  in  each  bunch  is  15. 

Step  motion  equations.  Recently,  Bonzel  and  Mullins ana¬ 
lyzed  the  continuum  equations  for  a  surface  with  small  sinu¬ 
soidal  perturbation.  Similar  to  this  work,  the  systems  they 
considered  also  do  not  involve  annihilation  of  steps.  As  their 
results  have  shown,  the  step  bunch  should  decay  exponen¬ 
tially  when  the  surface  profile  is  in  the  linear  region,  with  the 
time  constant  dependent  on  both  the  average  miscut  angle 
and  periodicity  of  the  modulation.  Therefore  in  the  decay  of 
an  initially  steep  structure,  the  initial  power  law  decay  of  the 
step  bunch  should  break  down  and  crossover  to  exponential 
decay  at  long  times.  We  can  obtain  the  time  constants  of  the 
decay  from  the  linear  expansion  of  the  step  motion  equations 
[Eqs.  (5)  “(7)].  For  a  modulation  with  periodicity  L  and  av¬ 
erage  terrace  width  /,  the  relaxation  rate  can  be  written  as 
[using  appropriate  prefactors  from  Eqs.  (5) -(7)] 

1/t~ /"^(  18- 24  co^{kl)  +  6  cos.{2kl))-^3k^  (14) 

for  the  conserved  attachment/detachment  case,  and 

1/7--/“^[18-24  co^{kl)  +  6  cos(2)t/)]--3^^//  (15) 

for  the  diffusion  limited  case  and 

\lT-r\6~6cos{2kl)]^2k^lf  (16) 

for  the  nonconserved  attachment/detachment  case,  where 
k  =  2  irfL  is  the  wave  number.  At  the  long  wavelength  limit 
(i.e.,  when  the  separation  of  the  initial  step  bunches  is  much 
larger  than  the  average  terrace  width),  these  results  produce 
the  familiar  scaling  for  the  two  conserved  cases  and  I? 
scaling  for  the  nonconserved  case.  A  full  numerical  solution 
of  the  decay  of  periodic  step  bunches  as  shown  in  Fig.  3 
confirms  the  crossover  from  a  power  law  decay  to  an  expo¬ 
nential  decay  when  the  slope  of  the  bunch  has  decreased  to 
about  five  times  the  average  slope  of  the  surface. 

As  a  final  note  to  the  step  models,  we  compare  the  step 
motion  equations  obtained  above  with  equilibrium  fluctua¬ 
tion  results.  Bartelt  et  al}^  analyzed  the  fluctuations  of  steps 
on  Si(lll)  which  had  been  measured  using  reflection  elec¬ 
tron  microscopy  (REM).^^  It  was  shown  that  the  dependence 
of  relaxation  (or  creation)  time  on  the  wave  number  of  the 
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fluctuations  satisfies  an  approximate  q~^  relation,  with  mea¬ 
sured  mobility  r«=^5.5X  10^  A^/s.^^  This  suggests  that  adatom 
attachment/detachment  is  the  main  mechanism  for  step 
motions. However  as  was  shown  by  Pimpinelli  et  this 
result  alone  cannot  distinguish  whether  the  adatom  diffusion 
or  the  attachment/detachment  is  the  limiting  process.  Under 
the  diffusion  limited  assumption,  Pimpinelli  et  al}^  esti¬ 
mated  that  Z)^.Ceq~10^  s”^  For  the  attachment/detachment 
limited  assumption  with  local  conservation,  care  must  be 
taken  in  comparing  the  kinetic  coefficients  with  the  mo¬ 
bility  r  as  measured  from  the  equilibrium  fluctuation.^^  Fol¬ 
lowing  Bales  and  Zangwill^^  and  Pimpinelli  et  a  formal 
derivation  of  the  mobility  F  can  be  obtained  by  extending  the 
BCF  model  to  two  dimensions  and  doing  the  linear  pertur¬ 
bation  analysis  under  the  quasistatic  approximation.^^  The 
calculation  is  rather  complicated  and  not  particular  instruc¬ 
tive.  Here  we  just  quote  the  results.  If  the  step  motions  are 
mainly  caused  by  the  exchanges  of  atoms  between  neighbor¬ 
ing  steps,  then  2k^ k~ ^  K~)^TI{a^kgT).  If  ex¬ 
changes  of  atoms  between  different  positions  on  the  same 
step  (e.g.,  due  to  the  diffusion  on  the  terrace  roughly  parallel 
with  the  step  edges)  make  important  contributions  to  the 
fluctuations  of  the  step  position,  then  we  have 
k~)^T i {a^k^T) ,  The  first  scenario  is  suitable  for  ex¬ 
tremely  long  wavelength  fluctuations,  and  the  second  one  is 
suitable  at  shorter  wavelengths.  The  crossover  length  can  be 
determined  by  the  ratio  between  the  diffusion  rate  and  the 
attachment  rate,  which  is  still  an  outstanding  question.  In  the 
analysis  of  the  experiments  below,  we  assume  there  is  no 
asymmetry  in  the  adatom  attachment  rate  and  equilibrium 
fluctuation  is  caused  by  the  second  scenario 

[K^  =  K~  =  TI{2a^kBT)]. 

To  put  matters  into  perspective,  we  apply  Mullins’  isotro¬ 
pic  continuum  model  to  the  step  bunch  relaxation  problem. 
Consider  a  ID  surface  profile  described  by  z{x,t).  Mullins 
predicts  that  the  evolution  of  the  surface  profile  obeys  the 
following  partial  differential  equation  if  the  mass  transport  is 
through  surface  diffusion: 


dz  d^Z 
dt  ^ 


(17) 


where 


D,c^a‘^h^y, 


B  = 


knT 


(18) 


and  have  the  same  definition  as  in  the  diffusion  lim¬ 
ited  case.  7c  is  the  surface  tension  which  is  assumed  orien- 
tationally  isotropic  in  the  continuum  model.  Below  the 
roughening  transition,  cannot  be  derived  from  the 
Gruber-Mullins  form  of  free  energy.  Nonetheless,  it  is  in¬ 
structive  to  study  this  model  and  compare  it  with  the  experi¬ 
ments. 

We  study  the  evolution  of  a  surface  profile  that  connects 
two  large  terraces  at  different  heights;  therefore,  the  profile 
satisfies  the  boundary  condition  z{x,±^)=±HI2,  We  as¬ 
sume  the  following  scaling  form: 

z{x,t)  =  KZ[xl{Bt)%  (19) 


Fig.  4.  Shape  preserving  profile  determined  from  the  solution  of  the  scaling 
equation  for  the  isotropic  continuum  model  [Eq.  (20)]. 


Denote  u  =  xl{Bt)^ ,  then  from  Eq.  (17)  we  have 

=  (20) 

where  Z'(w)  denotes  the  first  order  derivative  of  Z  with  re¬ 
spect  to  u.  Thus  a  shape  preserving  profile  must  have  /3=  1/4. 
To  determine  Z(w),  it  is  more  convenient  to  consider  the 
derivative  of  Z(u)  first  because  it  will  vanish  at  plus  and 
minus  infinity.  Writing  W(w)  =  Z'(w)  leads  to 

W""(m)-wW'(m)/4-W(w)/4-0.  (21) 

The  assumption  of  a  power  series  solution 


W(a)=2  a,y  (22) 

n  =  0 

to  Eq.  (21)  leads  to  the  recursion  relation 


4(n-h2)(n-h3)(n-h4)’ 

We  can  use  the  symmetry  and  boundary  condition  of  the 
problem  to  find  the  linearly  independent  solutions.  The  final 
result  is 


W(u) 


^  (-l)'’r(3/4)(M2/8)'- 
"o  n!r(n/2+3/4) 


(24) 


The  coefficients  are  chosen  so  that  W(0)~1.  Z(m)  can  also 
be  obtained, 


^  (-l)«V8r(3/4)(M/V8)^'’+‘ 

d'o  n!r(«/2+3/4)(2n+l) 


(25) 


Figure  4  shows  the  form  of  Z(w).  Equation  (19)  predicts  that 
Z;c(^J)  =  X(Bt)~^^'^W[x(Bt)~^^'^].  W(u)  reaches  its  maxi¬ 
mum  at  x  =  0  and  since  W(0)  =  1,  we  have 
Zj^(0,0  =  k(50  ~^^'^.  Since  \  is  proportional  to  the  height  dif¬ 
ference  between  the  two  terraces  [2\Z{^)^H]  and  H  is 
proportional  to  the  number  of  steps  in  the  step  bunch,  we 
have  the  following  scaling  relation  for  the  maximum  slope  of 
the  step  bunch: 

(26) 
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Fig.  5.  The  time  evolution  of  the  steepest  slope  of  the  surface  for  a  system 
of  f//L  =  0.001,  where  H  is  the  height  of  the  step  bunch  and  L  is  the  system 
size  or  periodicity,  from  numerically  integrating  the  continuum  equation. 
The  two  solid  lines  are  for  comparison  with  the  power  law  for  the 
initial  decay  and  exponential  decay  for  long  time  limit. 

where  h  is  the  single  step  height  and  Z(oo)^  1.733.  Compar¬ 
ing  with  Eq.  (13),  we  see  that  the  isotropic  continuum  model 
yields  the  same  scaling  exponents  as  the  discrete  step  model 
using  the  attachment/detachment  limited  mechanism  with  lo¬ 
cal  conservation. 

On  the  other  hand,  predictions  of  other  characteristics  of 
the  decay  of  the  surface  profile  for  the  isotropic  continuum 
model  are  quite  different  from  the  step  model.  As  can  be  seen 
from  Fig.  4,  the  regions  that  connect  the  flat  terraces  with  the 
step  bunch  can  have  slopes  opposite  to  that  of  the  step 
bunch.  Physically,  if  the  direction  of  two  large  terraces  coin¬ 
cides  with  the  low  index  facet,  slopes  of  opposite  sign  cor¬ 
respond  to  steps  of  opposite  sign  (steps  and  antisteps).  There¬ 
fore  in  this  case,  the  isotropic  continuum  model  predicts  the 
creation  of  step  and  antistep  pairs.  In  contrast,  in  the  step 
model  it  is  assumed  that  there  are  no  extra  steps  or  antisteps 
created;  i.e.,  the  step  creation  energy  is  prohibitively  high. 
Thus  the  total  number  of  steps  is  conserved.  This  is  a  more 
realistic  picture  for  surfaces  well  below  the  roughening  tem¬ 
perature,  as  the  experiment  on  Si(lll)  will  show. 

To  complete  the  study  of  the  continuum  model,  instead  of 
an  isolated  step  bunch,  we  study  a  periodic  system  with  step 
bunches  separated  by  a  distance  L .  This  is  of  interest  because 
in  this  case  the  final  relaxed  surface  will  have  a  finite  slope 
HIL.  We  perform  a  numerical  integration  of  the  partial  dif¬ 
ferential  equation,  Eq.  (17).  Figure  5  shows  the  evolution  of 
the  slope  at  the  center  of  the  bunch  (which  corresponds  to  the 
maximum  slope  of  the  surface)  for  a  system  with  HIL 
=  0.001.  The  calculation  shows  the  surface  assumes  the 
shape  preserving  profile  in  Eq.  (25)  very  quickly  and  retains 
this  profile  until  the  slope  decreases  to  within  an  order  of 
magnitude  of  the  final  slope.  The  decay  rate  obeys  a  power 

law  approximately  with  exponent - 0.25  as  in  Eq.  (26).  As 

the  slope  decreases  and  step  bunches  widen  and  overlap  with 
one  another,  the  power  law  breaks  down.  When  the  slope  of 
the  step  bunch  becomes  comparable  to  the  average  slope,  the 
profile  of  the  surface  is  approximately  sinusoidal  relative  to 
the  average  slope.  The  decay  of  the  slope  of  the  bunch  can 


then  be  better  fit  to  ///L+C  exp(  — ?/r).  This  is  consistent 
with  the  exponential  decay  rate  of  a  sinusoid  in  Bonzel  and 
Mullins’s*"^  continuum  model. 

V.  EXPERIMENTS  ON  THE  RELAXATION  OF  STEP 
BUNCHES  ON  Si(111) 

Recently,  experiments  have  been  performed  on  the  par¬ 
ticular  surface  geometry  discussed  before. In  these  experi¬ 
ments,  we  have  investigated  the  thermal  relaxation  of  step 
bunched  surfaces.  These  surfaces  consist  of  bunches  of  very 
closely  spaced  steps  separated  by  terraces  much  wider  than 
the  equilibrium  step  separation.  We  use  direct  current  heating 
to  produce  these  metastable  structures  of  a  controlled  size. 
The  dramatic  changes  in  surface  morphology  produced  by 
the  use  of  direct  current  heating  on  the  Si(lll)  surface  have 
been  well  documented,  first  by  Latyshev  et  al.^^  and  since 
then  by  many  others.^’^^“^^  This  step  bunching  effect  on 
Si(lll)  is  dependent  on  both  the  direction  of  the  applied 
current  relative  to  the  step  edges  and  the  temperature  of  the 
surface.  A  uniformly  stepped  surface  will  become  bunched 
when  direct  current  is  applied  in  the  step  up  direction  at 
temperatures  around  1190  °C.  The  surface  remains  uniformly 
stepped,  however,  for  step  up  current  for  temperatures  near 
945  and  1245  ""C.  For  direct  current  applied  in  the  step  down 
direction,  the  opposite  occurs.  For  the  conditions  we  use  to 
obtain  step  bunching,  the  number  of  steps  within  the  bunches 
and  the  distances  between  bunches  increases  with  increasing 
time  of  applied  current.  Thus  the  length  of  time  during  which 
direct  current  is  applied  to  the  sample  is  used  to  determine 
the  size  of  the  resulting  structures  produced  on  the  surface. 

The  experiments  were  performed  in  ultrahigh  vacuum 
(base  pressure  of  4X 10”^^  Torr)  using  a  homemade  scanning 
tunneling  microscope  (STM).^^  We  use  two  methods  of  heat¬ 
ing  the  sample,  resistive  heating  and  electron  bombardment 
heating  from  a  tungsten  filament  positioned  behind  the 
sample.  In  the  case  of  resistive  heating,  an  optical  pyrometer 
was  used  to  measure  the  sample  temperature.  For  electron 
bombardment  heating,  the  resistance  of  the  sample  was 
monitored  to  measure  the  sample  temperature.  We  used  a 
sample,  15X3X0.4  mm^  in  size,  of  nominally  flat,  n-type 
Si(lll)  with  a  measured  equilibrium  step  spacing  of  1300  A. 
The  sample  was  placed  into  vacuum  without  any  chemical 
precleaning  and  degassed.  This  was  followed  by  a  1  min 
flash  to  1275  °C  using  direct  current  applied  in  the  step  up 
direction. 

We  studied  the  thermal  decay  of  three  different  starting 
surfaces.  To  prepare  the  initial  step  bunched  surfaces,  we 
used  direct  current,  applied  in  the  step  down  direction,  at  a 
temperature  of  ~1260  °C,  to  bunch  samples  for  2,  1,  and  0.5 
min.  For  these  heating  times,  we  obtained  surfaces  with 
structures  from  —50  to  28  A  in  height.  The  voltage  drop 
across  the  sample  was  —8  V  and  the  current  used  —6  A.  The 
next  step  was  then  to  relax  these  starting  surfaces.  To  avoid 
the  effects  of  a  direct  current  on  the  decay  of  the  surface 
structures,  we  used  electron  bombardment  heating  at  a  tem¬ 
perature  of  930  °C  (only  a  small  current  of  0.068  A  was 
passed  through  the  sample  for  temperature  measurement). 
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For  each  starting  surface,  relaxations  were  done  for  two  dif¬ 
ferent  times.  For  the  starting  surfaces  which  were  direct  cur¬ 
rent  heated  for  2,  1,  and  0.5  min,  relaxations  were  done  for 
30  and  120,  15  and  60,  and  11  and  24  min,  respectively. 
Times  were  chosen  to  produce  a  similar  amount  of  relaxation 
on  the  different  initial  surfaces.  After  each  relaxation,  the 
sample  was  quenched  and  imaged  at  room  temperature.  The 
pressure  during  heating  was  less  than  6X10“^^  Ton*. 

Figures  6(a) -6(c)  show  3D  STM  images  of  the  three  ini¬ 
tial  step  bunched  surfaces  (with  structures  created  by  heating 
with  direct  cunent  for  2,  1,  and  0.5  min,  respectively).  Fig¬ 
ures  7(a)  and  7(b)  show  the  initial  surface  of  Fig.  6(a)  after 
relaxations  for  30  and  120  min,  respectively.  There  are  two 
things  to  note  in  these  images.  First,  even  after  120  min,  the 
bunches  have  only  spread  out  a  small  amount.  We  are  still  in 
an  early  stage  of  the  relaxation.  The  second  thing  to  note  is 
the  presence  of  the  crossing  steps^  on  the  surface,  whose 
numbers  increase  with  increasing  relaxation  time.  Both  of 
these  factors  had  an  effect  on  our  choice  of  the  relaxation 
parameter  used  to  measure  the  extent  of  decay.  We  consid¬ 
ered  several  possibilities  including  the  largest  terrace  width 
and  the  root-mean-square  (rms)  height  of  the  surface.  The 
problem  with  the  largest  terrace  width  as  a  relaxation  mea¬ 
sure  was  the  initial  presence  of  crossing  steps  between  the 
bunches.  Because  the  crossing  steps  are  nearly  parallel  to  the 
bunches,  it  is  impossible  to  distinguish  between  crossing 
steps  which  are  located  close  to  a  bunch  and  steps  which  are 
actually  part  of  a  widening  bunch.  The  root-mean- square 
roughness  of  the  surface  was  found  not  very  sensitive  to  the 
decay  of  the  surface  structures  in  this  early  stage  of  the  re¬ 
laxation. 

The  relaxation  measure  which  we  found  to  be  the  most 
useful  is  the  maximum  slope  of  the  bunches.  This  measure  is 
more  sensitive  to  the  decay  than  rms  roughness  in  this  early 
stage  of  the  relaxation,  but  insensitive  to  the  effects  of  cross¬ 
ing  steps.  For  each  heating  preparation,  the  maximum  slope 
of  the  bunches  was  measured  for  11  to  22  bunches  on  the 
surfaces.  The  slopes  were  then  averaged  and  plotted  as  a 
function  of  time  as  shown  in  Fig.  8.  The  error  bars  on  each 
point  represent  the  standard  deviation  of  the  mean.  As  is 
apparent  from  the  plot,  the  initial  slope  (and  thus  the  step- 
step  distance)  of  the  bunches  is  different  for  each  of  the 
starting  surfaces.  This  step-step  spacing  within  the  bunches 
decreases  as  the  number  of  steps  within  the  bunches  in¬ 
creases,  which  is  important  in  the  analysis  of  the  data.  As 
shown  above,  the  thermal  relaxation  of  the  surfaces  is  driven 
by  the  repulsive  step-step  interaction.  Since  the  initial  step- 
step  separation  determines  the  initial  driving  force  on  the 
surface,  comparison  of  the  relaxation  curves  for  the  different 
starting  surfaces  requires  setting  the  zero  of  time  for  the  ex¬ 
perimental  data  so  that  all  curves  have  the  same  slope  at 
t=0. 

In  analyzing  the  experimental  data,  we  considered  predic¬ 
tions  from  the  three  different  relaxation  mechanisms  dis¬ 
cussed  earlier  in  this  article.  The  first  two  mechanisms  as¬ 
sume  a  local  conservation  condition,  in  the  two  extremes  of 
attachment/detachment  limited  (fast  diffusion  but  slow 
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Fig.  6.  50  000  Ax 50  000  A  STM  images  of  the  three  different  initial  sur¬ 
faces  with  (a)  a  bunch  size  of  16  steps,  a  bunch-bunch  separation  of  22  300 
A,  and  an  initial  step-step  separation  within  the  bunches  of  124  A  (prepared 
by  heating  with  dc  in  the  step  down  direction  for  2  min),  (b)  a  bunch  size  of 
12  steps  and  a  bunch-bunch  separation  of  10  900  A  (prepared  by  heating 
with  dc  for  1  min),  and  (c)  a  bunch  size  of  9  steps  and  a  bunch-bunch 
separation  of  7300  A  (prepared  by  heating  with  dc  for  0.5  min). 

attachment/detachment  of  adatoms  to/from  the  step  edges) 
and  diffusion  limited  (fast  attachment/detachment  rate  but 
relatively  slow  diffusion).  The  third  mechanism  does  not  re¬ 
quire  local  adatom  conservation  and  assumes  that  the 
attachment/detachment  of  adatoms  to/from  the  step  edges  is 
the  limiting  factor  in  adatom  motion.  Since  the  numerical 
solutions  [see  Eq.  (13)]  predict  that  the  decay  of  the  slope  of 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2806 


Liu  et  al.:  Relaxation  of  the  step  profile 


2806 


Fig.  7.  50  000  Ax50  000  A  STM  images  of  surfaces  with  an  average  bunch 
size  of  16  steps.  Individual  steps  are  a  single  interplanar  spacing,  3.14  A, 
high.  The  starting  surface,  shown  in  Fig.  6(a),  was  prepared  by  heating  with 
dc  in  the  step  down  direction  for  2  min.  This  was  followed  by  relaxations 
under  indirect  heating  for  (a)  30  and  (b)  120  min. 

the  bunches  follows  a  scaling  relation  with  respect  to  the 
number  of  steps  in  the  bunches,  we  first  investigate  if  such  a 
scaling  relation  exists  in  the  experimental  data.  We  offset  the 
time  axis  of  the  relaxation  curves  so  that  all  start  out  subject 
to  the  same  initial  driving  force.  We  performed  a  three  pa¬ 
rameter  fit  to  obtain  the  best  values  of  a,  and  the  time  shifts 
for  data  sets  two  and  three.  We  find  that  the  experimental 
data  does  indeed  scale  with  the  number  of  steps  in  the 
bunches.  A  conservative  error  bar  on  a  of  ±0.5  was  obtained 
by  varying  a  about  the  best  value  of  4.3,  and  finding  the 
values  at  which  two  experimental  data  points  moved  outside 
their  error  bars  from  the  scaled  curve.  The  measured  value  of 
a  is  consistent  with  a  conserved  model  of  step  motion  for 
which  a  is  predicted  to  be  4. 

To  make  quantitative  comparisons  between  the  three  theo¬ 
retical  mechanisms  and  the  experimental  results,  we  need 
values  of  the  step-step  interaction  coefficient  g,  the  step 
mobility  T,  and  the  diffusion  coefficient  for  atoms  on  the 
terraces  D^Cq^.  In  general,  the  step-step  interaction  coeffi¬ 
cient  can  be  determined  by  direct  observation  of  the  equilib¬ 
rium  distribution  of  step-step  spacings.^^  REM  observations 
of  steps  on  Si(lll)  at  900  °C  show  that  the  steps  interact 


Fig.  8.  Plot  of  maximum  slope  of  the  bunches  vs  time  for  the  three  experi¬ 
mental  data  sets.  Each  data  point  is  an  average  of  approximately  11-22 
measurements,  and  the  error  bars  represent  the  standard  deviation  of  the 
mean. 


with  a  repulsive  interaction  which  decays  as  the  inverse 
square  of  separation,  and  a  magnitude  consistent  with  a 
stress-mediated  interaction. The  value  of  the  step  inter¬ 
action  coefficient  at  900  °C  has  been  found  to  be  0.015 
eV/A^,  with  an  uncertainty  of  approximately  30%.  The  ki¬ 
netic  parameters  can  be  determined  by  direct  observations  of 
the  equilibrium  fluctuations  of  the  steps.^^  REM  observations 
of  step  fluctuations  at  900  °C  show  that  fluctuations  decay 
with  a  time  constant  that  depends  on  the  square  of  their 
wavelength.^^’^^  The  result  has  been  alternatively  interpreted 
as  due  to  step  attachment/detachment  limited  kinetics  with  a 
step  mobility  of  r=5.5X10^  AVs,^^  and  is  due  to  diffusion- 
limited  kinetics  with  a  diffusion  constant  of  D^Co‘’=lXlO^ 
s“^^^  These  parameters  were  used  in  numerical  solutions  of 
Eqs.  (5)-(7),  with  T=f{k+  as  discussed  above,  to 
make  predictions  on  the  characteristics  of  the  decay  of  the 
slope  (see  Fig.  3  for  the  illustration  of  the  change  of  profile). 
The  numerical  calculations  were  done  for  starting  surfaces 
with  bunch  sizes  matching  those  of  the  experimental  system, 
A=16,  12  and  9,  and  an  initial  slope  of  0.0253,  the  initial 
slope  of  all  experimental  data  sets  after  the  time  axis  was 
reset.  The  resulting  calculated  relaxation  curves  were  then 
scaled  to  produce  the  curves  shown  in  Fig.  9.  The  results 
show  quantitative  agreement  between  the  measured  rates  and 
the  predicted  rates,  well  within  the  experimental  uncertain¬ 
ties. 

Next  we  look  to  the  rate  of  decay  of  the  slope  to  try  and 
distinguish  between  the  two  mechanisms  which  require  local 
adatom  conservation.  The  relaxation  of  the  slope  shown  in 
Fig.  10  follows  a  power  law  with  an  exponent  close  to  —0.23 
(-0.2  to  —0.3  give  reasonable  fits).  We  cannot  distinguish 
between  the  two  conserved  mechanisms;  as  in  these  two 
cases,  the  solutions  described  above  yield  time-scaling  expo¬ 
nents  of  -1/4  and  -1/5,  respectively.  As  in  the  case  of  the 
analysis  of  the  step  fluctuations,  the  data  do  not  allow  these 
two  cases  to  be  distinguished. 

As  a  final  test  of  the  utility  of  the  mesoscale  theory,  we 
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Fig.  9.  Plot  of  scaled  slope  vs  time  for  the  experimental  data  and  results  of 
the  numerical  solutions.  The  three  experimental  data  sets  are  plotted  using 
different  symbols,  solid  circles  for  A^=16,  solid  squares  for  N=12,  and 
solid  triangles  for  N—9,  and  the  error  bars  again  represent  the  standard 
deviation  of  the  mean.  The  closely  spaced  points  show  the  results  of  the 
numerical  solutions  for  the  two  conserved  relaxation  mechanisms  [see  Eqs. 
(5)  and  (6)].  For  each  mechanism,  relaxation  results  were  obtained  for  the 
three  initial  surface  configurations  matching  the  experiment  (initial  slope  of 
0.0253,  the  starting  slope  of  all  the  data  after  appropriate  time  shifts)  and 
then  scaled  using  the  experimental  best  a  of  4.3.  The  hollow  squares  with 
dots  in  the  centers  denote  the  results  for  the  conserved  attachment/ 
detachment  limited  case,  and  the  hollow  circles  with  dots  in  the  centers 
denote  results  for  the  diffusion  limited  case.  The  error  bar  shown  for  each 
mechanism  is  the  error  bar  for  the  specific  time  for  which  it  is  plotted  and 
was  obtained  from  the  numerical  solution  results  assuming  an  error  in  g  of 
30%,  an  error  in  T  of  50%,  and  an  order  of  magnitude  variation  in 
The  error  bar  in  each  case  is  representative  of  the  larger  percentage  varia¬ 
tions  in  slope  observed  for  the  time  shown  in  the  figure.  The  experimental 
results  are  consistent  with  either  of  the  two  mechanisms  which  require  ada¬ 
tom  conservation,  and  inconsistent  with  the  mechanism  which  does  not  re¬ 
quire  adatom  conservation  (not  shown). 

compare  these  experimental  results  to  the  previously  dis¬ 
cussed  predictions  of  Mullins’  original  isotropic  continuum 
theory  applied  to  our  surface  geometry.  The  range  of  slopes 
investigated  experimentally  in  this  study  were  from  12  to  5 
times  the  initial  slope.  In  this  range,  we  are  in  agreement 
with  the  continuum  prediction  of  (Fig.  5).  However,  as 
mentioned  earlier,  the  surface  profile  which  evolves  in  the 
continuum  analysis  contains  the  unphysical  prediction  of  the 
creation  of  antisteps  on  the  surface,  which  was  never  ob¬ 
served  experimentally.  In  addition,  we  used  our  fit  of  the 
experimental  data  to  calculate  the  prefactor 
DsCo'a‘^h^yisotio^,iJkBT  (Z),Co‘'=10*  s“^),  obtaining  an  esti- 
mate  of  the  effective  surface  tension  yisotropic  of  0.015  eV/A^. 
This  result  effectively  has  no  meaning  since  it  cannot  be 
identified  with  any  physical  parameter  in  our  system,  which 
is  below  the  roughening  temperature. 

VI.  SUMMARY 

We  have  presented  theoretical  and  experimental  results 
which  show  the  applicability  of  coarse-grained  models  of 
step  motion  to  the  prediction  of  the  rate  of  decay  of  meta¬ 
stable  structures  of  arbitrary  shape.  We  have  considered  the 
specific  case  of  a  structure  consisting  of  step  bunches  with 
steps  of  the  same  sign  separated  by  wide  flat  terraces.  Three 


Fig.  10.  Plot  of  the  scaled  experimental  data  with  a  fit  of  We  cannot 
distinguish  between  the  two  mechanisms  which  require  local  conservation 
(attachment/detachment  limited  and  diffusion  limited). 


distinct  mechanisms  of  step  motion  were  considered.  Nu¬ 
merical  solutions,  as  well  as  analytical  solutions  in  the  con¬ 
tinuum  limit,  of  the  equations  of  step  motion  for  this  geom¬ 
etry  show  that  in  the  large  amplitude  (or  large  slope)  regime 
the  decay  has  a  power  law  dependence  on  time.  A  crossover 
to  exponential  decay  occurs  when  the  slope  of  the  structures 
becomes  comparable  to  (within  about  a  factor  of  5)  the  av¬ 
erage  slope  of  the  surface,  in  agreement  with  previous  stud¬ 
ies  of  the  small  amplitude  regime.  The  decay  of  the  struc¬ 
tures  is  shown  to  obey  a  size-scaling  relationship  that 
depends  on  the  initial  size  of  the  structure  (number  of  steps 
in  the  bunch)  independent  of  the  spacing  of  the  bunches. 
Numerical  solutions  of  an  isotropic  continuum  model  (i.e., 
one  in  which  there  are  no  steps)  give  similar  time  and  size 
scaling  results.  However,  the  prediction  of  the  shape  evolu¬ 
tion  of  the  profile  in  the  isotropic  continuum  model  is  incon¬ 
sistent  with  experimental  observations,  and  the  prefactor  for 
the  rate  of  decay  cannot  be  predicted  on  any  physical  basis. 

We  have  compared  the  theoretical  results  with  the  mea¬ 
sured  decay  of  metastable  step  bunches  of  three  different 
sizes  fabricated  using  direct  current  heating.  The  metastable 
structures  consist  of  step  bunches  from  3  to  5  nm  in  height 
(step-step  separations  on  the  order  of  a  couple  hundred  ang¬ 
stroms)  separated  from  one  another  by  terraces  7300  to 
22000  A  wide.  Their  decay  demonstrates  size  scaling  {a=43 
±0.5)  as  predicted  by  the  theory  for  step  motion  governed  by 
local  conservation  of  mass  (i.e.,  atoms  motion  via  rapid  ex¬ 
change  across  steps  is  not  the  governing  component  of  mass 
transport).  The  time  scaling  observed  {/3~-0.2  to  ”0.3)  is 
consistent  with  either  a  mechanism  in  which  the  rate  limiting 
process  in  step  motion  is  diffusion  on  the  terraces  (/3=-0.2) 
or  attachment  and  detachment  of  atoms  at  the  step  edges 
(/3=—0.25).  The  experimentally  measured  decay  rate  can  be 
predicted  quantitatively  using  values  for  the  parameters  in 
the  theory  which  were  determined  from  measurements  of  the 
equilibrium  properties  of  the  surface.  The  approach  taken  in 
this  work,  which  considers  appropriate  coarse-graining  of  ki¬ 
netic  processes  to  remove  the  system-dependent  details  of 
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atomic  processes,  provides  a  useful  and  successful  method  of 
predicting  surface  mass  transport.  The  rather  simple  system 
studied  here,  decay  of  a  single-component  quasi-one- 
dimensional  metastable  structure  in  the  absence  of  external 
perturbations,  provided  an  excellent  test  of  this  approach. 
Extension  of  this  work  to  more  complex  systems  including 
growth,  chemical  reaction,  and  full  2D  structures  is  feasible, 
and  will  present  interesting  theoretical  and  experimental 
challenges. 
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Using  a  first-principles  approach,  we  study  Si  2/?  core-level  shifts  at  water  and  hydrogen  covered 
Si(001)2X  1  surfaces.  After  allowing  for  full  relaxation  of  the  surface  structures,  core-level  shifts  are 
calculated  including  core-hole  relaxation  effects.  We  find  that  dissociated  water  on  the  Si(001)2Xl 
surface  induces  a  core-level  shift  of  1.1  eV  to  higher  binding  energies,  in  good  agreement  with 
experiment.  In  the  case  of  the  hydrogen  terminated  Si(001)2X  1  surface,  calculated  surface  shifts  are 
small  (about  0.2  eV)  and  comparable  to  shifts  of  subsurface  Si  atoms.  ©  1996  American  Vacuum 
Society. 


The  characterization  of  silicon  surfaces  is  an  essential  pre¬ 
requisite  to  understanding  the  early  stages  of  the  silicon  oxi¬ 
dation  process.  In  particular,  a  considerable  amount  of  effort 
has  been  focused  on  the  water  covered  Si(OOl)  surface, 
because  of  the  importance  of  wet  oxidation  in  silicon-based 
device  technology.  It  is  by  now  well  established  that  H2O  is 
adsorbed  dissociatively  on  Si(001)2X  1  by  saturating  the  free 
dangling  bonds  of  the  clean  surface  with  -OH  and  -H 
groups.^ 

Core-level  (Si  Ip)  photoelectron  (PE)  spectroscopy  is 
used  routinely  on  Si  surfaces  and  interfaces  in  an  attempt  to 
determine  the  microscopic  structures.  This  technique  is  sen¬ 
sitive  to  the  local  potential,  providing  a  measure  of  the 
chemical  environment  (or  oxidation  state)  of  the  silicon  at¬ 
oms.  When  H2O  is  adsorbed  on  Si(OOl),  the  Si  2p  spectra 
show  the  appearance  of  a  peak  shifted  by  about  0.8-1. 0  eV 
to  higher  binding  energies  with  respect  to  the  bulk  peak.^"^ 
This  peak  is  generally  attributed  to  the  surface  silicon  atoms 
to  which  OH  groups  are  bonded.  Because  of  the  well  defined 
nature  of  water  covered  Si(OOl)  at  saturation  coverage,  it  is 
possible  to  calibrate  the  intensity  of  this  peak.^  In  this  way, 
estimates  of  the  number  of  partially  oxidized  silicon  atoms  at 
the  Si(001)/SiO2  interface  can  be  extracted  from  PE 
spectra.^’^ 

In  previous  work,  we  addressed  Si2p  core-level  shifts  at 
the  Si(001)-SiO2  interface  using  a  first-principles  approach.^ 
By  making  use  of  interface  models,  we  showed  that  the  sub¬ 
oxide  peaks,  which  are  about  1  eV  apart  in  PE  spectra,  could 
be  attributed  to  silicon  atoms  with  a  different  number  of 
oxygen  neighbors.  The  effect  of  further  neighbors  turned  out 
to  be  negligible.  Calculated  shifts  were  in  good  quantitative 
agreement  with  experiment  provided  core-hole  relaxation  ef- 

^^Electronic  mail:  pasquarello@eldp.epfl.ch 


fects,  which  were  found  to  account  for  more  than  50%  of  the 
shifts,  were  properly  included. 

In  this  article,  we  apply  our  approach  to  calculate  Si  2/7 
core-level  shifts  at  water  and  hydrogen  covered  Si(OOl)  sur¬ 
faces.  In  the  case  of  water  covered  Si(OOl),  surface  silicon 
atoms  occur  in  an  oxidation  state  Si^^  According  to  the 
first-neighbor  dependence  that  resulted  from  our  study  on  the 
Si(001)-SiO2  interface,^  the  core  levels  of  these  atoms  are 
expected  to  shift  by  about  1  eV.  Agreement  between  theory 
and  experiment  in  this  case,  in  which  the  structure  is  well 
known,  would  strengthen  the  picture  which  emerged  from 
the  study  of  Si-Si02  interface  models.  We  complement  this 
study  by  calculating  core-level  shifts  at  the  hydrogen  termi- 
nated  Si(001)2Xl  surface.”’-!^ 

The  results  in  this  work  are  obtained  within  the  local  den- 
sity  approximation  to  density  functional  theory  (DF-LDA). 
The  surface  geometries  were  obtained  by  fully  relaxing  the 
atomic  coordinates  to  minimize  the  total  energy.  This  was 
done  by  using  the  Car-Parrinello  method,  which  provides 
the  electronic  structure  as  well  as  the  forces  that  act  on  the 
ions.  Only  valence  electrons  are  explicitly  considered  using 
pseudopotentials  (PPs)  to  account  for  the  core-valence  inter¬ 
actions.  A  norm-conserving  PP  is  used  for  Si,^"^  whereas  the 
O  atoms  are  described  by  an  ultrasoft  Vanderbilt  PP.^^  The 
Coulomb  potential  is  used  for  H.  The  electronic  states  were 
expanded  on  a  plane-wave  basis  set.  Exchange  and  correla¬ 
tion  were  included  using  Perdew  and  Zunger’s  interpolation 
formulas.  A  description  of  the  method  is  given  in  Ref.  17, 
In  order  to  achieve  good  convergence  in  the  electronic 
properties,  we  have  used  a  plane- wave  cutoff  of  16  Ry  for 
the  wave  functions  and  of  150  Ry  for  the  augmented  electron 
density.  Our  system  contains  a  2X2  surface  unit  of  side 
L  =  7.65  A  (based  on  the  theoretical  equilibrium  lattice  con¬ 
stant  of  Si).  The  dimension  of  the  cell  in  the  direction  or¬ 
thogonal  to  the  surface  is  31.7  A,  containing  ten  layers  of 
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Fig.  1.  Ball  and  stick  model  of  the  relaxed  positions  resulting  from  disso¬ 
ciative  water  adsorption  on  Si(OOl).  The  labels  for  the  distinct  Si  positions 
correspond  to  Table  1. 


Si  ("^13  A),  the  lower  extremities  being  saturated  with  hy¬ 
drogen  atoms.  The  Brillouin  zone  of  our  simulation  cell  was 
sampled  using  only  the  F  point.  In  the  minimization  process, 
all  the  surface  atoms  as  well  as  the  first  six  Si  layers  were 
allowed  to  relax. 

The  Si2p  core-level  shifts  were  calculated  both  within 
the  initial-state  approximation  and  including  core-hole  relax¬ 
ation  effects.  The  difficulty  of  using  a  PP  approach  in  which 
core  levels  are  not  explicitly  considered  is  overcome  as  fol¬ 
lows.  Initial-state  shifts  are  obtained  in  first-order  perturba¬ 
tion  theory,  by  evaluating  the  expectation  value  of  the  self- 
consistent  local  potential  on  the  Si2p  atomic  orbital. 
Final  state  effects  are  included  by  performing  differences  in 
total  energy  calculations  according  to  the  procedure  of  Ref. 
20.  Two  separate  calculations  are  performed.  In  a  first  step, 
the  electronic  ground  state  is  determined.  Then,  the  PP  of  a 
given  atom  is  replaced  by  another  PP  which  simulates  the 
presence  of  a  screened  2p  hole  in  its  core.  Following  these 
procedures,  relative  core-level  shifts  can  be  determined.  Be¬ 
cause  of  the  use  of  a  PP  approach,  the  absolute  values  of  the 
core-level  binding  energies  are  not  obtained.  A  more  detailed 
description  of  our  procedure  is  given  in  Ref.  9.  By  compar¬ 
ing  DF-LDA  shifts  with  experiment  for  a  series  of  mol¬ 
ecules,  an  overall  accuracy  of  0.5  eV  with  respect  to  experi¬ 
ment  has  been  established.^’^^  These  tests  span  a  wide  range 
of  core-level  shifts  (several  eV).  Relative  shifts  for  chemical 
environments  that  are  more  similar  are  expected  to  be  more 
accurate,  e.g.,  recent  work  on  the  clean  Si(OOl)  surface^^  and 
on  the  H-terminated  Si(lll)  surface.^^ 

The  surface  geometry  resulting  from  dissociative  water 
adsorption  on  Si(OOl)  is  shown  in  Fig.  1.  At  the  end  of  the 
relaxation  process,  we  found  a  Si-Si  dimer  bond  of  2.39  A, 
a  Si-H  bond  of  1.54  A,  and  a  Si-0  bond  of  1.66  A.  The 
Si-H  and  Si~0  bonds  form  angles  of  75°  and  73°  with  the 
surface  plane,  respectively.  The  Si  dimer  is  tilted  by  only 
1.5°,  with  the  OH-bonded  Si  atom  higher.  The  0-H  bond  is 
roughly  perpendicular  to  the  dimer  orientation  and  nearly 
parallel  to  the  surface.^^  Overall,  the  structural  parameters 


Table  L  Calculated  Si  2p  core-level  shifts  at  the  water  adsorbed  Si(OOl) 
surface.  Shifts  are  calculated  in  the  initial-state  approximation  and  including 
core-hole  relaxation  (full  shift).  The  shifts  are  given  with  respect  to  the  bulk 
value,  obtained  by  averaging  the  shifts  between  the  sixth  and  the  eighth  Si 
monolayer.  Shifts  are  given  in  eV.  See  Fig.  1  for  the  geometry. 


Initial  state 

Full 

Expt. 

Si(l)  (bonded  to  -OH) 

-0.52 

-1.09 

-1.00“  -0,9'“ 

Si(2)  (bonded  to  -H) 

-0.02 

-0.08 

-0.25“  -0.3'“ 

Si(3)  (second  layer) 

0.07 

0.11 

Si(4)  (second  layer) 

0.02 

0.03 

Si (5)  (third  layer) 

0.06 

0.09 

Si (6)  (third  layer) 

-0.18 

-0.18 

^Reference  4. 
^Reference  6. 


we  found  are  very  close  to  those  obtained  in  a  previous  LDA 
calculation.^ 

Calculated  Si2p  core-level  shifts  for  this  surface  struc¬ 
ture  are  given  in  Table  I.  The  shifts  are  given  with  respect  to 
a  reference  value  which  corresponds  to  the  average  shift  of 
atoms  in  the  Si  slab  sufficiently  distant  (sixth  to  eighth 
monolayer)  from  the  surface,  but  far  enough  from  the  hydro¬ 
gen  terminated  back  surface.  The  Si  atoms  that  present  the 
most  significant  shifts  are  those  that  are  bonded  to  an  OH 
group.  In  the  initial-state  approximation,  we  found  a  shift  of 
-0.52  eV,  which  increased  to  -1.09  eV  when  core-hole  re¬ 
laxation  was  taken  into  account.  For  the  other  surface  Si 
atoms,  the  ones  bonded  to  a  H  atom,  we  calculated  a  full 
shift  -0.08  eV.  These  calculated  values  are  in  good  agree¬ 
ment  with  experimental  data  which  give  shifts  ranging  from 
-0.8  to  -1.0  eV  (Refs.  2-6)  for  the  OH-bonded  Si  atom 
and  of  -0.25  eV  (Ref.  4)  [-0.3  (Ref.  6)]  for  the  H-bonded 
one.  The  experimental  values  reported  in  Table  I  are  taken 
from  Refs.  4  and  6,  which  are  the  only  ones  providing  shifts 
for  both  surface  Si  atoms. 

In  Table  I,  we  also  give  calculated  shifts  for  atoms  in 
lower  lying  layers.  We  found  scattered  values  ranging  from 
—0.2  to  +0.1  eV.  Since  all  these  values  fall  rather  close  to 
each  other  as  well  as  to  the  bulk  peak,  it  is  rather  difficult  to 
distinguish  their  contributions  separately  in  the  PE  spectra. 
The  same  remark  holds  for  the  surface  Si  atom  bonded  to  H 
atoms.  The  experimental  value  reported  in  Table  I  is  the  re¬ 
sult  of  a  fitting  procedure  with  a  limited  number  of  degrees 
of  freedom;  in  our  opinion,  it  should  be  taken  as  the  effective 
contribution  due  to  several  Si  atoms  rather  than  to  the  single 
contribution  due  to  the  H-bonded  Si  atoms. 

We  complement  this  study  by  considering  the  hydrogen 
terminated  Si(001)2Xl  surface.^^”^^  Figure  2  shows  the  re¬ 
laxed  structure  of  this  surface.  We  found  an  equilibrium  ge¬ 
ometry  defined  by  structural  parameters  in  close  agreement 
with  other  LDA  calculations  in  the  literature.^^’^"^  We  found 
Si  dimer  and  Si-H  bonds  of  2.38  and  1.54  A,  respectively. 
The  Si-H  bond  forms  an  angle  of  74°  with  the  plane  of  the 
surface. 

Calculated  core-level  shifts  for  the  hydrogen  terminated 
surface  are  given  in  Table  II.  Overall,  the  shifts  are  rather 
small.  The  H  coordinated  Si  shows  a  shift  close  to  that  found 
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Fig.  2.  Ball  and  stick  model  of  the  relaxed  positions  of  the  monohydride 
Si(001)2Xl  surface.  The  labels  for  the  distinct  Si  positions  correspond  to 
Table  IT 


for  the  dissociated  water  case.  The  shifts  for  subsurface  Si 
atoms  are  also  similar.  The  experimental  data  is  taken  from 
Ref.  25,  and  are  again  the  result  of  a  fit.  Thus,  the  experi¬ 
mental  value  reflects  the  effective  contribution  of  all  Si  at¬ 
oms  which  present  shifts  differing  from  the  bulk  value.  The 
overall  comparison  between  theory  and  experiment  is  satis¬ 
factory. 

In  conclusion,  we  have  studied  Si  2/7  core-level  shifts  at 
water  and  hydrogen  covered  Si(001)2Xl  surfaces.  In  the 
case  of  the  water  adsorbed  surface,  the  surface  Si  atom, 
bonded  to  the  OH  group  and  nominally  in  a  Si^^  oxidation 
state,  presented  a  shift  of  —1.1  eV,  in  very  good  agreement 
with  experiment.  Core-hole  relaxation  was  found  to  account 

Table  IL  Calculated  Si  2p  core-level  shifts  at  the  monohydride  Si(001)2X  1 
surface.  Shifts  are  calculated  in  the  initial-state  approximation  and  including 
core-hole  relaxation  (full  shift).  The  shifts  are  given  with  respect  to  the  bulk 
value,  obtained  by  averaging  the  shifts  between  the  sixth  and  the  eighth  Si 
monolayer.  Shifts  are  given  in  eV.  See  Fig.  2  for  the  geometry. 


Initial  state 

Full 

Expt. 

Si(l)  (dimer) 

-0.08 

-0.16 

-0.3" 

Si(2)  (second  layer) 

0.00 

0.00 

Si(3)  (third  layer) 

-0.22 

-0.24 

Si (4)  (third  layer) 

0.07 

0.08 

^Reference  25. 


for  about  50%  of  this  shift.  These  results  support  the  inter¬ 
pretation  given  to  Si  2/7  core-level  shifts  at  the 
Si(001)-SiO2,  which  assigned  a  —1  eV  shift  for  every 
nearest-neighbor  oxygen  atom.  In  the  case  of  the  monohy¬ 
dride  Si(001)2Xl  surface,  calculated  shifts  are  small  and 
agree  with  experiment. 
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New  approach  to  preparing  smooth  Si(100)  surfaces:  Characterization 
by  spectroeiiipsometry  and  validation  of  Si/Si02  interfaces  properties 
in  metal-oxide-semiconductor  devices 
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Wet  chemical  removal  of  thermally  grown  Si02  layers  on  Si(lOO)  substrates  has  been  studied  as  a 
function  of  the  pH  of  the  etching  solutions  in  the  range  of  ”0.32-1.6  by  adding  controlled  amounts 
of  H2SO4  to  a  1:30  HF:H20  mixture.  Characterization  of  the  stripped  Si(lOO)  surfaces  by 
spectroeiiipsometry  showed  that  the  smoothest  surfaces  were  obtained  at  a  1:0.50:30  HF 
(49  wt  %):H2S04(98  wt  %):H20  etch  with  a  pH  of  approximately  0.5.  Electrical  characterization  of 
metal-oxide-semiconductor  (MOS)  capacitors  fabricated  on  these  surfaces  with  oxide  layers 
prepared  by  remote  plasma  enhanced  chemical  vapor  deposition  showed  (i)  the  lowest  density  of 
interface  traps,  Djp  (ii)  the  lowest  tunneling  currents,  Jq  ,  and  that  (iii)  the  highest  breakdown  fields, 

^gD,  occurred  at  the  same  pH  value  that  produced  the  smoothest  surfaces.  In  contrast,  MOS 
capacitors  fabricated  with  high-temperature  thermally  grown  oxides  were  not  significantly  affected. 

©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

In  devices  with  submicron  feature  sizes  scaled  down  for 
ultralarge  scale  integration  (ULSI)  circuit  applications,  the 
complementary  scaling  of  gate  oxide  thicknesses  to  ~6  nm 
and  less  reduces  significantly  the  influence  of  bulk  oxide 
defect  and  defect  precursors  on  electrical  performance  and 
reliability.  This  means  that  defect  and  defect-precursor  bond¬ 
ing  arrangements  at  the  Si-Si02  interface  will  then  play  a 
dominant  role  in  determining  device  performance  and  reli¬ 
ability.  For  devices  with  gate  oxides  less  than  —6  nm  thick, 
the  roughness  of  the  Si  surface  prior  to  either  thermal  oxida¬ 
tion,  or  oxide  deposition  by  remote  plasma  enhanced  chemi¬ 
cal  vapor  deposition  (RPECVD)  is  an  important  factor  that 
must  be  considered  in  relating  performance  and/or  reliability 
to  the  specific  interface  formation  process.  This  means  that 
considerable  attention  must  be  placed  on  all  aspects  of  pro¬ 
cessing  that  occur  prior  to  the  sequence  of  steps  that  form  the 
Si-Si02  interface. 

Solutions  containing  HF,  NH4F,  and  H2O  are  commonly 
used  to  remove  sacrificial  oxides  and  for  other  aspects  of  Si 
device  processing,  including  interprocess  cleaning.  For 
Si(lOO)  wafers  that  are  used  in  ULSI  devices,  the  two  most 
often  applied  steps  in  preoxidation  processing  are  (i)  an  RCA 
clean  and  (ii)  a  final  etch  in  dilute  HF  to  remove  the  sacrifi¬ 
cial  chemical  oxides  generated  in  the  RCA  clean.  It  has  been 
demonstrated  that  these  steps  combine  to  roughen  the 
Si(lOO)  surface.^"^  In  marked  contrast,  Si(lll)  surfaces  can 
be  etched  to  a  nearly  atomically  flat  condition  using 
HF:NH4F:H20  solutions  with  a  pH^l.  Solutions  with  this 
pH  also  remove  thin  oxide  layers  and  leave  the  Si  surface 
hydrophobic.  However,  improvements  in  Si(lOO)  surface 
quality  using  similar  pH  values  are  at  best  marginal.^ 

The  accessible  range  of  pH  values  for  HF:NH4F:H20 
mixtures  is  from  approximately  1.6  for  1:30  HF(49%):H20 


^^Electronic  mail:  aspnes/@ unity. ncsu.edu 


to  8  for  NH4F.  However,  with  control  of  pH  by  adding  strong 
acids  (such  as  HCl,  one  can  control  the  relative  amounts  of 
species  (H2F2,  HF,  HF2  ,  F~)  in  solution.  At  low  values  of 
pH,  the  H2F2  and  HF  species  predominate,  which  may  effect 
the  etching  process.  In  one  such  study,  HCl  (0.5  mol//)  was 
added  to  various  HF  solutions  to  describe  the  kinetics  of  the 
Si02  etch  rate  as  a  function  of  solution  species  and  the  net 
effect  on  hydrogen  passivation  of  the  surface.^  However,  the 
effect  on  surface  smoothness  and  the  use  of  other  acids  with 
moderate  oxidizing  potentials  were  not  studied  over  a  range 
of  pH  values.  As  seen  in  other  studies,  the  variation  of  pH 
has  significant  effects  on  surface  smoothness  in  Si(lll).  The 
influence  of  the  concentrations  of  these  solution  species  with 
respect  to  etching  different  Si  surfaces  clearly  needs  addi¬ 
tional  study, in  particular  to  determine  which  species  or 
combination  of  species  promotes  the  smoothest  Si(lOO)  sur¬ 
faces. 

This  article  presents  a  study  of  the  influence  of 
HF:H2S04:H20  mixtures  of  pH  values  between  —0.33  and 
1.6  on  the  surface  roughness  of  Si(lOO)  and  on  the  electrical 
properties  of  Si~Si02  interfaces  formed  on  these  surfaces. 
This  work  thereby  extends  etching  studies  on  Si(lOO)  to  sig¬ 
nificantly  lower  pH  values  than  could  be  obtained  for  the 
more  conventional  HF:NH4F:H20  mixtures  as  well  as  pro¬ 
viding  an  oxidizing  anionic  species.  The  relative  surface 
roughness  for  different  etching  solutions  has  been  deter¬ 
mined  by  measuring  the  pseudodielectric  function 
(6)  =  (6i)  +  /(62)  by  spectroscopic  ellipsometry  (SE)  and 
comparing  these  results  with  reference  data^  obtained  on 
atomically  flat  Si(lll)  surfaces  prepared  by  an  RCA  clean 
followed  by  immersion  in  a  40%  solution  of  NH4F  in  H2O. 
The  electrical  characteristics  of  metal-oxide-semiconductor 
(MOS)  structures  fabricated  on  these  surfaces  by  thermal 
oxidation  and  RPECVD  were  determined  and  compared  with 
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the  SE  results.  We  found  a  good  correlation  between  SE- 
determined  smoothness  and  MOS  performance  for  the 
RPECVD  oxides,  but  none  for  the  thermally  grown  oxides. 

II.  EXPERIMENTAL  PROCEDURES 

Two  inch  ;?-type  Si(lOO)  wafers  with  a  resistivity  of  2-6 
flcm  were  initially  subjected  to  a  conventional  hot  RCA 
clean  in  which  the  sacrificial  chemical  oxides  were  removed 
in  1:10  dilute  HF:H20.  This  was  followed  by  a  conventional 
wet  thermal  oxidation  at  950  °C  during  which  100  nm  of 
Si02  was  grown.  The  thermally  grown  oxide  was  then  etched 
off  in  different  HF:H2S04:H20  mixtures  with  pR  values  be¬ 
tween  about  -0.33  and  1.6  immediately  before  (i)  optical 
characterization  by  SE,  and  (ii)  either  (a)  RPECVD  or  (b) 
thermal  oxidation  to  generate  an  oxide  layer  for  a  MOS  de¬ 
vice  structure.  For  the  oxide  removal  step,  etching  solutions 
with  different  pU  values  were  obtained  by  adding  1-20  ml 
H2S04(--98%)  to  a  5  ml:  150  ml  HF(49%):H20  mixture.  pH 
values  are  calculated  from  the  system  of  equilibrium  equa¬ 
tions  presented  by  Warren,^  using  K^2  (H2S04)=0.012  and 
the  appropriate  mass  balance.  Direct  measurement  of 
with  standard  electrodes  is  not  feasible  due  to  their  reaction 
with  HE.  The  samples  were  taken  out  of  the  etching  solution 
(at  room  temperature)  after  complete  removal  of  the  thermal 
oxide  as  evidenced  by  their  surfaces  becoming  hydrophobic. 
Ellipsometric  data  were  obtained  in  the  spectral  range  of 
206-450  nm.  Care  was  taken  to  maintain  each  etched  sur¬ 
face  under  a  flowing  dry  N2  atmosphere  before  and  during 
SE  measurement.  These  data  were  converted  to  (e)  form.  In 
another  set  of  experiments,  water  and  methanol  rinses  were 
performed  after  the  etching  step  to  remove  any  residual  fluo¬ 
ride  and/or  carbon  contamination.  The  etched  Si(lOO)  sur¬ 
faces  prepared  in  this  manner  were  also  studied  by  SE. 

Electrical  characterization  was  performed  by 
capacitance- voltage  (C-V)  and  current- voltage  (/- V) 
measurements  on  MOS  capacitors  fabricated  on  these  sur¬ 
faces.  For  the  first  set  of  devices  the  wafers  were  transferred 
to  an  ultrahigh  vacuum  compatible  multichamber  system  less 
than  one  minute  after  removing  the  thermally  grown  oxide 
layer  as  described  above.  Thin  oxide  layers  (^10  nm)  were 
prepared  by  a  300  °C  two-step  RPECVD  process  followed 
by  a  30  s  rapid  thermal  anneal  at  900  in  a  0.1  Torr  oxygen 
ambient. For  the  second  set  of  devices,  7.5  nm  thermal 
oxides  were  grown  at  900  °C.  The  time  between  etching  and 
loading  of  the  wafers  into  the  thermal  oxidation  system  was 
approximately  five  minutes.  As  an  additional  control,  (e) 
spectra  were  obtained  between  412  and  755  nm  from  Si 
samples  etched  in  the  same  solution  as  the  plasma-processed 
wafers.  The  gate  oxide  layer  thicknesses  were  also  deter¬ 
mined  using  ellipsometry,  and  cross  checked  using  the  data 
obtained  from  C-V  measurements.  For  the  MOS  capacitors, 
a  350-nm-thick  metal  electrode  was  formed  by  vapor  depos¬ 
iting  Al.  Individual  capacitors  with  areas  of  6.3X10“"^  cm^ 
and  2.3X10”^  cm^  were  prepared  for  the  C~V  and  I-V 
studies,  respectively.  Areas  were  defined  by  standard  photo¬ 
lithography  and  wet-chemical-etching  techniques.  A  150-nm- 
thick  Al  film  formed  the  backside  contact  to  the  Si  wafer. 


Fig.  1.  Spectral  dependence  of  the  real  and  imaginary  parts  of  the  pseudodi¬ 
electric  function  (6)=(ei)  +  f(^2)  for  three  Si(lOO)  surfaces  after  removing  a 
thermally  grown  oxide  in  HF:H2S04:H20  etches  with  pH  values  of  -0.32, 
0.52,  and  1.59.  These  are  compared  with  the  (e)  spectrum  of  atomically  flat 
H-terminated  Si(lll)  (Ref.  8). 

The  Al  depositions  were  followed  by  a  postmetallization  an¬ 
neal  at  400  °C  for  30  min  in  N2.  Additional  control  capaci¬ 
tors  were  made  on  the  same  type  of  Si(lOO)  wafers  without 
any  preoxidation  processing;  i.e.,  the  wafers  were  not  sub¬ 
jected  to  the  RCA  clean,  the  rinse  in  the  1:10  dilute  HF:H20 
etch,  or  the  thermal  oxidation  and  oxide  removal  in 
HF:H2S04:H20  solutions. 

III.  EXPERIMENTAL  RESULTS 

Figure  1  shows  (e)  spectra  of  three  Si(lOO)  samples  ob¬ 
tained  after  removing  the  100  nm  thermal  oxide  by  etching 
in  solutions  with  pH  values  of  -0.32,  0.2,  and  1.6.  A  refer¬ 
ence  spectrum^  obtained  from  an  atomically  flat 
H-terminated  Si(lll)  sample  prepared  by  an  RCA  clean  fol¬ 
lowed  by  etching  in  a  40%  NH4F  solution  is  also  shown  for 
comparison.  Differences  between  the  Si(lOO)  and  Si(lll) 
spectra  are  minimized  for  a  pH  of  ~0.5.  A  simple  measure 
of  effectiveness  is  the  height  of  the  4.25  eV  E2  peak  in  (€2), 
which  for  thin  oxide  layers  decreases  by  6.4/nm  from  its 
atomically  flat  Si(lll)  value  of  48.3.  For  rough  surfaces,  the 
corresponding  decrease  in  peak  height  is  9.2/nm.  The  peak 
value  of  (62)  is  shown  as  a  function  of  pH  in  Fig.  2.  The  data 
reveal  that  the  highest  value  of  {^2)  occurs  for  a  pH  of  about 
0.5.  Decreases  in  (62)  at  the  E2  peak  have  generally  been 
attributed  to  (i)  surface  contamination  by  hydrocarbons,  etc., 
and/or  (ii)  surface  roughness. Surface  contamination  re¬ 
sulting  from  HF  and  H2SO4  solutions  has  been  studied  by 
Auger  electron  spectroscopy  (AES)  and  x-ray  photoelectron 
spectroscopy.^^  These  studies  showed  the  presence  of  rela¬ 
tively  small  amounts  of  carbon  and/or  fluoride  contamination 
(in  the  percent  range),  which  could  be  removed  by  a  rinse 
with  methanol  and  water.  To  establish  the  extent  to  which 
this  contamination  could  contribute  to  these  data,  we  also 
show  in  Fig.  2  the  difference  between  as-etched  and 
methanol/water-rinsed  surfaces.  The  difference  is  consis¬ 
tently  about  1,  indicating  that  the  dominant  reduction  is  due 
to  roughness.  If  (^2)  values  were  affected  from  organic  im- 
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Fig.  2.  Peak  values  of  (e2>  at  4.25  eV  as  a  function  of  pH  of  etching  solution 
with  (•)  and  without  (O)  a  methanol-water  rinse. 


purities,  there  would  be  an  expected  increase  in  (62)  with 
H2SO4  concentration  due  to  oxidation  of  organic  molecules. 
However,  {€2)  decreased  at  /7H<0.5  (higher  [H2SO4])  further 
supporting  a  surface  roughness  explanation  for  the  observed 
(62)  values.  We  conclude  that  the  smoothest  Si(lOO)  surfaces 
are  obtained  for  a  pU  0.5±0.1,  and  that  surfaces  prepared  in 
both  lower  and  higher  pH  mixtures  are  rougher.  We  note  that 
the  (€2)  peak  of  41.1  obtained  by  Utani  and  Adachi^^  with  a 
60:1  dilution  of  HF  in  H2O  (pH— 1.8)  is  entirely  consistent 
with  the  trend  established  in  Fig.  2. 

Figure  3  shows  the  calculated  concentration  of  reactive 
species  in  the  etching  solution  where  the  points  a,  b,  c,  d,  and 
e  correspond  to  pH  values  of  1.6,  0.9,  0.52,  0.2,  and  —0.32, 
respectively,  for  the  data  shown  in  Fig.  2.  As  can  be  seen,  the 
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Fig.  3.  Neutral  and  ionic  species  in  solution  calculated  from  Ref.  9  for 
etching  solutions  used  in  Fig.  2  as  a  function  of  added  H2SO4 .  Points  a,  b,  c, 
d,  and  e  correspond  to  solution  pH  values  of  1.6,  0.9,  0.52,  0.2,  and  —0.32, 
respectively,  with  point  c  resulting  in  the  maximum  {€2)  of  Fig.  2. 
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Fig.  4.  Measured  etch  rates  of  sacrificial  oxides  in  155  ml  1:30  HF  (49%) 
solutions  as  a  function  of  added  H2SO4  compared  to  calculated  rates  from 
Ref.  5  (dashed  line)  based  on  the  concentrations  of  [HF]  and  [H2F2]. 


concentration  of  HF  and  H2F2  remains  nearly  constant  while 
HSO4  and  SO4""  increase.  At  pH  values  below  0.9,  only 
HSOJ  increases.  Since  sulfate  anions  are  known  to  be  oxi¬ 
dizers,  a  possible  mechanism  for  the  observed  increase  in 
smoothness  is  the  selective  oxidation  of  Si  facets  to  Si02 
which  is,  thus,  removed  by  HF.  There  are  two  important 
points  that  support  a  selective  surface  oxidation/etching 
model  for  surface  smoothing.  The  first  is  that  the  same  ex¬ 
periments  using  HCl  instead  of  H2SO4  to  decrease  pH  did 
not  result  in  any  improvement  in  surface  roughness.  CF  has 
significant  etching  power.  Second,  H2SO4  was  seen  to  not 
dramatically  effect  the  etching  rate,  thus,  not  to  be  an  inti¬ 
mate  part  of  the  HF/Si02  etching  process.  Figure  4  shows  the 
Si02  etching  rate  as  a  function  of  added  H2SO4  compared  to 
calculated  etching  rates  based  on  concentrations  of  [HF]  and 
[H2F2].  As  can  be  seen,  pH  values  as  low  as  that  which 
resulted  in  the  smoothest  Si(lOO)  surface  did  not  change  the 
Si02  etch  rate.  This  strongly  suggests  that  the  mechanism  for 
Si  surface  smoothing  is  not  the  effect  of  the  anion  on  the 
process  of  Si02  etching  by  HF.  It  should  be  noted  that  at  very 
low  pH  values  there  is  a  moderate  increase  in  the  etch  rate, 
however,  the  ionic  strength  of  this  solution  is  very  high  and 
deviations  are  not  surprising.  The  effect  of  ionic  strength  is 
difficult  to  study  since  conventional  methods  of  adding  con¬ 
centrate  spectator  ions  (such  as  KCl)  would  have  a  detrimen¬ 
tal  effect  on  the  cleaning  process. 

As  further  evidence  of  the  trends  previously  discussed. 
Fig.  5(a)  displays  (62)  values  at  2.81  eV  for  the  control 
samples  described  above,  i.e.,  for  the  Si(lOO)  samples 
stripped  in  the  solutions  used  to  prepared  these  surfaces  for 
plasma-deposited  oxides.  At  2.81  eV  (62)  values  for  the 
smoothest  surfaces  are  lowest,  providing  a  confirming  per¬ 
spective  of  the  results  shown  in  Fig.  2. 

The  smoother  surfaces  as  characterized  by  SE  would  be 
expected  to  translate  to  increased  performance  of  MOS  ca¬ 
pacitors  and  is  explored  for  both  PECVD  and  thermally 
grown  oxide  gate  dielectrics.  The  concentration  of  interface 
defects  (Di^)  at  midgap,  determined  by  conventional  high  fre¬ 
quency  and  quasistatic  C-V  measurements  averaged  over  at 
least  4  devices,  is  shown  in  Fig.  5(b).  For  the  plasma- 
deposited  oxides  the  values  show  a  minimum  for  pH 
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Fig.  5.  Comparison  of  various  surface,  interface,  and  oxide  data  for  etched  surfaces  and  devices  as  a  function  of  (a)  (62)  values  at  2.81  eV  for  wafers 
etched  in  the  same  solution  as  the  substrates  for  the  thermal  and  plasma  oxides;  (b),  (c),  and  (d):  midgap  Djt,  breakdown  field  ,  and  tunneling  current  Jq 
values,  respectively,  of  MOS  capacitors  fabricated  on  these  surfaces  with  plasma  and  thermal  oxides.  The  lines  in  (b),  (c),  and  (d)  labeled  reference  are  MOS 
capacitors  with  thermal  oxides  fabricated  on  surfaces  not  subjected  to  preoxidation  etching. 


values  of  about  0.5.  For  the  thermal  oxides  the  values  are 
lower  by  a  factor  about  0.5  with  respect  to  the  plasma- 
deposited  oxides,  and  these  values  display  a  different 
dependence  on  pH.  For  example,  for  the  thermal  oxides 
there  is  very  little  difference  in  Djj  for  pH  values  of  0.5  and 
1.8,  whereas  is  increased  slightly  for  a  pH  value  of 
“0.32.  Note  further  that  the  average  value  for  the  refer¬ 
ence  thermal  oxide  devices  without  preoxidation  processing 
falls  in  the  middle  of  the  range  of  values  for  the  thermal 
oxides  that  were  etched.  The  breakdown  field  (£’bd)  itf 
Fig.  5(c)  show  that  MOS  devices  with  plasma  oxides  display 
lower  £bd  values  than  those  for  MOS  devices  with  thermal 
oxides,  and  in  addition  display  a  pH  dependence  paralleling 
that  of  Djt  in  the  sense  that  the  best  performance  (highest 
J^bd)  occurs  at  a  pH  of  ^0.5.  The  devices  with  thermal 
oxides  display  a  much  weaker  dependence  on  pH;  however, 
paralleling  their  behavior,  the  performance  of  devices 
subjected  to  pH  values  of  0.45-1.6  display  improved  perfor¬ 
mance  with  respect  to  devices  formed  on  wafers  with  no 
preoxidation  processing.  Finally,  as  shown  in  Fig.  5(d)  the 
lowest  tunneling  current  density  (Jq)  for  a  bias  of  5  V  in  the 
Fowler-Nordheim  regime  for  devices  with  both  thermal  and 
plasma  oxides  is  obtained  at  a  pH  of  —0.5.  Note  further  that 
the  dependences  of  and  /q  cn  pH  for  the  plasma- 
deposited  oxides  are  essentially  the  same.  Finally,  the  value 
of  Jq  for  the  reference  wafer  is  higher  than  those  of  the 


thermal  oxides  that  had  been  subjected  to  preoxidation  pro¬ 
cessing. 

IV.  DISCUSSION 

Our  ellipsometric  studies  of  Si(lOO)  surfaces  etched  in 
H2S04:HF:H20  solutions  with  pH  values  between  —0.32 
and  1.7  indicate  that  the  smoothest  surfaces  are  obtained 
with  a  1:0.50:30  HF(49  wt  %):H2S04(98  wt  %):H20  etch 
with  a  pH  of  0.52.  However,  the  values  of  {62)  at  the  E2  peak 
for  these  surfaces  were  significantly  smaller  than  that  ob¬ 
tained  from  atomically  flat  H-terminated  Si(lll)  surfaces 
prepared  with  an  RCA  clean  followed  by  a  final  etch  in  a 
40%  NH4F:H20  mixture.^  The  best  Si(lOO)  surface  had  a 
peak  (€2)  value  of  44.3,  which  can  be  modeled  to  correspond 
to  either  0.64  nm  of  oxide  or  0.45  nm  of  surface  roughness. 
These  smaller  peak  values  of  (62)  are  here  interpreted  as 
showing  that  the  Si(lOO)  surfaces  are  rougher  on  an  atomic 
scale  than  optimized  Si(lll)  surfaces.  C-V  and  I-V  char¬ 
acterizations  of  MOS  structures  formed  on  these  surfaces 
were  consistent  with  the  SE  measurements  in  the  sense  that 
the  best  electrical  performance  was  obtained  when  sacrificial 
oxides  were  removed  before  device  fabrication  using 
H2S04:HF:H20  mixtures  with  a  pH—0.5,  the  same  pH  that 
gave  the  highest  values  for  the  E2  peak  of  (62).  As  noted 
above,  the  pH  value  of  0.5  falls  outside  the  range  accessible 
with  HF:NH4F:H20  solutions  that  are  conventionally  used  to 
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prepare  Si  surfaces.  Preliminary  SE  studies  on  Si(lOO) 
samples  with  doping  concentrations  other  than  those  used 
above  show  similar  trends  in  surface  roughness  with  the  pH 
of  the  etching  solution. 

More  generally,  we  found  no  improvement  relative  to 
conventional  etching  in  dilute  1:30  HF:H20  when  the  pH 
was  lowered  by  adding  HCl  instead  of  H2SO4.  This  result 
establishes  that  pH  is  not  the  sole  factor  in  either  promoting 
or  reducing  surface  roughness  on  Si(lOO),  but  that  specific 
anions  can  also  play  a  role.  The  Si02  etching  rate  was  not 
effected  by  H2Si02  concentration  indicating  that  the  sulfate 
anion  is  not  prevalent  in  the  Si02/HF  etching  mechanism. 
Since  H2SO4  is  an  oxidizer,  there  is  the  possible  mechanism 
of  selective  Si  surface  oxidation  to  Si02  with  subsequent 
removal  by  HF  Importantly,  there  is  a  report  of  the  oxidizer 
H2O2  being  added  to  the  final  step  of  a  NH4OH  buffered  HF 
etch  (high  pH)  resulting  in  markedly  smoother  films  as  de¬ 
tected  by  scanning  tunneling  microscopy.  Strong  oxidizing 
agents  such  as  HNO3  and  HCIO4  would  be  expected  to  not 
be  selective.  The  role  of  the  sulfate  anion  and  other  mildly 
oxidizing  or  kinetically  hindered  oxo-acids  as  well  as  elec¬ 
trically  biasing  the  potential  of  the  substrate  are  being  cur¬ 
rently  investigated. 

Finally,  the  results  presented  for  thermally  grown  oxides 
also  demonstrate  that  even  if  Si(lOO)  surface  roughening 
does  occur  in  predeposition  processing,  the  effects  of  this 
roughening  are  largely  removed  during  thermal  oxidation.  In 
contrast,  correlations  between  preoxidation  wafer  processing 
and  the  performance  of  MOS  devices  involving  thermally 
grown  oxides  have  been  observed  for  Si(lll)  indicating  a 
quantitatively  different  relationship  for  surface  smoothing  ef¬ 
fects  occurring  during  thermal  oxidation  of  this  surface."^  In 
contrast,  the  results  for  Si(lOO)  plasma-deposited  oxides  par¬ 


allel  those  for  Si(lll),  i.e.,  the  pH  of  the  etch  used  to  remove 
sacrificial  oxides  prior  to  plasma  processing  does  have  a  sig¬ 
nificant  effect  on  MOS  device  performance."^  These  differ¬ 
ences  in  behavior  for  devices  with  thermally  grown  and 
plasma-deposited  oxides  for  Si(lOO)  and  Si(lll)  substrates 
indicate  that  considerable  care  must  be  exercised  in  compar¬ 
ing  the  electrical  performance  of  devices  with  thermally 
grown  and/or  deposited  oxides,  in  particular  when  attempt¬ 
ing  to  correlate  trends  in  device  performance  with  surface 
roughness  arising  from  predeposition  surface  treatments. 
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Schwoebel  barriers  on  Si(111)  steps  and  kinks 
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Motivated  by  our  previous  work  using  the  Stillinger- Weber  potential,  which  shows  that  the  [211] 
step  on  the  relaxed  1X1  Si(lll)  has  a  Schwoebel  barrier  of  0.61  ±0.07  eV,  we  calculate  here  the 
same  barrier  corresponding  to  two  types  of  kinks  on  this  step — one  with  rebonding  between  upper 
and  lower  terrace  atoms  (type  B)  and  the  other  without  (type  A).  From  the  binding  energy  of  an 
adatom,  without  additional  relaxation  of  other  atoms,  we  find  that  the  Schwoebel  barrier  must  be 
less  than  0.39  eV  (0.62  eV)  for  the  kink  of  type  A  (type  B).  Such  a  bound  is  argued  to  be  a  robust 
feature  following  from  the  presence  of  rebonding  at  the  step  edge  or  kink  site.  From  the  true  adatom 
binding  energy  we  determine  the  Schwoebel  barrier  to  be  0.15 ±0.07  eV  (0.50±0.07  eV)  for  the 
kink  of  Type  A  (B).  The  decrease  in  the  Schwoebel  barrier  is  roughly  consistent  with  our  previous 
estimates  of  its  upper  bound  of  0.05  eV.  However,  as  the  true  binding  energy  plots  show 
discontinuities  due  to  significant  movement  of  atoms  at  the  kink  site,  we  speculate  on  the  possibility 
of  multi-atom  processes  having  smaller  Schwoebel  barriers.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  Schwoebel  barrier,  which  was  introduced  as  the  ad¬ 
ditional  barrier  for  adatom  diffusion  over  a  step  edge  from 
the  upper  to  lower  terraces,^  has  been  a  subject  of  current 
interest  for  its  influence  on  the  growth  of  a  singular  (flat) 
surface.^"^  It  was  pointed  out  by  Villain^  that  in  the  presence 
of  such  a  barrier,  growth  by  step  flows  was  stable  only  if  the 
surface  was  sufficiently  vicinal  with  possible  instabilities  set¬ 
ting  in  during  the  growth  a  flat  surface.  It  is  now  accepted 
that  it  leads  to  a  coarsening  of  the  evolving  surface 
morphology.^"^  The  present  study  is  motivated  by  recent  ob¬ 
servations  of  another  kind  of  instability  on  the  high  tempera¬ 
ture  IX  1  phase  of  Si(lll) — the  reversible  step  bunching  in¬ 
stability  during  sublimation.^’^  As  these  experiments  can  be 
reinterpreted  in  the  presence  of  the  Schwoebel  barrier,^®  we 
summarize  here  our  previous  calculations  of  the  same  corre¬ 
sponding  to  straight  (high  symmetry)  steps  and  present  re¬ 
sults  corresponding  to  kinked  steps,  both  of  which  use  the 
empirical  Stillinger-Weber  (SW)  potential.  The  use  of  this 
potential  here  (and  in  our  previous  study  ^^)  has  been  moti¬ 
vated  by  the  fact  that  features  that  follow  from  changes  in 
coordination  number  (of  the  adatom  probing  the  potential 
energy  topography)  are  expected  to  survive  even  if  the  de¬ 
tails  of  the  empirical  potential  used  change.  We  attempt  to 
identify  such  features  here. 

Table  I  summarizes  our  previous  results. The  straight 
(high  symmetry)  [211]  and  [112]  and  other  step  orientations 
are  shown  in  Fig.  1.  Note  that  the  [211]  step  shows  a  large 
Schwoebel  barrier  of  0.61  ±0.07  eV.  However,  an  analysis  of 
experimental  data  on  the  electromigration  of  steps^  using  a 
diffusion  equation  showed  that  the  upper  bound  on  the 
Schwoebel  barrier^®  (in  a  particular  limit  of  the  equation  pa¬ 
rameters)  was  very  small  (0.05  eV).  Therefore,  here  we  cal¬ 
culate  the  Schwoebel  barrier  corresponding  to  unit  depth 
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kinks  on  the  [211]  step  to  determine  if  it  continues  to  be 
large. 

II.  MOLECULAR  DYNAMICS  METHOD 

Using  the  SW  potential,  diffusion  barriers  are  determined 
by  mapping  the  adatom  potential  energy  a  function  of  the 
(x,y)  position  of  the  adatom  [in  the  (111)  plane]  for  two 
types  of  kinks  on  the  [211]  step — type  A  [see  Fig.  2(a)], 
which  has  rebonding  of  atoms  along  the  kink  site  and  type  B 
[see  Fig.  2(b)],  which  has,  at  the  kink  site,  an  upper  terrace 
atom  rebonded  to  the  lower  terrace.  Here,  the  adatom  is  noth¬ 
ing  but  an  additional  atom  (not  belonging  to  the  bare  1 X  1 
Si(lll)  surface  that  probes  the  potential  energy  topography). 
It  must  be  noted  that  the  rebonding  in  these  configurations 
(and  in  our  previous  study  on  straight  steps  ^^)  is  similar  to 
that  on  the  Si(OOI)  surface  whose  reconstruction  energy  pre¬ 
dicted  the  SW  potential  (0.85  eV)^^  is  consistent  with  ab 
initio  calculations.^^  The  energies  (per  step  edge  atom)  of  the 
straight  [211]  and  [112]  steps  (configuration  types  A)  are 
reproduced  by  the  SW  potential  in  the  same  order  (0.72  eV 
and  0.62  eV,  respectively^^)  as  compared  to  tight  binding 
calculations  (0.70  eV  and  0.38  eV,  respectively^^).  Moreover, 
the  kink  energy  of  the  type  A  configuration  studied  here  (SW 
prediction:  0.22  eV)^^  is  roughly  consistent  with  that  derived 
from  experiments  (0.23  eV)^^’^^  on  fluctuating  steps  on 
1  X  1  Si(lll).  The  type  B  configuration  however  has  a  very 
small  (SW  prediction:^  — 1  meV)^^  kink  energy.  In  our  pre¬ 
vious  study  on  straight  steps,  configurations  of  type  B  were 
neglected  since  the  step  ([112])  that  allowed  for  this  struc¬ 
ture  would  have  very  large  step-step  interactions  inconsistent 
with  experimental  estimates. However,  the  large  interac¬ 
tions  were  due  to  the  presence  of  one  rebonding  atom  per 
lattice  constant  along  the  step  edge.  The  configuration  of 
type  B  is  nevertheless  considered  here,  since  (besides  being 
energetically  favorable  according  to  the  SW  potential),  at 
low  kink  densities,  i.e.,  for  a  nearly  straight  [211]  step,  the 
number  of  such  rebonding  atoms  per  unit  length  along  the 
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Table  I.  V  is  the  adatom  potential  energy.  Shown  here  is  its  value  at  the  relevant  minimum  and  saddle  points. 


Global  minimum  in  V  == 

-3.31  ±  0.05  eV 

Configuration 

V  at  saddle  point  (eV) 

=^'Diffusion  barrier  (eV) 

=5^Schwoebel  barrier  (eV) 

Si(lll)  surface 

-2.34±0.05 

0.97  ±0.07 

[211]  step 

-1.73±0.05 

1.58±0.07 

0.61  ±0.07 

[112] 

-2.18±0.05 

1.13±0.07 

0.16+0.07 

Step  edge  would  be  correspondingly  low  and  therefore  the 
expected  step- step  interactions  would  be  smaller.  The  adatom 
potential  energy  V  has  been  computed  as  the  difference  in 
the  minimum  potential  energy  of  the  system  with  the  adatom 
at  infinity  (non-interacting)  and  the  same  with  the  interacting 
adatom. 

Standard  molecular  dynamics  (MD)  procedures  of  inte¬ 
grating  Newton’s  law  (with  dissipation  to  reduce  tempera¬ 
ture)  and  the  steepest  descent  equations  have  been  used  to 
determine  the  minimum  potential  energy  of  the  system. 
These  routines  determined  the  adatom  potential  energy  to  an 
precision  of  10“^  eV.  The  (x,y)  coordinates  of  the  adatom 
are  fixed  during  the  integration  process.  The  system  con¬ 
sisted  [as  before  (Ref.  10)]  of  six  bi-layers  of  Si(lll)  in  an 
MD  cell,  the  bottom  three  layers  of  which  are  fixed  at  bulk 
lattice  coordinates  throughout  the  calculation.  The  system 
size  along  the  x  axis  (/x)  (which  was  parallel  the  [211]  step 
edge)  was  5\ai  and  along  the  y  axis  (/^)  was  3|a2.  Peri¬ 
odic  boundary  conditions  along  the  x  axis  identified  the 
points  (0,0, /z)  {h  is  the  step  height)  and  (0,/^,0)  and  the 
same  along  the  y  axis  identified  the  points  (0,k,0)  {k  is  the 
kink  depth)  and  (/;^,0,0).  These  boundary  conditions  made  it 


0=0° 


0=30° 


Fig.  1.  One  bilayer  of  the  Si(lll)  surface  consisting  of  the  upper  monolayer 
(gray)  and  lower  monolayer  (black).  The  figure  shows  the  threefold  and 
reflection  symmetry  of  this  surface:  Steps  running  along  directions  with 
equal  6  are  identical.  (Borrowed  from  Ref.  13). 


possible  to  have  exactly  one  vicinal  step  with  one  kink  (and 
no  antikinks)  in  the  MD  cell.  As  before, the  atoms  on  the 
(x,y)  boundaries  were  however  held  fixed  during  the  com¬ 
putation  of  V  to  prevent  the  entire  configuration  from  shear¬ 
ing,  particularly  when  the  adatom  is  moved  away  from  a 
deep  minimum.  The  kinks  were  roughly  in  the  middle  of  the 
cell  consisting  of  movable  atoms  [see  Figs.  3(a)  and  4(a)].  As 
the  (x—y)  size  of  this  cell  is  larger  than  that  used 
previously^^  in  studying  the  system  size  dependence  of  V  on 
the  Si(lll)  surface  and  since  it  is  more  square  in  shape,  we 
expect  the  error  in  V  due  to  finite  size  effects  to  be  smaller 
than  before^®  (0.01  eV).  It  must  be  additionally  noted  that 
with  the  system  size  used  here  (with  the  adatom  absent),  the 
kink  energy  was  within  6  meV  (4  meV)  of  that  calculated 
previously^^  for  the  kink  of  type  A  (type  B). 

The  MD  procedures  began  with  the  initial  configurations 
for  each  (x,y)  position  of  the  adatom  corresponding  to  the 
relaxed  adatom- free  structures.  The  z  coordinate  of  the  ada¬ 
tom  was  then  varied  in  small  steps  in  a  wide  range  to  roughly 
determine  the  point  (zo)  at  which  its  potential  energy  is  the 
smallest.  Initializing  this  z  coordinate  at  zq,  the  integration 
procedures  were  followed,  first  with  only  the  adatom  relax- 


Fig.  2.  The  two  kinks  studied  here:  type  A  has  atoms  rebonded  along  the 
kink  site  whereas  type  B  has,  at  the  kink  site,  an  upper  terrace  atom  reb¬ 
onded  to  the  lower  terrace.  On  both  terraces  the  upper  monolayer  is  shown 
in  grey  and  the  lower  monolayer  in  black.  (Borrowed  from  Ref.  13). 
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(a) 


[211]  Kink  Type  A 


(b) 
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Fig.  3.  Shown  in  (a)  is  the  top  view  of  the  kink  with  rebonding  along  the  kink  site  with  the  upper  terrace  atoms  larger  than  those  on  the  lower  terrace,  (b) 
shows  the  corresponding  adatom  potential  energy  derived  from  the  local  geometry  ( Vjg)  whereas  (c)  shows  the  true  adatom  potential  energy  (F).  In  both  plots, 
contours  are  separated  by  0.2  eV  with  the  minima,  saddle  points,  and  maxima  marked  (and  sometimes  labeled)  by  +(m),  *(s),  and  X  (M),  respectively.  Figures 
in  parenthesis  are  corresponding  values  in  eV.  Contours  in  (b)  ^  —  1.9  eV  and  those  in  (c)  ^  -  2.0  eV  are  marked  with  dashed  lines. 


ing  while  other  atoms  remain  fixed.  The  minimum  of  the 
potential  energy  reached  this  way  (V^g)  depends  only  on  the 
local  geometry  of  atoms  around  the  adatom.  The  atoms  that 
were  held  fixed  are  then  allowed  to  relax  together  with  the 
adatom  to  recompute  the  minimum  which  is  now  the  true 


adatom  potential  energy  (V).  With  the  kink  roughly  in  the 
middle  of  the  region  explored  [see  Figs.  3(b),  3(c),  4(b), 
4(c)],  V  and  Fjg  are  computed  on  a  rectangular  grid  with  the 
spacing  between  points  being  ai/16  (^2/^^)  along  the  x(y) 
axis  for  a  total  length  of  2^ai  {I  la 2).  An  interpolation 
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Fig.  4.  Shown  in  (a)  is  the  top  view  of  the  kink  with  an  atom  of  the  upper  terrace  (larger  atoms)  rebonded  to  the  lower  terrace,  (b)  shows  the  corresponding 
adatom  potential  energy  derived  from  the  local  geometry  (Vig)  whereas  (c)  shows  the  true  adatom  potential  energy  (V).  In  both  plots,  contours  are  separated 
by  0.2  eV  with  the  minima,  saddle  points,  and  maxima  marM  (and  sometimes  labeled)  by  +(m),  *(s),  and  X(M),  respectively.  Figures  in  parenthesis  are 
corresponding  values  in  eV.  Contours  in  (b)  >  —  1.9  eV  and  those  in  (c)  ^  -2.0  eV  are  marked  with  dashed  lines. 


scheme  applying  periodic  boundary  conditions  along  the  x 
and  y  axes  similar  to  that  used  in  the  simulations,  was  used 
to  construct  the  contour  plots  of  Fig,  3.  From  our  previous 
study  (Ref.  10)  the  error  in  V  due  to  (the  same)  finite  grid 
size  and  similar  interpolation  scheme  was  estimated  to  be 


±0.05  eV.  We  therefore  assume  here  that  this  error  remains 
the  same.  Being  much  larger  than  the  errors  due  to  finite  size 
effects,  it  is  assumed  to  be  the  error  bar  in  V.  Barrier  values 
being  differences  are  therefore  estimated  to  have  an  error  bar 
of  ±0.07  eV. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2821 


Kodiyalam,  Khor,  and  Das  Sarma:  Schwoebel  barriers  on  Si(111) 


2821 


III.  DISCUSSION 

It  must  be  noted  that  the  SW  potential  was  constructed  to 
reproduce  only  the  bulk  diamond  and  liquid  properties  of 
Silicon.^^  No  additional  tuning  to  the  (111)  surface/step  prop¬ 
erties  has  been  carried  out.  Specifically,  this  potential  does 
not  reproduce  the  correct  energetics,  for  the  (111)  surface 
configurations  with  adatoms  as  compared  to  ab  initio 
calculations. However,  as  mentioned  previously,  it  satis¬ 
factorily  predicts  energy  differences  when  rebonding 
occurs — as  was  the  case  with  the  Si(OOl)  surface,  the  rela¬ 
tive  energies  of  the  straight  [211]  and  [112]  steps^^’^"^  and 
the  energy  of  [21 1]  kink  of  type  These  observations 

suggest  that  the  energetics  predicted  by  this  potential  is  reli¬ 
able  when  atoms  undergo  a  change  in  their  coordination 
number.  We  use  this  idea  in  an  attempt  to  identify  the  robust 
features  of  this  study — those  that  follow  from  changes  in 
adatom  coordination  number.  As  the  functional  form  of  the 
SW  potential  does  not  provide  an  independent  measure  of 
atomic  coordination,  here  we  assume  [as  before  (Ref.  10)] 
that  the  potential  energy  (Vig)  obtained  from  relaxing  only 
the  adatom,  over  the  relaxed  (but  fixed)  adatom  free  configu¬ 
rations,  to  be  a  good  measure  of  the  coordination  number  of 
the  adatom.  Although  very  similar  to  the  true  adatom  poten¬ 
tial  energy  (V)  it  helps  to  identify  features  that  follow  di¬ 
rectly  from  the  local  geometry  of  atoms  around  the  adatom. 
Hence,  assuming  (for  barrier  calculations  on  the  Si(lll) 
surface/step/kink  configurations)  that  the  SW  potential  is 
more  reliable  when  it  predicts  an  absence  of  additional  re¬ 
laxation  (due  to  the  presence  of  the  adatom)  which  if  present 
changes  the  potential  energy  topography  significantly,  it  fol¬ 
lows  [as  argued  previously  (Ref.  10)]  that  features  that  fol¬ 
low  from  a  strong  correlation  between  Vjg  and  V  are  robust, 
i.e.,  would  survive  changes  in  details  of  the  empirical  poten¬ 
tial  used  and  are  expected  to  be  reproduced  in  more  satisfac¬ 
tory  ab  initio  or  tight  binding  calculations.  Specifically,  we 
had  assumed  previously  (Ref.  10)  that  if  the  saddle  point 
determining  the  diffusion  barrier  in  Vjg  was  nearly  at  the 
same  position  as  that  in  V  then  the  diffusion/Schwoebel  bar¬ 
rier  is  a  robust  feature.  Even  if  the  position  of  the  relevant 
saddle  points  differed,  there  is  however  a  bound  on  the  true 
Schwoebel  barrier  that  follows  purely  from  Vig  [this  is  the 
difference  between  V^g  at  barrier  determining  saddle  point  on 
the  step  or  kink  configurations  and  V  at  the  diffusion  barrier 
determining  saddle  point  on  the  free  Si(lll)  surface].  It  is  a 
strict  upper  bound  on  the  Schwoebel  barrier  (that  follows 
from  V  on  the  step  or  kink  configuration)  since  any  relax¬ 
ation  that  occurs  during  the  computation  of  V  can  only  re¬ 
duce  the  relevant  saddle  point  energy.  As  this  bound  is  com¬ 
pletely  independent  of  the  details  in  V,  we  now  argue  that  it 
must  be  a  robust  feature. 

From  the  contour  plots  of  Vjg  corresponding  to  the  kinks 
studied  here  [Figs.  3(b)  and  4(b)]  as  well  as  the  same  corre¬ 
sponding  to  straight  steps  studied  previously^^  [Figs.  5(b) 
and  5(c)]  a  common  trend  emerges — the  bound  on  the 
Schwoebel  barrier  (or  equivalently  the  energy  of  the  barrier 
determining  saddle  point  in  Vig)  is  small  (or  more  negative) 
when  atoms  at  the  step  edge  or  kink  site  are  rebonded.  It  can 


be  seen  (from  Figs.  3  and  5)  that  this  saddle  point  [^1  in  Fig. 
3(b),  si  in  Fig.  4(b),  ^3  in  Fig.  5(b)  and  5*6  in  Fig.  5(d)]  is  in 
the  neighborhood  of  an  upper  terrace  atom  that  has  moved 
from  its  bulk  terminated  position  in  a  direction  away  from 
the  saddle  point  due  to  the  presence  of  rebonding  at  the  step 
edge  or  kink  site.  The  magnitude  of  this  displacement  (in  the 
x-y  plane)  is  found  to  be  correlated  to  the  saddle  point 
energy  which  is  more  negative  if  the  displacement  is  large. 
The  displacement  (and  the  relevant  saddle  point  energy)  for 
the  [211]  kinks  of  types  A,B,  the  [211]  step  and  the  [112] 
step  is  0.99  A  (-1.95  eV),  0.21  A  (-1.72  eV),  0.00  A 
(-1.63  eV)  and  0.72  A  (-1.89  eV),  respectively.  Therefore 
the  bound  on  the  corresponding  Schwoebel  barrier  is  0.39 
eV,  0.62  eV,  0.71  eV,  and  0.45  eV,  respectively. 

The  true  adatom  potential  energy  V,  corresponding  to  the 
kink  configurations  studied  here,  is  shown  in  Figs.  3(c)  and 
4(c).  These  plots  show  discontinuities  near  the  kink  site  in¬ 
dicated  by  the  presence  of  minima,  some  of  which  are  la¬ 
beled  in  the  figures.  Studying  the  final  atomic  configuration 
when  the  adatom  is  in  these  regions  shows  that  the  disconti¬ 
nuities  are  due  to  large  scale  rearrangements  of  atoms  lead¬ 
ing  to  a  loss  in  the  identity  of  the  adatom.  In  other  words,  in 
these  configurations,  the  adatom  seems  to  occupy  a  lattice 
position  after  dislodging  another  atom  which  now  appears  to 
be  the  adatom.  Diffusion  in  these  regions  cannot  therefore  be 
viewed  as  a  single  atom  process,  thereby  making  the  contour 
plots  in  these  regions  less  meaningful.  The  saddle  point  de¬ 
termining  the  Schwoebel  barrier  in  case  of  the  [211]  kink  of 
type  A[s\  in  Fig.  3(c)]  is  however  not  in  the  proximity  of 
such  regions.  Therefore  the  corresponding  Schwoebel  barrier 
of  0.15  ±0.07  eV  is  assumed  to  be  relevant  for  processes 
involving  only  a  single  adatom.  In  the  case  of  the  kink  of 
type  B,  it  appears  that  .yO  [in  Fig.  4(c)]  is  the  relevant  saddle 
point  that  determines  the  Schwoebel  barrier.  However,  this 
point  is  close  to  a  minimum  that  indicates  a  discontinuity  in 
V.  The  Schwoebel  barrier  determined  by  this  point 
(0.28  ±0.07  eV)  is  therefore  considered  as  corresponding  to 
a  multi  atom,  and  not  a  single  atom,  process.  Hence,  dis¬ 
counting  this  point,  the  Schwoebel  barrier  [determined  by 
^1  in  Fig.  4(c)]  is  0.50±0.07  eV. 

In  our  previous  study  of  barriers  over  straight  steps, dis¬ 
continuities  in  y,  of  the  kind  that  are  seen  here,  were  not 
observed.  This  was  because  the  temperature  used  during  the 
simulation  was  very  small — ^2X  lO”"^  eV.  Here  we  used  a 
larger  temperature  (^3X10*^  eV)  which  resulted,  in  the 
presence  of  the  adatom,  the  large  scale  movement  of  atoms 
around  the  kink  site.  The  experimentally  relevant 
temperature^  however  continues  to  be  much  larger — ^0.1 
eV.  At  these  temperatures  we  expect  multi-atom  processes  to 
have  smaller  Schwoebel  barriers — this  is  supported  by  the 
observation  here  that  such  a  process  occurs  near  the  [211] 
kink  of  type  B  with  the  Schwoebel  barrier  being  smaller  than 
that  for  the  single  atom  process.  Carrying  out  simulations  at 
high  temperatures  with  a  single  probe  atom  cannot  yield  use¬ 
ful  information  since  such  potential  energy  plots  are  ex¬ 
pected  to  suffer  from  severe  discontinuities  due  to  such 
multi-atom  processes.  The  general  decrease  in  the  barrier 
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(a) 


[211]  Step  (b) 


Fig.  5.  In  the  step  configurations  [(a)  and  (d)]  the  upper  terrace  atoms  are  shown  larger  than  those  on  the  lower  terrace  and  within  each  terrace  the  upper 
monolayer  is  in  grey  and  the  lower  in  black,  (b)  and  (d)  show  the  adatom  potential  energy  derived  from  the  local  geometry  ( Vig)  corresponding  to  (a)  and  (c), 
respectively.  In  these  plots  contours  are  0.2  eV  apart  with  those  ^  —  1 .9  eV  shown  by  dashed  lines.  The  minima,  saddle  points,  and  maxima  are  marked 
(labeled)  by  +(m),  *(s),  and  X(M),  respectively,  with  the  figure  in  parenthesis  being  their  corresponding  value  in  eV. 
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values  is  however  consistent  with  the  small  upper  bound  on 
the  Schwoebel  barrier  (0.05  eV)  developed  previously^®  from 
an  analysis  of  experimental  data  on  the  electromigration  of 
steps.® 


IV.  CONCLUSION 

Schwoebel  barriers,  calculated  using  the  empirical 
Stillinger- Weber  potential,  for  unit  depth  kinks  along  the 
[211]  step  are  smaller  in  magnitude  as  compared  to  that 
calculated  previously^®  for  the  straight  [211]  step.  This  de¬ 
crease  can  be  expected  directly  form  the  adatom  potential 
energy  plots  (Vig)  that  follow  purely  from  the  local  geometry 
of  atoms  around  the  adatom  in  these  atomic  configurations. 
These  plots  as  well  as  similar  plots  for  the  straight  [211]  and 
[112]  steps  calculated  previously^®  show  that  the  upper 
bound  on  the  true  Schwoebel  barrier  calculated  using  Vjg  is 
correlated  to  the  the  displacement  of  an  atom  on  the  upper 
terrace  of  these  configurations  that  is  near  the  relevant  saddle 
point  in  The  true  adatom  potential  energy  plots  (V) 
however  show  discontinuities  due  to  large  scale  movement 
of  atoms  near  the  kink  sites  resulting  sometimes  in  smaller 
barriers  than  when  such  movements  do  not  occur.  We  there¬ 
fore  speculate  that  multi-atom  processes  occurring  in  the 
configurations  studied  here  and  previously^®  could  have 
smaller  Schwoebel  barriers. 
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It  has  long  been  held  that  high  resolution  x-ray  photoemission  spectra  of  the  Si  2p  core  level  at 
SiSi02  interfaces  provide  important  structural  information,  and  that  any  credible  interfacial  model 
must  be  able  to  account  for  these  data.  To  this  end  it  has  traditionally  been  assumed  that  there 
existed  a  one-to-one  relationship  between  chemically  shifted  spectral  features  and  interfacial  silicon 
atoms  in  specific  formal  oxidation  states.  A  series  of  new  measurements  of  Si  2p  core  level  binding 
energies  for  cluster-derived  Si/Si  oxide  interfaces  appear  to  stand  in  direct  contradiction  to  this 
critical  assumption.  In  this  article  we  present  a  critique  of  various  responses  to  the  challenge  posed 
by  the  new  observations.  Particular  attention  is  given  to  the  logical  consequences  of  either 
maintaining  or  rejecting  the  one-to-one  relationship  between  shifted  features  and  formal  silicon 
oxidation  states  and  to  the  challenges  each  of  these  responses  must  meet  if  they  are  to  prevail. 
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I.  INTRODUCTION 

The  structure  and  reactivity  of  the  Si(100)/SiO2  interface 
has  been  a  subject  of  continuing  interest  and  controversy 
over  the  past  few  years. ^  While  a  variety  of  experimental 
methods  have  been  brought  to  bear  on  these  problems,  high 
resolution  x-ray  photoemission  studies  of  the  2p  core  level 
binding  energies  of  the  interfacial  atoms  have  played  a  par¬ 
ticularly  important  role.^“^  The  technique  offered  the  prom¬ 
ise  of  simple  to  interpret,  reliable  information  concerning  the 
structure  of  the  silicon-oxygen  network  at  the  interface.  Prior 
to  1993  it  was  generally  held  that  in  the  spectra  of  Si/Si02 
interfaces,  chemically  shifted  Si  2p  core  level  peaks  of  0.9, 
1.8,  and  2.5  eV  were  the  result  of  interfacial  Si  atoms  bound 
to  one,  two  and  three  oxygen  atoms. The  binding  energy 
shift  exhibited  by  fully  formed  Si02  exhibits  a  small  depen¬ 
dence  on  overlayer  thickness,  which  is  generally  held  to 
originate  from  varying  image  charge  stabilization  of  the  core 
hole  state.^*^’^’^^^  Aside  from  this  effect,  no  details  of  the 
structure  other  than  the  number  of  oxygen  atoms  to  which 
the  Si  atom  in  question  is  directly  bound  was  generally  held 
to  have  any  measurable  effect  on  the  spectra.  This  set  of 
empirical  assumptions  has  been  heavily  relied  upon  for  the 
construction  of  structural  models. In  addition,  the  as¬ 
signment  scheme,  and  the  structural  models  derived  from  it, 
are  routinely  used  when  attempting  to  optimize  processes  for 
semiconductor  device  development.  For  these  reasons,  a  re¬ 
liable,  clearly  delineated  assignment  scheme  is  critically  im¬ 
portant. 

Recently,  however,  both  the  simplicity  and  reliability  of 
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the  assignment  rules  have  been  called  into  question.  New 
measurements  have  been  made  of  Si  2/7  core  level  shifts  for 
a  class  of  cluster  derived  Si/Si02  interfaces. These  inter¬ 
faces,  formed  by  the  low  temperature  attachment  of  cluster 
molecules  with  pre-formed  Si-0  bonds,  constituted  the  first 
class  of  silicon/silicon  oxide  interfaces  for  which  aspects  of 
the  local  structure  could  legitimately  claimed  to  be  known  a 
priori.  This  permitted  the  first  examination  of  the  assignment 
rules  for  the  Si  2p  core  level  chemical  shifts  which  was  not 
intimately  entangled  with  the  problem  of  structure 
determination.^'^  From  the  standpoint  of  the  simple  clear  cut 
assignment  scheme  described  above,  the  results  were,  to  say 
the  least,  rather  disturbing.  In  this  article  we  will  review  the 
key  observations  made  on  the  cluster  interfaces,  along  with 
some  new  supporting  evidence  from  a  new  class  of  mol¬ 
ecules,  the  silatranes.^^  We  will  then  attempt  to  systematize 
the  various  positions  which  have,  implicitly  or  explicitly, 
been  taken  with  respect  to  the  need  to  modify  the  rules  and 
principles  by  which  the  photoemission  spectra  are  assigned 
and  models  of  Si/Si02  interfaces  are  constructed. 

II.  THE  DISQUIETING  CLUSTER  INTERFACE 
SPECTRA 

In  1993  Banaszak  Holl  and  McFeely  published  a  paper 
containing  the  Si  2 p  photoemission  spectrum  of  H8SigOi2 
spherosiloxane  clusters  chemisorbed  on  a  Si(100)-2xl  sur¬ 
face  (Fig.  i).^4(a),i7  Analogous  spectra  from  the  chemisorbed 
clusters  H]2Sii20|8  and  Hi4Sii402i  soon  followed. Each 
of  these  spectra  contained  no  less  than  three  features  which 
were  to  varying  degrees  incompatible  with  the  then  univer¬ 
sally  accepted  rules  for  the  assignment  of  Si  2p  spectra  from 
silicon/silicon  oxide  interfaces.  For  the  sake  of  convenience 
in  the  forthcoming  discussion,  we  enumerate  these  formally. 
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Fig.  1.  Panel  A:  cluster  derived  silicon/silicon  oxide  interface.  Panel  B: 
thermal  silicon/silicon  oxide  interface  with  formal  oxidation  states  assigned 
to  features  using  the  orthodox  assignment  model. 

Cl :  The  binding  energy  of  a  surface  Si  to  which  a  cluster 
vertex  is  bound,  i.e.,  03Si-S/,  is  shifted  by  1.0  eV  with  re¬ 
spect  to  the  bulk,  despite  the  fact  that  it  is  a  formal  Si^ 
species  without  any  direct  bond  to  an  oxygen  atom. 

C2:  The  cluster  vertex  Si  effecting  the  attachment  to  the 
surface,  i.e.,  035f-Si,  is  shifted  by  2.1  eV,  a  value  almost 
exactly  midway  between  those  held  to  be  characteristic  of 
(1.8  eV)  and  (2.5  eV)  at  Si(100)/SiO2  interfaces. 

C3\  HSiO^  moieties  are  chemically  shifted  3.6  eV-3.7  eV, 
far  removed  from  the  2.8 -2.9  eV  predicted  on  the  basis  of 
the  simple  additivity  of  chemical  shifts. 

III.  RESPONSES  TO  THE  NEW  DATA 

As  the  results  presented  above  do  not  fit  neatly  into  the 
traditional  assignment  scheme,  they  naturally  suggest  that 
some  modification  of  the  assignment  principles  may  be  in 
order.  The  experimental  observations,  however,  are,  to  stress 
a  point,  observations  only.  They  do  not  transparently  reveal 
the  underlying  physics  responsible  for  the  anomalous  shifts. 
As  such,  they  themselves  allow  considerable  latitude  in  the 
assessment  of  the  nature  and  extent  of  such  modifications.  In 
this  section,  we  consider  some  of  the  positions  which  have 


been  taken  explicitly  or  implicitly  in  the  literature  with  re¬ 
gard  to  the  appropriate  modifications  of  the  traditional  as¬ 
signment  procedures.  In  so  doing  we  will  review  a  series  of 
positions  ranging  from  the  orthodox,  to  the  most  aggres¬ 
sively  revisionist,  orthodoxy  in  this  case  referring  to  posi¬ 
tions  substantially  upholding  the  pre-1993  assignment 
scheme.  Emphasis  will  be  given  to  a  discussion  of  the  logical 
and  evidentiary  foundations  of  the  positions  in  question  and 
to  any  conclusions  necessitated  with  regard  to  the  cluster 
data.  As  in  all  matters  not  firmly  settled  by  an  overwhelming 
preponderance  of  experimental  evidence,  the  reader  will 
doubtless  note  the  possibility  fully  self-consistent  positions 
lying  in  between  those  we  have  explicitly  set  out.  Those 
specifically  elucidated  (orthodoxy,  revisionism,  and  accom- 
modationism)  should  most  properly  be  considered  as  some¬ 
what  arbitrary  guideposts  along  a  continuum.  In  the  spirit  of 
forthrightness,  we  alert  the  reader  to  the  fact  that  we  are  not 
disinterested  parties  in  this  debate.  We  are  on  record  as  ad¬ 
vocates  of  revisionism,  and  continue  (variously)  to  hold  po¬ 
sitions  on  this  end  of  the  spectrum.  We  will  however  attempt 
to  draw  the  reader’s  attention  to  the  weak  points  in  our  own 
position  as  well  as  those  in  opposing  views. 

A.  Orthodoxy 

As  orthodoxy  implies  subscribing  to  a  well  established  set 
of  ideas,  we  designate  as  orthodox  those  asserting  that  the 
pre-1993  assignment  principles  are  in  no  need  of  revision  in 
light  of  the  evidence  cited  above,  and  apply  universally  to 
the  silicon/oxygen  system.  While  we  are  aware  of  no  one 
who  has  made  this  exact  affirmation  in  print,  authors  who 
assign  their  spectra  by  means  of  the  pre-1993  standard  prin¬ 
ciples  without  explicit  acknowledgment  of  divergent  possi¬ 
bilities  suggested  by  the  cluster  results,  are  by  clear  implica¬ 
tion  upholders  of  the  orthodox  position.  To  summarize,  the 
tenets  of  the  orthodox  assignment  scheme  are 

01:  The  binding  of  an  oxygen  atom  to  a  Si  atom  at  the 
interface  produces  a  shift  of  approximately  0.9  eV  in  the  Si 
2p  binding  energy.  The  effect  of  oxygen  binding  is  additive, 
each  additional  oxygen  inducing  a  similar  incremental  shift, 
so  that  Si2Si02  and  Si5/03  configurations  should  give  rise  to 
shifts  at  1.8  and  2.5  eV  on  the  Si(lOO)  surface  ,  respectively. 

02:  No  configurations  other  than  those  stated  above  give 
rise  to  a  spectroscopically  measurable  chemical  shift.  The  Si 
2p  binding  energy  is  determined  completely  by  the  identity 
of  the  four  nearest  neighbors  of  the  central  atom,  with  the 
exception  of  03  below. 

03:  A  small  decrease  in  the  dielectric  screening  of  the 
core  hole  final  state  occurs  as  the  emitting  Si  atom  moves 
away  from  the  bulk  Si  into  the  oxide.  The  primary  observa¬ 
tional  consequence  of  this  is  a  gradual  increase  in  the  SiO  2 
chemical  shift  (to  a  maximum  added  shift  of  0.4  eV)  with 
overlayer  thickness  in  the  0-10  A  regime. 

Some  comment  on  this  particular  choice  of  propositions 
as  representative  of  the  orthodox  position  is  in  order.  Propo¬ 
sition  03  is  clearly  the  least  central  to  orthodoxy,  and  there 
are  indeed  some  workers  who  have  (at  least  in  the  past) 
assigned  spectra  using  generally  orthodox  methods  without 
adopting  this  principle.  We  have  included  it  because  it  pro- 
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vides  an  apparently  simple  means  to  a  ccount  for  observation 
C2.  This  attempt  raises  a  number  of  subtle  points  which  we 
wish  to  address  below,  hence  its  inclusion.  Absent  the  03 
principle,  the  orthodox  assignment  principles  seem  simply  at 
a  loss  to  account  for  this  discrepancy.  Principle  02,  it  may  be 
noted,  would  be  regarded  as  a  rather  extreme  statement  by 
some  workers  who  have  in  the  past  interpreted  the  photo¬ 
emission  data  by  methods  which  we  would  characterize,  on 
balance,  as  orthodox.  This  is  particularly  true  of  those  who 
have  argued  for  the  importance  of  bond  angle  variations  in 
inducing  small  chemical  shifts.  These  bond  angle  effect  ar¬ 
guments,  however,  have  largely  been  applied  to  the  “Si'^^” 
peak,  which  is  not  our  primary  subject  in  this  article.  Fur¬ 
thermore  as  we  have  discussed  elsewhere,  the  spherosiloxane 
cluster  studies  directly  contradict  the  claims  that  these  effects 
are  important,  as  do  recent  theoretical  calculations  upholding 
the  orthodox  viewpoint. Moreover,  it  is  our  view,  based 
upon  a  survey  of  recent  publications,^^*^^’"^’^  that  the  notion  of 
bond  angle  fluctuations  as  a  significant  effect  to  be  incorpo¬ 
rated  into  the  analysis  of  the  photoemission  data  has  not 
found  widespread  acceptance. 

The  orthodox  assignment  scheme  rests  on  several  founda¬ 
tions.  First,  the  assignment  of  a  1  eV  shift  to  a  surface  silicon 
bound  to  three  silicons  and  one  oxygen  atom  has  firm  under¬ 
pinnings  in  studies  of  water  chemisorption.^^^^’^^  We  know  of 
no  systems  containing  known  Si25'/02  and  SiSiO^  moieties 
yielding  Si  2p  chemical  shifts  in  the  1.8  eV  and  2.5  eV 
range.  (Clearly,  the  thermal  oxide  interfaces  themselves  can¬ 
not  be  used  for  this  purpose.)  In  fact,  the  only  cases  we  know 
of  in  which  a  moiety  Si5/03  is  known  to  be  present  are  the 
various  cluster  interfaces  where  shifts  of  2.0-2. 1  eV,  not  2.5 
eV  are  measured.  Nevertheless,  the  fairly  regular  spacing  of 
the  interfacial  oxide  peaks  at  0.9,  1.8  and  2.5  eV  are  clearly 
reminiscent  of  the  regular  spacings  observed  for  halide 
chemisorption  on  Si(lll)  surfaces.^^  As  these  monolayer 
systems  are  considerably  simpler  from  the  standpoint  of 
structural  chemistry,  the  di-  and  tri-halide  origin  of  the  peaks 
shifted  by  multiples  of  a  basic  increment  are  quite  generally 
accepted.  The  ascription  of  the  1.8  and  2.5  eV  peaks  to 
Si25/02  and  Si5’/03  moieties  has  therefore  been  based  upon 
the  principle  of  the  regular  additivity  of  shifts,  in  analogy 
with  the  halide  systems.  This  idea  has  recently  been  bol¬ 
stered  by  theoretical  calculations  from  Pasquarello,  Hybert- 
son,  and  Car.^®  In  this  calculation,  the  binding  energy  shifts 
are  insensitive  to  the  details  of  the  second  nearest 
neighbors,^^  but  no  explanation  for  the  discrepancy  posed  by 
the  cluster  interface  spectra  was  given. 

The  challenge  to  the  orthodox  position  is  to  account  for 
the  results  of  the  cluster  studies.  As  this  assignment  scheme 
asserts  that  spectroscopically  detectable  binding  energy  shifts 
are  dependent  upon  the  first  coordination  sphere  only,  the 
key  structural  moieties  in  the  cluster  interfaces  should  clearly 
fall  within  their  purview.  Observation  C7,  that  a  formally 
Si^  species  should  give  rise  to  a  1  eV  shifted  core  level,  is 
absolutely  incompatible  with  the  orthodox  picture.  There¬ 
fore,  to  espouse  orthodoxy  requires,  for  consistency’s  sake, 
the  simultaneous  assertion  that  this  experimental  observation 


is  either  some  sort  of  misleading  artifact  or  that  the  assign¬ 
ment  is  wrong  in  some  fundamental  way.  This  prospect  will 
be  considered  in  conjunction  with  our  discussion  of  revision¬ 
ism  below. 

Orthodoxy  must  also  face  the  challenge  posed  by  obser¬ 
vation  C2.  This  is  more  troubling  than  it  appears  at  first 
sight.  The  orthodox  position  is  placed  in  a  curious  position 
here.  The  cluster  spectra  provide  a  model  system  which  is 
apparently  ideal  for  determining  the  binding  energy  of  an 
Si*S/03  moiety  experimentally.^'^  Moreover,  it  is,  to  the  best 
of  our  knowledge,  the  only  such  model.  Nevertheless,  the  2,1 
eV  shift  exhibited  by  this  moiety  is  rejected  in  favor  of  the 
value  2.5  eV.  This  latter  value  is  evidently  to  be  preferred 
based  upon  its  appearance  in  the  thermal  oxide  interface 
spectra,  and  the  presumed  linear  variation  of  binding  energy 
shift  with  formal  oxidation  state,  a  condition  which,  if  this 
latter  value  does  not  actually  quite  satisfy,  it  at  least  approxi¬ 
mates  more  closely.  The  difference  between  the  2.5  eV  peak 
observed  in  the  thermal  oxide  and  the  2.1  eV  shift  observed 
for  the  SiSiO^  moiety  in  the  cluster  interface  spectra  is  not  a 
trivial  one.  A  shift  of  2.1  eV  is  actually  closer  to  the  1.8  eV 
shift  proposed  for  Si25'z02  fragments  than  to  the  2.5  eV  shift. 

The  orthodox  assignment  scheme  offers  one  means  by 
which  this  observation  could  be  rationalized,  principle  03.  In 
the  cluster  interface  spectra,  the  O  3Si-  moiety  is  bound  di¬ 
rectly  to  the  substrate,  while  in  every  plausible  model  of  the 
interface  these  fragments  are  on  the  Si02  side  of  the  interfa¬ 
cial  region,  removed  by  up  to  4  A  or  so  from  the  silicon 
substrate.^’^’^’^’^^’^^  According  to  03  this  could  in  principle  be 
accompanied  by  some  upward  shift  in  binding  energy. In¬ 
voking  03,  however,  to  rationalize  this  discrepancy  creates 
new  problems  of  its  own.  The  035/Si  feature  of  the  cluster 
interfaces  and  the  Si^^  feature  of  the  thermal  interface  spec¬ 
tra  (or  the  HO-Si  feature  from  water  chemisorption,  for  that 
matter)  both  occur  at  the  bulk  Si  side  of  the  interfacial  re¬ 
gion.  Most  interface  models,  whether  based  upon  orthodox 
or  upon  radically  revised  interpretations  of  photoemission 
data,  and  tridimite-based  models,^^  agree  in  placing  the 
Si2S/02  moieties  there.  If  the  difference  between  the  2.5  eV 
of  the  thermal  oxide  Si'^^and  the  2.1  eV  in  the  clusters  is 
attributed  to  a  0.4  eV  “05  effect,”  and  the  Si  and  Si*^  ^  are 
so  situated  geometrically  as  to  have  the  same  03  contribu¬ 
tion  as  the  cluster  Si^^,  then  we  must  select  the  cluster  Si 
as  the  best  species  to  use  in  obtaining  an  undistorted 
picture  of  the  Si^^-Si^^  binding  energy  progression.  The 
resulting  series,  0.9- 1.0,  1.8,  2.0-2. 1,  does  not  seem  very 
regular  at  all,  and  appears  at  clear  variance  with  the  theoreti¬ 
cal  assertion^^  that  the  initial  state  contributions  to  the  shifts 
should  vary  linearly.  Suddenly,  from  a  self  evident  regularity, 
simply  accounted  for,  one  seems  forced  to  conclude  that  if 
one  invokes  03  as  the  sole  cause  of  the  discrepancy  in  bind¬ 
ing  energy  between  Si"^^  species  in  the  thermal  and  cluster 
interfaces,  the  approximately  regular  spacing  of  peaks  in  the 
oxide  interface  obtains  only  because  a  large  incremental  de¬ 
crease  in  initial  state  shift  in  adding  a  third  oxygen  atom  is 
fortuitously  compensated  by  a  decrease  in  core  hole  relax¬ 
ation.  In  so  doing  however,  proponents  of  orthodoxy  would 
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be  compelled  to  relinquish  at  least  some  of  their  claim  that 
their  position  is  bolstered  by  theory.  Pasquarello,  Hybertson, 
and  Car  state  unequivocally  that  the  initial  state  contributions 
to  the  Si  2p  binding  energies  at  the  Si/Si02  interface  vary 
linearly  with  formal  oxidation  state.^^  Thus,  experiment  and 
theory  currently  seem  at  an  impasse  regarding  this  point.  In 
conclusion,  orthodoxy,  which  recall  is  defined  as  asserting 
universal  applicability  to  the  Si/0  interface  system,  can  pos¬ 
sibly  survive  observation  C2,  but  not  without  some  damage 
to  the  plausibility  of  its  foundations. 


Oh  -  H8Si80i2 


B.  Revisionism 

Revisionism,  as  the  name  implies,  refers  to  the  doctrine 
that  the  orthodox  assignment  rules  01-03  must  be  modified 
in  light  of  experimental  observations  C1-C3  to  produce  a 
new  set  of  assignment  rules,  of  equally  universal  application 
within  the  Si/Si  oxide  interface  systems.^^  Implicit  in  any 
revisionism  are  two  fundamental  assertions. 

Rl:  The  cluster  interfaces  are  not  pathological  or  any  sort 
of  special  case.  They  are  typical  examples  of  Si-0  bonding 
and  as  such  their  spectra  are  assignable  on  the  basis  of  the 
same  rules  as  any  other  Si-0  system. 

R2:  The  experiments  on  the  cluster  derived  interfaces 
have  been  performed  correctly,  and  the  assignments  given 
for  the  spectral  features  is  as  given  by  the  authors. 

Point  Rl,  which  serves  to  distinguish  revisionism  from 
the  accommodationist  position  discussed  below,  is  straight¬ 
forward  and  in  essential  agreement  with  the  orthodox  view. 
In  examining  the  structural  moieties  giving  rise  to  Cl  and 
C2,  one  must  go  out  to  at  least  3rd  nearest  neighbors  before 
the  structure  reaches  any  anomaly  (a  hydrogen  atom).  As  the 
orthodox  position  is  that,  to  all  extents  and  purposes,  only 
the  first  coordination  sphere  matters,  the  clusters  should  not 
be  anomalous.  While  not  endorsing  precisely  this  reasoning, 
advocates  of  revisionism  would  agree  that  absent  some  spe¬ 
cific,  convincing  physical  reason  why  the  cluster  interfaces 
should  be  anomalous,  they  should  not  be  so  regarded. 

Principle  R2  is  of  course  the  foundation  of  revisionism.  It 
is  clear  from  the  preceding  discussion  that  if  orthodoxy  is  to 
prevail,  the  assignments  given  in  the  cluster  interface  spectra 
must  be  fundamentally  wrong  ones,  and  it  must  be  possible 
to  construct  new  assignments  based  upon  orthodox  principles 
which  successfully  explain  the  data.  We  examine  here  what 
we  consider  the  two  most  pertinent  challenges  to  the  revi¬ 
sionist  interpretation  of  the  cluster  interface  spectra,  the  con¬ 
tamination  hypothesis  and  the  partial  dissociation  hypothesis. 

The  contamination  hypothesis  deals  with  Cl  by  asserting 
that  the  1.0  eV  shifted  feature  in  the  cluster  and  silatrane 
spectra  are  in  every  case  an  artifact  caused  by  the  contami¬ 
nation  of  the  surface  by  water.  The  spectra  are  thus  held  to 
really  be  that  of  clusters  and  co-adsorbed  water,  with  the 
latter  responsible  for  the  1.0  eV  feature.  To  be  sure,  the 
Si(100)-2xl  surface  is  highly  reactive  towards  water,  and 
careful  procedures  had  to  be  adopted  to  obtain  the  spectra 
reported.  We  do  not  however  believe  that  the  water  hypoth¬ 
esis  can  be  sustained.  To  begin,  note  that  the  water  hypoth¬ 
esis  must  assert  that  ALL  of  the  intensity  in  Fig.  1,  peak  pi. 


Fig.  2.  Cluster  molecule  H8Si80|2  and  a  schematic  view  of  the  cluster  bound 
to  a  silicon  surface. 


is  due  to  water.  This  makes  for  difficulties  in  understanding 
the  intensity  relationships  among  peaks  p3,  p2,  pi,  and  H 
in  the  H8SigO|2  spectrum  of  Fig.  1  and  in  the  analogous 
intensity  relationships  for  spectra  of  the  other  clusters.  Ban- 
aszak  Holl  and  McFeely  noted  that  peaks  p3-R  should  stand 
in  the  ratio  of  7: 1:1:1  for  single  vertex  attachment  to  the 
surface.  This  single  vertex  intensity  relationship  is  approxi¬ 
mately  satisfied  by  all  the  clusters.  We  have  repeated  the 
H8Si80i2  result  in  particular  many  times  over  the  past  three 
years,  using  different  synthetic  lots  of  source  cluster,  always 
with  this  same  result.  It  strains  our  credulity  to  believe  we 
would  not  have  seen  considerably  more  variability  in  this 
spectrum  if  it  were  the  result  of  contamination.  Even  if  this 
point  is  waived,  however,  problems  remain.  If  peak  pi  is 
ascribed  to  water,  then  its  intensity  with  respect  to  peaks  p3 
and  p2  implies  that  water  adsorbs  in  approximate  1:1  propor¬ 
tion  to  all  the  clusters,  as  well  as  showing  an  identical  1:1 
behavior  in  the  chemically  very  different  silatrane  case.  If 
true,  this  would  be  an  exceedingly  curious  phenomenon.  The 
clusters  and  H2O  each  efficiently  inhibit  the  chemisorption  of 
the  other.  A  water  saturated  surface  is  inert  towards  clusters, 
and  a  cluster  saturated  surface  is  inert  towards  water  chemi¬ 
sorption.  One  supposes  therefore  that  in  the  water  contami¬ 
nation  scenario,  what  must  be  going  on  is  a  race  for  surface 
sites  between  the  two  species.  The  vapor  pressure  difference 
between  H8Si80i2  and  Hi4Sii402i  appears  to  be  on  the  order 
of  a  factor  of  100  at  the  very  least,  which  during  the  expo¬ 
sure  process  leads  to  a  similar  reduction  in  the  flux  of  cluster 
arriving  at  the  surface.  If  the  H8Si80i2  is  adsorbed  in  1:1 
proportion,  one  must  conclude  that  the  race  for  sites  between 
background  water  and  the  Hi4Sii402i  clusters  should  be 
overwhelmingly  in  favor  of  the  former,  and  that  not  more 
than  a  percent  or  so  of  surface  sites  should  wind  up  being 
occupied  by  the  clusters.  This  expectation  is  clearly  belied  by 
the  experiments.^"^ 

In  spite  of  these  difficulties  with  the  water  hypothesis,  we 
have  nevertheless  undertaken  further  experiments  to  ascer¬ 
tain  its  likelihood.  To  do  this  we  have  prepared  surfaces  con¬ 
taining  only  about  half-saturation  coverage  of  HgSigO  12 ,  a 
representative  spectrum  of  which  is  shown  in  Fig.  3.  The 
half-saturated  surface  still  clearly  shows  surface  dimer  peaks 
indicative  of  reactive,  clean  surface  sites.  The  fact  that  we 
can  measure  these  spectra,  which  require  substantial  addi- 
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tional  sample  manipulations,  is  strong  evidence  that  we  can 
adsorb  the  clusters  cleanly.  To  verify  that  the  surface  shown 
in  Fig.  3  is  still  reactive  towards  water,  the  sample  was  re¬ 
moved  to  the  reaction  chamber  and  deliberately  dosed  with 
water.  As  shown  by  the  dotted  curve  in  Fig.  3,  the  expected 
result  was  obtained  The  surface  dimer  site  peak  is  com¬ 
pletely  removed  and  additional  water  induced  intensity 
grows  in  at  about  1  eV. 

As  a  final  piece  of  evidence,  we  have  for  the  silatrane  case 
been  able  to  demonstrate  that  for  a  closely  related  isoelec- 
tronic  molecule,  water- free  depositions  can  be  effected.  To 
this  end  we  employed  azasilatrane,  HSi[(NH)CH2CH2]  3N, 
which  is  identical  to  silatrane  except  that  each  oxygen  is 
replace  by  an  NH  group.^^  The  absence  of  oxygen  within  this 
cluster  made  possible  the  use  the  O  1  .s  photoemission  inten¬ 
sity  to  rule  out  significant  contamination  by  water. 

In  conclusion,  we  find  the  hypothesis  of  water  contamina¬ 
tion  unsustainable.  Even  if  one  allows  for  the  strange  con¬ 
stancy  of  the  magnitude  of  the  putative  contamination  from 
cluster  to  cluster,  it  offers  no  insight  into  why  azasilatrane, 
alone  amongst  these  clusters  should  be  immune  from  con¬ 
tamination  effects.  Finally,  we  note  that  the  water  hypothesis, 
by  itself  does  not  by  itself  solve  all  of  the  difficulties  faced 
by  the  orthodox  position,  as  it  offers  no  explanation  for  ob¬ 
servation  C2,  which,  as  we  have  seen,  is  reconcilable  with 
the  orthodox  doctrine  only  awkwardly,  if  at  all. 

The  second  possibility  we  shall  consider  is  the  hypothesis 
of  partial  decomposition.  The  partial  decomposition  structure 
most  consistent  with  the  orthodoxy  assignment  scheme  in¬ 
volves  breaking  one  silicon-oxygen  bond  at  the  surface  at¬ 
tachment  vertex  and  rebonding  it  to  another  surface  silicon. 
This  rebonding  would  provide  an  Si-0  moiety  at  the  surface 
with  the  ~1.0  eV  Si  2p  binding  energy  shift  necessary  to 
account  for  CL  This  sort  of  partial  decomposition  scenario  is 
difficult  to  subscribe  to  on  a  number  of  grounds.  First  of  all, 
this  process  involves  the  spontaneous  breakup  of  a  chelated 
site  to  form  two  radical  sites  separated  in  space,  one  at  the 
cluster  vertex  and  one  resulting  from  the  breaking  of  the 
dimer  bond  accompanying  the  rebonding  of  the  oxygen  atom 
at  the  surface.  While  surface  reconstruction  might  account 
(somehow)  for  the  quenching  of  the  surface  radical,  the  clus¬ 
ter  radical  seems  unavoidable.  This  process  would  therefore 
be  expected  to  be  extremely  endothermic.  Further  compen¬ 
sating  dissociations  are  virtually  impossible  to  square  either 
with  the  preservation  of  high  symmetry  evidenced  by  the 
sharpness  of  the  valence  band  structure,  or  the  absence  of 
additional  resolvable  core  level  features. The  partial  disso¬ 
ciation  theory  also  requires  that  five  different  cluster  com¬ 
plexes,  which  have  both  subtle  and  substantial  differences  in 
the  chelate  formed  at  the  vertex  Si,  all  partially  dissociate  in 
the  exact  same  manner.  This  is  difficult  to  accept.  Note  that 
upon  heating,  which  we  believe  does  cause  the  clusters  to 
react  further  with  the  surface,  the  sharp  valence  band  features 
rapidly  give  way  to  a  spectrum  reminiscent  of  amorphous 
Si02  and  new,  resolvable  features  are  apparent  in  the  Si  2p 
core-level  spectrum.  Even  if  these  difficulties  are 

waived,  the  partial  decomposition  model  must  account  for 


Fig.  3.  The  solid  line  shows  a  Si(100)-2X1  surface  that  has  been  partially 
dosed  with  HgSi80|2.  Note  the  presence  of  a  shoulder  to  lower  binding 
energy  than  the  bulk,  indicative  of  surface  dimers.  Subsequent  dosing  with 
water  causes  the  disappearance  of  the  dimer  featuers  and  the  growth  of  an 
Si -OH  feature  at  1.0  eV  as  expected  [Ref.  2(c)]. 

the  observed  chemical  behavior  of  the  cluster  interfaces.  The 
interfaces  produced  by  HgSi80i2  show  no  spectral  change 
when  exposed  to  thousands  of  langmuirs  of  water.  Even 
short  exposure  to  atmosphere  produces  no  discernible 
changes. This  observation  is  incompatible  with  the  exist¬ 
ence  of  highly  reactive  free  radicals  on  the  surface. 

Another  unfortunate  consequence  of  the  partial  decompo¬ 
sition  hypothesis  is  that  it  makes  observation  C2  even  more 
problematic  for  the  orthodox  assignment  model  than  it  was 
before.  The  disruption  of  the  cluster  vertex  upon  partial  de¬ 
composition  changes  this  silicon  atom  from  a  formal  Si"^^  to 
a  Si^^.  This  leaves  one  with  the  predicament  that  somehow  a 
single  oxygen  atom  in  duces  0.9  eV  of  chemical  shift  while 
the  addition  of  a  second  can  induce  an  additional  1.2  eV.  If 
the  silicon  cluster  vertex  site  is  a  free  radical,  one  would  also 
expect  some  multiplet  splitting  in  the  core  level  spectrum, 
which  is  not  in  evidence.  We  could  go  on,  considering  the 
effects  of  shifting  hydrogen  atoms,  but  to  the  best  of  our 
understanding  these  shifts  only  serve  to  patch  up  (poorly) 
one  problem,  leaving  worse  ones  in  their  wake.  In  summary, 
neither  we  nor  anyone  else  has  succeeded  in  constructing  any 
alternative  structural  model  or  contamination  hypothesis 
which  plausibly  explains  the  cluster  interface  spectra.  We 
hope  shortly  to  offer  infrared  evidence  to  further  bolster  this 
conclusion.^^’^^ 

Having  upheld,  at  least  for  now,  the  experimental  founda¬ 
tions  of  revisionism  we  proceed  to  the  problem  of  exactly 
how  it  is  to  be  implemented.  This  is  a  difficult  problem,  as  in 
our  opinion  the  fundamental  lesson  of  the  cluster  studies  is 
that  reliable  model  systems  of  known  structure  are  absolutely 
required  to  make  unimpeachable  assignments  of  interface 
core  level  features.  Simple  rules  of  thumb  can  take  one  only 
so  far,  and  subtleties  can  be  missed.  An  excellent  case  in 
point  is  provided  by  observation  C3,  that  HSi03  species  are 
shifted  by  3.6-3.7  eV.  The  principle  of  regular  additivity  of 
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chemical  shifts,  which  is  at  least  consistent  with  overall  shift 
trends,  would  take  the  2.5  eV  shift  of  interfacial  Si03 ,  and 
add  to  it  the  0.3  eV  shift  induced  by  H  on  bare  Si,  and  predict 
this  feature  to  come  in  the  2.7-2.9  eV  range.  Numerous  as¬ 
signments  have  been  made  on  this  basis  in  the  literature.^^ 
This  is  moreover  a  case  in  which  the  germaneness  of  the 
cluster  studies  to  the  interface  cannot  be  challenged,  as  re¬ 
cently  it  has  been  shown  that  hydrogen  atom  attachment  to 
ultrathin  Si02  interfaces  produces  a  feature  at  exactly  the  3.6 
eV  predicted  by  the  cluster  studies. 

The  most  resolutely  conservative  position  for  a  revisionist 
to  take  is  that  only  the  binding  energies  associated  with 
03Si-Si,  with  the  second  Si  being  a  bulk  Si  atom,  have  been 
measured,  and  that  this  and  only  this  should  be  incorporated 
into  the  analysis  of  interface  spectra.  While  certainly  safe, 
such  a  diffident  attitude  is  not  the  sort  which  is  conducive  to 
progress  in  understanding  Si/Si02  interfaces.  For  the  sake  of 
intellectual  consistency  would  one  not  also  be  obliged  show 
equivalent  reticence  in  attributing  the  1.8  eV  peak  to 
Si25/02  or  the  2.5  eV  peak  to  oxide  matrix  Si5/0  3 ,  as  these 
assignments  are  based  merely  upon  plausible  analogies? 
Clearly,  this  is  carrying  rigor  to  the  point  of  intellectual  rigor 
mortis.  It  is  also  inconsistent  with  revisionist  principle  Rl, 
that  the  cluster  systems  are  typical  ones.  As  such  the  same 
latitude  should  be  granted  to  the  revisionist  as  to  the  ortho¬ 
dox  in  the  use  of  carefully  chosen  plausibility  arguments  to 
construct  physical  models  of  the  interface  worthy  of  scrutiny. 

With  this  perspective,  while  as  revisionists  we  disagree 
with  orthodox  principal  02,  we  accept  the  idea  that  the  prin¬ 
cipal  of  regular  additivity  of  shifts,  an  implicit  underpinning 
of  orthodoxy,  is  a  useful  rule  of  thumb  to  which  recourse  can 
legitimately  be  made,  although  not  regarded  in  itself  as  a 
source  for  definitive  conclusions  or  a  substitute  for  the  direct 
evidence  of  appropriate  model  systems.  Adopting  this  stance 
necessitates  a  revision  of  the  orthodox  assignments  on  the 
grounds  that  they  are  insufficiently  specified  by  not  including 
the  effects  of  second  nearest  neighbors, which  the  cluster 
results  show  can  be  significant.^"^  Orthodoxy  requires  that 
any  silicon  atom  without  a  direct  bond  to  oxygen  exhibit  a 
negligible  Si  2/7  chemical  shift.  Cluster  results  show  that  if 
such  an  atom  is  bound  to  an  —  Si03  group,  so  as  to  place  3 
oxygen  atoms  in  the  second  coordination  sphere,  a  1  eV  shift 
results.  Similarly,  we  can  consider  the  2.5  eV  peak  in  the 
thermal  interface  spectra.  All  parties  to  this  debate,  whether 
rigorously  orthodox  or  radically  revisionist,  associate  this 
peak  with  Si03  groups,  more  specifically  with  03Si-Si03 
structures.  As  the  Si3Si-5'/03  binding  energy  falls  at  2.1  eV,  it 
is  reasonable  to  ascribe  the  bulk  of  the  binding  energy  dif¬ 
ference  to  the  effect  of  additional  oxygen  atoms  in  the  sec¬ 
ond  coordination  sphere,  afforded  by  the  mutual  linking  of 
the  Si03  moieties.  (As  orthodox  principle  03  is  fully  com¬ 
patible  with  revisionism  one  can  allow  some  sm  all  final 
state  contribution  to  this  difference  as  well.) 

Given  these  two  cases,  and  a  toleration  for  the  additivity 
principle  as  a  guideline,  does  it  make  sense  to  maintain  that 
there  should  be  no  difference  in  binding  energy  between  the 
two  formal  Si*^^  fragments  Si35/0  and  Si2Si(0)(Si03)?^^  We 


think  not.  Taking  the  same  sort  of  approach  as  was  originally 
used  to  construct  the  orthodox  as  signment  rules,  one  could 
propose  a  linear  scaling  of  the  effect  of  —  Si03  group  attach¬ 
ment  between  the  two  above  cases.  Thus  the  addition  of  an 
Si03  group  to  an  otherwise  Si+1  species  would  produce  an 
extra  shift  of  about  0.8  eV  and  to  a  Si'^^  an  additional  0.6  eV. 
The  adoption  of  this  set  of  auxiliary  assignment  rules,  and 
retaining  the  remainder  of  the  orthodox,  one  would  arrive  at 
a  species  of  “minimal  revision”  which  would  be  able  to 
address  most  of  the  configurations  likely  to  be  found  in  a 
model  of  the  Si(100)/SiO2  interface.  Even  this  Minimal  Re¬ 
visionist  Model  has  dramatic  consequences  for  photoemis¬ 
sion  based  models  of  the  Si(100)/SiO2  interface.  If  it  is  cor¬ 
rect,  then  many  proposed  models  cannot  be  right,  as  they 
attrib  ute  all  of  the  observed  core  level  photoemission  inten¬ 
sity  at  0.9-1.0  eV  to  Si2Si(0)(Si03)  species,  while  the  con¬ 
siderations  above  would  indicate  that  1.7  eV  or  so  is  a  more 
appropriate  position  for  the  Si  2/7  core  level  peak  arising 
from  this  moiety. 

As  the  explanation  of  the  cluster  results  constitutes  a  chal¬ 
lenge  for  the  orthodox  position,  an  accounting  for  the  ther¬ 
mal  oxide  interfaces  is  a  challenge  for  revisionism.  At  first 
thought,  it  might  be  expected  that  significant  second  nearest 
neighbor  effects  would  be  incompatible  with  the  discrete 
peak  structure  which  is  experimentally  observed.  To  address 
this  challenge,  Banaszak  Holl,  Lee,  and  McFeely  put  forth  a 
more  comprehensive  and  more  radical  revisionist  scheme  by 
adopting  an  effective  electronegativity  model.^^  This  scheme 
predicts  qualitatively  the  same  revisionist  assignment  prin¬ 
ciples  as  given  above,  and  also  gives  a  systematic  means  of 
evaluating  chemical  shifts  of  other  sorts  of  configurations, 
such  as  the  formal  Si^  fragment  Si25/(Si02)2  (to  which  it 
assigns  a  shift  of  approximately  1  eV),  employing  a  method¬ 
ology  which  has  had  past  successes  It  must  be  emphasized 
clearly  that  the  further  such  a  scheme  strays  from  moieties 
which  have  been  directly  observed  in  well  characterized 
model  systems,  the  more  it  becomes  speculative.  However, 
applying  revisionist  photoemission  assignment  principles, 
they  wer  e  able  to  construct  a  model  of  the  Si(100)/SiO2 
interface  that  clearly  reproduced  the  photoemission  data.  In¬ 
terested  readers  are  referred  to  this  paper  for  details.  Given 
that  we  have  no  cluster  data  for  the  other  crystallographic 
faces  of  Si  (the  clusters  do  not  even  adsorb  on  the  (111) 
surface  to  a  significant  extent),  we  have  been  hesitant  to 
pursue  further  model  building  at  this  time.  The  structural 
model  proposed  based  upon  revisionist  assignment  principles 
for  the  Si(100)/SiO2  interface  has  been  strenuously  chal¬ 
lenged  by  Pasquarello,  Hybertson,  and  Car,  on  the  basis  of 
theoretical  calculations,  however,  as  these  authors  apparently 
offer  no  explanation  for  the  cluster  observations,  it  is  difficult 
to  regard  this  matter  as  settled. 

C.  Accommodationism 

Between  the  diametrically  opposing  views  oLOrthodoxy 
and  Revisionism  lies  the  doctrine  of  Accommodationism. 
This  position  denies  the  catholic  claims  made  by  both  of  the 
foregoing,  and  asserts  that  while  the  cluster  interface  spectra 
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have  been  correctly  interpreted,  they  constitute  a  special  case 
of  some  sort,  that  inferences  from  the  cluster  derived  inter¬ 
faces  to  the  thermal  interfaces  are  unw  arranted,  and  that  for 
the  latter  case  the  orthodox  assignment  doctrine  holds  good. 

Accommodationism  currently  has  a  peculiar  status.  There 
is  a  chance,  in  our  opinion,  that  this  position  may  ultimately 
prove  to  be  substantially  correct.  However,  to  date,  no  one 
has  elaborated  an  argument  along  this  lines.  The  necessary 
elaboration,  which  is  crucial,  is  a  plausible  reason  grounded 
in  fundamental  physical  terms  to  regard  the  two  cases  as 
different.  At  this  time,  it  is  not  clear  to  us  what  form  such  an 
argument  would  take. 

IV.  CONCLUSIONS 

In  this  article  we  have  argued  that  based  upon  the  most 
straightforward  interpretation  of  the  various  cluster  interface 
spectra,  the  orthodox  scheme  for  the  assignment  of  Si  2p 
XPS  spectra  from  the  Si/Si02  interface  must  be  revised  in 
some  way.  Furthermore,  we  have  shown  that  it  is  no  simple 
matter  to  devise  an  alternative  explanation  for  the  cluster 
interface  spectra. 

The  most  resolute  and  uncompromising  defense  of  ortho¬ 
doxy  is  provided  by  the  theoretical  work  of  Pasquarello,  Hy- 
bertson,  and  Car.^®  These  calculations  represent  pioneering 
efforts  to  apply  modern  local  density  functional  theory  to 
these  problems,  and  they  find  no  evidence  for  any  of  the 
second  nearest  neighbor  effects  we  claim  to  have  measured. 
In  our  opinion,  these  calculations  constitute  at  this  time  the 
best  foundation  for  the  orthodox  position,  far  better  than  the 
small  body  of  direct  experimental  observations,  or  an  appeal 
to  simplicity  which  can  no  longer  be  easily  sustained.  Since 
we  advocate  revisionism,  we  are  logically  committed  to  the 
position  that  that  these  calculations  are  insufficiently  accu¬ 
rate  in  their  treatment  of  the  effects  of  second  nearest  neigh¬ 
bor  atoms.  It  is  an  important  challenge  to  the  revisionist 
community  to  provide  cogent  reasons  why  this  should  be  so. 
While  aware  of  this  problem,  we  frankly  admit  lacking  the 
necessary  theoretical  expertise  to  address  it.  In  any  case, 
whether  one  presumptively  accepts  their  results  as  true  or 
not,  we  believe  it  is  clear  at  the  present  time  that  these  sort  of 
calculations  have  not  been  subjected  to  the  exhaustive  testing 
necessary  to  render  their  conclusions  unimpeachable.  There 
are  thus  insufficient  grounds  to  warrant  the  wholesale  relega¬ 
tion  of  the  cluster  experiments  to  the  status  of  certainly 
wrong  results  unworthy  of  serious  consideration.  The  chal¬ 
lenge  posed  by  the  cluster  interface  spectra  must  be  met,  and 
not  merely  dismissed,  by  the  supporters  of  the  orthodox  doc¬ 
trine.  The  calculations  by  the  mselves,  however,  do  not  shed 
any  obvious  light  on  the  nature  of  any  alternative  explanation 
of  the  cluster  spectra. 

Unfortunately  the  theoretical  and  experimental  ap¬ 
proaches  have  thus  far  tended  to  talk  past  one  another  some¬ 
what.  Where  one  concludes  the  balance  of  evidence  lies  at 
this  time  depends  upon  whether  one  believes  the  calculations 
or  the  cluster  experiments  are  more  trustworthy.  The  one 
concrete  suggestion  we  can  make  to  the  theorists  is  that  they 
employ  their  molecular  dynamics  approach  to  investigate  the 


stability  of  the  clusters  on  the  surface.  Perhaps  such  an  ap¬ 
proach  could  reveal  a  type  of  partial  decomposition  which 
has  not  occurred  to  us,  or  add  weight  to  our  assertion  that 
they  remain  intact.  For  our  part,  we  accept  the  challenge  of 
bringing  additional  experimental  evidence  to  bear  on  this 
fascinating  problem. 
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A  chemical  bonding  model  for  defects  and  defect  precursors  at  Si — Si02  interfaces  is  presented. 
Bonding  geometries  of  neutral  and  charged  Si-,  0-,  N-,  and  H-atoms  are  described  in  terms  of  the 
valence  shell  electron  pair  repulsion  model,  thereby  identifying  a  qualitative  distinction  between 
charged  (a)  Si-atoms,  and  charged  (b)  0-,  N-,  or  H-atoms.  Threefold  coordinated  Si-  atoms  are 
stable  in  three  charge  states  (+,  0,  and  —),  and  can  be  active  as  trapping  and/or  recombination 
centers  depending  on  their  charge  state.  In  contrast,  there  is  a  direct  relationship  between  charge 
state  and  bonding  coordination  for  0-,  N-,  and  H-atoms,  and  as  such  their  roles  in  defect  generation 
processes  are  qualitatively  different.  Reaction  mechanisms  based  on  these  differences  in  local 
bonding  are  (i)  discussed  for  the  generation  of  defects  comprised  of  threefold  coordinated  Si-atoms, 
and/or  positively  charged  H-,  0-,  and  N-atoms,  with  coordinations  of  two,  three,  and  four, 
respectively,  and  (ii)  compared  with  experiments.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

As  device  dimensions  shrink  in  the  deep  submicron  for 
ultra  large  scale  integrated  (ULSI)  circuit  applications,  pro¬ 
portional  decreases  in  gate  dielectric  thicknesses  of  field  ef¬ 
fect  transistors  (FETs)  to  about  3-3.5  nm  are  required.  In 
this  regime  of  ultra  thin  dielectrics,  interfacial  defects  and 
defect  precursors  play  dominant  roles  in  device  performance 
and  reliability.  For  example,  there  is  a  large  body  of  experi¬ 
mental  data  for  thin  oxide  dielectrics  which  has  demon¬ 
strated  that:  (i)  H-atom  transport  from  poly  cry  stalline-Si  gate 
electrodes  to  the  Si — SiO  2  interface  under  stress-bias  condi¬ 
tions  generates  interfacial  traps,  and  positively  charged 
defects,  (ii)  N-atom  incorporation  at  Si — Si02 

interfaces  improves  device  reliability,^"^  while  interfacial 
N — H  bonding  leads  to  significantly  higher  defect  densities 
(each  of  these  comparisons  is  with  respect  to  non-nitrided 
interfaces).^ 

This  article  presents  a  model  based  on  bonding  geom¬ 
etries  of  neutral  and  charged  Si-,  0-,  N-,  and  H-atoms  at 
Si — Si02  interfaces  that  is  derived  from  an  extension  of  the 
valence  shell  detection  pair  reulsion  (VSEPR)  model  to 
amorphous  solids  and  their  iterfaces,  providing  a  framework 
for  understanding  the  experimental  observations.  Section  II 
reviews  bonding  arrangements  of  neutral  and  charged  Si-, 
0-,  N-,  and  H-atoms  in  terms  of  the  VSEPR  model.^  Section 
III  discusses  electronically  active  defects  at  Si — Si02  inter¬ 
faces.  Section  IV  presents  reaction  equations  for  defect  gen¬ 
eration  at  Si — Si02  interfaces  in  terms  of  the  chemical  bond¬ 
ing  geometries  of  the  neutral  and  charged  Si-,  0-,  N-,  and 
H-atoms  discussed  above.  Section  V  summarizes  the  main 
points  of  the  paper. 

II.  CHEMICAL  BONDING  OF  SI-,  0-,  N-,  AND  H- 
ATOMS 

The  VSEPR  model  of  chemical  bonding  describes  the 
bonding  geometries  of  inorganic  molecules.^  The  model  is 
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based  on  a  set  of  empirical  rules  that  describe  the  way  that 
atoms  combine  to  form  molecules,  and  has  been  successful 
in  predicting  the  stereo-chemistry  of  different  classes  of  in¬ 
organic  molecules.  It  is  readily  extendable  to  molecular  ions 
through  a  consideration  of  isoelectronic  analogs;  e.g.,  a  posi¬ 
tive  charged  0-atom  has  the  same  number  of  electrons  avail¬ 
able  for  bonding  as  a  neutral  N-atom  and  this  is  used  as  a 
basis  for  defining  the  local  bonding  geometry  of  the  charged 
0-atom  with  respect  its  nearest-neighbor  atoms  in  forming 
molecular  ions. 

Following  the  language  of  Ref.  6,  the  VSEPR  rules  are: 

(1)  The  preferred  arrangement  of  a  given  number  of  electron 
pairs  in  the  valence  shell  of  an  atom  is  that  which  maxi¬ 
mizes  their  distance  apart. 

(2)  A  non-bonding  pair  of  electrons  takes  up  more  room  on 
the  surface  of  an  atom  than  a  bonding  pair. 

(3)  The  size  of  a  bonding  electron  pair  decreases  with  in¬ 
creasing  electronegativity  of  the  ligand. 

(4)  The  two  electron  pairs  of  a  double  bond  (or  the  three 
electron  pairs  of  a  triple  bond)  take  up  more  room  that 
does  the  one  electron  pair  of  a  single  bond. 

The  VSEPR  rules  are  not  generally  applicable  to  crystal¬ 
line  solids  due  to  long  range  interactions  and  constraints  im¬ 
posed  by  periodicity.  However,  based  on  a  large  experimen¬ 
tal  data  base  for  good  glass-forming  materials  like  Si02, 
AS2S3,  GeS2,  etc.,  these  rules  also  provide  a  qualitative 
guideline  for  describing  local  bonding  environments  in  non¬ 
crystalline  solids.  In  particular,  the  local  bonding  arrange¬ 
ments  of  the  constituent  elements  in  melt-quenched  glasses 
such  as  Si02 ,  Ge02 ,  AS2S3 ,  As2Se3 ,  GeS2 ,  and  GeSe2 ,  and 
their  alloys,  as  determined  from  infrared  and  Raman  spec¬ 
troscopies,  and/or  direct  structural  probles  such  as  x-ray  dif¬ 
fraction  and  EXAFS^’^  are  consistent  with  the  bonding  ar¬ 
rangements  predicted  from  this  application  of  the  VSEPR 
rules  to  network  structures.  The  idealized  structures  of  these 
glasses  are  chemically  ordered  random  covalent  networks,^ 
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and  the  disorder  that  distinguishes  these  materials  from  crys¬ 
talline  solids  is  mostly  at  the  bond-angle  and  dihedral- angle 
level,  and  not  at  the  coordination  geometry  level.  For  ex¬ 
ample,  bond-  and  dihedral-angles  are  fixed  and  repeated  pe¬ 
riodically  in  a  crystal  lattice,  whereas  in  network  glasses 
there  are  statistical  distributions  of  these  angles  centered 
around  preferred-mglcs  predicted  from  application  of 
VSEPR  rules.  The  concept  of  a  chemically  ordered  random 
covalent  network  can  also  be  applied  to  amorphous  materials 
prepared  by  thin  film  deposition,  including  silicon  dioxide, 
Si02,  silicon  nitride,  Si3N4,  and  alloys  of  Si02  and  Si3N4 
(silicon  oxy nitrides). The  materials  system  of  interest  in 
here  is  the  interface  between  a  crystalline  silicon  substrate 
and  non-crystalline  Si02  dielectric,  and  as  such  the  VSEPR 
bonding  rules  are  assumed  to  apply  on  the  Si02  side  of  the 
interface;  i.e.,  at  the  metallurgical  interface  and  into  the 
Si02  layer.  In  addition  to  the  bonding  of  Si-  and  0-atoms  at 
such  an  interface,  the  local  bonding  arrangements  of  H-  and 
N-atoms  must  also  be  addressed  to  maintain  contact  with 
device  processing  technologies. 

The  bonding  geometries  of  neutral  and  charged  Si-,  0-, 
N-,  and  H-atoms  discussed  in  Sections  II A  and  II B  are  re¬ 
stricted  to  environments  that  are  expected  to  occur  at  (i)  as- 
grown  or  as -deposited  interfaces,  and  (ii)  interfaces  subjected 
to  conventional  levels  of  electrical-stress  bias.  Qualitatively 
different  defects  can  be  produced  at  interfaces  subjected  to 
intense  x-ray,  y-ray,  and  particle  fluxes  which  dislodge  at¬ 
oms  from  their  normal  bonding  positions.  Examples  include 
the  0-atom  hole  center  in  Si02  glasses,^ ^  and  two-fold  coor¬ 
dinated  neutral  N-atoms  in  Si3N4  thin  films. 

A.  Bonding  geometries  of  neutral  Si-,  0-,  N-,  and  H- 
atoms 

The  bonding  geometries  of  neutral  Si-,  0-,  N-,  and 
H-atoms  in  (j-bonds  in  thin  film  oxide  and  nitride  dielectrics 
are  consistent  with  VSEPR  Rules  (1)  and  (2)  (see  Fig.  1). 
Consider  first  Si02 :  each  Si  atom  makes  fours  bonds  with  its 
four  oxygen  neighbors  which  are  arranged  in  a  tetrahedral 
geometry  to  comply  with  Rule  (I).  Each  oxygen  atom  has 
two  silicon  neighbors  and  two  non-bonding  pairs,  which  are 
also  arranged  in  a  distorted  tetrahedral  geometry,  also  as  re¬ 
quired  by  Rule  (1).  This  distortion  derives  from  (i)  the  two 
0-atom  non-bonding  pairs  taking  up  more  space  than  the  two 
0-atom  bonding  pairs,  and  (ii)  a  back-donation  of  electrons 
from  the  non-bonding  oxygen  orbitals  into  anti-bonding  or¬ 
bitals  on  the  Si-atoms,  the  called  pir — dir  bonding 
interaction.^  Studies  of  Si02  have  indicated  a  large  distribu¬ 
tion  of  Si — O — Si  bond  angles  (±30°)  around  an  average 
bond  angle  of  —145°  consistent  with  the  proposed  extension 
of  VSEPR  Rules  (1),  (2),  and  (4).  The  average  angle  of  145° 
is  indicative  of  some  degree  p  ir — d  it  bonding,  which  is  also 
evident  from  the  magnitude  of  the  average  Si — O  bond 
length  which  is  about  0.15  A  less  than  anticipated  on  the 
basis  of  the  sum  of  the  appropriate  Si-  and  0-atom  bonding 
radii.^ 

The  bonding  in  Si3N4  parallels  that  of  Si02  with 
TT-bonding  interactions  also  being  evident  in  a  shorter  (0.15- 
0.20  A)  than  expected  Si-N  bond  length.^  The  bonding  ge- 


Bonding  of  neutral  Si-,  N-,  0-  and  H-atoms 


H-Si(3)  H-N-S!(2)  H-0-Si(1) 


Si  N  O  H 

m  ®  o  • 

Fig.  1.  Bonding  arrangements  of  neutral  Si-,  0-,  N-,  and  H-atoms  in 
Si — Si02  heterostmctures. 

ometry  of  Si-atoms  is  tetrahedral  in  accord  with  the  VSEPR 
model,  whereas  the  bonding  geometry  of  the  four  orbitals  on 
the  N-atoms  is  markedly  non- tetrahedral,  indicating  signifi¬ 
cant  TT-bonding,  and  requiring  an  extension  of  VSEPR  rules. 
In  particular,  back  donation  of  electon  density  from  the  non¬ 
bonding  /j-orbital  of  nitrogen  into  anti-bonding  orbitals  of 
silicon  with  a  sd^  character  is  the  pTT — dTr  bonding  interac¬ 
tion  that  forces  the  N-atom  into  a  planar  bonding  arrange¬ 
ment  with  three  cr-bonds  to  Si-neighbors  directed  at  vertices 
of  an  equilateral  triangle,  and  the  non-bonding  -orbital  per¬ 
pendicular  to  that  bonding  plane.^  This  planar  bonding  ge¬ 
ometry  for  nitrogen  and  its  three  silicon  neighbors  has  been 
verified  by  Raman  scattering  in  non-crystalline  Si3N4 ,  (Ref. 
13),  and  also  occurs  in  both  crystalline  forms  of  Si3N4.  The 
formation  of  relatively  strong  7r-bonds  between  N-atoms  and 
three  Si-atom  neighbors  is  quantitatively  different  than  for 
the  bonding  of  0-atoms  to  two  Si-atom  neighbors,  and  will 
be  shown  to  be  a  significant  factor  in  accounting  for  improv¬ 
ing  device  reliability  by  N-atom  incorporation  at  Si — Si02 
interfaces. 

Hydrogen  atoms  can  also  be  incorporated  into  amorphous 
Si02  and  Si3N4  networks,  bonding  to  both  atomic  constitu¬ 
ents  in  arrangements  consistent  with  their  neutral  atom  va¬ 
lence  of  one.  The  bonding  arrangements  deduced  from 
infrared  (ir)  studies  are  consistent  with  the  VSEPR  model: 
Si— H,  O— H,  N— H  and  N— H2  (Ref.  14)  (see  Fig.  2).  One 
aspect  of  the  incorporation  of  H-atoms  in  O — H  and  N — H 
groups  in  Si02  and  Si3N4 ,  respectively,  is  in  H-bond 
formation,^  which  shows  up  in  ir  line-shapes  in  the  bond¬ 
stretching  modes.  H-bonding  interactions  fall  outside  of  the 
VSEPR  model,  but  represent  an  example  of  three-center 
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Bonding  of  charged  Si-,  N-,  O-  and  H-atoms 
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Fig.  2.  Bonding  arrangements  of  charged  Si-,  0-,  N-,  and  H-atoms  in 
Si — Si02  heterostructures. 


bonding  for  neutral  atoms.  This  serves  to  provide  a  frame¬ 
work  for  understanding  other  three-center  bonds  involving 
positively  charged  H-atoms  (see  Sec.  II B),  e.g., 

=  Si— H+— Si  =  .‘^-'^The  Si— H+—Si=”  notation  in¬ 

dicates  that  the  Si-  atom  is  bonded  to  the  H'''-atom  as  well  as 
three  other  atoms  of  the  network  structure. 

B.  Bonding  geometries  of  charged  Si-,  0-,  N-,  and  H- 
atoms 

The  bonding  geometries  of  charged  O-  and  N-atoms  can 
be  addressed  in  the  context  of  isoelectronic  relationships:  for 
example,  is  isoelectronic  with  N^;  i.e.,  both  atomic  spe¬ 
cies  have  five  electrons  available  for  bonding.  In  a  parallel 
manner,  N"*"  is  isoelectronic  with  C®,  with  both  atoms  having 
four  electrons  available  for  bonding.  Depending  up  the  com¬ 
petition  between  repulsive  interactions  between  non-bonding 
and  bonding  orbitals,  and  7r-bonding  interactions,  the  local 
bonding  of  to  three  neighbors  will  be  a  distorted  tetrahe¬ 
dron,  or  in  a  planar  geometry  for  three  bonds  with  the  fourth 
bonding  direction  occupied  by  a  non-bonding  pair  perpen¬ 
dicular  to  the  bonding  plane.  The  local  bonding  of  to  four 
atoms  will  be  tetrahedral;  however,  if  the  four  neighbors  to 
the  N-atom  are  markedly  different  (e.g.,  2  Si-  and  2 
H-atoms),  it  will  be  a  distorted  tetrahedron.  Threefold  coor¬ 
dinated,  positively  charged  0-groups  are  designated  as  oxo- 
nium,  and  fourfold  coordinated,  positively  charged  N-groups 
are  designated  as  ammonium,^  H"^  cannot  be  discussed  in 
terms  of  isoelectronic  species  simply  because  the  H-atom 
contains  only  one  electron,  and  after  it  is  removed  to  form 
H'^  there  are  no  electrons  present  to  mimic  the  electronic 
structure  of  another  atom  whose  atomic  number  is  reduced 
by  one.  However,  H^  can  bond  to  two  Si-neighbors  in  two- 


atom,  three-center  bonding  arrangement  that  parallels  the 
three-center  bonds  between  B-  and  the  bridging  H-atoms  in 
B2H6;^’^^”^^  hence  the  isoelectronic  relationship  is  between 
two  bonding  groups:  Si — H'^ — Si  and  B — H — B,  rather  than 
between  two  atoms. 

Si'^  has  three  electrons  available  for  bonding  in  a  pyrami¬ 
dal  geometry  to  three  neighbors.  However,  in  contrast  to  H-, 
0-,  and  N-atoms  where  coordination  and  charge  state  are 
linked,  threefold  coordinated  Si-atom  can  support  three 
charge  states,  (+,  0,  or  — )  according  to  the  electron  occu¬ 
pancy  (0,  1,  or  2,  respectively)  of  the  so-called  lone-pair 
orbital, and  therefore  will  be  treated  separately  after  the 
discussion  of  negative  charge  states  for  H-,  0-,  and  N-atoms. 

The  bonding  geometries  of  H”,  0“,  and  N“  can  also  be 
described  in  terms  of  isoelectronic  relationships.  H“  is  iso¬ 
electronic  to  the  rare  gas  atom  He^,  and  does  not  form  a 
stable  bond  with  Si,  but  may  play  a  role  as  an  intermediate 
and  transient  excited  state  in  solid  state  reactions  involving 
muon  capture. In  contrast,  0“  and  N“  can  form  stable 
bonding  arrangements  with  coordinations  reduced  by  one 
with  respect  to  the  neutral  bonding.  0“  is  isoelectronic  with 
F®,  and  can  be  singly  bonded  with  the  six-remaining  elec¬ 
trons  occupying  three  non-bonding  pair  orbitals.  If  0“  coor¬ 
dinates  in  this  way,  the  bond  will  be  weak  and  highly  reac¬ 
tive  due  to  the  fact  that  the  six  electrons  contained  in  three 
non-bonding  orbitals  produce  a  significant  destabilizing  re¬ 
pulsive  effect.  Neutral  Si-  and  negatively-charged  0-atoms 
are  not  known  to  form  a  stable  terminal  bond,  either  in  mol¬ 
ecules  or  in  the  solid  state.  The  bonding  of  N~  is  expected  to 
parallel  that  of  O^,  and  be  twofold  coordinated  with  two 
non-bonding  pairs.  This  type  of  bonding  has  not  as  yet  been 
identified  in  combination  with  Si-atom  neighbors. 

Threefold  coordinated  group  IV  atoms  such  as  silicon 
must  be  treated  separately.  Because  of  their  neutral  valence 
band  occupation  of  four,  they  are  stable  in  three  charge  states 
with  the  same  number  of  nearest-neighbor  bonds.  This  is 
because  the  two  charged  atoms,  Si“  and  Si^,  are  isoelec¬ 
tronic,  respectively,  with  the  group  III  and  group  IV  ele¬ 
ments,  B  and  P,  and  therefore  they  display  local  bonding 
geometries  similar  to  those  of  P-  and  B -atoms.  Finally,  the 
threefold  coordinated  neutral  Si^  center  is  the  singly- 
occupied  Si-dangling  bond  which  has  received  much  atten¬ 
tion  as  a  defect  in  amorphous  silicon  and  amorphous  silicon 
alloys,  and  as  an  -center  in  Si02  and  Si3N4  glasses  and 
thin  films.^^’^^ 

The  application  the  VSEPR  model  to  the  Si-,  N-,  0-,  and 
H-atoms  thereby  identifies  possible  local  bonding  geometries 
for  these  atoms  in  the  body  of  the  oxide,  and  at  the 
Si — Si02  interface.  In  the  discussions  that  follow  the  focus  is 
on  formation  of  two  types  of  defects  which  are  of  importance 
in  NMOS  devices,  interfacial  traps  and  fixed  positive  charge. 
A  parallel  set  of  defect  interfactions  occurs  in  PMOS  struc¬ 
tures,  but  will  not  be  addressed  specifically  in  this  paper. 

III.  DEFECT  BONDING  AT  SI— SiOg  INTERFACES 
AND  Si— SiOg  HETEROSTRUCTURES 

Because  of  its  importance  in  Si-based  microelectronics, 
the  Si — Si02  materials  system  has  received  much  attention. 
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For  details,  the  reader  is  referred  to  papers  in  four  topical 
conferences  held  between  1976  and  1992.^^"^'^  Defect  studies 
have  proceeded  into  two  different  ways:  they  have  been 
made  on:  (i)  defects  in  capacitor  and  FET  structures,  before 
and  after  current  and  voltage  stress-biasing,  and  (ii)  defects 
induced  in  Si02  thin  films  and  bulk  glasses  as  well,  or  in 
Si — Si02  heterostructures  by  exposure  to  intense  UV- 
radiation,  x-rays,  y-rays,  and/or  charged  or  neutral  particles 
(e.g.,  electrons  and  neutrons,  respectively).  The  defect  con¬ 
centrations  of  interest  in  electronic  devices  are  typically  in 
the  range  from  about  10^^  cm“^  to  10^^  cm“^,  whereas  de¬ 
fect  levels  induced  by  radiation,  and/or  charged  and  neutral 
particles  are  usually  substantially  higher,  up  to  10^^  cm~^. 
After  irradiation,  defect  levels  are  high  enough  so  that  they 
can  be  studied  by  spectroscopic  techniques,  including  optical 
absorption  and  ESR,  and  as  noted  above,  there  may  be  quali¬ 
tative  differences  in  defect  bonding  geometries  in  such  ma¬ 
terials  so  that  they  are  not  necessarily  described  in  terms  of 
the  VSEPR  model. 

Recently,  the  focus  has  shifted  away  from  bulk  oxide  de¬ 
fects  to  interfacial  and  near-interfacial  defects  since  these  are 
the  ones  that  determine  performance  and  reliability  in  de¬ 
vices  with  ultra  thin  gate  dielectrics.  Two  types  of  interfacial 
defects  dominate  in  n-channel,  or  NMOS  structures:  (i)  in¬ 
terface  trapping  states,  denoted  as  Djj,  and  (ii)  fixed  positive 
charge,  denoted  as 

The  remainder  of  this  paper  focuses  on  these  defects  in 
the  context  of  VSEPR  bonding  descriptions.  The  distinguish¬ 
ing  aspects  of  this  approach  is  based  on  intrinsic  differences 
between  bonding  defects  on  group  V  and  VI  atoms,  such  as 
N  and  O,  respectively,  and  the  group  IV  atom  Si.  For  ex¬ 
ample,  within  the  framework  of  the  VSEPR  model,  different 
charge  states  of  N-  and  0-atoms  are  stable  for  different  num¬ 
bers  of  bonding  neighbors,  whilst.  Si-atoms  bonded  to  three 
neighbors  are  stable  in  three  different  charge  states.  Bonding 
configurations  of  N-  and  0-atoms  with  different  charge  states 
can  contribute  to  shifts  in  flatband  voltages  of  capacitors, 
threshold  voltages  of  FETs,  and  charge  carrier  scattering  in 
channel  layers  of  FETs.  Changing  the  charge  state  of  these 
centers,  either  in  a  defect  creation  or  neutralization  process 
requires  permanent  trapping  (effectively  recombination)  of  a 
charged  carrier,  and  a  change  in  bonding  coordination  that 
can  be  accomplished  by  acquiring  or  losing  a  H-atom  bond¬ 
ing  partner.  On  the  other  hand  threefold  coordinated  Si- 
atoms  can  act  as  trapping  sites  for  charged  carriers  of  either 
sign,  and  this  does  not  require  a  change  in  bonding  coordi¬ 
nation. 

IV,  DEFECT  REACTIONS  AT  Si— SiOs  INTERFACES 

Representative  defect-generation  reactions  that  can  take 
place  at,  or  in  the  immediate  vicinity  of  Si — Si02  interfaces 
are  described  by  the  nine  equations  presented  below.  The 
changes  of  coordination  of  0-,  N-,  and  H-atoms  with 
changes  in  charge  state  in  these  reactions  are  consistent  with 
the  VSEPR  model.  In  order  for  these  reactions  to  take  place, 
the  energy  of  the  resulting  defect  state  must  fall  within  the 
forbidden  bandgap  of  Si02 ;  however,  in  several  instances. 


e.g.,  examples  (i),  (iii),  and  (iv)  there  is  experimental  data 
suggesting  the  reactions  do  indeed  occur.  ^  Nine  representa¬ 
tive  reactions  are: 

(i)  =  Si — =  Si®+H2:  generation  of  centers 
(  =  Si°  is  a  Si  dangling  bond); 

(ii)  =Si — Si=+H+/2^^  =  Si — — S=:  generation  of 
Qss  (H2^); 

(iii)  =Si — O — Si=  +  H+/z'^^H — — (Si=)2:  genera¬ 
tion  of  2,,  (03^); 

(iv)  =  Si— H+ ^Si— O— Si= 

=  Si°+H — — (Si=)2'  generation  of  Djt  and 

ess(0  3^); 

(v)  H— N— (Si^)2+H+/i  +  ~->H2— N+— (Si^)2):  genera¬ 
tion  of  Qss(o^); 

(vi)  =  Si— H + H— N— (Si  ^  )2 

=  Si°+H2 — — (Si=)2«  generation  of  and 
ess(N^); 

(vii)  H+N— (Si^)3+/z  +  -/no/->H— N+— (Si^)  3: 
N=(Si=)3  does  not  act  a  defect  precursor, 

(viii)  H+H — O — Si  =  +h'^^lrL  2 — — Si  =  :  generation 
of  2ss(0^);  and 

(ix)  H— O— Si^+H— Si^+/i  +  ^H2— 0+— Si^+  ^Si=: 

generation  of  Djj  and  2ss(0^)- 

The  notations  and  N4  are  shorthand  descriptions  for 
defect  centers  involving  oxonium  (e.g.,  H — — (Si=)2  and 
ammonium  (e.g.,  H2 — — (Si^)2)  bonding  arrangements, 
respectively.  A  defect  reaction  in  which  the  coordination  of 
an  atom  is  changed  by  virtue  of  the  atom  going  from  a  neu¬ 
tral  to  a  charged  state  will  requires  a  network  relaxation  to 
accommodate  the  change  in  bonding  geometry  that  accom¬ 
panies  that  change  in  coordination.  This  can  be  a  significant 
factor  in  either  driving  the  defect  reaction  to  the  right  as  in 
(v)  and  (vi),  or  inhibiting  the  reaction  in  (vii)  as  will  be 
discussed  later  on. 

Figures  3(a)  and  3(b)  indicate  schematically  the  defect 
generation  reactions  in  (iii)  and  (v).  The  first  reaction,  (i) 
Si — H+H^=Si^+H2,  demonstrates  that  H-atoms  trans¬ 
ported  to  the  Si — Si02  interface  can  extract  H-atoms  from 
Si — H  bonding  sites  thereby  creating  a  dangling  bond  that 
acts  as  interfacial  trap,  The  driving  force  for  this  reac¬ 
tion  is  the  large  difference  between  the  bond  energies  of 
Si— H,  3.4  eV,  and  H— H,  4,5  eV.®  The  Si— H  bond  that  is 
attacked  can  be  at  the  interface  having  at  least  one  Si -atom 
neighbor,  or  in  the  oxide  having  three  0-atom  neighbors.  The 
electrical  activity  of  the  dangling  bond  created  by  the 
H-atom  abstraction  reaction  in  depends  on  its  position  within 
the  forbidden  gap  of  Si  and/or  Si02 .  The  energy  of  the  neu¬ 
tral  dangling  bond  in  hydrogenated  amorphous  silicon  rela¬ 
tive  the  conduction  band  edge  has  been  shown  to  be  a  func¬ 
tion  of  the  chemistry  of  its  nearest  neighbors,  moving  toward 
the  conduction  band  edge  as  the  electronegativity  sum  of  the 
three  nearest  neighbors  increases.^^  Since  trapping  kinetics 
depend  on  trap  depth,  the  release  times  of  Si-dangling  bond 
traps  will  then  vary  significantly  with  their  distance  from  the 
Si — Si02  interface. 

The  second  reaction,  (ii)  Si — Si=+K+h'^ 
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Generation  of  charged  bonding  defects  by 
hole  trapping  and  hydrogen  atom  capture 


Changes  in  bonding  geometry  required  for 
creation  of  ammonium  defect  centers 
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oxonium  bonding 
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Fig.  3.  Schematic  representation  of  defect  generation  reactions  for  (a)  oxo¬ 
nium  centers  (O3),  and  (b)  ammonium  centers  (N4). 


— >Si — — Si=,  is  newly  proposed  in  this  paper;  note  that 
=Si — — Si=  is  isoelectronic  to  B — H — bridge  in 
B2H6  molecules.  Once  created,  the  =Si — — Si=  defect 
can  be  active  in  two  ways:  (i)  it  can  contribute  to  fixed 
charge  and  thereby  shift  flatband  and  threshold  voltages  in 
devices,  or  (ii)  it  can  serve  as  a  recombination  center  for 
electrons,  creating  a  Si-atom  dangling  bond  in  the  process  of 
trapping  an  electron  as  in  Eq.  (1): 

^  Si— H-"— Si^  +  ^  ^  Si— H + ^Si®.  ( 1 ) 

The  third  reaction,  (iii)  =  Si — O — Si=+H  +  — O 

+ — (Si=)2,  is  similar  to  (ii)  in  the  sense  that  it  (i)  involves 
the  creation  of  a  positively  charged  defect  [see  Fig.  3(a)],  and 
(ii)  requires  concurrent  H-atom  attachment  and  hole  trapping 
for  creation.  However,  in  this  case  de- activation  of  the 
H — — (Si^)2  center  by  electron  trapping  returns  the 
Si — O — Si  bond  group  to  its  original  neutral  state  freeing  a 
H-atom  without  the  creation  of  a  Si-atom  dangling  bond  as 
in  Eq.  (2); 

H— (Si)2  +  e  ■  -^  =  Si— O— Si= +H.  (2) 

The  fourth  reaction  has  been  discussed  in  Ref.  26;  if  the 
defect  pair  are  near-neighbors  then  trapping  of  an  electron 
will  return  the  local  bonding  to  its  original  state,  as  in  Eq. 
(3): 

=  Sf +H— 0+— (Si=)2  +  e  =  Si— H+  =Si— O— Si= . 

(3) 

In  this  case  the  H-atom  motion  will  be  displacive  rather  than 
diffusive,  as  would  be  required  if  the  two  bonding  groups 
were  more  distant  network  neighbors.  Reactions  (v),  see  Fig. 
3(b),  and  (vi),  respectively,  are  analogs  of  reactions  (iii)  and 
(iv),  in  the  sense  that  the  neutral  N — H  group  is  isoelectronic 
with  a  neutral  0-atom. 


planar  bonding 
neutral  2Si-N-H 


H-atom  rotates  out  of 
bonding  plane:  barrier  -  kT 


distorted  tetrahedron 
positively-charged 
2SI-N-2H 
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planar  bonding 
neutral  3Si-N 
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barrier  ~  2  eV 


distorted  tetrahedron 
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Fig.  4.  Differences  in  defect  precursor  reactions  between  the  bonding  of 
N-atoms  to  (a)  one  H-  and  two  Si-atoms,  and  (b)  three  Si-atoms. 


There  is  another  point  regarding  increases  in  coordination 
that  take  place  when  oxonium  and  ammonium  centers  are 
created.  Following  VSEPR  rules,  the  geometry  of  an  oxo¬ 
nium  center  is  generally  pyramidal  so  that  relatively  small 
changes  in  bond  angles  are  required  when  the  coordination  is 
increased  from  two  to  three.  These  changes  will  not  lead  to 
significant  energy  barriers  as  estimated  from  empirical  bond¬ 
bending  force  constants.  Similarly,  for  creation  of  the  ammo¬ 
nium  center,  the  H-atom  of  the  originally  bonded  N — H 
group  must  rotate  out  of  the  bonding  plane  and  in  combina¬ 
tion  with  hole  trapping  and  attachment  of  a  second  H-atom 
participate  in  the  creation  of  a  distorted  tetrahedral  arrange¬ 
ment  (see  Fig.  4).  The  energy  barrier  for  this  bond  angle 
change  is  at  most  0.02  to  0.05  eV,  as  estimated  from  empiri¬ 
cally  determined  bond-bending  force  constants  and  will  not 
represent  a  significant  limitation  to  room  temperature  reac¬ 
tion  kinetics.  Reverse  reactions  for  (iv)  and  (v)  are  given  in 
Eqs.  (4a)  and  (4b),  respectively: 

H2— N+— (Si^)2  +  ^~-^H— N— (Si^)2+H,  (4a) 

and 

^Si^+H2— N''— (Si=)2+e"^^Si— H+H— N— (Si=)2. 

(4b) 

Figure  4  indicates  a  significant  difference  between  the  de¬ 
fect  precursor  behavior  of  N-atoms  bonded  only  to  Si-atoms, 
and  to  both  Si-  and  H-atoms  as  in  arrangement  that  includes 
an  N — H  bond.  What  is  illustrated  in  Fig.  4  is  the  basis  for 
comment  in  reaction  (vii)  that  N — (Si=)3  does  not  act  as  a 
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defect  precursor.  The  bonding  of  a  N-atom  to  three  Si- atoms 
is  unique  in  the  stereo-chemistry  of  N-atoms  to  three  neigh¬ 
bors  of  the  same  chemical  species.  With  the  exception  of  a 
similar  bonding  between  one  N-atom  and  three  Ge-atoms, 
the  bonding  arrangements  of  N-atoms  bonded  to  three  simi¬ 
lar  atoms  or  groups  are  pyramidal,  e.g.,  as  in  ammonia, 
NH3  and  N=(CH3)3.^  However,  for  symmetric  bonding  to 
three  Si-  (or  Ge-atoms)  as  in  Si3N4  or  silane-derived  mol¬ 
ecules,  such  as  N — (SiH3)3,  the  bonding  arrangement  is  pla¬ 
nar.  As  noted  above,  these  planar  arrangements  for  Si  (and 
Ge)  are  stabilized  by  pir-dir  bonding  interactions.^  Experi¬ 
ments  have  shown  that  unlike  the  incorporation  of  N — H 
bonding  groups,  the  incorporation  of  N-atoms  bonded  only 
Si-atoms  at  Si — Si02  interfaces  (i)  does  not  degrade  initial 
device  performance,  but  instead  (ii)  improves  reliability  with 
respect  to  wn-nitrided  interfaces.^  This  aspect  of  defect/defect 
precursor  behavior  is  also  supported  by  ab  initio  calculations 
which  demonstrate  that  there  is  an  approximately  2  eV  bar¬ 
rier  to  the  conversion  of  a  planar  N — Si  3  group  to  a  tetrahe¬ 
dral  ammonium  center  by  hole  trapping  and  H-atom 
attachment.^^ 

Finally,  an  additional  intrinsic  defect  precursor  is  the  the 
H — O — Si=  configuration  [see  examples  (viii)  and  (ix)].  The 
hydroxyl  (H — O)  group  can  be  converted  to  a  charged  defect 
by  hole  trapping  as  in  examples  (viii)  and  (ix).  Neutralization 
can  occur  by  electron  capture,  as  for  example  in  Eqs.  (5)  and 
(6): 

H2— O^— Si^  +  e  ■  ^  H— O— Si^ + H,  (5) 

and 

H2— 0+— +^Si+e“^H— O— Si=+H— Si=  .  (6) 

The  reactions  involving  creation  of  O3  and  N4  are  sup¬ 
ported  by  ab  initio  calculations  which  have  been  presented  in 
Ref.  26  which  demonstrated: 

(i)  The  stability  of  the  positively  charged  oxonium  and 
ammonium  centers  and  N4  ,  respectively. 

(ii)  The  instability  of  neutral  oxonium  and  ammonium 
centers,  O3  and  N4,  respectively. 

(iii)  The  absence  of  significant  barriers  (i.e.,  E}y>kT)  to 
defect  generation  processes  in  (ii)  to  (v). 

The  ab  initio  calculations  on  small  molecules  have  shown 
that  the  O — H,  and  N — H  bond  energies  for  the  O3  and  N4 
are  5.7  eV  and  5.0  eV,  respectively,  whereas  the  same  bonds 
are  destabilized  by  energies  of  3.4  and  1.5  eV,  respectively 
for  the  O3  and  N4  centers  (for  details,  see  Ref.  26).  In  addi¬ 
tion  other  studies  (see.  Refs.  15  and  16)  have  specifically 
addressed  bond-centered  hydrogen  in  crystalline  Si.  Based 
on  the  calculations  presented  in  Refs.  15  and  16,  and  an 
empirical  relationship  presented  in  Ref.  17,  the  binding  en¬ 
ergy  of  a  Si-H^  bond  is  estimated  be  between  2.5 ±0.5  eV; 
i.e.,  greater  than  the  Si-Si  bond  energy  of  2.34  eV,  but  less 
than  the  Si — H  bond  energy  of  3.35  eV.^  For  example,  an 
upper  limit  can  be  estimated  from  the  expression  given  in 
Ref.  17  for  the  bond  dissociation  energy,  of  a  Si — 
bond  in  a  three  center  bonding  arrangement, 


/:/XH_si(A/)/R,  (8) 

where  k  is  an  empirical  constant  equal  to  =2.44X10^^  C~^ 
/XH—si  is  the  bond  moment,  A/  is  an  effective  ionization 
energy,  and  R  is  the  bond  length.  If  we  assume  that  the  bond 
moment  is  determined  by  one  positive  charge  at  a  distance  R 
from  the  Si-atom,  and  take  A/  =  7.8  eV,  following  the  proce¬ 
dure  given  in  Refs.  6  and  17,  then  E^  is  approximately  equal 
to  3.0  eV,  the  upper  estimate  given  above. 

V.  DISCUSSION 

This  article  has  made  several  points  relative  to  defects  and 
defect  precursors  at,  or  in  the  immediate  vicinity  of 
Si — Si02  interfaces.  The  first  is  based  on  the  application  of 
the  VSEPR  model  to  amorphous  materials  and  their  inter¬ 
faces  and  deals  with  a  distinction  between  bonding  defects  in 
which  the  number  of  nearest-neighbor  bonds  is  independent 
of  their  charge  state,  and  bonding  defects  in  which  the  num¬ 
ber  of  nearest-neighbors  changes  with  their  charge  state.  The 
threefold  coordinated  Si  atom  represents  the  first  type  of  de¬ 
fect.  It  can  exist  in  three  different  charge  states  according  to 
its  electron  occupancy  without  any  change  in  bonding  coor¬ 
dination.  The  positions  of  the  three  trapping  levels  relative 
the  band  edges  of  the  crystalline  Si  substrate,  and/or  the 
Si02  dielectric,  will  be  different  for  different  nearest-,  and 
more  distant-neighbor  bonding  arrangements.  For  example, 
for  the  neutral  centers  with  an  electron  occupancy  of  one,  the 
more  electronegative  these  neighbors,  the  higher  into  the  gap 
is  the  energy  state  of  the  defect.  This  means  that  the  release 
times  for  Si-atom  dangling  bonds  that  have  trapped  charge 
carriers  will  vary  considerably  depending  on  their  distance 
from  the  Si — Si02  interface,  since  moving  the  Si-atom  dan¬ 
gling  bond  into  the  oxide  increases  effective  electronegativ¬ 
ity  of  the  atoms  backed  to  that  dangling  bond. 

In  contrast,  the  over-  and  wn<7^r-coordinated  bonding  ar¬ 
rangements  with  respect  to  the  twofold  and  threefold  coordi¬ 
nations  for  neutral  O-  and  N-atoms,  are  not  stable  unless  a 
change  in  charge  state  accompanies  the  change  in  coordina¬ 
tion.  In  this  context,  the  over-  and  under-coordinated  bond¬ 
ing  arrangements  are  charged,  either  positively  or  negatively, 
respectively.  The  three  charge  states  of  these  two  atoms  can 
then  be  represented  by  Oj ,  O2  and  Oj ,  and  N2  ,  N3,  and  N4  . 
The  inter-conversion  between  these  charge  states  requires 
capture  of  a  hole  or  electron  and  this  is  represented  in  the 
following  symbolic  reactions: 

02  +  e“^0r;02+/i+^0^  ■,0^  +  e~^0l\0'[  +  h"^^0l. 

(9) 

Each  of  the  hole  capture  reactions  reactions  is  accompanied 
by  capture  of  a  neutral  bonding  partner,  and  each  of  the 
electron  capture  reactions,  by  release  of  a  neutral  bonding 
partner;  in  the  examples  presented  above  these  are  H-atoms. 
A  similar  set  of  symbolic  reactions  can  also  be  written  for  the 
N-atoms,  where  the  coordination  of  the  N-atom  in  the  ex¬ 
pression  below  includes  at  least  one  N — H  bonding  pair: 

+  e  ■  ^  N2  ;N^  + ^  N4  ;  N4  +  e  - ;N2  + ^  . 

(10) 
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In  addition  to  O-  and  N-atoms,  H-atoms  are  stable  in  the 
neutral  and  positively-charged  states  and  the  relevant  sym¬ 
bolic  reaction  is  given  below: 

+  +  (11) 

By  virtue  of  the  changes  in  coordination  that  accompany 
changes  in  charge  state,  these  types  of  defects  are  active  in 
two  different  ways:  in  their  charged  states  (i)  they  change 
surface  potentials  of  the  crystalline  Si  at  the  channel  in  an 
FET,  and  thereby  change  the  threshold  voltage,  and  the  car¬ 
rier  mobility;  or  (ii)  they  act  as  terminal  traps  for  charged 
carriers.  In  their  neutral  states  O-  and  N-atoms  can  trap  free 
carriers  of  either  sign;  however,  these  trapping  events  require 
changes  in  bonding  coordination,  so  that  H-atom  participa¬ 
tion  (or  its  equivalent)  is  required,  and  the  trapping  is  once 
more  terminal. 

There  are  significant  differences  between  the  defect  pre¬ 
cursor  behaviors  of  N-atoms  bonded  to  one  H-atom  and  two 
Si-atoms,  N-atoms  bonded  to  three  Si-atoms.  The  planar  ge¬ 
ometry  H-N-(Si=)2  centers  can  convert  to  a  tetrahedral 
H2-N'^-(Si=)2  by  hole  trapping  and  attachment  of  a  H-atom 
with  at  most  a  small  reaction  barrier  associated  with  a  rota¬ 
tion  of  the  originally  bonded  H-atom  out  of  the  original 
bonding  plane.  However,  the  conversion  of  a  planar 
N-(Si=)3  group  to  a  tetrahedral  H-N^-(Si=)3  arrangements 
presents  a  significant  energy  barrier,  eV,  due  to  the  bond- 
length  changes  required.  It  has  been  demonstrated  that 
N-atom  incorporation  at  Si — Si02  interfaces  improves  de¬ 
vice  reliability  with  respect  to  interfaces  which  contain  either 
Si— O  or  SiN— H  bonds.^ 

The  defect  generation  processes  (i)  and  (iii)  through  (ix) 
can  also  take  place  in  bulk  oxides,  nitrides  or  oxynitride 
alloys,  or  at  the  interfaces  between  dielectrics  layers  in 
stacked  dielectric  structure  such  an  oxide-nitride-oxide 
(ONO)  composite.  Experiments  performed  on  ONO  sand¬ 
wich  structures  indicated  significant  fixed  positive  charge 
(~I0^^  cm“^)  at  the  internal  O — N  and  N — O  interfaces 
prior  to  a  30  s  900  °C  RTA,^^  and  a  significant  reduction  by 
more  than  a  factor  of  10  after  the  anneal.  This  reduction  is 
attributed  to  release  of  H-atoms  from  bonding  configurations 
involving  N — H  groups. 

The  chemical  bonding  framework  for  characterizing  de¬ 
fects,  and  defect  precursors  for  the  Si — Si02  interface,  can 
be  applied  to  other  semiconductor-dielectric  interfaces  in¬ 
cluding  interfaces  between  SiC,  Ge,  (Si,Ge)  alloys  and  ox¬ 
ides  and/or  nitrides.  These  extensions  must  take  into  account 
differences  in  bonding  between  (i)  Si  and  O,  and  (ii)  C  or  Ge 
and  O,  that  may  change  the  qualitative  aspects  of  some  of  the 
types  of  reactions  described  above.  For  example,  one  factor 
to  be  considered  for  Ge  and  O  is  that  there  are  two  different 
bonding  arrangements  between  Ge  and  O,  octahedral  and 
tetrahedral  as  exemplified  by  the  rutile-  and  quartz-like  ox¬ 
ides  (both  of  which  have  the  same  composition  Ge02).^  A 
second  factor  is  the  tendency  of  C  and  O  to  from  multiple 
bonds  as  in  the  7r-bonding  in  both  CO2  and  CO;  this  carries 
over  to  C  and  N  as  well,  where  the  gaseous  species  is  CN. 


The  defect  generation  processes  in  (iv),  (vi),  and  (ix)  that 
result  in  the  charged  H2  ,  O^,  and  N|  centers,  and  where  the 
H-atom  is  donated  from  a  near-neighbor  Si — H  bond  thereby 
creating  a  Si-dangling  bond  defect  partner,  can  be  also 
viewed  as  an  extension  of  the  valence  alternation  pair  (VAP) 
model  originally  proposed  for  defects  in  chalcogenide 
glasses.^^  In  its  original  form  the  VAP  model  included  pairs 
of  negatively  and  positively  charged  defects  which  had  co¬ 
ordinations  in  their  charged  states  that  were  different  from 
the  neutral  bonded  atoms.  As  in  the  discussion  above,  the 
bonding  coordination  of  the  negatively-charged  centers  was 
one  lower,  and  the  bonding  coordination  of  the  positively 
charged  center  was  one  higher  than  neutral  bonding,  so  that 
in  amorphous  selenium,  the  defect  generation  reaction  could 
be  written  as 

2Se^^Se3++Se;',  (12) 

where,  the  subscripts  again  indicate  the  number  of  nearest- 
neighbors,  and  superscripts  the  charge  state.  The  defect  gen¬ 
eration  reactions  in  (iv),  (vi),  and  (ix)  are  similar  to  those  in 
Eq.  (12),  except  that  the  valence  alternation  pair  defect  that 
is  produced  consists  a  neutral  Si-atom  dangling  bond,  Si3^, 
and  a  positively  charged  O-  or  N-atom,  O3  or  N4 ,  respec¬ 
tively,  so  that  the  pair  is  charged  rather  than  neutral  as  Eq. 
(11). 
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It  has  been  observed  in  many  studies  of  oxidation  kinetics  that  silicon  dioxide  growth  in  the  thin 
regime  (<30  nm)  is  faster  than  the  predictions  of  the  linear-parabolic  Deal-Grove  relationship.  We 
have  developed  a  conceptual  and  mathematical  model  for  the  thermal  oxidation  of  silicon  which  is 
based  on  an  initial  dissociative  chemisorption  of  molecular  oxygen  on  the  2X1  silicon  (100)  surface 
followed  by  subsequent  diffusion  and  dissociative  reactions  of  molecular  oxygen  at  the  Si/Si02 
interface.  This  model  accounts  for  current  experimental  observations  on  the  structural  modification 
of  the  reconstructed  surface  to  a  1 X 1  superlattice  on  limited  exposure  to  molecular  oxygen  and 
provides  a  mechanistic  rationale  for  the  self-limiting  mechanism  of  the  oxide  film  at  ^0.6  nm 
during  low-temperature  oxidation  processes.  The  rate-equation  model,  consistent  with  the  proposed 
reactions  at  the  Si/Si02  interface,  has  been  refined  to  give  an  excellent  fit  to  experimental  data 
within  all  thickness  regimes,  but  especially  in  the  initial  rapid  growth  regime  where  the  growth  rate 
is  proportional  to  (^02)^^^*  The  rate  equation  reduces  to  a  linear  dependence  on  oxygen  partial 
pressure  in  the  thicker  regime,  where  it  predicts  classic  Deal-Grove  behavior.  We  present  the 
development  of  the  model  for  oxidations  performed  between  780-1100  °C  consistent  with  observed 
oxide  growth  and  film  properties.  The  activation  energies  for  the  reaction-controlled  regime  and  the 
diffusion-controlled  regime  are  consistent  with  literature  data.  We  observe  a  sharp  transition  in  a 
characteristic  length  parameter  at  ~950  °C  which  may  be  possibly  due  to  the  experimentally 
observed  change  in  oxide  film  density.  A  detailed  analysis  and  explanation  is  presented.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

Silicon  dioxide  is,  arguably,  the  most  important  compo¬ 
nent  in  the  fabrication  of  a  metal-oxide-semiconductor 
(MOS)  device.  Despite  extensive  research  over  the  last  thirty 
years,  accurate  and  quantitative  models  of  the  initial  oxida¬ 
tion  of  the  silicon  surface  have  proven  illusive.  Models  of 
the  oxidation  process  in  the  thick  oxide  regime  have  been 
successfully  developed,^’"^’^  but  fail  when  applied  for  the  pre¬ 
diction  of  oxidation  rates  at  thicknesses  less  than  approxi¬ 
mately  20  nm.  These  failures  may  be  ascribed  to  an  incom¬ 
plete  understanding  of  the  growth  mechanisms  in  the  very 
early  stages  of  oxidation.  Very  large  scale  (VLSI)  and  ul- 
tralarge  scale  integration  (ULSI)  device  fabrication  requires 
the  formation  of  silicon  dioxide  films  ranging  in  thickness 
from  3  nm  to  1000  nm,  and  as  a  consequence  there  is  an 
immediate  need  for  models  capable  of  predicting  oxidation 
rates  in  the  thin  regime. 

The  thermal  oxidation  of  silicon  has  usually  been  mod¬ 
eled  within  the  framework  originally  formulated  by  Deal  and 
Grove^  in  the  mid-1960’s.  They  modeled  the  oxidation  rate 
in  terms  arising  from  two  processes:  the  chemical  reaction  of 
oxygen  with  silicon  at  the  oxide/substrate  interface  and  the 
diffusion  of  oxygen  through  the  growing  oxide  film.  The 
combination  of  rate  relationships  for  these  processes  yielded 
the  classic  Deal-Grove  “linear-parabolic”  rate  law.  This 
model  does  not,  however,  accurately  predict  the  growth 
curves  observed  for  oxides  of  thicknesses  below  20  nm. 
Within  this  “thin  oxide”  regime,  significantly  higher  oxida¬ 


tion  rates  are  observed  than  predicted  by  the  Deal-Grove 
model.  Massoud  and  Plummer^  have  proposed  a  modifica¬ 
tion  of  the  original  Deal -Grove  model  which  incorporates 
two  additional  exponential  terms,  one  in  order  to  obtain  a 
better  fit  in  the  initial  regime  of  oxidation  and  the  other  for 
the  intermediate  oxidation  phase.  Their  model  improves 
upon  Deal-Grove  rate  predictions  for  the  thin  oxide  regime, 
but  fails  at  oxide  thicknesses  of  less  than  4  nm.  A  variety  of 
other  models  have  been  reported  in  the  literature,^' but 
none  have  successfully  yielded  accurate  rate  predictions  in 
the  thin  oxide  regime.  Our  model  is  based  upon  kinetic  rela¬ 
tionships  involving  the  interfacial  production  of  a  sub- 
stoichiometric  oxide,  SiO,  in  addition  to  the  final  product 
Si02.  In  this  article,  we  compare  the  predictions  of  an  oxi¬ 
dation  rate  model  that  we  have  developed  with  the  avail¬ 
able  experimental  data  for  both  the  thin  and  thick  oxide  re¬ 
gimes. 


II.  OXIDATION  MECHANISM 

Our  model  is  based  on  the  assumption  that  the  oxidation 
proceeds  via  the  dissociative  chemisorption  of  molecular 
oxygen,  initially  at  the  silicon  surface  and  subsequently  at 
the  oxide/silicon  interface.  We  assume  that  the  starting 
Si(lOO)  surface  has  a  2X1  reconstruction  with  one  dangling 
bond  per  silicon  atom  and  a  number  of  unique  adsorption 
sites  for  the  oxygen  molecule.  The  geometries  and  energetics 
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of  the  possible  adsorption  sites  on  the  Si(lOO)  2X1  surface 
have  been  discussed  by  Oshiyama  and  Miyamoto^^  and  by 
Zheng.^ 

The  driving  force  for  the  initial  reaction  between  oxygen 
and  the  silicon  surface  is  orbital  mixing  between  the  oxygen 
molecule  and  the  silicon  surface.  The  highest  occupied  mo¬ 
lecular  orbitals  (HOMOs)  of  the  oxygen  molecule  consist  of 
two  half-filled,  orthogonal  and  degenerate  orbitals  of  anti¬ 
bonding  symmetry.  These  may  accept  electron  density  from 
the  highest  energy  filled  orbitals  at  the  silicon  surface,  pre¬ 
sumably  the  dangling  bonds  associated  with  the  2X1  recon¬ 
struction.  The  incorporation  of  electron  density  from  the  sur¬ 
face  silicon  into  the  oxygen  tt*  orbital  weakens  the  oxygen- 
oxygen  bond,  leading  to  dissociation  of  the  oxygen  molecule 
and  the  insertion  of  an  oxygen  atom  into  neighboring  silicon- 
silicon  backbonds.  Depending  on  the  temperature,  three 
stages  of  oxidation  at  the  surface  may  exist.  At  low  tempera¬ 
tures,  there  is  insufficient  energy  available  for  dissociation  of 
the  silicon  surface  dimers  and  oxygen  atoms  may  bridge  be¬ 
tween  dimer  rows,  yielding  half  monolayer  coverage  of  the 
surface.  Such  coverage  has  been  reported  for  low  tempera¬ 
ture  exposure  of  reconstructed  Si(lOO)  to  molecular 
oxygen.’^  At  higher  temperatures  (but  insufficient  for  sus¬ 
tained  oxidation  of  the  silicon),  dissociation  of  the  silicon 
dimers  leads  to  a  1X1  oxygen  superlattice  on  the  surface, 
consistent  with  literature. Finally,  the  oxide  films  thickness 
is  expected  to  self-limit  at  approximately  0.5-0.6  nm  at  tem¬ 
peratures  below  those  required  to  drive  oxygen  diffusion 
through  the  surface  oxide,  due  to  inaccessibility  of  silicon- 
silicon  bonds  to  incoming  molecular  oxygen.  This  thick¬ 
ness  is  consistent  with  self-limiting  oxide  films  observed  on 
Si(lOO).  At  temperatures  and  pressures  sufficient  for  sus¬ 
tained  oxidation  of  the  silicon  substrate  (i.e.,  conditions  sup¬ 
porting  significant  diffusion  of  molecular  oxygen  through  the 
growing  oxide),  our  model  suggests  a  layer-by-layer  growth 
of  the  film.^^ 


III.  OXIDATION  KINETICS 

Adopting  a  Langmuir  kinetic  formalism, the  chemical 
reaction  sequence  for  the  oxidation  of  silicon  may  be  written 
as 


Si+^02<=>Si0, 

k; 


(1) 


SiO  +  — >  SiOo . 


(2) 


Based  on  chemical  kinetics  and  Pick’s  law  of  diffusion  the 
expression  for  the  total  growth  rate  is  then 


dx  DeffC* 

dt  ■^Deff+A:3X  , 


Thus: 

dx 

dt  N[D^ff-\-k^x] 


+  ■ 


TT' 


-xia 


(3) 


(4) 


where  k^,  is  the  linear  reaction  rate  at  the  interface,  D^ff  is  the 
diffusion  coefficient,  C*  is  the  equilibrium  concentration  of 
the  oxygen  molecules  in  the  gas  phase,  N  is  the  number  of 
oxygen  molecules  in  a  unit  volume  of  the  oxide,  a  is  a  pre¬ 
exponential  fitting  parameter  and  x  is  the  oxide  thickness. 
We  interpret  the  parameter  a  which  appears  in  the  exponen¬ 
tial  term  as  a  characteristic  length  associated  with  the  onset 
of  diffusional  limits  on  the  transport  of  oxygen  to  the  oxide/ 
silicon  interface. 

Expression  (4)  for  dxidt  may  be  compared  with  existing 
experimental  data  on  the  oxidation  rate  of  silicon.  In  the  thin 
limit  (<20  nm),  our  model  successfully  predicts  the  ob¬ 
served  oxide  growth  rates  with  significantly  better  accuracy 
than  previously  reported  models.  Beyond  oxide  thicknesses 
of  ca.  20  nm,  our  relationship  gives  rates  that  are  comparable 
with  those  predicted  by  the  Deal -Grove  model  and  with  ex¬ 
perimental  observations. 

In  the  thin  regime  [x  <  (D^ff/k^)],  the  Deal-Grove  model 
reduces  to 

x=~7^t.  (5) 


In  our  model,  for  x  Eq.  (4)  reduces  to 

^dx, 


- dt= - e 


N 


(6) 


and 


aC* 


(7) 


Equation  (7)  is  found  to  match  the  observed  growth  rates 
within  the  thin  oxide  regime  if 


a 


-<1. 


(8) 


When  the  fitting  parameter  a  in  equation  (4)  is  set  to  zero, 
the  relationship  reduces  to  the  Deal-Grove  form  where  the 
growth  can  be  characterized  by  the  two  Deal -Grove  param¬ 
eters  /C3  and  D.  The  model  has  been  fit  to  the  available  ex¬ 
perimental  data  in  the  thick  oxide  regime  and  the  corre¬ 
sponding  values  and  D  are  obtained  (see  Fig.  5).  Using 
these  activation  energies  for  and  D  in  an  Arrhenius  rela¬ 
tionship,  values  of  and  D  are  obtained  for  different  tem¬ 
peratures  and  the  value  of  a  is  varied  to  fit  the  experimental 
data  in  the  thin  oxide  regime,  while  the  value  of  a  is  kept 
constant.  The  only  variable  fitting  parameter  in  the  thin  re¬ 
gime  is  a.  The  value  of  a  is  kept  constant  at  4.0X10^  jjml 
min. 


IV.  RESULTS  AND  DISCUSSIONS 

We  have  chosen  data  taken  from  Massoud,  Plummer,  and 
Irene,"^  Irene  and  Van  de  Meulen,^^  Kamigaki  and  Itoh,^^  and 
Chao,  Lee,  and  Lei^^  as  experimental  values  of  oxidation 
rates  with  which  to  develop  and  test  our  model.  Massoud’s 
samples^  had  ca.  1.0  nm  of  oxide  prior  to  thermal  oxidation 
whereas  Irene^’^^’^^  reported  the  presence  of  0.3-0.6  nm  of 
oxide  before  the  wafers  were  introduced  into  the  oxidation 
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l/T  (K*‘) 


Fig.  1.  The  diffusion  coefficient  D  and  the  linear  reaction-coefficient 
plotted  as  a  function  of  temperature.  The  activation  energies  are  given  next 
to  the  plots. 


Fig.  2.  The  data  of  Massoud,  Plummer,  and  Irene  (Ref.  4)  versus  our  model. 
The  corresponding  values  of  the  parameter  a  which  give  the  best  fit  are 
given  against  the  oxidation  temperature. 


chamber.  The  samples  of  Kamigaki  and  Itoh^^  reportedly  had 
a  0.6  nm  thick  oxide  layer  while  the  ellipsometry  of  Chao, 
Lee,  and  Lei^^  detected  a  1. 6-2.0  nm  oxide  film  prior  to 
oxidation.  Massoud,  Irene,  and  Chao  used  an  in  situ  ellip- 
someter  whereas  Kamigaki’s  samples  were  taken  out  of  the 
oxidation  furnace  in  order  to  perform  the  oxide  thickness 
measurements  by  ellipsometer.  The  reported  initial  oxide 
thicknesses  agreed  quite  well  with  the  initial  oxide  thick¬ 
nesses  found  in  fitting  the  Deal-Grove  model  to  determine 
the  and  D  values.  All  the  samples  were  lightly  doped.  All 
the  experiments  compared  in  this  paper  were  performed  at 
atmospheric  pressure.  Following  the  first  evaluation,  equa¬ 
tion  (4)  was  used  to  fit  each  set  of  data,  but  with  and  D 
evaluated  from  Fig.  1.  The  only  variable  was  the  parameter 
a. 

Figure  1  shows  the  linear  reaction  coefficients  and  the 
diffusion  coefficients  D  for  all  the  data  on  a  single  plot.  The 
associated  activation  energies  for  and  D  are  1.54  eV  and 
2.58  eV,  respectively.  The  activation  energy  for  is  close  to 
the  Si-Si  bond  energy  of  1.83  eV,^"^  consistent  with  a  mecha¬ 
nism  in  which  the  rate  controlling  step  of  the  interfacial  re¬ 
action  is  the  dissociation  of  Si-Si  back  bonds.  The  higher 
activation  energy  observed  for  the  diffusion  coefficient  is 
consistent  with  decreased  growth  rate  observed  in  the  thick 
oxide  regime,  where  the  rate  controlling  step  in  the  oxidation 
is  the  diffusion  of  molecular  oxygen  rather  than  the  interfa¬ 
cial  reaction.  It  has  been  previously  reported^  that  the  acti¬ 
vation  energy  associated  with  the  bulk  diffusion  of  oxygen  in 
fused  silica  is  1.17  eV.  This  is  far  smaller  than  the  value 
derived  from  our  model.  It  may  be  that  the  structural  nature 
of  the  silicon  dioxide  formed  in  thermal  oxidations  of  silicon 
differs  significantly  from  that  of  the  silica  studied  in  these 
earlier  reports,  with  concommittent  differences  in  the  char¬ 
acteristics  of  oxygen  transport  through  the  lattice.  This  could 
lead  to  the  observed  differences  in  activation  energies.  Cer¬ 
tainly,  lattice  matching  considerations  would  suggest  that 
thin  films  of  silicon  dioxide  on  silicon,  at  least  near  the 


oxide/substrate  interface,  are  subject  to  significant  compres¬ 
sive  stress.  This  could  induce  the  formation  of  metastable 
structure  in  the  oxide  film  having  diffusion  characteristics  for 
oxygen  that  differ  significantly  from  those  of  unstressed  sili¬ 
con  dioxide. 

Figure  2  shows  the  oxide  thickness  as  a  function  of  oxi¬ 
dation  time  for  experimental  data  of  Massoud,  Plummer,  and 
Irene"^  and  fits  to  our  model.  The  value  of  the  characteristic 
length,  a,  varies  between  0.28  nm  and  1.0  nm  over  the  tem¬ 
peratures  800-1000  °C.  In  Fig.  3,  we  show  the  comparison 
between  our  model  and  Irene’s  experimental  data.  Figure  4 
shows  only  the  growth  curves  for  those  experiments  per¬ 
formed  at  atmospheric  pressure.  In  Fig.  5  we  show  the  com¬ 
parison  between  our  model  and  data  of  Chao,  Lee,  and  Lei. 
The  fit  is  excellent  for  all  temperature  ranges  in  both  sets  of 
data.  The  range  of  the  characteristic  length  a  is  0.3- 1.0  nm 


Fig.  3.  The  data  of  Irene  and  van  der  Meulen  (Ref,  21)  versus  our  model. 
The  corresponding  values  of  the  parameter  a  which  give  the  best  fit  are 
given  against  the  oxidation  temperature. 
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Fig.  4.  The  data  of  Chao,  Lee,  and  Lei  (Ref.  23)  versus  our  model.  The  piQ.  6.  Variation  of  parameter  a  with  oxidation  temperature,  where  a  is  kept 

corresponding  values  of  the  parameter  a  which  give  the  best  fit  are  given  constant  at  ca.  10^  /xm/min. 

against  the  oxidation  temperature. 


for  data  of  Irene  and  van  der  Muelen^^  and  0.33-1.25  for 
data  of  Ref.  23,  consistent  with  that  observed  in  the  more 
complete  data  set  of  Ref.  4.  Kamigaki’s  experiments  were 
done  within  several  different  pressure  regimes.  The  range  of 
a  values  found  for  Kamigaki  and  Itotf^  experiments  were 
from  0,15  to  0.30  nm  for  temperature  ranges  of  950- 
1100  °C,  a  distinctly  different  range  than  in  any  other  data 
set.  The  origin  of  this  anomalous  range  for  Kamigaki’s  data 
is  not  presently  clear.  Kamigaki  and  Itoh’s  experiments  were 
performed  using  nitrogen  to  dilute  the  oxygen  gas.  It  may  be 
that  the  molecular  nitrogen  blocks  some  of  the  interfacial 
adsorption  sites,  resulting  in  atypical  reaction  rates  at  the 
interface.  This  may  give  rise  to  ranges  of  a  that  are  signifi¬ 
cantly  different  from  those  observed  in  the  other  studies 
compared  in  this  article.  Minimal  data  points  in  the  experi¬ 
ments  of  Refs.  22  and  23  prevented  us  from  making  a  dis- 


Fig.  5.  The  data  of  Kamigaki  and  Itoh  (Ref.  22)  versus  our  model.  The 
corresponding  values  of  the  parameter  a  which  give  the  best  fit  are  given 
against  the  oxidation  temperature. 


tinction  between  reaction  and  diffusion  controlled  regimes 
for  computation  of  the  activation  energies. 

In  Fig.  6,  we  plot  the  values  of  the  characteristic  length  a 
as  a  function  of  the  growth  temperature.  The  average  value 
of  a  for  the  temperatures  up  to  950  °C  is  ca  0.35  nm,  ap¬ 
proximately  the  height  of  the  Si-0  moiety  at  the  Si/Si02 
interface.  The  rise  in  a  above  950  °C  presumably  signals 
modified  interfacial  processes  or  a  structural  change  in  the 
oxide  film  or  a  combination  of  the  two.  The  parameter  a  may 
be  characteristic  of  an  initial  surface  oxide  formation  analo¬ 
gous  to  the  self-limiting  oxide  formation  observed  at  much 
lower  temperatures.  At  low  temperatures,  a  is  ca.  0.33  nm, 
consistent  with  an  initial  film  formed  by  monolayer  coverage 
of  oxygen  atoms  on  the  silicon  surface.  At  increased  oxida¬ 
tion  temperatures  oxygen  molecules  may  diffuse  further 
through  the  initially  formed  oxide  layer  to  the  point  at  which 
no  unoxidized  silicon  back  bonds  are  available  for  reaction, 
yielding  somewhat  larger  values  for  a.  Alternatively  or  addi¬ 
tionally,  the  variations  observed  in  a  may  arise  from  struc¬ 
tural  differences  within  the  oxide  film  which  affect  diffusion 
processes  for  molecular  oxygen.  It  has  been  reported  by 
Irene,  Tierney,  and  Angiello^^  that  there  is  a  change  in  the 
density  of  the  oxide  film  at  ca.  950  °C.  They  have  observed  a 
3%  change  in  density  between  films  formed  at  600  °C  and 
those  formed  at  1150  °C.  Based  on  the  values  reported  by 
Irene,  Tierney,  and  Angiello^^  2.28-2.20  g/cm^,  comparisons 
with  available  data^^  indicate  the  formation  of  either  tridym- 
ite  or  cristobalite  forms  of  silica.  The  change  in  the  value  of 
a  may  thus  be  related  to  variations  in  the  diffusion  of  oxygen 
through  the  growing  oxide  film  which  in  turn  depends  on  the 
structural  porosity  of  the  silicon  lattice.  Such  a  structural 
variation  would  not  necessarily  need  to  be  present  through¬ 
out  the  oxide  film  but  could  be  localized  to  regions  near  the 
oxide/substrate  interface.  Indeed,  given  the  amorphous  char¬ 
acter  of  thermal  oxides,  it  is  most  likely  that  regions  possess¬ 
ing  crystalline  order  are  localized  near  the  interface.  The 
available  data  do  not  permit  us  to  differentiate  between  pos- 
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sible  forms  of  silica  in  this  system  or  whether  there  is  actu¬ 
ally  a  structural  change  occurring  at  950  °C. 

The  fitting  parameter,  a,  is  kept  constant  so  that  alk'^ 
>  1 .  This  parameter  describes  the  rate  at  which  the  oxygen 
molecules  penetrate  the  Si  2X1  reconstructed  surface  to  oxi¬ 
dize  the  upper  most  layers  of  the  substrate.  A  typical  value  of 
a  in  the  temperature  range  of  800-1000  °C  is  —10^  /^m/min, 
whereas  the  linear  reaction  rate  coefficient,  ,  is  —10^  /xm/ 
min.  We  expect  a  to  have  a  stronger  dependence  on  gas 
phase  oxygen  pressure  rather  than  the  oxidation  temperature. 
Apparently,  in  this  initial,  extremely  fast  growth  regime,  the 
diffusivity  of  oxygen  does  not  play  a  major  role. 

V.  CONCLUSIONS 

We  have  compared  relevant  experimental  data  for  the  dry 
oxidation  of  silicon  with  a  model  invoking  the  dissociative 
chemisorption  of  oxygen  at  the  interface  between  a  growing 
silicon  dioxide  film  and  the  silicon  substrate.  This  relation¬ 
ship  has  been  tested  against  experimental  data  taken  from  the 
literature  and  found  to  give  an  excellent  fit.  The  relationship 
exhibits  a  dependence  on  the  value  of  a  characteristic  length, 
a,  which  is  observed  to  vary  with  temperature.  The  value  of 
a  is  believed  to  be  related  to  the  thickness  of  an  initial  oxide 
film  produced  by  an  extremely  fast  reaction  of  oxygen  with 
the  2X1  reconstructed  silicon  surface.  Alternatively,  varia¬ 
tions  in  a  may  arise  from  variations  in  the  structural  charac¬ 
teristics  of  all  or  part  of  the  silicon  dioxide  films  grown  at 
varying  temperatures.  Further  experimental  evidence  is  re¬ 
quired  before  conclusive  remarks  could  be  made  on  the 
structural  evolution  of  the  initial  oxidation  phase. 
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Kinetics  of  field-induced  oxidation  of  hydrogen-terminated  Si(111) 
by  means  of  a  scanning  force  microscope 
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Using  an  atomic  force  microscope  (AFM)  with  a  highly  doped  Si  tip  (0.01-0.02  ft  cm)  we  induced 
the  oxidation  of  hydrogen-terminated  Si(lll)  surfaces  by  scanning  the  surface  with  an  applied  bias 
Uox  between  sample  tip  (negative)  and  substrate  (positive)  in  air  of  60%  relative  humidity.  The 
resulting  oxide  height  was  measured  relative  to  the  unoxidized  substrate  immediately  afterwards 
using  the  same  setup  in  the  AFM  mode.  Typical  oxide  thicknesses  of  some  nm  were  achieved, 
p-type  as  well  as  n-type  samples  were  investigated  and  the  oxidation  time  at  each  point  was  varied 
by  varying  the  scan  speed  of  the  tip  during  oxidation.  Some  of  the  salient  results  are  as  follows:  (i) 
There  is  a  definite  threshold  voltage  such  that  no  oxidation  takes  place  below  ;  (ii)  f/jh 
depends  on  doping  and  is  lowest  (2.7  V)  for  n^-  and  highest  (5.4  V)  for  p^-typc  material;  (iii) 
is  independent  of  scan  speed  and  thus  oxidation  time  for  speeds  between  0.25  and  7  /xm/s.  Assuming 
that  the  oxidation  time  is  inversely  proportional  to  the  tip  scan  speed  during  oxidation  we  tried  to 
fit  the  data  to  current  models  of  oxide  growth  (parabolic  growth  law,  linear-quadratic  growth  law, 
and  Mott-Cabrera  mechanism).  None  of  them  fits  the  experimental  data.  Instead,  we  find  that  the 
results  are  well  described  by  a  power  law  of  the  form  Z  —  ( ag  (?ox/^o)^  where  Z  and  t  are 
the  oxide  thickness  and  the  oxidation  time,  respectively,  and  is  a  factor  that  depends  on 
experimental  conditions  such  as  doping,  tip  shape,  and  humidity.  The  exponent  yhas  a  value  of  1/4 
to  within  experimental  uncertainty.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

Electric  fields  are  known  to  enhance  the  oxidation  rate  of 
Si  since  the  pioneering  work  of  Jorgensen.^  Dagata  et  al? 
were  the  first  to  apply  the  field-induced  oxidation  (FIO)  to 
create  an  oxide  pattern  on  Si  with  the  help  of  a  scanning 
tunneling  microscope  and  since  then  a  number  of  related 
reports  have  appeared  in  the  literature.^”^  The  potential  of 
these  oxides  as  etch  masks  for  device  fabrication  on  the  na¬ 
nometer  scale  has  been  emphasized  by  Snow  and  Campbell^ 
and  the  general  field  of  scanning  probe  microscopy  (SPM) 
based  techniques  to  create  nanometer- scale  structures  on  dif¬ 
ferent  surfaces  has  been  reviewed  in  Ref.  7.  We  recently 
demonstrated  that  oxide  patterns  written  by  a  scanning  probe 
microscope  on  H- terminated  Si  (100)  exhibit  an  increase  in 
friction  force  compared  to  the  unoxidized  Si  surface  that 
scales  with  the  oxide  thickness.^ 

In  this  article  we  report  an  investigation  of  the  kinetics  of 
field-induced  oxidation  of  hydrogen-terminated  Si(lll)  sur¬ 
faces  using  a  scanning  force  microscope  (SFM)  with  a  con¬ 
ducting  tip.  The  advantage  of  this  approach  lies  in  the  fact 
that  the  same  probe  can  be  used  to  oxidize  and  detect  the 
resulting  oxide  thiekness  in  consecutive  scans  without  the 
need  to  remove  the  sample  from  the  microscope. 

II.  EXPERIMENTAL  DETAILS 

The  SFM  was  custom  built  at  our  laboratory  employing 
rectangular  0.01-0.02  flcm  n^-type  Si  cantilevers  (0.2 
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N/m)  with  integrated  tips  (apex  radius  —20  nm)  which  could 
be  eleetrically  biased  without  metal  coating.  Oxidation  by 
SFM  was  performed  in  air  with  —60%  relative  humidity  and 
at  room  temperature.  The  SFM  was  operated  in  contact  mode 
at  a  constant  loading  force  of  —  20  nN. 

We  investigated  oxide  growth  on  p'^-type  samples  with  a 
resistivity  of  0.005-0.01  flcm,  moderately  doped  5.6-10.4 
ft  cm  p-typc  samples,  and  0.01-0.02  ft  cm  n^Aypc  mate¬ 
rial.  Before  oxidation,  oxide-free  H-terminated  Si(lll)  sur¬ 
faces  [Si(lll):H]  were  prepared  on  Czochralski  (Cz)  grown 
wafers.  To  this  end,  the  Si(lll)  surfaces  were  chemically 
treated  under  clean  room  conditions  using  procedures  after 
Ref.  9,  finished  by  a  3  min  dip  in  40%  aqueous  NH4F  solu¬ 
tion  and  a  3  s  rinse  in  ultrapure  water  (18  Mflcm).  As  a 
result,  hydrophobic,  H-terminated,  and  atomically  stepped 
surfaces  with  flat  terraces  are  formed  which  are  well  stabi¬ 
lized  against  oxide  formation  in  air  ambient.^’^^  A  native  ox¬ 
ide  thickness  well  below  0.1  nm  is  expected  to  form  within  3 
h,  the  time  scale  of  our  experiments.^®  In  contrast,  in  most 
cases  atomic  scale  roughness  remains  on  Si(lOO)  after 
chemical  treatment. 

As  judged  by  IR-absorption  spectra  of  our  samples  their 
dominant  surface  termination  is  due  to  Si — H  bonds  with 
less  than  10%  contributions  from  either  Si~H2  or  Si-H3  con¬ 
figurations.  Surface  contamination  fell  short  of  the  XPS  de¬ 
tection  limit  (—0.7  at.  %)  and  chemically  shifted  Si  2p  com¬ 
ponents  due  to  initial  oxidation  were  absent. 

For  local  FIO  we  used  the  method  of  Ref.  3,  applying  a 
positive  voltage  to  the  sample  with  respect  to  the  tip  at 
selected  positions  while  scanning  the  sample  surface.  Subse- 
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Fig.  1.  Oxide  pattern  formed  by  field-induced  oxidation  using  a  conducting- 
probe  scanning  force  microscope  [p-type  Si(lll):H,  p=5.6-10.4  Hem, 
t/ox=10  V,  Uox=2  Atm/s,  Fx  =  20  nN,  image  size:  1. 2X1.2  fjuve^]. 


quently,  the  resulting  oxide-height  z  relative  to  the  substrate 
was  measured  by  the  same  SFM  tip. 

III.  RESULTS 

In  Fig.  1  an  example  of  the  resulting  topography  after 
oxidation  is  shown  for  /7-type  material.  In  the  bright  areas  a 
voltage  of  10  V  was  applied  during  the  previous  pattern¬ 
ing  scan  whereas  in  the  dark  areas  was  set  to  0  V.  During 
subsequent  imaging  tip  and  sample  were  grounded.  The  scan 
speed  was  2  /xm/s  during  oxidation  as  well  as  for  imaging. 
The  image  size  is  1. 2X1,2  /xm^.  Bright  areas  show  protru¬ 
sions  formed  by  the  FIO  process,  exhibiting  an  oxide  height 
z  of  3.8 ±0.5  nm  above  the  unoxidized  substrate.  We  find  no 
increase  of  the  topographical  roughness  over  the  oxidized 
areas  compared  to  the  unstructured  surface.  On  thermally 
and  naturally  oxidized  Si(lll)  surfaces,  i.e.,  not  patterned  by 
FIO,  atomic  steps  and  terraces  have  been  reported  by 
Homma,  Suzuki,  and  Yabumoto.^^ 

The  oxide  height  for  a  given  voltage  and  scan  speed  de¬ 
pends  on  the  doping  of  the  substrate  in  the  way  depicted  in 
Fig.  2  where  we  have  plotted  z  vs  for  three  differently 
doped  substrates.  The  writing  speed  Vq^  was  kept  fixed  at  2 
yLtm/s  in  all  cases.  It  is  apparent  that  the  oxidation  requires  a 
certain  threshold  voltage  below  which  no  oxidation  is 
observed.  This  threshold  voltage  depends  not  only  on  the 
type  of  doping  (n  or  p  type)  but  also  on  the  dopant  concen¬ 
tration.  We  measure  values  for  c>f  2.7,  4.2,  and  5.4  V  for 
our  n"^-,  p-,  and  p'^-type  material,  respectively.  The  uncer¬ 
tainty  in  f/jh  is  estimated  at  ±0.1  V.  Above  threshold  the 
oxidation  thickness  increases  linearly  with  in  such  a  way 
that  the  slope  dzidU^y  increases  from  n^-  over  p-  to 
p'^-type  material.  It  thus  appears  that  in  the  material  (p^) 
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Fig.  2.  Oxide  height  z  as  a  function  of  the  sample  voltage  C/px  applied  for 
oxidation  for  three  different  doping  levels  {vq^=2  fjmls,  F^  =  20  nN). 


that  requires  the  highest  potential  to  initiate  oxidation  the 
oxidation  rate  also  depends  most  sensitively  on  the  field 
strength. 

In  Fig.  3(a)  the  oxide  height  is  plotted  versus  for 
different  scan  speeds  during  oxidation.  The  sample  is  lightly 
p  type.  Again,  no  FIO  takes  place  below  f/th  ^^is  thresh- 


Vox'*  (  ) 

Fig.  3.  (a)  Oxide  height  z  as  a  function  of  the  sample  voltage  at  differ¬ 
ent  scan  speeds  Uox  during  oxidation,  (b)  z  as  a  function  of  the  inverse  scan 
speed  during  oxidation  at  different  voltages  U^y.  [/?-type  Si(]ll):H, 
p=5.6-10.4  Ocm,  Fj^-20  nN]. 
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old  voltage  is  furthermore  independent  of  the  scan  speed 
while  above  the  slope  dzid  inereases  with  decreasing 
The  data  of  Fig.  3(a)  are  replotted  in  Fig.  3(b)  as  z 
versus  the  inverse  scan  speed  during  oxidation  for  a  set 
of  different  voltages  Both  higher  Vq^  and  higher  Uq^ 
lead  to  thicker  oxide  films. 


IV.  DISCUSSION 

Before  discussing  these  results  in  the  light  of  models  for 
the  oxidation  kinetics  and  other  experiments  it  is  necessary 
to  establish  the  relationship  between  our  experimental  pa¬ 
rameters  oxide  height  z  and  scan  speed  during  oxidation  v  ox 
with  the  commonly  employed  parameters  oxide  thickness  Z 
and  oxidation  time  t.  Clearly,  the  oxidation  time  at  each 
point  is  inversely  proportional  to  even  if  the  proportion¬ 
ality  factor  is  not  known  exactly.  On  the  other  hand,  there  is 
ample  evidence^’ that  the  oxide  height  z  above  the  origi¬ 
nal  silicon  surface  is  proportional  to  the  total  oxide  thickness 
with  a  proportionality  factor  z/Z^O.44.^^  Consequently,  we 
use  in  what  follows  the  parameters  z  and  v  synonymously 
with  Z  and  t. 

There  is  an  extensive  literature  on  the  functional  relation¬ 
ship  between  oxide  thickness  and  oxidation  time.  The  point 
of  departure  is  in  most  cases  the  Deal-Grove  model  which 
combines  the  diffusion  limited  rate  yielding  Z  ^  appropri¬ 
ate  for  thick  oxides  with  the  initial  linear  regime  Z^t  re¬ 
flecting  the  interface  reaction.  This  is  the  so-called  “linear- 
quadratic”  growth  model  AZ^  +  BZ=t  —  r,  where  A  and  B 
are  parameters  and  the  integration  constant  r  takes  an  ini¬ 
tially  present  oxide  into  account. An  attempt  to  fit  our  data 
with  this  model  failed.  An  extension  of  the  Deal-Grove 
model  that  considers  a  spatial  variation  of  the  diffusion  co¬ 
efficient  in  growth  direction  on  account  of  the  strain  field 
likely  to  be  present  in  the  oxide  was  proposed  by  Blanc. It 
yields  a  functional  relationship  between  oxidation  time  and 
oxide  thickness  of  the  form  t— t=AZ^-\-BZ- C  ln(l  +DZ), 
where  C  and  D  are  additional  parameters  describing  the 
variation  of  the  oxygen  diffusivity  through  the  oxide.  This 
model  fits  the  highly  precise  thermal  oxidation  data  of  Hop¬ 
per,  Clarke,  and  Young^^  exceedingly  well.  It  nevertheless 
fails  to  describe  our  data  despite  the  fact  that  Hopper  and 
co-workers’  data  also  cover  the  thickness  regime  below  10 
nm.  A  major  difference  between  Hopper  and  co-workers’ 
data  and  ours  is  oxidation  temperature:  around  830  °C  in 
their  case  and  room  temperature  here.  Needless  to  say,  our 
data  cannot  also  be  fitted  to  the  Cabrera-Mott  model, 
Z^-'Et,  where  E  is  now  proportional  to  the  electric  field 
across  the  oxide  which  we  take  to  be  U^Jz. 

We  did,  however,  succeed  in  a  two-parameter  fit  of  the 
whole  set  of  data  to  an  empirical  kinetic  expression, 

z  =  a{vo/VoxV,  (1) 

with  parameters  a  and  y  and  an  arbitrary  constant  v  o  which 
was  chosen  to  be  Vq—1  /xm/s.  The  result  is  represented  in 
Fig.  3(b)  as  dashed  lines.  The  parameters  a  and  y  obtained 
for  a  given  value  of  f/^x  plotted  in  Fig.  4  versus  f/^x  •  is 
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Fig,  4.  Parameters  a,  y,  and  ag  from  a  fit  of  Eqs.  (1)  and  (2)  to  the  data  of 
Fig.  3  vs  the  sample  voltage  [p-type  Si(Ul):H,  p=5.6-10.4  ft  cm, 
F^  =  20  nN]. 


’  p-typeSi(lll):H 

1 - 

.O'' 

^^.h  =  4-2V 

9-'' 

- 

1  ■ 

. 

►  ' 

. a . . 

_ _ _ j-i. . 

.  .0, _ 

Q  --U-- 

"X)"' 

— 1 _ 1 _ 1 _ 

J _ L 

obvious  from  this  plot  that  the  exponent  is  independent  of 
/7ox  within  experimental  error:  7=  0.23  ±0.02,  while  a=aQ 
(t/ox~^th)-  We  thus  rewrite  Eq.  (1)  as 

z  =  aoiU,,-UJ{vo/v,y  (C/o.>t/th).  (2) 

The  normalized  prefactor  is  about  0.78  nmA^  for  the 
p-type  sample  (Fig.  3)  and  we  find  a  dependence  on  doping 
as  well  as  on  experimental  conditions,  such  as  tip  shape  and 
humidity.  In  order  to  substantiate  Eq.  (2)  we  replot  the  data 
of  Figs.  2  and  3  in  Fig.  5.  Here,  z  vs  is 

shown,  yielding  straight  lines  for  the  respective  complete 
sets  of  data.  To  check  general  applicability,  we  also  fit  Eq. 
(2)  to  data  for  FIO  on  n-type  (2-6  flcm)  Si(lll)  covered 
with  a  thin  protective  oxide,  taken  from  Ref.  4  and  shown  in 
Fig.  5  as  crosses.  Here,  too,  we  obtain  7^1/4  while  is 
smaller  than  for  our  samples. 

Two  examples  of  silicon  oxidation  not  induced  by  an  elec¬ 
tric  field  that  also  obey  the  power  law  kinetics  with  an  ex¬ 
ponent  7=  1/4  are  given  in  Figs.  6  and  7.  The  data  are  taken 
from  Refs.  18  and  19.  The  upper  panels  show  in  each  case 
the  oxide  thickness  Z  versus  oxidation  time  together  with  the 
power  law  (dashed  line),  whereas  the  lower  panels  give  the 


Fig.  5.  Oxide  height  z  as  a  function  of  (t/ox“  for  the  data  of 

Figs.  2  and  3  [Si(in):H,  F^-20  nN],  and  data  from  Ref.  4. 
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Fig,  6.  (a)  Fit  of  the  oxidation  data  of  Lukes  (Ref.  18)  to  a  power  law  with 
exponent  1/4.  (b)  Residues  of  the  fitting  procedure. 


residues.  We  have  chosen  these  two  examples  because  they 
both  deal  with  RT  oxidation  in  a  thickness  regime  that  is 
comparable  to  ours.  At  the  same  time  the  oxidation  proce¬ 
dures  differ  in  an  important  aspect.  Carriere  and 
co-workers (Fig.  7)  oxidize  under  ultrahigh- vacuum  condi¬ 
tions  in  a  partial  pressure  of  1.3X10“^  Pa  O2  and  in  the 
presence  of  electron  bombardment.  This  must  therefore  be 
considered  a  dry  oxidation  process.  In  contrast,  Lukes oxi¬ 
dizes  under  ambient  conditions  which  implies  a  certain 
amount  of  humidity.  Humidity  is  also  present  in  our  case  and 
it  appears  to  be  a  necessary  condition  for  FIO  as  demon¬ 
strated  recently  by  Kramer,  van  den  Berg,  and  Schonberger 
who  find  oxidation  rates  that  scale  with  relative  air  humidity 
in  an  experiment  very  similar  to  ours.^^  The  results  of  Figs.  6 
and  7  and  our  results  therefore  indicate  that  neither  the  elec¬ 
tric  field  nor  the  presence  of  water  vapor  is  essential  for  the 
observed  kinetics.  A  description  of  oxidation  kinetics  by  a 
simple  power  law  is  in  fact  not  uncommon.  Reisman  et 
have  demonstrated  that  all  dry  oxidation  data  of  Si  published 
up  to  1987  can  be  described  by  such  a  power  law,  albeit  with 
exponents  ranging  generally  between  0.57  and  0.90.  Only  in 
one  case  was  an  exponent  as  low  as  0.24  observed  which 
would  agree  with  the  data  presented  so  far. 

At  the  present  state,  the  exact  microscopic  mechanisms 
underlying  the  observed  doping  dependence  and  the  empiri¬ 
cal  growth  relation  remain  speculative.  Nevertheless,  some 
aspects  should  be  pointed  out.  A  doped-silicon  tip  is  used  for 
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Fig.  7.  Same  as  Fig.  6  for  the  oxidation  data  of  Carriere  and  co-workers 
(Ref.  19). 


sample  oxidation  which  is  initially  covered  with  a  thin  native 
oxide  which  alternatively  might  also  be  worn  away  by  con¬ 
tact  during  scanning.  While  a  voltage  drop  over  this  tip  oxide 
may  partly  account  for  it  cannot  explain  its  variation 
with  substrate  doping.  This  behavior,  however,  as  well  as  the 
variation  of  dzIdU^^  with  doping,  might  be  related  to  carrier 
depletion  near  the  apex  region  of  the  tip^^  which  we  expect 
to  depend  on  the  doping  of  the  tip  as  well  as  on  that  of  the 
sample. 

Further,  before  layer  growth  of  the  oxide  commences  it  is 
necessary  to  reactivate  the  stabilized  Si(lll):H  surface  for 
initial  oxidation.  Becker  et  ah  demonstrated  in  vacuo 
electron-stimulated  desorption  of  hydrogen  from  72*^-type 
(0.007  flcm)  Si(lll):H  by  bombardment  with  electrons 
above  2  eV  from  a  scanning  tunneling  microscope  tip.^^ 
Schwartzkopff  et  aL  in  a  similar  experiment  find  —4.5  V  as  a 
H-desorption  threshold  from  /?-type  (—1  Hem)  Si(lll):H.^'^ 
In  both  cases  the  electron  current  was  in  the  nA  range,  while 
in  our  experiment  it  remained  below  10  pA.  This  implies  in 
case  of  Fig.  1  that  an  electron  dose  of  less  than  5  pC  per  scan 
line  and/or  the  presence  of  the  intense  electric  field  between 
tip  and  sample,  which  itself  is  also  dependent  on  oxide  thick¬ 
ness,  was  sufficient  for  oxidation. 

The  rate  limiting  mechanism  for  oxide  layer  growth  lead¬ 
ing  to  the  observed  kinetics  will  be  difficult  to  deduce  from 
measurements  of  z  versus  oxidation  time  alone.  Power-law 
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kinetics  for  Si  oxidation  have  been  derived  assuming  very 
different  underlying  mechanisms.^^’^^ 

V.  SUMMARY 

In  conclusion,  we  have  presented  experiments  on  the  dop¬ 
ing  dependence  of  SFM-based  FIO  of  Si(lll):H,  as  well  as 
on  its  kinetics.  FIO  is  observed  only  for  voltages  exceeding  a 
doping-dependent  threshold  above  which  oxidation  kinetics 
follows  a  power  law  incompatible  with  several  classical 
models  for  oxide  growth  developed  in  the  regime  of  thicker 
oxids.  Further  work  should  address  differently  doped  tips, 
different  tip  materials,  as  well  as  a  controlled  variation  of 
environmental  conditions  such  as  humidity,  temperature,  and 
gas  partial  pressures. 
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Light  induced  surface  reactions  of  reverse-biased  n-type  porous  Si  with  H2O,  CH3OH  or 
CF3COOH  are  studied  by  diffuse  reflectance  and  transmission  Fourier  transform  infrared 
spectroscopy.  The  goal  of  this  work  is  to  identify  reagents  which  will  react  with  the  irradiated, 
electron  deficient  surface  but  not  the  non-irradiated,  neutral  surface  of  reverse-biased  porous  Si.  The 
porous  Si  is  the  anode  in  an  electrochemical  cell  with  the  desired  reagent  present  as  the  electrolyte 
solution.  H2O/O.5M  NaCl  is  found  to  react  with  the  reverse-biased  Si  in  both  the  light  and  the  dark 
to  form  the  native  oxide.  CH3OH/O.IM  NaBF4  generates  a  small  amount  of  methoxide-modified  Si 
in  both  the  light  and  the  dark.  CF3COOH/IM  CF3COONa  reacts  with  the  surface  when  the  porous 
Si  is  optically  excited  to  create  a  surface-bound  trifluoroacetic  ester  species.  CF3COOH/IM 
CF3COONa  does  not  react  with  reverse  biased  Si  in  the  dark.  The  Si  surface  can  be 
photolithographically  patterned  with  the  trifluoroacetic  ester  species.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

In  this  article  we  describe  a  method  for  using  the  elec¬ 
tronic  structure  of  n-type  Si  to  photopattern  Si  surfaces.  The 
n-type  Si  is  brought  under  positive  potential  control  in  an 
electrochemical  cell.  This  is  the  reverse-biased  condition 
where  the  bands  are  bent  such  that  upon  optical  excitation, 
electrons  are  driven  into  the  bulk  of  the  semiconductor, 
changing  the  state  of  charge  of  the  Si  surface.  We  have  in¬ 
vestigated  three  common  oxygen-containing  nucleophiles — 
water,  alcohols  and  carboxylic  acids — to  identify  reagents 
which  will  react  with  the  irradiated,  positively  charged  Si 
surface  but  not  the  non-irradiated  neutral  Si,  Scheme  1 .  The 
ability  of  a  nucleophile  to  discriminate  between  the  irradi¬ 
ated  and  non-irradiated  surfaces  will  allow  us  to  photopattern 
the  species  onto  the  Si  surface. 

There  are  few  examples  of  light  induced  reactions  with  Si 
in  the  literature,  Photoinduced  electron  transfer  involving 
semiconductors  and  solution  redox  couples  is  an  active  area 
of  investigation.^  Irradiation  of  n-type  Si  has  been  shown  to 
polymerize  and  dope  polypyrrole  films^  and  excitation  of 
porous  Si  is  known  to  enhance  styrene  polymerization.^  Re¬ 
actions  at  Si  electrodes  in  solar  cells  have  been  studied  in  an 
effort  to  minimize  corrosion."^ 

The  Si  used  in  this  work  is  n-type  Si  (100)  which  has 
been  etched  in  an  HF  acid  solution  to  produce  a  photolumi- 
nescent  porous  layer.  The  surface  of  as-etched  porous  Si  is 
H-terminated.^  Some  work  on  the  surface  chemistry  of  po¬ 
rous  Si  has  been  published,  including  electrochemical 
alkoxidation,^  and  modification  of  porous  Si  by  chemical 
means. ^  An  earlier  communication  of  our  work  on  photoelec¬ 
trochemical  modification  of  porous  Si  has  been  published.^ 

There  are  two  advantages  in  using  porous  Si  to  study  Si 
surface  chemistry.  Porous  Si  has  a  large  surface  area,  aiding 
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spectroscopic  identification  of  surface  species.  A  second  ad¬ 
vantage  is  that  porous  Si  is  photoluminescent^  with  an  inten¬ 
sity  which  depends  on  the  nature  of  the  surface 
composition,^  allowing  for  easy  identification  of  patterned 
areas. 

II.  EXPERIMENT 

A.  Materials 

The  Si  was  (100)  n-type  1  fl-cm  resistivity,  polished  on 
one  side,  purchased  from  International  Wafer  Service  and 
used  as  received. 

B.  Apparatus 

The  light  source  was  a  tungsten  filament  Kodak  ELH  bulb 
controlled  by  a  Variac.  Light  intensity  was  monitored  by  a 
calibrated  photodiode.  The  potentiostat/galvanostat  was  a 
Princeton  Applied  Research  Model  363.  Fourier  transform 
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Fig.  1.  Transmission  FTIR  spectrum  of  porous  Si. 


infrared  (FTIR)  spectra  were  obtained  on  a  Nicolet  Model 
550  with  an  interchangeable  diffuse  reflectance  apparatus  by 
Spectra-Tech. 

The  electrochemical  cell  was  custom  made  from  Teflon 
and  consists  of  two  pieces,  a  top  with  a  cylindrical  bore  of 
1.2  cm  diameter  which  acts  as  the  solution  bath,  and  a  bot¬ 
tom  piece  with  edges  cut  to  fit  the  FTIR  spectrometer 
transmission-mode  sample  holder.  The  Si  was  cut  into 
squares  approximately  2  cm  on  a  side  and  secured  between 
two  O-rings  by  screwing  together  the  bottom  and  top  of  the 
Teflon  cell.  Kalrez  O-rings  were  found  to  withstand  the  car¬ 
boxylic  acid  solutions  much  better  than  Viton.  Both  0-ring 
materials  withstand  HR  The  counter  electrode  was  a  Pt  wire 
coil. 


C.  Porous  Si  formation 

The  Si  working  electrode  was  galvanostatically  etched  in 
1:1  49%  HF(aq):CH3CH20H  at  88.5  mAW  for  2  min 
while  under  ~30  mW/cm^  white  light  illumination.  The  re- 
suiting  porous  layer  was  washed  thoroughly  with  CH2CI2 
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Fig.  2.  FTIR  spectra  following  electrolysis  of  porous  Si  in  H2O/O.5M  NaCl 
at  1  V  for  10  min  while  irradiated  and  not  irradiated.  Both  samples  develop 
oxide. 
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Fig.  3.  FTIR  spectra  following  electrolysis  of  porous  Si  in  CH3OH/O.IM 
NaBF4  at  1  V  for  10  min  while  irradiated  and  not  irradiated.  Both  samples 
develop  a  small  amount  of  the  surface  methoxide. 


and  dried  under  N2  stream.  The  porous  Si  sample  was 
handled  in  air  before  and  after  derivatization. 

The  etch  conditions  were  chosen  to  produce  a  thin,  pho- 
toluminescent  layer  with  large  diameter  pores.  The  current 
density  of  88.5  mA/cm^,  near  the  electrochemical  polishing 
regime,  makes  large  diameter  pores.  An  etch  time  of  less 
than  2  min  results  in  porous  layers  that  are  not  photolumi- 
nescent  while  longer  etch  times  produce  thicker  porous  lay¬ 
ers  of  which  only  a  fraction  is  electrochemically  accessible. 
The  pore  diameters  have  not  yet  been  determined  and  the 
porous  layer  is  estimated  to  be  several  microns  thick. 
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Fig.  4.  FTIR  spectrum  following  electrolysis  of  porous  Si  in 
CF3COOH/IM  CF3COONa  at  1  V  for  10  min  with  irradiation  indicates  the 
surface  has  been  modified  with  a  trifluoroacetic  ester  species.  In  the  dark,  no 
reaction  occurs. 
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Fig.  5.  Left:  Photograph  of  the  photoluminescence  from  a  porous  Si  sample  photopatterned  with  an  ester-modified  square  region.  Photoluminescence  from  the 
ester-modified  region  is  less  intense  than  that  from  the  H  passivated  surface.  Right:  Diffuse  reflectance  FTIR  spectra  confirm  that  the  patterned  square  is 
ester-modified  porous  Si,  while  the  area  outside  the  square  retains  the  initial  hydride  surface. 


D.  Si  Derivatization 

Reagent  solutions  of  H2O/O.5M  NaCl,  CH3OH/O.IM 
NaBF4,  CH3OH/O.IM  CH30Na  and  CF3COOH/IM 
CF3COONa  were  prepared.  All  chemicals  except  H  2O  were 
purchased  from  Aldrich  or  Acros  and  used  as  received. 
H2O  was  purified  by  a  Bamstead  Nanopure  filtration  system 
to  a  resistivity  of  18M  fl*cm.  The  reagents  were  used  as 
electrolyte  solutions  in  the  same  Teflon  electrochemical  cell 
used  to  etch  the  porous  Si  (after  thoroughly  rinsing  with  CH 
2CI2  to  remove  trace  HF).  Derivatization  of  the  Si  with  the 
reagent  solutions  proceeded  at  constant  potential  of  1  V  (Si 
biased  positive)  and  a  light  intensity  of  20  mW/cm^  for 
10-20  min.  The  derivatized  sample  was  washed  thoroughly 
with  CH3CN,  CH2CI2  and  dried  under  N2  stream.  FTIR 
spectra  were  collected  in  a  N2-purged  chamber.  The  modified 
surfaces  were  stable  in  air  for  at  least  the  30  min  time  period 
of  the  experiment.  The  surface-bound  alkoxide  or  ester 
groups  could  be  removed  by  washing  with  water  or  other 
alcohols,  but  not  with  CH2CI2 . 

Photopatterned  samples  were  generated  by  imaging  a 
square  mask  placed  in  front  of  the  light  source  through  a 
Vivitar  70-180  mm  zoom  camera  lens  onto  the  Si  surface. 
The  patterned  areas  were  squares  of  roughly  0.2  cm  on  a 
side,  large  enough  to  be  sampled  by  front-face  diffuse  reflec¬ 
tance  IR  spectroscopy.  Diffuse  reflectance  FTIR  spectra  are 
reported  in  Kubelka-Munk  units. 

III.  RESULTS  AND  DISCUSSION 

The  porous  Si  spectrum.  Fig.  1,  is  dominated  by  the 
j^(Si-H^)  stretching  modes  at  2089  cm“’  for  r'(Si-H),  2116 
cm“^  for  x^(Si-H2)  and  2139  cm“^  for  v(Si-H3).^^’^^  A 
(5(Si-H2)  scissor  mode  absorption  is  present  at  908  cm“^ 
An  absorption  band  at  627  cm~^  has  been  assigned  to  a 
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combination  of  i^(Si-H)  and  v(Si-Si)  stretches.  No  absorption 
bands  that  can  be  assigned  to  Si-F  stretches  were  detected. 

The  porous  Si  was  oxidized  by  photoelectrolysis  in 
H2O/O.5M  NaCl  for  10  min  at  1  V  applied  bias.  The  aqueous 
solution  did  not  penetrate  the  porous  layer  well,  but  suffi¬ 
cient  contact  was  made  to  generate  a  photocurrent  within  the 
cell.  The  photocurrent  increased  over  time  as  oxidation  of  the 
Si  surface  rendered  it  more  hydrophilic.  FTIR  spectroscopy 
identified  the  surface  after  reaction  as  the  Si  oxide.  Fig.  2. 
Absorbances  due  to  stretches  of  hydrides  on  Si  backbonded 
to  O  appear  at  2250  cm“^  and  2195  cm“^  p(Si-0)  stretching 
modes  at  1070  cm“^  and  840  cm”^  also  appear  upon  deriva¬ 
tization.  The  v(Si-Si)  absorption  band  at  627  cm“^  is  found 
to  decrease  upon  oxidation. 

Electrolysis  in  the  dark  generates  a  small  dark  current  and 
surface  oxidation  almost  as  extensive  as  with  the  illuminated 
sample,  Fig.  2.  Apparently,  H2O/O.5M  NaCl  is  too  reactive 
with  Si  to  discriminate  between  illuminated  and  non- 
illuminated  Si. 

Electrochemical  derivatization  of  porous  Si  in 
CH3OH/O.IM  NaBF4  in  both  the  light  and  the  dark  results  in 
a  slight  oxidation  of  the  surface,  and  minimal  deposition  of 
surface  methoxide,  Fig.  3.  Both  the  irradiated  and  non- 
irradiated  samples  have  absorptions  in  the  r'(Si-O)  and 
v{C-K)  regions.  The  r'(C-H)  absorptions  exhibit  the  charac¬ 
teristic  double  peaks  of  -CH3  symmetric  and  anti-symmetric 
stretching  modes.  The  small  absorptions  at  2253  cm“^are 
attributed  to  v(OSi-Ft)  and  occur  at  the  same  frequency  as 
the  chemically  prepared  alkoxide-modified  porous  Si.^ 
Chazalviel  and  co-workers  studied  electrolysis  of  p-type  po¬ 
rous  Si  in  CH30H,^  and  concluded  that  formation  of  Si- 
OCH3  bonds  activates  that  Si  atom  to  further  oxidation,  lead¬ 
ing  to  dissolution  of  surface  Si  (similar  to  the  attack  on  Si  by 
HF)  rather  than  chemical  modification. 

Porous  Si  held  at  a  + 1  V  under  20  mW/cm^  white  light 
illumination  in  CF3COOH/IM  CF3COONa  reacts  to  produce 
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a  Si  ester  modified  surface,  Fig.  4.  The  r'(OSi-H)  stretching 
frequency  is  shifted  to  2261  cm”^  due  to  the  back-bonded 
trifluoroacetic  ester  species  which  has  a  greater  inductive  ef¬ 
fect  than  either  the  simple  oxide  or  the  methoxide.^^  We  also 
observed  the  appearance  and  growth  of  an  absorption  at  1771 
cm~^  which  we  assigned  to  a  carbonyl  stretching  mode. 
Shifts  in  the  carbonyl  stretching  frequency  are  useful  finger¬ 
prints  for  changes  in  its  chemical  environment.  The 
^/(C=0)  stretch  of  the  trifluoroacetic  acid  starting  material  is 
at  1785  cm"^  A  model  compound  for  the  surface-bound 
species,  (CH3)  3SiOOCCF3  (Aldrich),  has  a  carbonyl  absorp¬ 
tion  at  1772  cm”^  Absorptions  due  to  the  CF3  group  are 
present  at  1377  cm“^  A  j^(Si-O-C)  asymmetric  stretch  is 
observed  at  1144  cm”^  A  broad  absorption  around  3500 
cm~^  is  due  to  stretches  from  surface-bound  hydrox¬ 

ide  and  adsorbed  moisture,  and  is  an  indication  of  the  hydro- 
philicity  of  the  surface.  The  presence  of  the  v(Si-O-C)  ab¬ 
sorptions,  the  presence  and  position  of  the  r^(C=0) 
absorption  and  the  position  of  the  v(OSi-H)  absorption  all 
indicate  that  an  ester  species  has  bonded  to  the  surface.  Bind¬ 
ing  in  a  bidentate  carboxylate  mode  (where  both  O  atoms  of 
the  CO2  co-ordinate  to  the  substrate)  as  is  observed  on  many 
metal  oxides  is  ruled  out  because  such  a  species  would  have 
three  i^(C02~)  stretches  between  1600  cm”^  and  1400 
cm“^^^  none  of  which  are  observed. 

Samples  derivatized  in  the  dark  show  no  sign  of  surface 
modification,  Fig.  4.  A  slight  amount  of  oxidation  due  to 
handling  is  seen  by  the  small  r'(Si-O)  band  at  1070  cm“\  but 
no  v{C=0)  or  r’(OSi-H)  stretches  are  present. 

As  another  control  experiment,  the  counter  electrode  and 
Si  were  shorted  together  and  illuminated.  The  photocurrent 
alone  was  sufficient  to  esterify  the  Si,  though  at  a  much 
lower  rate  than  with  an  applied  bias. 

All  three  reagents,  H2O,  CH3OH,  and  CF3COOH,  react 
with  porous  Si  upon  photoelectrolysis.  Water  is  reactive 
enough  that  the  dark  current  is  sufficient  to  oxidize  Si.  Elec¬ 
trolysis  in  CH3OH  in  the  light  or  the  dark  leads  to  dissolution 
of  Si  by  the  methoxide,  leaving  behind  only  a  small  amount 
of  methoxide-modified  Si.  Trifluoroacetic  acid  reacts  with 
the  reverse-biased  Si  surface  when  the  surface  is  sufficiently 
positively  charged  by  irradiation  but  does  not  react  with  the 
non-irradiated  Si. 

The  necessity  for  optical  excitation  of  the  Si  to  initiate 
reaction  with  carboxylic  acids  allows  the  Si  surface  to  be 
photopattemed  with  the  ester.  The  apparatus  for  photoelec¬ 
trochemical  patterning  is  shown  in  Scheme  2.  A  square  re¬ 
gion  in  the  middle  of  the  porous  Si  is  illuminated  during  the 
electrolysis  in  CF3COOH/IM  CF3COONa.  Figure  5  shows  a 
photograph  of  the  photoluminescence  from  the  circular  po¬ 
rous  Si  sample  patterned  with  an  ester-modified  square  re¬ 
gion.  The  ester-modified  porous  Si  has  a  lower  photolumi¬ 
nescence  quantum  yield  because  the  ester  provides  additional 
surface  sites  for  non-radiative  recombination.  Diffuse  reflec¬ 
tance  FTIR  of  the  patterned  surface  identifies  the  square  area 
as  that  containing  the  surface-bound  ester,  while  the  area 
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outside  the  square  has  only  the  v(Si-H^)  absorptions  of  the 
starting  porous  Si,  Fig.  5. 


IV.  CONCLUSIONS 

H2O,  CH3OH,  and  CF3COOH  electrolyte  solutions  are 
found  to  react  with  the  surface  of  reverse-biased  n-type  po¬ 
rous  Si.  H2O  was  reactive  in  both  the  light  and  the  dark, 
producing  the  simple  oxide.  CH3OH  produced  only  slight 
methoxide-modification  in  either  the  light  or  the  dark. 
CF3COOH  was  unreactive  with  the  surface  in  the  dark  but 
optical  excitation  of  the  reverse-biased  n-type  Si  rendered 
the  Si  surface  susceptible  to  nucleophilic  attack  by 
CF3C00~,  producing  the  trifluoroacetic  ester-modified  sur¬ 
face.  This  discrimination  in  reactivity  between  irradiated  and 
nonirradiated  areas  allowed  the  porous  Si  to  be  photopat- 
terned  with  the  surface-bound  trifluoroacetic  ester  species. 
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Stressing  and  high  fieid  transport  studies  on  device-grade  Si02  by  ballistic 
electron  emission  spectroscopy 

R.  Ludeke,®^  H.  J.  Wen,  and  E.  Cartier 

IBM  TJ.  Watson  Research  Center,  RO,  Box  218,  Yorktown  Heights,  New  York  10598 
(Received  22  January  1996;  accepted  20  April  1996) 

The  tip  of  a  scanning  tunneling  microscope  was  used  to  inject  hot  electrons  across  the  gate  and  into 
the  oxide  of  a  metal-oxide-semiconductor  structure.  This  method,  known  as  ballistic  electron 
emission  microscopy  (BEEM),  allows  an  arbitrary  choice  of  the  energy  of  the  injected  electrons, 
which  may  be  further  accelerated  by  the  application  of  a  gate  bias.  The  high  current  densities  and 
choice  of  energy  make  BEEM  an  attractive  method  to  study  hot  electron  transport  and  breakdown 
phenomena  in  dielectrics.  The  studies  reported  here  were  made  on  Pd/SiO2/Si(100)  structures  with 
a  Si02  layer  thickness  of  3.8  nm.  Monte  Carlo  techniques  were  used  to  calculate  the  spreading  of 
the  electron  beam  as  it  traverses  the  oxide.  A  strong  dependence  of  the  spreading  on  the  kinetic 
energy  and  oxide  thickness  were  observed.  Using  the  calculated  beam  spreads  to  determine  current 
densities  and  injected  charge  densities,  the  charge-to-breakdown  (2^^)  was  measured  for  several 
breakdown  sequences.  The  consistently  exceeded  by  several  orders  of  magnitude  the  values 
obtained  by  conventional  Fowler-Nordheim  (FN)  tunnel  injection  under  high  field  conditions.  Most 
of  the  time,  breakdowns  could  not  be  achieved  for  3.8  nm  oxides.  It  is  concluded  that  impurity/ 
defects  still  control  all  observed  breakdowns;  an  intrinsic  limit — although  claimed  to  have  been 
reached — has  not  yet  been  established.  Changes  in  the  BEEM  spectra  with  injected  electron  charge 
are  interpreted  in  terms  of  a  three  stage  process  to  breakdown:  (I)  electron  trap  creation  and  filling 
at  the  Si02-Si  interface,  (II)  prebreakdown  believed  to  occur  through  thinning  of  the  oxide  that 
starts  at  the  oxide-Si  interface,  and  (III)  oxide  punch-through,  characterized  by  an  injection 
threshold  ^1  V,  close  to  that  for  Si.  The  role  of  hot  hole  injection  into  the  Si02  valence  band  was 
also  assessed  and  deemed  a  negligible  factor  in  the  degradation  process  under  the  zero  or  low  oxide 
biases  used  in  the  experiment  reported  here.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

From  a  design  point  of  view,  future  generations  of  inte¬ 
grated  circuits  (IC)  can  be  scaled  well  below  the  presently 
manufactured  0.25  /nm  technology,  a  projection  that  requires 
gate  oxides  ultimately  reaching  thicknesses  below  2  nm.^”^ 
However,  the  reliability  of  thin  gate  oxides  is  one  of  the  most 
important  problems  in  metal-oxide-semiconductor  (MOS) 
technology.  Hence,  the  ability  of  manufacturers  to  mass  pro¬ 
duce  such  aggressively  scaled  ICs  will  depend  on  the  suc¬ 
cessful  solution  of  a  number  of  materials  and  processing 
problems,  among  which  is  the  oxide  integrity  for  gate  oxides 
in  the  4  nm  range  and  below.  This  problem  has  been  desig¬ 
nated  a  potential  show  stopper."^  In  this  thickness  range,  in¬ 
terface  roughness,  deviations  from  stoichiometry,  and  impu¬ 
rities  in  the  oxide  and  at  the  Si02-Si  interface  are  believed 
to  be  especially  critical.  Their  roles  in  affecting  device  reli¬ 
ability  and  eventually  oxide  breakdowns  are  presently  not 
known,  and  attempts  at  gaining  further  understanding  of 
these  issues  are  undertaken  at  a  number  of  laboratories.^' A 
fundamental  physical  description  of  oxide  degradation  and 
failure  are  of  crucial  importance  in  formulating  models  that 
predict  device  reliability  and  lifetimes  from  burn-in  and  ac¬ 
celerated  aging  studies. 

Oxide  breakdown  studies  are  based  on  failure  analysis  of 
electrically  stressed  MOS  capacitor  structures  or  MOS  field 
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effect  transistors  (FETs).  Conventionally,  energetic  (hot) 
electrons  are  injected  into  the  conduction  band  of  the  Si02 
by  means  of  field  assisted  injection  or  Fowler-Nordheim 
(FN)  tunneling.  Following  injection,  the  electrons  gain  en¬ 
ergy  through  field  acceleration  until  they  reach  the  opposite 
interface.  The  injection  may  occur  in  a  constant  voltage  or  a 
constant  current  mode.  The  time  is  measured  until  device 
failure  occurs  (time  to  breakdown).  This  time,  which  is  de¬ 
pendent  on  the  fluence  or  total  injected  charge,  exhibits  a 
complex  dependence  on  field  and  oxide  thickness,  which  can 
be  rationalized  in  terms  of  electron  heating  in  the  oxide  film.^ 
The  total  charge  injected  at  failure  is  referred  to  as  charge  to 
breakdown,  An  example  of  the  dependence  of  2bd  on 
oxide  field  and  thickness  is  shown  in  Fig.  1.^^  These  Sbd’s 
represent  state-of-the-art  values  on  high  quality  oxides.  Their 
values  have  increased  over  the  years  as  oxide  growth  and 
substrate  cleaning  methods  have  improved,  implying  that  the 
origin  of  breakdowns  are  still  associated  with  defects  or  im¬ 
purities.  It  is  general  practice,  however,  that  for  the  best  ox¬ 
ides,  such  as  those  shown  in  Fig.  1,  the  breakdown  is  ad¬ 
dressed  as  the  intrinsic  mode,^^’^"^’^®  because  breakdown 
distributions  are  narrow  and  quite  distinct  from  early  fails  or 
low-field  break-down.  In  a  strict  sense,  however,  intrinsic 
would  imply  an  oxide  free  of  defects,  with  the  Qbd’s  reflect¬ 
ing  the  practical  upper  limit  of  oxide  reliability. 

A  fundamental  and  little  disputed  aspect  of  oxide  break¬ 
down  is  the  role  of  the  hot  electrons  in  generating  charge 
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Fig.  1.  Normalized  charge  to  breakdown  as  a  function  of  average  electric 
field  for  indicated  oxide  thicknesses  at  room  temperature.  These  results  were 
obtained  through  field  stressing  (injection  by  FN  tunneling  at  the  polysilicon 
gate).  Open  data  points  are  experimental  results,  solid  lines  are  model 
calculations-adapted  from  Ref.  16.  The  solid  data  points  represent  oxide 
failures  achieved  with  BEEM  for  3.8  nm  and  7.5  nm  oxides.  The  energies  of 
the  STM  injected  electrons  were  converted  to  the  equivalent  field  necessary 
to  achieve  those  values. 


traps  at  the  interfaces.  Microscopic  details  of  such  traps  are 
either  lacking  or  highly  speculative,  with  the  exception  of 
traps  generated  at  the  cathode  due  to  the  presence  of  hydro¬ 
gen.  The  latter  is  released  at  the  anode  through  impact  with 
energetic  electrons  (kinetic  energy  ^2  eV).  The  hydrogen 
diffuses  to  the  cathode  where  it  combines  with  another  hy¬ 
drogen  atom  and  escapes.  The  release  of  the  second  hydro¬ 
gen  is  believed  to  leave  behind  an  interface  state  or  an  elec¬ 
tron  trap.^’^  For  thicker  oxides  under  high  oxide  fields, 
conditions  for  which  some  electrons  in  the  oxide  reach  ener¬ 
gies  exceeding  9  eV  (the  Si02  band  gap  energy),  electron- 
hole  pairs  may  be  created  in  the  oxide  by  impact  ionization, 
with  the  holes  being  swept  to  the  injecting  cathode  where 
they  can  be  trapped  by  oxygen  vacancies,  for  example. 
Electron-hole  recombination  can  then  create  interface  states 
and  electron  traps.  This  mechanism  is  important  only  for 
oxide  thicknesses  S:10  nm,  and  will  not  be  further  discussed, 
as  these  conditions  are  not  encountered  in  the  present  experi¬ 
ments.  Hole  injection  into  the  oxide  may  also  occur  at  the 
Si-Si02  interface  through  electron-hole  pairs  generated  by 
impact  ionization  of  the  hot  electrons  reaching  the  anode  (Si 
substrate).  Although  deemed  a  low  efficiency  process,^’ 
some  breakdown  models  rely  exclusively  on  hole  injection 
as  the  primary  trap  creation  mechanism  at  the  cathode,  and 
the  principal  mechanism  leading  to  progressive  oxide  degra¬ 
dation  emanating  from  the  cathode.  Other  models  de- 
emphasize  the  role  of  hole  trapping  and  attribute  the  start  of 
oxide  degradation  to  electron  trapping  at  the  anode. 

In  this  article,  we  present  results  of  stressing  thin  Si02 
layers  by  directly  injecting  hot  electrons  into  the  conduction 
band  of  Si02  with  the  tip  of  a  scanning  tunneling  microscope 
(STM).  This  method  obviates  the  need  for  field  acceleration 
to  energize  the  electrons  in  the  Si02  conduction  band,  al¬ 
though  this  is  still  an  option  that  allows  additional  experi¬ 
mental  flexibility.  The  STM  mediated  injection  provides  un- 


2856 

precedented  spatial  resolution  and  a  choice  of  the  location  of 
stressing.  The  results  reported  here  were  obtained  for  3.8  nm 
device  grade  oxides.  The  principal  conclusions  of  this  study 
are:  (i)  in  general,  the  oxides  were  difficult  to  break  down  in 
spite  of  injecting  fluences  exceeding  by  several  orders  of 
magnitude  the  reported  Qbd  values  of  conventionally  (FN) 
stressed  oxides  (such  as  those  in  Fig.  1)  for  comparable  an¬ 
ode  energies;  (ii)  when  breakdowns  were  observed,  the 
far  exceeded  the  best-of-breed  published  results;  (iii)  strong 
spectral  evidence  suggests  that  breakdown  emanates  from 
the  anode  following  a  clearly  resolved  trapping/charging 
stage  and  a  prebreakdown  phase;  and  (iv)  hole  injection  ap¬ 
pears  to  be  inconsequential  in  breakdown  processes  initiated 
by  STM-mediated  hot  electron  injection. 

II.  EXPERIMENTAL  DETAILS 

A.  Ballistic  electron  emission  microscopy/ 
spectroscopy 

Ballistic  electron  emission  microscopy  (BEEM)  is  a  STM 
based  microscopy  that  differs  from  conventional  STM  by  the 
presence  of  a  thin  metallic  or  other  conducting  film  overlying 
the  substrate — the  Si02-Si  structure  in  the  present  case.  The 
sole  purpose  of  this  conductive  layer  is  to  provide  a  potential 
reference  plane  (generally  at  ground  potential)  relative  to 
which  the  STM  tip  is  biased.  The  electrons  injected  into  this 
layer  by  the  STM  tip  have  an  energy  equal  to  eVj,  where  Vj 
is  the  STM  tip  bias.  The  thickness  of  the  metal  layer  is  com¬ 
parable  or  preferably  less  than  the  electron  mean  free  path, 
so  that  the  electrons  can  traverse  the  layer  without  scattering 
(ballistically).  If  their  energy  is  larger  than  the  potential  step 
at  the  interface — in  the  present  case  the  difference  cVq  be¬ 
tween  the  Fermi  level  in  the  metal  and  the  bottom  of  the 
Si02  conduction  band — the  electrons  have  a  finite  probabil¬ 
ity  of  getting  injected  into  the  conduction  band,  traverse  the 
oxide  and  of  being  collected  as  a  collector  current  /^.  in  the  Si 
substrate.  This  probability  is  dependent  on  the  overlap  of 
conduction  band  density  of  states  at  the  interfaces,  the  trans¬ 
mission  probability  across  the  oxide,  as  well  as  transmis¬ 
sion  probabilities  of  a  quantum  mechanical  origin.  These  is¬ 
sues  have  been  detailed  elsewhere.^'^’^"^  Suffices  to  say  here 
that  Tqx  is  strongly  dependent  on  the  scattering  rates  of  elec¬ 
trons  with  longitudinal  optical  phonons  (energy  dispersive) 
and  acoustical  phonons  (momentum  dispersive),  rates  which 
are  strongly  dependent  on  the  kinetic  energy  of  the 
electrons. The  latter  is  determined  by  a  combination  of 
the  energy  at  the  point  of  injection  {eVj-eV^))  and  any  gain 
in  energy  from  field  acceleration  in  the  oxide  due  to  the 
presence  of  an  oxide  potential  applied  by  means  of  a  bias 
potential  An  energy  diagram  for  a  BEEM  experiment  is 
illustrated  in  Fig.  2  for  conditions  of  positive  (accelerating) 
and  negative  (retarding)  oxide  potentials. 

In  the  spectroscopy  mode,  the  raster  scan  of  the  STM  is 
stopped  and  the  collector  current  is  measured  as  is 
ramped  over  a  range  that  includes  the  barrier  step  potential 
Vq  •  becomes  finite  once  Vj  exceeds  Vq  for  or  ex¬ 
ceeds  Vo“Fox  for  conditions  illustrated  in  Fig.  2. 

The  STM  is  generally  operated  at  a  constant  tunneling  cur- 
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Fig.  2.  Schematic  energy  band  diagram  for  a  BEEM  experiment  on  an  MOS 
structure  for  positive  oxide  bias  negative  (b).  is 

controlled  by  means  of  the  gate  bias  .  For  Pd/Si02/Si,  Vox<0  when 
V^<  — 0.5  V.  The  energy  of  the  electrons  injected  by  the  STM  tip  is  deter¬ 
mined  by  the  tip  bias  Vj .  Their  energy  distribution  is  represented  by  the 
exponential  function  fj.  The  distribution  of  the  electrons  reaching  the 
Si02-Si  interface  is  shown  by  the  hatched  distributions  cVq  is  the 
energy  difference  between  the  Fermi  level  Ef  of  the  Pd  layer  and  the  bottom 
of  the  Si02  conduction  band. 

rent  Ir'h  can  also  be  used  for  image  contrast  in  the  scanning 
mode,  thereby  generating  an  electron  transmission  image  of 
the  MOS  structure,  commonly  referred  to  as  a  BEEM 
image, that  is  recorded  simultaneously  with  the  STM  topo¬ 
graphic  image. 

In  stressing  studies  using  BEEM,  electrons  at  a  given  en¬ 
ergy  (eVj)  are  injected  locally  (well  within  a  nm)  for  a  given 
time,  with  the  collector  current  determining  the  total  fluence. 
Then  an  versus  Vj  spectrum  is  taken  to  assess  the  conse¬ 
quences  of  the  stressing  in  terms  of  changes  in  the  local 
transmission  characteristics,  including  threshold  shifts 
This  process  is  then  repeated  until  either  the  stressed  area 
moves  out  of  range  due  to  instrument  drifts,  or  the  oxide 
breaks  down  as  evidenced  by  a  low  value  of  Vq*  Alterna¬ 
tively,  spectra  over  a  broad  energy  range  are  repeatedly  taken 
at  the  same  spot  until  either  of  the  two  previously  mentioned 
events  occur.  As  a  point  of  reference,  an  7^  of  50  pA  col¬ 
lected  over  an  area  of  5  nm^  (realistic  for  oxides  dominated 
by  ballistic  transport,  i.e.,  oxide  thickness  corre¬ 

sponds  to  a  charge  injection  of  10^  C/cm^  s!  In  our  experi- 
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ments,  we  can  achieve  injection  energies  of  9  eV.  To  achieve 
this  value  by  conventional  FN  injection,  one  would  need  to 
apply  a  field  of  —24  MV/cm  to  a  5  nm  oxide.  Still  larger 
effective  energies  can  be  achieved  with  BEEM  by  applying 
an  accelerating  oxide  bias. 

B.  Sample  preparation 

Device-grade  amorphous  Si02  layers  were  thermally 
grown  in  the  range  750-800  °C  at  1  atm  of  dry  oxygen  for 
30-50  min.  The  substrates  were  125-mm-diam  Si(lOO)  wa¬ 
fers  doped  to  the  low  10*^  range.  No  additional  treatments 
were  performed  after  the  oxidation.  Oxide  thicknesses  for  the 
studies  reported  here  are  3.8  and  7.5  nm.  Approximately 
8X 15  mm^  samples  were  cleaved  from  the  wafers  and  intro¬ 
duced  into  an  ultra  high  vacuum  (UHY)  preparation  cham¬ 
ber.  Arrays  of  Pd  dots,  0.2  mm  in  diameter,  were  deposited 
by  thermal  evaporation  of  Pd  through  a  shadow  mask,  with 
the  substrate  held  at  '--30  K.  The  finished  sample  was  al¬ 
lowed  to  warm  up  to  room  temperature  and  was  subse¬ 
quently  transferred  under  UHV  into  the  STM  chamber.  The 
grounding  contact  needed  to  bias  the  tip  was  carefully  posi¬ 
tioned  onto  a  selected  Pd  dot  by  means  of  three  orthogonally 
mounted  inchworms'^^.  Deposition  at  low  temperatures  im¬ 
proves  the  surface  morphology  of  the  thin  (—3.5  nm)  Pd 
films,  resulting  in  a  morphology  that  consisted  of  a  nodular 
structure  typically  8  nm  in  diameter  that  protruded  nm 
above  the  valleys.  An  STM  image  is  shown  in  Fig.  3(a).  A 
smooth  surface  morphology  is  needed  to  minimize  BEEM 
image  contrast  arising  from  the  surface  topography  of  the 
metal,  which  has  previously  been  observed  to  be  strongly 
reflected  in  the  BEEM  image.^^’^^  The  corresponding  BEEM 
image  of  the  3.8  nm  Pd/3.8  nm  SiO2/Si(100)  structure  is 
shown  in  Fig.  3(b).  The  surface  topography  is  clearly  repli¬ 
cated  in  the  BEEM  image.  However,  the  nodular  structure  of 
the  topographic  image  appears  rather  fiat  in  the  BEEM  im¬ 
age,  that  is,  there  is  little  evidence  that  thickness  differences 
in  the  metal  film  affect  the  net  transmission.  The  reason  for 
this  is  that  the  mean  free  paths  in  the  metal  film  are  larger 
than  the  thickness  of  the  metal  film.  However,  near  the  grain 
boundaries  the  scattering  drastically  increases,  resulting  in 
the  electrons  being  scattered  outside  the  acceptance  cone  for 
transmission  across  the  metal-Si02  interface. 

The  change  from  the  previously  used  Pt  gates^^  to  the 
presently  used  Pd  metallurgy  revealed  a  somewhat  surprising 
enhancement  in  the  net  transmission  characteristics  across 
the  MOS  device.  This  enhancement  is  shown  in  Fig.  4, 
where  we  compare  the  BEEM  spectrum  of  a  3.3  nm  Pt  film 
with  that  of  a  Pd  film  of  comparable  thickness,  with  both 
films  deposited  on  3.8  nm  oxides.  It  should  be  noted  that  a 
tunneling  current  of  10  nA  was  used  with  the  Pt  gates, 
whereas  for  Pd,  this  current  was  typically  in  the  1-5  nA 
range.  The  enhanced  transmission  for  Pd  not  only  provides 
enhanced  statistics,  but  makes  stressing  studies  much  easier 
by  allowing  larger  currents  to  be  injected  into  the  oxide.  An 
added  advantage  of  lower  tunneling  currents  lies  in  the  im¬ 
proved  collimated  nature  of  the  injected  current.  This  is  a 
direct  result  from  the  increased  tunnel  gap  during  injection  at 
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Fig.  3.  100X100  nm^  STM  topograph  (a)  and  BEEM  image  (b)  of  a  3  nm 
Pd/3.8  nm  SiO2Si(100)  structure.  Black-white  range  over  the  image  is  3  nm 
in  topography,  and  100  pA  in  BEEM.  Vj=-6  V,  /j=2  nA,  Vi,~0  Y.  The 
average  peak  to  valley  in  the  topography  image  is  less  than  1  nm. 


lower  currents.^^  A  more  collimated  electron  beam  is  subject 
to  fewer  scattering  events,  which  enhances  the  transmission 
across  the  MOS  structure.  This  in  turn  results  in  crisper  spec¬ 
tral  features,  as  shown  in  Fig.  5.  As  the  data  indicates,  under 
optimal  conditions  useful  spectra  can  be  readily  obtained 
with  tunnel  currents  of  ^0.5  nA. 

C.  Monte  Carlo  (MC)  simulations 

Details  of  the  (MC)  simulations,  which  are  based  on  the 
code  developed  by  Fischetti  et  have  been  given 
elsewhere.^^’^"^  In  the  present  application,  MC  simulations 
were  used  to  estimate  the  spreading  of  the  electron  beam  as  it 
traverses  the  Si02  layer.  Knowledge  of  the  spreading  is 
needed  in  order  to  estimate  current  and  charge  densities  dur¬ 
ing  electrical  stressing  of  the  Si02.  To  calculate  the  spread- 


BEEM  at  metal  /  38  A  Si02  /  n-S((100) 


Tip  bias  Vy  (V) 


Fig.  4.  Comparison  of  BEEM  spectra  for  similar  MOS  structures  except  for 
the  gate  metal.  Pd  gates  allow  substantially  larger  transmission  than  the 
previously  used  Pt  gates.  Spectrum  (a)  is  for  a  Pd  gate,  (b)  for  a  Pt  gate  and 
(c)  is  curve  (b)  normalized  to  a  2  nA  tunnel  current. 


ing  in  the  oxide,  normal  injection  across  the  Pd-Si02  inter¬ 
face  at  one  single  point  was  assumed  (delta  function).  The 
potential  in  the  oxide  included  image  force  lowering  and  the 
applied  oxide  bias.  Electrons  returning  to  the  metal  were 
discarded.  The  position  of  the  transmitted  electrons  was  re¬ 
corded  and  the  normalized  local  current  density  was 

calculated  after  sufficient  statistics  had  been  acquired  (typi¬ 
cally  10^  electrons).  The  distance  r  is  measured  away  from 
the  injection  point,  and  is  the  injection  current.  In  Fig.  6, 
typical  results  of  the  simulations  are  summarized  for  differ¬ 
ent  combinations  of  the  oxide  thickness,  the  oxide  bias,  and 
the  tip  bias.  The  two  distributions  for  Vj=5.5  (open  sym- 


BEEM  at  Pd  /  38  A  SiOj  /  n-Si(100) 


Tip  bias  Vj  (V) 


Fig,  5.  Normalized  collector  currents  for  a  Pd/SiO2/Si(100)  structure  for  a 
range  of  indicated  tunnel  currents.  The  curves  have  been  normalized  to  the 
Ij=2  nA  spectrum,  to  which  the  ordinate  scale  refers.  The  relative  intensi¬ 
ties  scale  inversely  with  l-p,  which  is  attributed  to  the  higher  collimation 
(and  transmission)  for  the  smaller  Ijs,  for  which  the  STM  tip-surface  dis¬ 
tance  is  greater. 
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Distance  from  Injection  Point,  r  (nm) 

Forward  Injection 

Fig.  6.  Spatial  distribution  of  the  normalized  current  densities  at  the  Si/Si02 
interface  after  electron  transport  through  3.8  and  7.5-nm-thick  Si02  films  as 
calculated  by  MC  simulations  for  various  bias  conditions.  The  figure  serves 
to  illustrate  the  strong  dependence  of  the  beam  spreading  on  the  electron 
energy  in  the  oxide.  At  low  electron  energies  (open  symbols,  yj=S.5  V) 
little  broadening  is  predicted.  Strong  broadening  will  occur  if  electrons  are 
either  injected  at  high  energy  (3.8  nm,  y7’=9.8  V  full  diamonds),  or  if  the 
electrons  are  heated  inside  the  oxide  by  a  strong  electric  field  (7.5  nm, 
V  full  dots). 


bols)  were  done  at  a  small  accelerating  bias.  At  this  tip  bias, 
the  electrons  travel  at  an  energy  of  1.5  eV  through  the  oxide, 
where  phonon  scattering  is  small  and  mostly  via  longitudinal 
optical  phonons  (forward  scattering).  This  results  in  a 
streaming  type  motion  in  the  presence  of  a  small  accelerating 
field  and  leads  to  very  little  beam  broadening  by  the  trans¬ 
port  through  the  oxide.  In  these  cases,  the  beam  size  on  the 
substrate  side  is  expected  to  be  comparable  to  the  size  of  the 
injection  beam.  The  two  distributions  were  scaled  to  the 
same  heights  in  such  a  way  that  the  additional  broadening  in 
the  thicker  oxide  can  be  clearly  seen.  The  peak  at  r=0  over¬ 
shoots  the  y  axis  of  the  plot  by  one  order  of  magnitude,  such 
that  actual  broadening  is  considerably  smaller  than  suggested 
at  a  first  glance. 

Most  interestingly,  considerably  broader  distributions  are 
obtained  if  the  electrons  are  either  injected  at  high  energies 
(for  3.8  nm  film,  solid  diamonds)  or  accelerated  by  a  strong 
electric  field  (7.5  nm  film,  solid  circles)  in  the  oxide.  The 
broadening  is  caused  by  a  rapid  increase  of  the  acoustic  pho¬ 
non  scattering  (Umklapp  dominated,  isotropic,  quasi-elastic 
scattering)  with  increasing  kinetic  energy  of  the  electrons. 
This  scattering  dominates  at  energies  above  3  eV  and  leads 
to  strong  backscattering  that  both  reduces  the  overall  trans¬ 
mission  efficiency  and  quite  significantly  spreads  the  beam. 
In  the  3.8  nm  film,  the  electrons  were  injected  at  a  tip  bias  of 
9.8  V,  which  corresponds  to  an  initial  electron  energy  in  the 
oxide  of  about  5.8  eV.  Alternatively,  electrons  were  injected 
at  an  energy  of  1.5  eV  into  a  7.5  nm  oxide  and  subsequently 
heated  by  a  strong  electric  field,  such  that  their  average  en¬ 
ergy  reaches  values  of  4-5  eV.  Again,  significant  beam 
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BEEM  for  38  A  Si02  stressed  at  Vj=10  V/lj=5  nA 


Fig.  7.  Spectra  of  threshold  region  for  BEEM-stressed  Pd/3.8  nm  Si02/Si 
structure.  The  kinetic  energy  of  the  injected  electrons  is  -"^6  eV.  The  total 
injected  charge  was  ~3XI0^  C/cm^.  This  charge  is  plotted  in  Fig.  1.  No 
oxide  breakdown  occurred.  The  stressing  location  is  indicated  in  Fig.  8. 

Spreading  is  predicted  and  the  increased  oxide  thickness 
leads  to  additional  broadening.  Under  conditions  where  the 
electrons  reach  energies  in  excess  of  3-4  eV,  the  beam  size 
on  the  substrate  side  is  predicted  to  be  largely  determined  by 
the  spreading  in  the  oxide.  This  has  to  be  taken  into  account 
when  calculating  local  current  densities  and  charge  to  break¬ 
down. 

In  the  present  determination  of  current  and  charge  densi¬ 
ties,  it  was  assumed  that  r  =  23  nm.  For  this  value  74%  of 
the  current  injected  into  the  oxide  is  included  in  the  density 
determinations.  The  remaining  current  in  the  tail  of  the  dis¬ 
tribution  contributes  little  to  the  density.  This  leads  to  an 
effective  area  of  17  nm^  for  the  beam  spread  at  the  Si02“Si 
interface  for  the  3.8  nm  oxide  at  a  kinetic  energy  of  5.8  eV. 

III.  STRESS  MEASUREMENTS 

As  pointed  out  before,  the  mere  fact  of  taking  a  BEEM 
spectrum,  such  as  those  of  Figs.  4  and  5,  involves  the  injec¬ 
tion  of  large  charges  into  the  oxide.  A  conservative,  lower 
estimate  of  this  charge  is  300  C/cm^  over  a  time  span  (about 
1  s)  that  corresponds  to  a  voltage  change  AV7’^0.5  V  during 
a  typical  spectral  sweep.  For  electrons  injected  at  10  V, 
with  y^  =  0  V,  their  kinetic  energy  in  the  oxide  is  "^6  eV,  a 
value  achieved  with  FN  injection  only  in  the  presence  of  an 
oxide  field  of  ~24  MV/cm  for  a  3.8  nm  oxide.  At  such  large 
energies  or  equivalent  oxide  fields,  breakdown  should  be  ob¬ 
served  frequently  if  the  local  oxide  characteristics  were  simi¬ 
lar  to  those  of  the  quality  oxides  represented  in  Fig.  1.  How¬ 
ever,  breakdowns  in  3.8  nm  oxides  were  seldom  observed 
during  routine  spectral  scans  with  I^V^^IO  V  and  V^^O 
V,  and  never  observed  on  oxides  ^7  nm  thick  unless  a  posi¬ 
tive  bias  was  applied.  An  extreme  case  of  stressing  without 
breakdown  is  illustrated  in  Fig.  7,  which  depicts  a  series  of 
spectral  scans  taken  at  the  same  location  of  an  MOS  structure 
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with  tQ^=3,S  nm.  A  spectrum  was  taken  after  each  dosage  of 
injected  charge.  Except  for  an  almost  imperceptible  increase 
in  the  threshold  position,  indicative  of  some  charging  at  the 
Si02”Si  interface,  no  spectral  changes  are  observed  after  3 
min  of  stressing  (2  in  ~  10^  C/cm^).  After  10  min  of  stressing 
the  spectrum  weakened  and  shifted  to  slightly  higher  ener¬ 
gies.  The  weakening  of  the  spectrum  is  indicative  of  some 
additional  scattering  in  the  oxide,  possibly  due  to  the  genera¬ 
tion  of  interface  defects.  These  may  be  precursors  to  break¬ 
down  (see  following  section),  but  the  evidence  of  a  conven¬ 
tional  BEEM  spectrum  and  threshold  indicates  that  the  oxide 
is  still  largely  intact.  Additional  stressing  was  not  possible 
because  of  instrument  drift,  which,  although  extremely  small 
at  this  particular  time  (~2.5  nm/h),  caused  the  STM  tip  to 
drift  out  of  the  stressed  area  (~1  nm^)  over  the  duration  of 
the  stressing  time.  (2inj~3XlO^  C/cm^  after  completion  of 
the  stressing.  This  value,  marked  on  Fig.  1  at  an  equivalent 
field  of  24  MV/cm,  is  totally  incompatible  with  the  rest  of 
the  Gbd’s  for  conventionally  stressed  oxides,  strongly  sug¬ 
gesting  again  that  the  Qbd’^  of  Fig.  1  are  not  intrinsic  but 
defect  controlled.  The  stressing  did  not  cause  any  observable 
damage  in  the  Pd  film.  Topography  and  BEEM  images  of  the 
stressed  area  are  shown  in  Fig.  8(a)  and  8(b),  respectively. 
The  stressed  area  has  been  marked  with  a  cross.  The  BEEM 
image  barely  shows  an  area  of  slightly  weakened  transmis¬ 
sion  at  the  stress  point. 

Stress  experiments  that  resulted  in  breakdown  were  suc¬ 
cessful  rather  infrequently  (of  the  order  of  10%),  which  sug¬ 
gest  that  the  sites  where  breakdowns  occur  are  quite  dis¬ 
persed,  of  rather  low  density  and  most  likely  controlled  by 
impurities  and/or  defects.  A  breakdown  sequence  is  illus¬ 
trated  in  Fig.  9,  which  consisted  of  applying  a  Vj=  10  V 
(kinetic  energy  ~6  in  the  oxide)  for  predetermined  time  in¬ 
tervals,  each  of  which  was  followed  by  a  spectral  scan.  After 
stressing  for  10  s  (curve  /?),  a  threshold  shift  to  higher  ener¬ 
gies  is  clearly  observed.  A  partial  breakdown  is  observed 
after  stressing  for  20  s,  with  the  threshold  moving  down  to 
about  1  V  and  the  collector  current  increasing  by  orders  of 
magnitude  (the  dashed  curves  represent  a  lOOX  demagnifi¬ 
cation).  The  low  threshold  is  a  clear  manifestation  that  the 
oxide  barrier  has  disappeared  and  that  a  direct  contact  exists 
to  the  Si  substrate.  The  total  injected  charge  to  breakdown 
corresponds  to  ~6X10^  C/cm^.  Another  breakdown  se¬ 
quence  is  shown  in  Fig.  10.  The  oxide  was  repeatedly 
stressed  at  the  same  point  of  a  3.8  nm  oxide  with  Vy=  10  V 
and  Viy  =  0  V.  Spectral  scans  were  taken  after  each  stress 
application.  Only  a  fraction  of  the  scans  are  shown  in  the 
figure,  starting  with  the  first  scan  (a)  on  a  virgin  section  of 
the  sample  and  ending  with  spectrum  (g),  which  represents 
the  locally  broken  down  oxide.  The  sequence  shows  a  weak¬ 
ening  of  the  transmitted  collector  current  that  is  accompanied 
by  threshold  shifts  to  higher  energies.  After  injecting 
~7X10^  C/cm^  the  intensities  again  increase,  as  shown  in 
spectrum  (d),  and  the  thresholds  move  progressively  to  lower 
energies  ultimately  reaching  the  breakdown  stage  (curve  g) 
after  —3X10"^  C/cm^  have  been  injected.  Following  such 
breakdown  events,  the  surface  of  the  MOS  structure  is 
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(b) 

Fig.  8.  12.5 X  12.5  nm^  STM  topograph  (a)  and  BEEM  image  (b)  of  a  3  nm 
Pd/3.8  nm  SiO2/Si(100)  structure.  Black-white  range  over  the  image  is  1.9 
nm  in  topography,  and  30  pA  in  BEEM.  Vj=  -5  V,  Ij~5  nA,  V/,  =  0  V. 
The  location  of  the  BEEM  stressing  area  is  marked  by  an  x. 

heavily  damaged.  The  damaged  area  is  of  the  order  of  10  nm 
across,  and  generally  consists  of  a  crater  with  surrounding 
rough  hillocks  of  dimensions  of  5-10  nm.  It  should  be  em¬ 
phasized  that  the  damage  resulting  from  the  breakdown  is 
not  due  to  a  tip  crash,  but  appears  to  be  the  result  of  the  large 
currents  injected  locally  and/or  discharge  of  injected  trapped 
charge  in  the  stressed  area.  The  morphology  in  the  prebreak¬ 
down  phase  was  not  investigated. 

IV.  DISCUSSION  AND  CONCLUSIONS 

A  key  feature  of  the  stressing  and  breakdown  studies  with 
BEEM  is  the  initial  shift  of  the  threshold  Vq  to  higher  values. 
This  shift  is  plotted  as  a  function  of  injected  charge  in  Fig.  11 
for  the  data  of  Fig.  10.  The  increase  SVq,  together  with  the 
attenuation  of  the  intensity  of  1q,  depicted  in  Fig.  10,  is 
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Fig.  9.  A  successful  breakdown  sequence  for  a  3.8  nm  oxide  stressed  with  6 
eV  (kinetic  energy)  electrons.  The  shift  to  higher  Vq  between  curve  (a)  and 
(b)  is  due  to  negative  charge  at  the  Si02“Si  interface.  V/,  =  0  V.  Total  injected 
charge  to  breakdown  (defined  by  reaching  Vq'-I  V,  as  in  curve  c)  is 
'-^bXlO^  C/cm^ 


equivalent  to  applying  a  negative  bias  to  the  oxide,  as  dem¬ 
onstrated  in  a  previous  publication.^^  The  responsible  field 
for  this  shift  is  due  to  electrons  trapped  in  the  vicinity  of  the 
Si02-Si  interface.  This  view  is  consistent  with  previous  ob¬ 
servations  based  on  FN  injection.^^’^^’^^’^^  However,  one 
could  argue  that  in  the  present  case  the  injected  electrons  are 


Breakdown  sequence  for  a  38A  Si02  film 


Tip  bias  Vt  (V) 

Fig.  10.  Oxide  breakdown  sequence  from  no  stress  (curve  a)  to  fully  broken 
down  oxide  region  or  punch  through  (curve  g)  for  a  3.8  nm  oxide.  The  oxide 
was  stressed  between  spectra  at  Vj=  10  V  for  10-30  s  periods.  Not  every 
spectrum  and  stress  sequence  is  shown.  V/,  =  0  V.  Of  interest  is  a  second 
threshold  in  the  stressed  area  beginning  with  spectrum  (d)  and  clearly  seen 
in  the  magnified  spectrum  (f).  The  threshold  is  ~2  V  above  Vq  appears 
to  be  related  to  trap  generation  reported  in  Ref.  9. 


Electrical  stressing  of  a  38A  Si02  film 


Fig.  11.  Shifts  in  the  fundamental  threshold  Vq  as  a  function  of  injected 
charge  taken  from  the  data  of  Fig.  10.  Three  phases  of  stressing  are  depicted. 
Prebreakdown  commences  after  a  maximum  has  been  reached  and  coincides 
with  the  appearance  of  a  second  threshold,  as  discussed  in  Fig.  10, 


already  sufficiently  energetic  to  cause  damage  at  the  inject¬ 
ing  Pd  electrode,  a  scenario  that  cannot  entirely  be  excluded. 
However,  any  trapped  charge  in  such  hypothetical  states 
would  be  screened  by  the  metal  and  contribute  a  negligible 
amount  to  the  threshold  shift.  A  simple  calculation,  with  the 
metallic  screening  treated  by  the  classical  image  force, 
gives  the  following  relationship  between  eSVQ  and  the  dis¬ 
tance  d  of  the  charge  from  the  metal-Si02  interface: 


edV^ 


477606, 


1 


(1) 


where  r  is  the  distance  between  the  trapped  charge  and  the 
injected  electron  as  it  crosses  the  plane  parallel  to  the  inter¬ 
face  that  includes  the  trap.  6o  and  6,  are,  respectively,  the 
permittivity  of  vacuum  and  the  dielectric  constant.  Clearly, 
eSVQ  approaches  zero  as  d  approaches  the  metal-Si02  in¬ 
terface,  so  that  the  trapped  charge  must  reside  in  the  vicinity 
of  the  Si02“Si  interface  to  produce  a  sizeable  SVq,  For  a 
shift  of  1  eV  or  more,  with  the  negatively  charged  trap  3.5 
nm  from  the  Pd-Si02  interface,  the  electron  must  pass 
within  0.35  nm  of  the  trap.  Alternatively,  if  the  charge  were 
uniformly  distributed,  a  1  eV  shift  corresponds  to  an  inter¬ 
face  charge  of  5.7  X 10^^  electrons/cm^.  In  a  realistic  situation 
there  will  be  a  number  of  filled  traps,  as  evidenced  by  the 
increasing  shift  with  dosage,  and  the  net  shift  will  arise  from 
the  sum  of  contributions  from  each  individual  charge.  During 
stressing,  Fig.  6  indicates  that  the  electron  beam  has  spread 
over  an  area  of  radius  ~2  nm  with  tails  of  the  distribution 
extending  somewhat  farther.  At  the  threshold,  however,  the 
beam  has  spread  very  little  and  is  thus  more  sensitive  to  local 
distributions.  Fluctuations  in  thresholds  are  indeed  observed 
even  for  unstressed  areas  a  few  nanometers  apart,  corrobo¬ 
rating  the  highly  localized  resolution  of  BEEM,  particularly 
for  thin  oxides.  Consequently,  the  shifts  in  Vq  represent  in¬ 
terface  trap  charging  on  a  truly  local  (<1  nm)  scale,  but  the 
trap  creation  from  the  energetic  electrons  occurs  over  an  area 
of  —20  nm^  for  a  3.8  nm  thick  oxide. 
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Trap  creation  and  charging  also  occur  for  y7’<10  V,  as 
can  be  ascertained  form  curve  (a)  of  Fig.  10  which  represents 
a  spectrum  for  a  previously  unstressed  area.  Yet  its  intensity 
for  V  is  uncharacteristically  low  compared  to  those  of 

Figs.  4  and  5.  The  lower  intensity  implies  an  ongoing  charg¬ 
ing  as  the  data  is  taken.  This  phenomenon  is  not  observed  in 
spectra  taken  randomly  over  the  sample  and  suggests  the 
presence  of  a  weak  spot  at  the  point  of  injection  that  ulti¬ 
mately  results  in  breakdown.  Upon  further  stressing  a  maxi¬ 
mum  in  SVq  is  reached,  as  shown  in  Fig.  11,  which  indicates 
that  a  maximum  was  reached  in  the  density  of  negative 
charge  trapped  near  the  Si02-Si  interface.  A  negative  charge 
in  the  absence  of  an  applied  bias  leads  to  a  local  energy 
profile  similar  to  that  depicted  in  Fig.  2(b).  Further  injection 
leads  to  a  gradual  decrease  of  Vq  ,  which  may  result  from  an 
increased  charge  leakage  due  to  the  deterioration  of  the  oxide 
or  from  either  positive  charges  generated  at  the  Si02-Si  in¬ 
terface  (anode),  or  electron  trapping  at  the  cathode.^  The 
maximum  in  Vq  coincides  with  the  onset  of  larger  currents 
for  eV,  as  seen  in  spectra  (d)-(f).  The  enhanced  cur¬ 

rent  is  due  to  an  ongoing  modification  of  the  oxide,  the  sim¬ 
plest  explanation  being  the  destruction  of  the  oxide  that 
gradually  progresses  into  the  oxide  starting  from  the  Si02”Si 
interface.  The  local  thinning  explains  the  dramatic  increases 
in  as  the  stressing  progresses,  curves  (e)  and  (f).  The  X 10 
enhanced  curve  (f),  shown  by  the  dashed  line,  clearly  reveals 
a  remnant  of  a  conventional  BEEM  spectrum  with  a  thresh¬ 
old  at  4.8  eV.  The  reason  for  the  weakened  threshold  region 
in  this  spectrum  is  attributed  to  added  scattering  at  defect 
sites  in  the  progressively  deteriorating  oxide.  However,  once 
exceeds  7  V,  the  oxide  breakdown  proceeds  rapidly  as 
suggested  by  the  rapid  rise  of  .  Upon  completion  of  spec¬ 
trum  (f)  punch-through  of  the  oxide  resulted,  as  indicated  by 
the  low  Vq^I  V,  and  the  large  currents. 

The  onset  of  the  current  enhancement  in  what  we  refer  to 
as  the  prebreakdown  region  in  Fig.  11,  and  represented  in 
Fig.  10  by  curves  (c)-(f),  is  approximately  2  eV  above  Vq, 
Perhaps  coincidentally,  this  energy  also  corresponds  to  the 
threshold  for  hydrogen  release  and  trap  formation  at  the  an¬ 
ode  of  MOS  capacitor  structures.^  The  hydrogen  then  dif¬ 
fuses  to  the  cathode  (injecting  electrode)  to  produce  electron 
traps.  As  discussed  above,  our  measurements  are  not  sensi¬ 
tive  to  such  traps  at  the  injecting  interface,  and  it  is  not  clear 
at  present  how  the  hydrogen  release  would  affect  the  oxide 
breakdown  scenario  depicted  here.  Nevertheless,  if  present, 
electron  trapping  at  the  cathode  would  increase  the  acceler¬ 
ating  field  in  the  oxide,  thereby  accounting  for  both  a  de¬ 
crease  in  Vq  in  the  prebreakdown  regime,  and  an  enhance¬ 
ment  of  oxide  deterioration  by  increasing  the  electron  energy 
as  they  reach  the  anode.  The  implication  that  most  of  the 
events  that  lead  to  breakdown  are  initiated  at  the  Si02-Si 
interface  (anode)  is  also  consistent  with  the  MC  simulations. 
These  indicate  that  beam  spreading  due  to  acoustic  phonon 
scattering  is  accentuated  in  the  oxide  away  from  the  injecting 
interface.  Since  this  scattering  occurs  with  little  loss/gain  in 
energy,  the  effectiveness  in  damaging  the  oxide  is  enhanced 
as  the  electrons  sample  a  greater  volume,  which  increases  the 
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probability  that  an  electron  encounters  a  weak  spots  where 
the  breakdown  may  be  initiated.  This  breakdown  scenario 
starting  at  the  oxide-Si  interface  is  consistent  with  those 
proposed  by  several  other  workers, but  inconsistent 
with  oxide  degradation  attributed  to  hot  hole  injection  into 
the  oxide  at  the  Si02-Si  interface. The  hot  holes  are  cre¬ 
ated  by  impact  ionization  of  the  hot  electrons  as  they  enter 
the  Si.  The  hole  injection  model  requires  the  high  fields 
present  in  conventional  FN  injection  to  accelerate  the  holes 
toward  the  cathode.  This  situation  does  not  apply  in  the 
present  case  for  a  3.8  nm  oxide  with  no  applied  bias.  In  fact, 
the  electron  traps  initially  created  by  the  stressing  produce  a 
retarding  field  for  the  holes,  keeping  them  from  being  in¬ 
jected  into  the  valence  band  of  the  Si02. 

The  gbd  values  obtained  from  various  breakdown  events 
have  been  plotted  in  Fig.  1,  as  well  as  the  total  charge  in¬ 
jected  in  one  experiment  that  did  not  reach  breakdown.  The 
injection  energies  of  the  BEEM  experiments  were  converted 
to  an  equivalent  electric  field  that  would  achieve  the  same 
energies  under  FN  injection.^^  The  stressing  current  at  maxi¬ 
mum  tip  bias  was  converted  to  a  charge  density  by  multiply¬ 
ing  by  the  time  and  dividing  the  product  by  the  area  of  the 
electron  beam  reaching  the  Si02-Si  interface  ('^l?  nm^),  as 
determined  from  the  MC  simulations  depicted  in  Fig.  6.  The 
Qbd’s  thus  obtained  fall  consistently  above  the  curves  ob¬ 
tained  from  conventional  stress  experiments.^^  These  were 
interpreted  in  terms  of  an  intrinsic  breakdown  limit,  a  con¬ 
clusion  that  is  challenged  by  the  present  results.  Even  the 
new  points  obtained  here  are  not  intrinsic,  as  much  higher 
charge  dosages  did  not  routinely  cause  breakdowns  in  the  3.8 
nm  oxides.  Consequently  we  conclude  that  even  the  higher 
observed  breakdown  dosages  are  still  limited  by  extrinsic 
effects,  either  defects  or  impurities  at  the  interface.  This  con¬ 
clusion  offers  hope  that  in  the  future,  device  lifetimes  and 
device  reliability  can  be  considerably  improved.  Our  obser¬ 
vations  are  consistent  with  an  increase  in  oxide  robustness 
observed  over  the  years  as  capacitor  areas  and  device  dimen¬ 
sions  shrink.^"^”^^  It  is  generally  acknowledged  that  these 
trends  are  directly  attributable  to  a  decreasing  probability  of 
finding  defects/impurities  in  the  oxide  as  the  device  areas 
decrease. 

It  is  a  reasonable  question  to  ask  if  Gbd’s  obtained  with 
BEEM  can  be  compared  to  those  obtained  under  FN  injec¬ 
tion.  The  conversion  of  the  former  from  an  energy  scale  to  an 
effective  electric  field  scale  assumes  that  only  the  electron 
energy  is  important  and  that  field  effects  in  the  FN  stress 
mode  contribute  negligibly  to  the  breakdown.  Previous  stud¬ 
ies  have  shown  that  the  principal  trap,  defect  and  interface 
state  generation  mechanisms  are  dependent  solely  on  the  ki¬ 
netic  energy  of  the  hot  electrons  injected  into  the  oxide,  with 
no  evidence  of  a  field  dependence  other  than  to  accelerated 
the  electrons  to  the  required  energies  for  defect 
generation. This  is  not  to  say  that  the  field  is  unimportant 
but  rather  that  thus  far  there  is  little  evidence  of  the  field 
playing  a  primary  role  in  determining  the  breakdown  mecha¬ 
nism.  For  the  electric  field  stressed  modes,  such  as  FN,  de¬ 
fect  formation  can  also  occur  from  holes  generated  by  impact 
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ionization  in  both  the  Si02  and  the  Si  (anode)  with  subse¬ 
quent  injection  into  the  Si02  valence  band.  The  field  is  thus 
important  to  accelerate  the  holes  towards  the  cathode  where 
they  can  create  traps  and  interface  states.  These  processes 
also  depend  on  the  energy  of  the  holes.^^  However,  hole 
generation  depends  on  the  energy  gained  by  the  electrons, 
with  a  minimum  oxide  potential  of  8  V  being  required  for 
hole  injection  to  occur  from  the  Si  (anode). This  potential 
threshold  is  higher  than  the  gate  voltages  used  by  Schuegraf 
and  Hu^^  to  obtain  the  data  of  Fig.  1  with  the  exception  that 
the  data  for  the  7  nm  oxide  beyond  1 1  MV/cm  correspond  to 
a  gate  voltage  exceeding  8  V.  However,  no  change  in  slope  is 
apparent  in  the  data  of  the  7  nm  oxide,  strongly  suggesting 
that  hole  injection  for  thin  oxides  and  for  the  given  field 
ranges  is  not  a  principal  contributor  to  breakdown.  It  is  also 
worth  noting  that  the  BEEM-stressed  2^^  for  the  7.5  nm 
oxide  in  Fig.  1  was  obtained  with  an  applied  field  of  ~6 
MY/cm  in  addition  to  a  kinetic  energy  of  4.2  eV  at  the  point 
of  injection.  From  these  discussions,  it  can  be  surmised  that 
the  oxide  field  has  a  yet  to  be  identified  role  on  for  thin 
oxides  (aside  from  accelerating  the  electrons  in  FN  stress¬ 
ing).  Consequently,  the  comparison  of  FN-stressed  and 
BEEM-stressed  data  is  a  reasonable  undertaking  with  any 
additional,  but  still  unknown,  field  enhanced  contributions  in 
the  FN  case  being  compensated  by  the  higher  energies  sus¬ 
tained  by  the  BEEM-injected  electrons  throughout  their  pas¬ 
sage  across  the  oxide. 

V.  CONCLUSIONS 

In  conclusion,  we  have  demonstrated  that  spatially  highly 
resolved  stress  studies  using  the  STM  can  give  new  insights 
into  breakdown  phenomena  in  ultrathin  oxides.  Unlike  con¬ 
ventional  stressing  by  means  of  FN  injection,  the  present 
technique  allows  an  arbitrary  choice  of  injected  electron  en¬ 
ergies,  thereby  greatly  simplifying  the  injection  and  oxide 
transport  processes.  The  latter  have  also  been  modeled  by 
MC  simulations,  from  which  the  spreading  of  the  electron 
beam  as  it  traverses  the  oxide  was  calculated.  Stress  induced 
breakdowns  under  conditions  that  far  exceed  those  for  con¬ 
ventionally  stressed  state-of-the-art  oxides  imply  that  break¬ 
downs  are  still  controlled  by  defects/impurities  in  the  oxide, 
and  that  an  intrinsic  breakdown  limit  has  not  yet  been 
reached.  When  observed,  the  breakdown  scenario  suggests  a 
three-fold  sequence  of  initial  trap  creation  and  filling,  fol¬ 
lowed  by  a  prebreakdown  phase  that  appeared  to  be  associ¬ 
ated  with  a  progressive  oxide  deterioration  starting  at  the 
Si02-Si  interface,  and  finally  a  catastrophic  punch  through, 
characterized  by  a  very  low  barrier  to  electron  injection.  The 
data  furthermore  suggests  that  hole  transport  in  the  oxide  is 
unlikely  to  occur  for  the  low  oxide  biases  used  in  the  present 
experiments. 
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We  have  used  ballistic-electron  emission  microscopy  (BEEM)  to  locally  inject  and  microscopically 
characterize  trapped  charge  in  metal-oxide-semiconductor  structures  made  with  Pt  metal  electrodes, 
moderately  thick  (~25  nm)  Si02  films  and  jp-type  Si  substrates.  Electrons  injected  into  the  oxide 
conduction  band  produce  a  local  suppression  in  the  transmitted  BEEM  current  across  the  oxide.  This 
suppression  depends  on  the  voltage  applied  across  the  oxide  during  injection,  indicating  that  its 
source  is  within  the  oxide  film,  and  does  not  simply  result  from  hot-electron  damage  at  the 
metal-oxide  interface.  This  suppression  is  accompanied  by  a  local  increase  in  the  BEEM  TV  curve 
threshold  voltage,  consistent  with  a  build-up  of  trapped  BEEM  electrons  in  the  oxide.  For  a  given 
trapped  charge  density  we  find  that  the  threshold  varies  systematically  with  applied  during  TV 
curve  measurement.  We  use  these  variations  to  make  estimates  of  the  trapped  charge  density  and 
approximate  depth  in  the  oxide  film,  and  find  them  consistent  with  previous  macroscopic 
observations.  We  further  study  the  lateral  extent  of  the  suppression  and  the  lifetime  of  this  effect.  We 
also  measure  the  height  of  the  Pt-Si02  barrier  and  observe  its  lowering  due  to  the  image  force 
effect.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

Si02  is  a  material  of  crucial  importance  for  the  over¬ 
whelming  majority  of  today’s  active  electronic  devices.  It  is 
chemically  stable,  has  excellent  passivating  and  dielectric 
properties  and  its  quality  can  be  consistently  controlled  dur¬ 
ing  production.^  The  properties  of  Si02  are  well  known  from 
many  macroscopic  measurements,  including  TV  and  C-V 
measurements,  internal  photoemission,  etc.^’^  In  these  mea¬ 
surements  the  microscopic  properties  are  averaged  over  large 
areas.  With  the  ongoing  miniaturization  of  electronic  devices 
the  atomic-scale  features  of  the  materials  used  may  soon  be 
comparable  with  the  size  of  the  device  features  themselves, 
and  will  affect  the  performance  of  the  devices.  It  is  therefore 
critical  to  investigate  the  properties  of  these  materials  at  the 
microscopic  level.  Cuberes  et  al^  first  used  the  technique  of 
ballistic-electron  emission  microscopy  (BEEM)  as  a  new, 
microscopic  method  to  study  electronic  properties  of  very 
thin  CaF2  insulating  films.  Ludeke  et  al^~^  later  showed  that 
it  was  possible  to  use  BEEM  to  perform  nanometer  scale 
investigations  of  thin  (<10  nm)  Si02  films  sandwiched  in 
metal-oxide- semiconductor  (MOS)  structures.  Ludeke  et  al. 
studied  hot  electron  transport  processes  in  the  Si02  films  and 
observed  microscopic  irregularities  in  the  electronic  proper¬ 
ties. 

Here  we  report  BEEM  experiments  performed  on  MOS 
structures  made  with  thicker  ( ~  25  nm)  Si02  films,  in  which 
charge  trapping  effects  are  found  to  be  very  significant  (ini¬ 
tial  measurements  are  described  elsewhere^).  We  have  ob¬ 
served  a  local,  time-dependent  suppression  in  the  BEEM  cur¬ 
rent  through  the  Si02  film,  which  is  accompanied  by  a  local 
increase  in  the  BEEM  threshold  voltage.  Our  measurements 
demonstrate  that  both  of  these  effects  are  related  to  a  local 
build-up  of  charge  in  the  oxide  bulk,  due  to  trapping  of  some 
of  the  injected  electrons.  We  have  developed  a  method  to  use 
BEEM  to  measure  the  magnitude  of  the  trapped  charge  and 


its  mean  depth  in  the  oxide,  essentially  by  measuring  shifts 
in  BEEM  threshold  voltages  as  a  function  of  the  applied 
electric  field  in  the  oxide.  Good  agreement  is  found  between 
our  nanometer-scale  measurements  and  previous  macro¬ 
scopic  measurements  for  similar  injection  conditions.  We 
have  also  investigated  the  lateral  extent  of  the  current  sup¬ 
pression  caused  by  injection  of  charge  at  one  point,  and  have 
determined  some  lower  limits  on  the  lifetime  of  this  effect. 
We  also  measure  the  height  of  the  Pt-Si02  barrier  and  ob¬ 
serve  its  lowering  due  to  the  image  force  effect. 

II.  EXPERIMENT 
A.  BEEM 

The  BEEM  technique,  which  is  an  extension  of  scanning 
tunneling  microscopy  (STM),  was  first  used  by  Kaiser  and 
Bell^  to  investigate  the  microscopic  properties  of  metal- 
semiconductor  Schottky  barriers.  The  configuration  of 
BEEM  for  investigation  of  MOS  structures  is  shown  in  Fig. 
1(a).  When  the  STM  tip  is  brought  close  to  the  metal  film 
with  a  negative  tip  bias  [corresponding  to  Vr>0  V  with  po¬ 
larities  defined  in  Fig.  1  (a)],  a  tunnel  current  Ij  is  produced 
consisting  of  electrons  which  tunnel  from  filled  states  in  the 
tip  into  empty  states  in  the  metal  film.  Since  the  tunnel  bar¬ 
rier  height  is  smaller  for  electrons  with  higher  energies,  the 
flux  of  hot  electrons  in  the  metal  film  is  peaked  in  energy 
near  the  Fermi  energy  of  the  tip  [as  shown  in  Fig.  1(b)]  and 
is  also  highly  spatially  localized  and  forward-focused.^^  A 
fraction  of  these  hot  electrons  is  able  to  traverse  the  metal 
layer  and  reach  the  metal-oxide  interface  ballistically,  i.e., 
without  losing  energy.^’^  These  ballistic  electrons  can  enter 
the  oxide  provided  they  have  enough  energy  (controlled  by 
Vf)  to  surmount  or  tunnel  through  the  barrier  at  the  metal- 
oxide  interface.  This  barrier  has  nominal  height  Vq  when  the 
voltage  drop  across  the  oxide  Vq^=0  V,  but  the  barrier  will 
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(a)  |z 


(b) 


(c) 


Fig.  1.  (a)  Schematic  configuration  of  BEEM  for  investigation  of  MOS 
structures.  The  STM  tip  with  the  sample  constitute  a  three  terminal  device, 
(b)  Energy  diagram  of  the  experiment.  The  horizontal  axis  corresponds  to 
the  z  axis  in  (a).  Some  of  the  hot  electrons  from  the  tip  (current  Ij,  dashed 
arrows)  can  ballistically  cross  the  metal  layer  and  be  injected  into  the  oxide, 
provided  they  have  sufficient  energy  (controlled  by  Vj) .  (c)  Charge 
build-up  in  the  oxide  causes  increase  of  the  barrier  height  which  the  ballistic 
electrons  have  to  overcome  to  be  collected  as  current  7^.  in  the  substrate.  See 
the  text  for  definition  of  the  remaining  symbols. 


be  lowered  below  Vq  under  a  “forward”  oxide  bias 
(Vo.  >  0  by  so-called  “image  force  lowering,”^  or 
raised  above  Vq  under  “reverse”  oxide  bias  (Vox^O  V). 
Once  in  the  oxide,  injected  electrons  may  be  accelerated  by 
an  electric  field  present  in  the  oxide,  and/or  be  scattered  by 


0.2 


< 

a 


0.0 


^  \4''’^'=  +  12V 
•  pristine 


=  +  8  V 


/,-=10nA 


I 


aV 

»  A 


aa  A^a  ^ 

**** 


4 

Vr(V) 


Fig.  2.  An  average  of  “short”  Ic-Vt  curves  taken  over  pristine  area  (circles, 
100  curves  averaged)  and  over  the  center  of  injected  region  (triangles,  10 
curves  averaged).  The  method  used  to  create  the  injected  region  is  described 
in  the  text.  The  threshold  voltage  Vfi,  =  33  V  for  the  pristine  area  while 
Fih  for  the  injected  region  is  increased  to  4.6  V.  Both  I^-Vt  curves  were 
measured  at  y)"‘^^^=  +  8  V,  /7=10  nA. 


phonons,  defects,  or  trapped  charge.  Some  injected  electrons 
will  be  scattered  back  into  the  metal  film,'^  some  may  become 
trapped  in  the  oxide, and  some  may  conduct  across  the 
oxide  into  the  Si  substrate  and  be  measured  as  the  external 
collector  current  .  Electrons  which  become  trapped  in  the 
oxide  will  alter  the  electric  field  in  the  oxide  and  modify  the 
barrier  for  subsequent  electrons,  as  shown  in  Fig.  1(c).  The 
model  describing  this  will  be  discussed  in  more  detail  later  in 
the  article. 


B.  Modes  of  collecting  data 

We  have  used  several  different  modes  of  collecting  data  in 
order  to  study  different  aspects  of  charge  injection  and 
charge  trapping  in  oxide  films.  To  monitor  the  barrier  height 
at  the  metal-oxide  interface,  we  measure  curves”^  as 

illustrated  in  Fig.  2.  Since  injected  electrons  must  have 
enough  energy  to  surmount  this  barrier,  the  BEEM  threshold 
voltage  (indicated  by  the  arrows  in  Fig.  2)  correspond  to 
the  barrier  height  at  a  particular  tip  location.^’^  We  extract 
from  measured  Ic-Vt  curves  by  fitting  the  above¬ 
threshold  region  to  the  functional  form 
which  reasonably  approximates  the  fitting  model  used  in 
Refs.  5,  6,  and  7.  Due  to  noise  in  (~25  fA  rms  in  a  20  Hz 
bandwidth)  the  typical  uncertainty  in  V^h  is  of  the  order  of 
±0.1  V.  In  many  cases  we  measure  “full”  Ic-Vj  curves, 
with  Vj  covering  a  typical  range  of  2~10  V.^  However,  if  we 
are  only  interested  in  extracting  the  threshold  voltage 
we  often  measure  truncated  or  “short”  Ic-Vt  curves  (as 
shown  in  Fig.  2),  in  which  Vj  is  only  increased  until  suffi¬ 
cient  current  I c  is  detectable  to  determine  an  accurate  value 
of  Fth-  We  do  this  to  minimize  the  oxide  charging  which 
takes  place  during  measurement  of  the  Ic-Vj  curve.  This  will 
be  discussed  in  more  detail  later  in  the  article. 
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Fig.  3.  Combination  curve.  Left  panel:  An  initial  4- curve  is 

taken  as  is  ramped  up  to  ~  10  V.  Central  panel:  Vj,  Ij,  and  are 
held  constant  to  measure  an  I^-t  curve.  Gradual  suppression  of  with 
increasing  time  is  clearly  visible.  Right  panel:  A  final  Ic-Yj  curve  is  mea¬ 
sured  by  ramping  Vj  back  down.  Also  note  the  increase  in  the  threshold 
voltage  (indicated  by  the  arrows)  from  the  initial  to  the  final  Ic-Yt  curves. 


In  a  second  mode  of  collecting  data,  we  directly  address 
the  time  dependence  of  charging  effects  by  measuring  “com¬ 
bination  curves,”  as  shown  in  Fig.  3.  Each  such 

curve  was  measured  by  recording  as  is  first 
ramped  up  to  an  above  threshold  value  (to  measure  an 
initial  Ic^Vt  curve),  then  held  at  this  voltage  for  a  specified 
time  (typically  10-60  s)  to  measure  an  curve,  then 
ramped  back  down  to  its  initial  (below-threshold)  value  to 
measure  a  final  Ic-Vt  curve.  During  the  curve,  any  mea¬ 
sured  changes  in  are  directly  attributable  to  a  build-up  of 
trapped  charge  and/or  possible  interface  damage  in  the  part 
of  the  sample  directly  under  the  tip. 

In  a  third  data  mode,  a  “BEEM  image is  measured  to 
monitor  the  lateral  variations  in  the  BEEM  current  through 
the  oxide  film.  In  this  mode,  I ^  is  measured  as  a  function  of 
tip  position  as  the  tip  is  scanned  across  the  surface,  while 
Vj  and  Ij  are  held  constant.  This  yields  simultaneously  both 
the  top  surface  topography  and  an  image  of  the  lateral  dis¬ 
tribution  of (provided,  of  course,  that  Vj  >  Fth)-  Such  pairs 
of  topographic  and  BEEM  images  are  depicted  in  Fig.  4. 
Measurement  of  lateral  images  can  be  also  combined  with 
the  measurement  of  Ic-Vt  or  combination  curves. 

The  tip  is  stopped  at  predetermined  locations  during  a  scan 
and  an  Ic-Vt  or  combination  Ic-Vjllc''^  curve  is  measured. 
Unfortunately,  these  interruptions  in  scanning  can  cause  dis¬ 
ruptions  in  the  continuity  of  the  lateral  images,  due  to  sample 
drift  and  piezocrystal  creep.  This  is  particularly  severe  if  the 
Ic-Vj  curves  are  measured  on  a  rectangular  “grid”  of  sample 
locations  aligned  along  the  normal  x-  and  y-scan  directions, 
since  several  curves  are  measured  along  a  single  x-scan  line. 
To  distribute  such  disruptions  more  evenly,  we  measure  the 
1c-Vt  curves  on  a  rectangular  grid  which  is  rotated  slightly 
with  respect  to  the  normal  x-  and  y-scan  directions,  in  such  a 


way  that  at  most  one  Ic-Vj  curve  is  measured  on  a  particular 
x-scan  line. 

C.  Sample  preparation  and  experimental  set-up 

Samples  were  prepared  from  p-Si(l00)  wafers  (resistivity 
3-20  D  cm)  with  —25  nm  of  high  quality  oxide  (provided 
courtesy  of  National  Semiconductor,  Inc.).  Samples  of  size 
—  5X20  mm^  were  rinsed  sequentially  in  acetone,  methanol 
and  deionized  water,  mounted  on  sample  blocks  and  intro¬ 
duced  into  our  UHV  system,  which  were  then  degassed 
while  the  system  was  baked  (— 180°C,  —12  h).  Then  —5 
nm  of  Pt  film  was  deposited  in  the  shape  of  several  1  mm 
diameter  dots  by  means  of  e-gun  evaporation^"^  and  a  shadow 
mask.  The  samples  were  then  transferred  in  UHV  into  an 
adjacent  STM  chamber  without  any  post-metallization  an¬ 
neal.  The  measured  dot  was  gently  contacted  with  a  0.1  mm 
diameter  Au  wire  positioned  with  a  mechanical  manipulator 
inside  the  STM  chamber. 

As  discussed  elsewhere,"^ large  tip  voltages  {Vj  >  4  V) 
are  required  to  surmount  the  large  energy  barrier  at  the 
metal-oxide  interface  and  therefore  UHV  is  essential  for 
these  BEEM  measurements.  For  different  materials^^’^^  such 
large  voltages  tend  to  seriously  alter  the  tip-surface  interface 
if  tunneling  is  done  in  air,  but  cause  no  observable  damage  if 
the  tunneling  is  done  in  UHV,  where  is  routinely  ramped 
up  to  10  V  without  any  apparent  damage  to  the  surface.^”^ 
The  reason  for  this  is  thought  to  be  related  to  the  presence  of 
water  vapor.  From  our  own  experience,  tunneling  in  air  on  Pt 
surfaces  damages  or  alters  the  Pt  interface  and/or  the  tip  if 
V'l  is  increased  above  —3  V  (holes  “drilled”  into  the  metal 
layer  were  sometimes  observed),  even  though  stable  tunnel¬ 
ing  is  possible  at  lower  voltages. 

III.  RESULTS 

A.  Dependence  of  suppression  on  oxide  bias 

Much  of  this  work  was  motivated  by  the  observation  of 
current  through  the  oxide  decaying  with  time  (as  illus¬ 
trated  in  Fig.  3),  even  when  Vj  is  held  constant.  This  effect  is 
important  for  two  reasons.  From  a  fundamental  viewpoint, 
we  would  like  to  understand  the  origin  of  the  suppression — 
whether  it  is  due  to  charge  build-up  in  the  oxide  or  simply 
due  to  damage  to  the  metal-oxide  interface.  However,  there 
is  a  more  practical  reason  for  investigating  the  suppression 
effect.  As  it  was  shown  by  Ludeke  et  aL^~^  a  full 
curve  can  carry  important  information  about  the  dynamics  of 
hot  electrons  in  the  conduction  band  of  the  oxide.  However, 
since  time  is  always  an  implicit  variable  during  the  measure¬ 
ment  of  an  Ic-Vt  curve,  any  explicit  time-dependence  of 
must  be  understood  and  taken  into  account  to  properly  inter¬ 
pret  the  Ic-  Vt  curve. 

One  possible  cause  for  suppression  of  is  damage  of  the 
metal-oxide  interface  by  energetic  BEEM  electrons  without 
any  build  up  of  trapped  charge,  as  was  observed  by  Fernan¬ 
dez  and  Hallen  on  certain  Au/Si  interfaces. To  show  that 
the  suppression  of  has  its  origin  in  the  bulk  of  the  oxide 
(and  therefore  not  at  the  interface)  we  demonstrate  that  the 
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Fig.  4.  Dependence  of  suppression  of  4  on  oxide  bias  V/, .  (a)  —150X150 
nm^  topographic  scan  before  injection,  measured  with  Vj~5A  V,  77-=  10  nA 
and  V.  The  Pt  topography  (full  gray-scale  range  2.5  nm)  shows  a 

nodular  character  with  average  nodule  size  6.5  ±2.0  nm.  (b)  Corresponding 
BEEM  image  (gray-scale  range  0-5  pA).  (c)  Topography  and  (d)  BEEM 
image  after  injection  (see  the  text),  over  the  two  regions  outlined  by  dashed 
lines.  Injection  in  the  upper  (lower)  region  done  with  reverse  (forward) 
oxide  bias  of  V  (Vj,"-'  =  +  12  V).  Significant  suppression  is  observed 

only  for  the  forward-bias  injected  region. 


suppression  of  depends  strongly  on  the  oxide  bias  voltage 

.  Since  the  electric  field  created  by  does  not  penetrate 
significantly  into  the  metal  film,  the  average  kinetic  energy 
and  flux  of  the  hot  electrons  reaching  the  metal-oxide  inter¬ 
face  should  not  be  significantly  affected  by  If  hot- 

electron  damage  to  this  interface  were  a  major  cause  of  the 
suppression,  then  it  should  not  be  possible  to  “turn  off  ”  the 
suppression  simply  by  changing  Vfj  from  forward  to  reverse 
bias. 

To  see  this  dependence  of  suppression  on  Vy,  a  pristine 
150X  150  nm^  area  was  first  “pre-scanned”  to  image  the 
lateral  distribution  of  4.  [Figs.  4(a)  and  4(b)].  Then,  two 
nominally  identical  “injection”  scans  were  performed  in  two 
adjacent  50X50  nm^  regions  using  a  5X5  grid  of 
Ic-Vrllc-t  curves  =  5.4  V  for  --60  s,  7^=20  nA)  for 
each  area.  The  two  regions  are  indicated  by  dashed  lines  in 
Fig.  4.  The  only  difference  between  the  two  areas  was  that  a 
reverse  bias  V^^^  =  —4  V  was  applied  for  the  upper  region, 
while  a  forward  bias  ^^,"^=  +  12  V  was  applied  for  the  lower 
region.  For  forward  oxide  bias,  the  average  integrated  collec¬ 
tor  charge  density  was  2^—125  C  cm“^,  while  for  reverse 
oxide  bias  it  was  negligible,  as  the  BEEM  electrons  do  not 
have  sufficient  energy  to  surmount  both  the  —  4  eV  barrier  at 
the  metal  oxide  interface  and  the  ~4  V  reverse  bias  across 
the  oxide  film.  The  whole  area  was  then  imaged  again.  This 
is  shown  in  Figs.  4(c)  and  4(d).  The  strong  suppression  of 
Ic  for  =  +  12  V  is  clearly  visible  while  the  suppression 


for  the  V]y^=  “4  V  area  is  negligible.  There  is  also  no  appar¬ 
ent  damage  to  the  metal  surface.  This  shows  that  source  of 
the  suppression  has  to  be  within  the  oxide  bulk  as  the  only 
varied  parameter — ^the  electric  field — is  effectively  screened 
at  the  metal -oxide  interface. 

B.  Threshold  shift  observed  for  suppressed  region 

If  the  suppression  is  caused  by  electrons  trapped  in  the 
bulk  of  the  oxide  near  the  suppressed  region,  then  the  electric 
field  from  this  trapped  charge  should  also  increase  the  en¬ 
ergy  barrier  at  the  metal-oxide  interface,  which  in  turn 
should  produce  an  increase  in  the  BEEM  threshold  voltage 
y^h.  The  two  Ic-Vf  curves  in  Fig.  3  exhibit  this  effect.  The 
threshold  voltage  yth=3.3  V  for  the  pristine  region  while  the 
threshold  voltage  is  shifted  to  4.6  V  for  the  Ic-Vj  taken 
in  the  center  of  the  y)J'^  =  +  12  V  injected  region.  Both 
Ic~Vt  curves  were  measured  at  y^®^®=  +  8  V.  These  thresh¬ 
old  shifts  and  their  dependence  on  y^  can  be  used  to  deter¬ 
mine  the  approximate  magnitude  of  the  trapped  charge  and 
its  depth  in  the  oxide.  This  will  be  addressed  later  in  the 
article. 

C.  Suppression  possible  even  for  injection  under 
reverse  oxide  bias 

In  Fig.  4(d)  no  substantial  suppression  is  visible  for  the 
area  injected  under  a  V  reverse  oxide  bias  when 

injection  was  done  with  a  tip  bias  y7^=5.4  V.  This  value  of 
Vj  is  about  1.5  V  higher  then  the  expected  barrier  height 
yo  and  therefore  we  expect  some  electrons  should  still  enter 
the  oxide  even  under  these  reverse  bias  conditions.  Further¬ 
more,  if  the  electrons  have  sufficient  energy,  they  can  com¬ 
pletely  traverse  the  oxide  and  be  collected  in  the  Si  substrate 
even  under  reverse  oxide  bias,  as  was  observed  in  Ref.  5. 
Hence  we  expect  that  oxide  charging  should  still  take  place 
under  reverse  bias  conditions,  provided  the  energy  eV^  of 
electrons  during  injection  is  sufficiently  high.  To  test  this,  we 
have  measured  the  suppression  resulting  from  injection  at 
-4  V  as  a  function  of  tip  voltage  y^^^^  during  injection. 
A  pristine  area  was  again  pre-imaged,  similarly  to  Sec.  Ill  A. 
Then  an  injection  was  performed  over  a  50X50  nm^  area  at 
n''j=-4  V  in  a  similar  manner  to  that  described  in  Sec. 
Ill  A,  except  that  the  maximum  tip  voltage  was  ad¬ 
justed  to  different  values.  The  whole  area  was  then  imaged 
again  and  the  ratio  of  the  average  7^.  over  the  injected  region 
after  ((7{))  and  before  ((7^))  the  injection  was  determined. 
Figure  5  shows  that  the  suppression  of  7^  is  almost  negligible 
at  V  (about  1.5  V  above  Fq),  as  discussed  in  Sec. 

Ill  A.  Interestingly,  the  suppression  increases  substantially 
for  a  rather  small  increase  in  of  about  0.5  eV.  At  this 
point,  it  is  unclear  whether  this  sharp  tum-on  is  related  to  a 
particular  “trap  creation  activation  energy”*^  or  simply  rep¬ 
resents  the  fact  the  total  charge  injected  into  oxide  increases 
with  .  Also  note  that  under  these  conditions  the  injected 
electrons  do  not  have  sufficient  energy  to  traverse  the  whole 
length  of  the  oxide,  so  the  transmitted  BEEM  current  I ^  dur¬ 
ing  injection  remains  negligible. 
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D.  Imaging  suppression  after  injection  at  one  point 

We  have  also  investigated  the  lateral  extent  of  the  sup¬ 
pression  caused  by  injection  of  charge  at  a  single  point.  To 
estimate  this  the  following  experiment  was  performed.  A 
prescan  was  first  taken  to  get  an  image  of  1^ .  This  is  shown 
in  Fig.  6(b)  together  with  the  corresponding  topography  [Fig. 
6(a)].  Then  charge  was  injected  in  the  center  of  that  region 
by  taking  the  a  combination  curve  shown  in  Fig. 

3  (yjioid  ^  yy  imaged  again  as  is 

shown  in  Fig.  6(c).  Again,  the  region  of  suppressed  in  the 
center  resulting  from  the  injection  is  clearly  visible  and  has 
approximate  diameter  of  20-30  nm.  In  order  to  obtain  an 
average  profile  of  the  suppression  this  was  repeated  several 
times  on  different  sample  locations.  The  average  profile  is 
depicted  in  Fig.  7.  The  FWHM  of  this  suppression  profile  is 
approximately  25  nm.  The  knowledge  of  this  also  turns  out 
to  be  important  for  determining  on  pristine  areas  of  the 
sample,  since  two  curves  must  not  be  taken  so  close 
together  that  the  later  one  would  be  affected  by  the  trapped 
charge  produced  by  the  first  Ic-Vt  curve.  We  also  point  out 
that  we  do  not  know  the  direct  relationship  between  the  mag¬ 
nitude  of  suppression  and  the  local  amount  of  trapped 
charge.  Therefore  the  size  of  the  suppression  profile  gives  us 
only  an  estimate  of  the  extent  of  the  trapped  charge  distribu¬ 
tion  after  injection  at  one  point. 

We  also  addressed  the  issue  of  the  stability  of  the  effect  of 
suppression.  If  the  lifetime  of  the  effect  were  comparable 
with  the  randomly  varying  delay  between  the  injection  and 
the  measuring  of  »  then  this  could  influence  the  measure¬ 
ments.  To  investigate  this,  the  injected  area  [which  was  im¬ 
aged  immediately  after  injection  in  Fig.  6(c)]  was  imaged 
one  more  time  hours  after  injection.  The  result  is  shown 
in  Fig.  6(d).  We  see  that  the  suppressed  region  appears 
roughly  the  same  as  in  Fig.  3(c).  In  order  not  to  loose  the 
injected  region  due  the  sample  drift  and  piezocrystal  creep 
we  took  advantage  of  the  nodular  character  of  the  top  metal 
surface  to  actively  lock  the  tip  onto  a  local  extreme  and  track 
this  extreme  until  the  later  image  was  to  be  taken.  During 
this  tracking,  Vj-  was  held  well  below  Vq  {Vt=\  V)  to  avoid 
additional  charging  of  the  oxide.  While  there  was  no  precise 
quantitative  answer,  we  concluded  that  the  lifetime  of  the 
suppression  is  at  least  several  hours. 

E.  Influence  of  4-  Vj  spacing 

In  order  to  lower  the  noise  in  the  Ic-Vt  curves  taken  to 
obtain  the  threshold  voltage  it  is  common  to  average 
together  several  Ic-Vt  curves.  However,  as  shown  above, 
injection  of  charge  into  the  conduction  band  of  the  oxide 
(which  is  a  necessary  consequence  of  taking  an  Ic-Vj  curve) 
may  leave  behind  trapped  charge  which  could  influence  the 
result  obtained  from  an  Ic-Vj  curve  measured  in  its  close 
proximity.  We  therefore  devised  a  simple  experiment  in 
which  the  spacing  d  between  the  positions  of  the  short 
Ic-Vj  curves  in  the  5X5  grid  was  varied.  The  Vfh  extracted 
from  the  average  of  these  25  Ic-Vt  curves  is  plotted  in  Fig. 
8.  We  see  that  for  d  ^25  nm  the  averaged  Vjj,  is  independent 
of  spacing  while  for  d<  25  nm  it  increases  at  smaller  d.  This 
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Fig.  5.  Suppression  of  4-  vs  tip  bias  under  reverse  oxide  bias.  Signifi¬ 
cant  suppression  of  /^.  is  possible  even  for  injection  under  a  reverse  oxide 
bias  of  V]j^=-4  V,  provided  the  energy  of  electrons  during  injection  (con¬ 
trolled  by  is  high  enough.  Almost  no  suppression  is  apparent  for 

^^”’^^=5.4  V,  but  it  increases  substantially  when  is  increased  by  ~0.5  V 
to  ~5.9  V. 


5.3  5.5  5.7  5.9 


demonstrates  that  if  the  Ic-Vj  curves  are  taken  too  close 
together  (s25  nm)  the  charge  left  behind  from  the  an 
Ic-Vt  curve  start  to  influence  subsequent  nearby  Ic-Vt 
curves,  which  results  in  an  increased  apparent  value  for 
Vth  •  Also,  a  similar  result  is  obtained  when  two  Ic-Vj  curves 
are  taken  subsequently  at  the  same  point  (not  shown).  While 
the  first  yielded  V^h  =  3.30  V  (y^  =  +  8  V,  /,  -  10  nA),  the 
second  yielded  “  ^-65  V. 


F.  Determining  the  magnitude  and  depth  of  the 
trapped  charge 


We  have  found  that  BEEM  measurements  can  be  used  to 
estimate  the  amount  and  depth  of  the  trapped  charge  in  the 
oxide  film.  To  do  this,  we  consider  a  simple  model  for  the 
trapped  charge.  In  this  model  the  trapped  charge  is  repre¬ 
sented  by  a  uniform  sheet  charge  at  the  distance  x  from  the 
metal-oxide  interface.  This  trapped  charge  of  assumed  elec¬ 
tron  density  /rt^ap  gives  rise  to  additional  voltage  V^ ,  as  is 
shown  in  Fig.  1(c).  In  this  case,  the  expression  for  V^h  (which 
corresponds  to  the  maximum  barrier  height)  is 


with  V„  given  by^^ 


if 

if  V„<xEj 


(1) 


e  Htnin  -V 

Es  ^ 


Here  =  (Vq^/L)  is  the  applied  electric  field  across  the 
oxide  without  trapped  charge,  Fox  —  V^-  0.6  V  (in  the  range 
of  measured  L=25  nm  is  the  oxide  film  thickness, 

Fq  is  the  intrinsic  barrier  height  at  the  metal-oxide  interface 
[see  Fig.  1(c)],  =  (3.9  6o)  and  =  (2.15  69)  are  the  static 
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Fig.  6.  Suppression  of  collector  current  4  caused  by  injection  at  one  point, 
(a)  ~ 75x75  nm^  topographic  scan  before  injection,  measured  with 
V^=5A  V,  /y'^lO  nA  and  V.  (b)  Corresponding  BEEM  image 

(gray-scale  range  0-5  pA).  (c)  BEEM  image  after  injection  in  the  center  of 
the  region  (Fig.  3).  Approximate  size  of  the  suppressed  region  is  20-30  nm. 
(d)  Same  region  after  ~14  h.  Suppression  is  still  visible. 

and  optical  dielectric  constants  of  the  oxide  film,  respec¬ 
tively,  60=8.85X10“^'^  Fcm“\  and  e  is  the  elementary 
charge.  For  ,  the  electric  field  due  to  the  trapped 

negative  charge  increases  Vth  by  reducing  the  amount  of 
“image  force  lowering”^  of  the  barrier  below  Vq,  while  for 
^  xEq^  the  total  electric  field  near  the  metal-oxide  inter¬ 
face  is  actually  reversed  and  the  barrier  is  increased  above 
Vq*  Threshold  voltages  were  measured  as  a  function  of 
Viy  for  both  pristine  areas  and  in  the  center  of  regions  in¬ 
jected  in  fashion  similar  to  those  discussed  in  Sec.  Ill  A 
(with  — +  12  V).  Note  that  a  “fresh”  injected  region  was 
prepared  for  each  new  measurement.  By  measuring  the 
Ic-Vt  curves  in  the  center  of  the  injected  region  (  —  50  nm 
across)  we  ensure  that  the  lateral  extension  of  the  trapped 
charge  is  sufficient  to  be  reasonably  approximated  by  an  in¬ 
finite  sheet  of  charge.  The  model  was  fitted  to  both  sets  of 
measured  threshold  voltages  by  adjusting  +,  and  Vq 
(note  that  for  pristine  regions  we  assume  ^trap^O).  The  re¬ 
sulting  fit  is  shown  in  Fig.  9.  The  model  fits  the  measured 
threshold  voltages  for  both  pristine  and  injected  areas  very 
well,  with  fitted  parameter  values  x=3.9  nm,  n^rap 
=  1.3X10^^  cm“^  Vq  =  3.8  V.  We  note  that  the  approximate 
depth  of  the  charge  and  its  magnitude  are  in  very  reasonable 
agreement  with  previous  macroscopic  measurements  for 
similar  injection  conditions. The  value  of  3.8  V  for  Vq  (the 
Pt-Si02  barrier  height  at  yox“0)  corresponds  well  to  the 
value  measured  by  Ludeke  et  al  (  —  3.9  V)  at  Vq^=0  V.^ 
Ludeke  et  al,  however,  observed  a  similar  threshold  voltage 
even  under  strong  forward  bias  conditions,  while  we  observe 
lowering  of  V^^  for  positive  (forward)  bias  ,  in  agreement 


with  the  predictions  of  image  force  lowering.  A  similar  im¬ 
age  force  lowering  effect  for  Au/Si  Schottky  diodes  was  ob¬ 
served  using  BEEM  by  Davies  and  Craighead.^^ 

IV.  DISCUSSION 

Since  we  have  made  estimates  of  both  the  trapped  charge 
density  following  injection  and  of  the  lateral  extent  of  the 
trapped  charge,  we  can  now  estimate  the  total  number  of 
electrons  trapped  in  a  certain  area.  It  turns  out  that  the  num¬ 
ber  of  trapped  electrons  is  actually  rather  small.  For  example, 
assuming  a  trapped  charge  density  —1.3X10^^  cm“^  spread 
over  the  50X50  nm^  injected  area  shown  in  Fig.  4(d),  we 
conclude  that  only  about  300-350  electrons  became  trapped 
during  the  entire  injection  scan.  This  projects  into  only  about 
15  electrons  for  each  out  of  the  25  I^-VjlIr-t  used  for  the 
injection.  This  suggests  that  it  may  ultimately  be  possible  to 
use  BEEM  to  directly  observe  the  trapping  or  de-trapping  of 
individual  electrons  in  the  buried  Si02  film.^^  In  fact,  we 
have  often  observed  abrupt  fluctuations  in  l^-t  curves  mea¬ 
sured  over  particular  sample  areas  (such  as  that  shown  in 
Fig.  3),  which  are  reminiscent  of  discrete  electron  trap 
“switching”  events  reported  by  other  techniques  for  the 
Si02  system.^^’^^  Further  work  is  clearly  required  to  deter¬ 
mine  whether  these  fluctuations  are  in  fact  caused  by  discrete 
trapping  events,  and  we  will  continue  to  study  this  in  future 
experiments. 

Past  macroscopic  studies  of  oxide  charging  indicate  that 
some  traps  are  already  present  in  the  “as-fabricated”  oxide 
film  while  others  can  be  generated  as  a  consequence  of  the 
hot-electron  current  flowing  through  the  oxide. It  is  of 
interest  to  consider  whether  as-fabricated  or  generated  traps 
are  dominant  in  our  measurements.  If  we  assume  that  the 
traps  we  observe  were  already  present  in  the  oxide  at  the 
moment  the  injection  started,  we  can  roughly  estimate  their 
cross-section.  The  cross-section  of  such  traps  is  cr^eA/rl, 
where  A  is  the  total  area  through  which  the  “charging”  cur¬ 
rent  I  is  flowing  and  r  is  the  average  time  required  to  fill  a 
trap.^^’^'^  We  can  estimate  A  from  the  approximate  width  of 
the  suppressed  region  (Fig.  7),  roughly  25^  nm^.  A  very  con¬ 
servative  estimate  of  the  current  1  is  the  final  current  in  Fig. 
3,  i.e.,  about  2  pA.  This  is  in  fact  the  lowest  possible  limit  for 
the  following  additional  reasons.  4  is  the  current  of  electrons 
that  make  it  all  the  way  through  the  oxide.  However,  the 
current  responsible  for  charging  can  be  constituted  of  and 
electrons  which  travel  a  distance  into  the  oxide,  and  then  are 
scattered  back  into  the  metal.^“^  Also,  if  scattering  of  elec¬ 
trons  is  significant  in  the  region  of  trapping,  then  the  effec¬ 
tive  current  is  higher  by  the  factor  1/Iq,  where  Iq  is  the  total 
depth  of  the  region  of  trapping  and  I  is  the  average  path  of  an 
electron  through  this  region.^^  Finally,  from  Fig.  3  we  esti¬ 
mate  conservatively  t=  10  s  as  the  characteristic  time  of  the 
suppression  decay.  This  yields  cr^5X10“^^  cm^,  which  is 
much  smaller  than  cross  sections  for  typical  as-fabricated 
electron  traps  in  Si02  films.^'^  We  therefore  conclude  that 
filling  of  as-fabricated  traps  is  not  the  main  cause  of  the 
suppression  of  7^.  and  that  electron  traps  generated  by  hot 
electron  flux  are  the  dominant  effect  and  the  rate-determining 
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Fig.  7.  An  average  profile  of  suppression  after  injection  in  the  center  (r=0 
nm)  calculated  from  several  different  suppressed  regions  generated  on  dif¬ 
ferent  sample  locations  (similar  to  Fig.  6).  The  FWHM  of  the  average  profile 
of  about  —25  nm  can  be  taken  as  an  estimate  of  the  extent  of  the  area 
affected  by  injection  at  one  point. 


Fig.  9.  Dependence  of  l/jh  during  Ic-Vj  curve  measure¬ 

ments,  for  pristine  (circles)  and  forward-bias  injected  (triangles)  regions. 
Curves  show  model  fits  according  to  Eq.  (1).  The  fit  yields  the  approximate 
depth  of  trapped  charge  x=3.9  nm,  the  trapped  charge  density  Wtrap  “  1-3 
X  lO’"*  cm  (for  the  injected  area),  and  Vo=3.8  V. 


process  in  our  experiments.  In  this  regard,  we  note  that  past 
macroscopic  experiments^^  reported  that  generated  traps 
were  dominant  for  injected  charge  densities  Qc^l  Ccm'~^ 
which  is  in  agreement  with  our  experiments  where  the  typi¬ 
cal  charge  densities  are  of  the  order  of  ~  100  C  cm~^. 

It  should  be  noted  that  the  lateral  extent  of  the  trapped 
charge  as  determined  in  Figs.  6-8  appears  surprisingly  large 
in  comparison  with  the  estimated  depth  of  the  trapped  charge 
in  the  oxide  x  =  4  nm,  as  determined  from  Eq.  (1)  and  Fig. 
9  for  the  injection  at  Vj^5A  V.  Since  lateral  scattering  of  the 
BEEM  electrons  in  the  5  nm  thick  metal  film  is  usually  as- 
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Fig.  8.  Dependence  of  the  apparent  average  threshold  voltage  on  the 
spacing  d  between  adjacent  of  Ic-Vt  curves.  25  Ic~^j  curves  measured  on  a 
5X5  grid  of  spacing  d  were  averaged  together  and  used  to  determine  the 
threshold  voltage  If  the  curves  are  spaced  too  closely,  the  re¬ 

sidual  charge  from  the  previous  4-Vr  curve  will  influence  the  following 
Ic-Vj  curves,  resulting  in  an  increase  in  d^25  nm  the  averaged 

Vth  is  independent  of  spacing. 


sumed  to  be  relatively  small,^  then  most  of  this  “widening” 
must  presumably  occur  in  the  oxide.  Below,  we  consider 
several  factors  possibly  contributing  to  this  apparent  large 
value  of  the  width  of  the  trapped  charge  compared  with  its 
depth.  (1)  A  build-up  of  trapped  electrons  directly  under  the 
tip  could  scatter  subsequent  incoming  BEEM  electrons  side¬ 
ways,  thus  widening  the  distribution.  Monte  Carlo  simula¬ 
tions  similar  to  those  performed  by  Ludeke  et  could 
help  determine  the  importance  of  this  effect.  (2)  It  is  possible 
that  the  simple  charge  trapping  model  considered  here  some¬ 
how  underestimates  the  depth  of  the  centroid  of  the  actual 
trapped  charge  distribution.  In  our  simple  model  we  have 
assumed  the  trapped  charge  in  the  form  of  a  thin  sheet, 
whereas  the  true  distribution  has  some  finite  spread  around 
the  centroid  depth  in  the  oxide.  We  have,  however,  compared 
our  experiments  with  a  model  which  permits  a  finite  uniform 
spread,  and  found  that  it  does  not  significantly  affect  our 
estimate  of  the  centroid.  Also  note  that  the  charge  centroid 
depth  was  determined  for  injection  done  with  Vj  =  5.4  V, 
whereas  the  lateral  width  of  the  suppressed  region  shown  in 
Figs.  6  and  7  was  determined  for  injection  at  a  single  point 
done  at  =  10  V.  It  is  possible  that  the  charge  centroid  may 
be  deeper  for  this  higher-voltage  injection.  (3)  It  is  possible 
that  the  large  lateral  extent  of  the  trapped  charge  is  a  conse¬ 
quence  of  the  actual  mechanism  for  trap  generation.  In  par¬ 
ticular,  we  note  that  a  leading  model  for  trap  creation^^  as¬ 
sumes  that  injected  hot  electrons  must  completely  traverse 
the  oxide  film  to  liberate/activate  a  trap-creating  species 
(thought  to  be  hydrogen-related),  which  then  diffuses  back 
across  the  oxide  to  create  traps  near  the  injecting  interface. 
Lateral  diffusion  of  the  injected  electrons  and/or  the  back- 
diffusing  hydrogen  could  naturally  produce  a  lateral  spread 
in  generated  traps  comparable  to  the  oxide  film  thickness, 
(about  25  nm  in  our  samples).  While  we  do  not  yet  know 
whether  this  particular  mechanism  is  important  in  our  experi- 
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merits,  we  note  here  that  BEEM  should  actually  be  a  very 
useful  tool  to  study  this  (and  other)  possible  trap  generating 
mechanisms. 

V.  SUMMARY 

We  used  BEEM  to  locally  inject  electrons  into  a  ^-25  nm 
thick  Si02  film  built  into  a  MOS  structure.  Some  of  these 
electrons  are  trapped  in  the  bulk  of  the  oxide  and  cause  an 
increase  in  the  local  barrier  height  and  suppression  of  the 
BEEM  current.  We  devised  a  method  using  BEEM  to  deter¬ 
mine  locally  the  density  of  this  trapped  charge  and  a  its  depth 
in  the  oxide.  We  estimated  the  lateral  extent  of  the  charge 
trapped  during  injection  at  one  point,  and  concluded  that 
generation  of  traps  created  in  the  oxide  by  hot  BEEM  elec¬ 
trons  is  a  likely  dominant  trapping  mechanism  in  our  experi¬ 
ments.  We  also  observed  lowering  of  the  barrier  due  to  the 
image  force  effect,  and  were  able  to  extract  the  intrinsic  bar¬ 
rier  height  at  the  Pt-Si02  interface. 
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Electronic  properties  of  the  interfaces  between  Si  and  ultrathin  (:^10  A)  oxides  formed  by  various 
low-temperature  processes  were  characterized  in  contactless  fashion,  using  contactless  capacitance- 
voltage  and  photoluminescence  surface  state  spectroscopy  techniques  together  with  x-ray 
photoelectron  spectroscopy  measurement.  Hydrogen  (H)  terminated  Si(lll)  surfaces  were  used  as 
the  initial  surface.  Ultrathin  oxides  were  formed  at  low  temperatures  by  chemical  oxidation 
processes  (hot  HNO3,  H2SO4+H2O2),  long-time  air  exposure,  and  low-temperature  oxidation 
processes  below  350  °C.  The  initial  H-terminated  surfaces  showed  presence  of  Fermi-level  pinning 
at  EQ  =  Ey-\- 0.65  eV  due  to  high  density  of  amphoteric  discrete  state  probably  originating  from  Si 
dangling  bonds.  On  the  other  hand,  all  the  ultrathin  oxide-Si  interfaces  exhibited  very  limited 
capacitance  variations  with  voltage  at  low  capacitance  levels  similar  to  GaAs  metal-insulator- 
semiconductor  systems,  and  indicated  that  the  Fermi  level  is  pinned  near  the  hybrid  orbital  charge 
neutrality  level  E^q  due  to  presence  of  interface  states  with  narrow  U-shaped  continuous 
distributions.  Low-temperature  oxidation  at  350  °C  slightly  weakens  such  pinning.  The  present 
work  indicates  difficulty  of  realizing  unpinned  ultrathin  oxide-silicon  interface  by  low-temperature 
processes.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Since  feature  sizes  used  in  silicon  ultralarge-scale  inte¬ 
grated  circuits  (ULSIs)  have  been  scaled  down  into  the  sub¬ 
micron  range,  the  importance  of  the  quality  of  thin  oxide 
layers  as  well  as  their  formation  process  and  preformation 
surface  treatments  related  to  surface  roughness  on  the  atomic 
scale  has  been  recognized.  Obviously,  importance  of  atomic 
scale  control  of  surfaces  and  interfaces  will  further  increase 
in  the  future  in  the  Si  ULSI  industry.  In  addition,  the  realiza¬ 
tion  of  next-generation  advanced  devices  in  the  quantum  re¬ 
gime  such  as  single  electron  transistors,^  will  also  depend 
strongly  on  successful  formation  of  a  highly  controlled  ultra¬ 
thin  oxide-Si  interface,  because  availability  of  ultrathin 
pinning-free  tunneling  barriers  is  crucial  in  such  devices. 

High-temperature  dry  oxidation  is  the  standard  technique 
to  produce  gate  oxide  in  current  ULSI  processes.  However,  it 
is  getting  increasingly  difficult  to  form  extremely  thin  (a  few 
10  A)  oxide  films  with  precise  control  of  the  thickness  and 
electrical  bulk  and  interface  properties  using  the  high- 
temperature  dry  oxidation  technique  only.  This  is  due  to  the 
atomic-scale  roughness  and  presence  of  native  oxides/ 
suboxides  on  the  initial  Si  surfaces  as  well  as  due  to  rapid 
initial  oxidation  rates  of  the  Si  surface  at  elevated  tempera¬ 
tures,  whose  values  depend  sensitively  on  initial  surface  con¬ 
ditions  and  the  temperature/gas  conditions. 

One  approach  to  overcome  this  difficulty  is  to  perform  a 
suitable  initial  surface  treatment  to  realize  a  well-defined 
atomically  flat  surface  and  subsequently  to  grow  a  well- 
controlled  ultrathin  (^10  A)  “pre-oxide”  at  low  tempera¬ 
tures  before  final  oxidation  is  made  to  a  desired  thickness  at 


an  elevated  but  optimal  temperature  with  a  well-controllable 
oxidation  rate.  In  fact,  such  an  approach  has  been  recently 
attempted  by  Ohmi  et  al^  who  have  found  it  useful  to  realize 
an  atomically  flat  interface  and  high-quality  oxide  with  a 
precise  control  of  thickness. 

As  a  possible  candidate  for  the  preprocessing  surface 
treatment,  particular  attention  has  been  paid  recently  to  the 
hydrogen  termination  of  the  silicon  surface.  According  to  the 
literature, it  is  capable  of  producing  atomically  well- 
defined  and  flat  Si  surfaces  with  strong  surface  passivation 
properties  and  high  reproducibility. 

As  for  controlled  formation  of  ultrathin  preoxides,  one 
could  try  various  low- temperature  processes  to  find  out  a 
simple  and  optimal  process.  In  order  to  successfully  develop 
a  most  suitable  surface  treatment  and  preoxide  formation 
process,  surface/interface  characterization  techniques  play 
important  roles. 

For  characterization  of  atomic  scale  structures  of  surfaces 
and  interface,  surface-science-type  characterization  tech¬ 
niques  have  recently  made  great  progress  which  include 
high-resolution  x-ray  photoelectron  spectroscopy  (XPS),^ 
scanning  tunneling  microscopy  (STM)/atomic  force  micros¬ 
copy  (AFM),^’^  grazing  angle  x-ray  diffraction  (XRD),^^ 
crystal  truncation  rod  (CTR)  scattering,^ ^  Fourier  transform 
infrared-absorption  spectroscopy  (FTIR),^’^^  etc. 

However,  there  has  been  no  well-established  suitable 
method  to  characterize  more  device-oriented  electronic  prop¬ 
erties  of  the  hydrogen  terminated  surfaces  and  thin  oxidized 
surfaces.  The  standard  metal- insulator- semiconductor 
(MIS)  capacitance-voltage  (C-V)  technique  fails  due  to 
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Schottky  or  tunneling  MIS  conduction  in  such  an  ultrathin 
regime.  Recently  use  of  the  old  surface  photovoltage  (SPV) 
technique^’ which  detects  surface  potential  change  due  to 
photoinduced  band  flattening  has  been  proposed. However, 
it  requires  a  complicated  parameter  fitting  analysis  due  to 
presence  of  the  Dember  effect  and  therefore  it  is  difficult  to 
judge  the  quantitative  validity  of  the  results. 

In  order  to  overcome  this  difficulty,  we  previously  pro¬ 
posed  the  photoluminescence  surface  state  spectroscopy 
(PLS^)  technique  and  applied  it  to  silicon  surfaces. More 
recently,  we  applied  a  contactless  C- V  technique  developed 
by  Sakai  et  al}^  to  the  characterization  of  hydrogen  termi¬ 
nated  and  air-exposed  silicon  (111)  surfaces  together  with  the 
PLS'"^  technique,  and  showed  that  these  two  techniques  are 
complementary  to  each  other  and  provide  a  powerful  means 
for  characterization  of  free  surfaces. 

In  a  continuation  of  efforts  to  establish  well-defined  pre¬ 
processing  conditions  for  formation  of  well-controlled  ex¬ 
tremely  thin  thermal  oxides  on  Si,  the  purpose  of  the  present 
article  is  to  characterize  the  electronic  properties  of  the  ul¬ 
trathin  oxide  (^10  A)-Si(lll)  interfaces  prepared  by  vari¬ 
ous  low-temperature  oxidation  processes.  The  contactless 
C-V  and  the  PLS^  techniques  were  used  together  with  x-ray 
photoelectron  spectroscopy  (XPS)  measurement.  As  the  ini¬ 
tial  surface  treatment  prior  to  formation  of  ultrathin  oxides, 
hydrogen  termination  was  applied  in  order  to  take  its  full 
average  of  well-defined  atomic  structure,  flatness,  and  high 
reproducibility.  Low- temperature  processes  to  grow  ultrathin 
oxide  include  chemical  oxidation  processes  (hot  HNO3, 
H2SO4+H2O2),  a  long-time  air  exposure,  and  a  low- 
temperature  thermal  oxidation  below  350  °C. 

XPS  band-bending,  C-V,  and  PLS^  measurements  gave 
results  that  are  consistent  with  each  other.  The  initial 
H-terminated  surfaces  showed  the  presence  of  Fermi-level 
pinning  at  EQ  =  Ey-\- 0.65  eV  due  to  high  density  of  ampho¬ 
teric  discrete  state  probably  due  to  Si  dangling  bonds.  On  the 
other  hand,  the  results  on  all  the  ultrathin-oxide-covered  sur¬ 
faces  indicated  that  the  Fermi  level  is  pinned  near  the  charge 
neutrality  level  £’^0  due  to  high-density  narrow  U-shaped 
interface  state  continuum  being  similar  to  GaAs  MIS 
interfaces. Low-temperature  oxidation  at  350  °C  slightly 
weakens  such  pinning. 

The  present  work  indicates  that  realization  of  an  unpinned 
ultrathin  oxide-silicon  interface  by  a  low-temperature  pro¬ 
cess  only  is  unexpectedly  difficult  even  with  the  thermal  oxi¬ 
dation. 

II.  EXPERIMENT 

A.  Sample  preparation  and  XPS  analysis 

Wafers  used  in  the  present  study  were  n-type  silicon  (111) 
with  the  carrier  concentration  of  1-2X10^^  cm“^.  The  fol¬ 
lowing  surfaces  were  prepared  and  characterized: 

(1)  a  thick  thermally  oxidized  surface; 

(2)  a  hydrogen  terminated  surface; 

(3)  chemical  oxidized  surfaces; 

(4)  an  air-exposed  surface;  and 


(5)  low-temperature  thermally  oxidized  surfaces  in  a 
vacuum  chamber. 

The  ultrathin  oxidized  surfaces  of  (3)-(5)  were  formed  on 
the  hydrogen- terminated  surface  (2). 

The  thermally  oxidized  surfaces  were  prepared  in  dry  O2 
at  1000  °C  for  3  h  and  annealed  at  for  30  min  in  dry  N2.  This 
surface  was  used  as  a  reference. 

Hydrogen  termination  of  the  surface  was  carried  out  ac¬ 
cording  to  the  method  of  Higashi  et  al^  Namely,  thick  ther¬ 
mal  oxide  (about  150  nm)  films  were  formed  in  dry  oxygen 
at  1000  °C.  Then,  after  removing  the  oxide  films  in  buffered 
hydrofluoric  acid,  native  oxide  films  were  formed  by  immer¬ 
sion  of  the  surface  in  a  HCI-H2O2  solution  at  80  °C  for  10 
min.  Finally,  the  surface  was  immersed  in  a  40%  NH4F  so¬ 
lution.  After  each  process,  the  sample  was  rinsed  for  10  min 
in  de-ionized  water.  According  to  the  literature, such  a  pro¬ 
cess  gives  an  atomically  flat  hydrogen-terminated  silicon 
(111)  surface. 

Chemical  oxidation  of  the  hydrogen  terminated  surfaces 
was  done  following  the  recent  work  by  Ogawa  and  Hattori,^® 
i.e.,  one  immersed  into  a  hot  HNO3  solution  (60  °C)  for  5 
min,  and  the  other  into  a  solution  of  H2S04:H202=4:1  for  10 
min.  Formation  of  ultrathin  oxide  by  air  exposure  was  done 
leaving  the  hydrogen-terminated  surface  in  an  air-nitrogen 
mixture  for  2  weeks.  For  comparison,  as-received  wafers 
with  saturated  formation  of  native  oxide  by  air  exposure  was 
also  studied. 

Ultrathin  low-temperature  thermal  oxides^  were  formed 
by  heating  the  hydrogen  terminated  surface  at  300-350 
for  30  min  in  dry  oxygen  with  a  pressure  of  133  Pa  in  an 
ultrahigh- vacuum  (UHV)  chamber  (base  pressure  <10“^ 
Pa).  Chemical  status  of  the  various  surfaces  were  investi¬ 
gated  by  XPS.  For  this,  a  Perkin-Elmer  PHI  5100C  spherical 
capacitor  energy  analyzer  was  used  under  excitation  by 
MgKa  (/zv=  1253.6  eV). 

B.  Contactless  C-  V  method 

The  principle  of  the  contactless  C-  V  method  used  in  this 
study  is  shown  in  Fig.  1(a).  Instead  of  depositing  a  gate 
electrode  directly  on  the  sample  surface,  C-V  measurement 
is  performed  from  the  field  electrode  that  is  placed  above  the 
sample,  being  separated  from  the  sample  surface  by  a  thin 
“air  gap.”  Thus,  a  MIS  assessment  of  “free”  surfaces  be¬ 
comes  possible  by  being  assisted  by  an  air-gap  insulator. 

In  the  present  study,  such  contactless  C-V  measurement 
was  performed  using  a  commercial  measurement  system 
(CV-8000,  Dainippon  Screen  Mfg.  Co.,  Ltd.).  In  this  system, 
an  electrode  head  shown  in  Fig.  1(b)  is  used  in  order  to 
maintain  parallelism  and  a  constant  distance  between  the 
field  electrode  and  the  sample  surface.  The  head  has  three 
additional  surrounding  “parallelism  electrodes”  which  main¬ 
tain  a  constant  air-gap  distance  of  300-350  nm  via  a  piezo 
mechanism  with  capacitance  feedback.  The  capacitance  is 
then  measured  between  the  field  electrode  and  the  sample 
mounted  on  the  sample  stage.  The  area  of  the  electrode  for 
C-V  measurement  was  7.5 X 10“^  cm“^  and  Cr  was  used  as 
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Fig.  1.  (a)  Basic  principle  of  the  contactless  C-  V  method  and  (b)  electrodes 
used  for  the  contactless  C~V  system. 


the  field  plate  metal.  The  absolute  value  of  the  air-gap  dis¬ 
tance  can  be  measured  optically  using  the  “Goos-Haenchen 
effect”  of  optics  in  this  system. 

Formation  of  an  ohmic  contact  at  the  back  surface  of  the 
samples  was  not  necessary,  because  the  capacitance  between 
the  sample  and  the  sample  stage  is  much  larger  than  the 
capacitance  to  be  measured. 

C.  Photoluminescence  surface  state  spectroscopy 
(PLS®) 

It  is  well  known  that  the  intensity  of  band-edge  photolu¬ 
minescence  (PL)  is  very  sensitive  to  surface  properties;  how¬ 
ever,  it  is  generally  used  for  qualitative  assessment  of  the 
surface  quality.  The  photoluminescence  surface  state  spec¬ 
troscopy  (PLS^)  technique  recently  developed  by  our 
group^^’^^  allows,  on  the  other  hand,  a  quantitative  assess¬ 
ment  of  the  surface  state  properties  of  free  surfaces  in  a 
contactless  fashion. 

The  experimental  setup  and  the  principle  of  the  PLS^ 
technique  are  shown  in  Fig.  2.  In  this  system,  the  band-edge 
PL  efficiency  where  /pL  is  the  band-edge  PL  intensity 
and  (f)  the  excitation  intensity,  is  measured  as  a  function  of 
using  an  automated  setup  schematically  shown  in  Fig.  2(a).  A 
typical  behavior  of  the  PL  efficiency  after  such  a  measure¬ 
ment  is  shown  in  Fig.  2(b).  In  high-quality  Si  single  crystals, 
the  bulk  recombination  is  of  little  importance  at  low  excita¬ 
tion  intensities  and  the  PL  efficiency  takes  a  low  constant 


(b) 


Fig.  2.  (a)  Experimental  setup  of  the  PLS^  technique,  (b)  Schematic  illus¬ 
tration  of  typical  behavior  of  band-edge  PL  efficiency. 


value  due  to  a  relatively  large  surface  recombination  velocity 
due  to  Fermi-level  pinning.  On  the  other  hand,  at  high  exci¬ 
tation  intensities,  the  PL  efficiency  approaches  an  intrinsic 
value  limited  by  bulk  radiative  and  Auger  recombination 
processes. 

In  the  transition  region  between  these  two  limits,  the  PL 
efficiency  becomes  very  sensitive  to  surface  state  properties. 
Here,  as  the  light  intensity  is  increased,  the  Shockley-Read- 
Hall  (SRH)  recombination  rate  of  each  surface  state  becomes 
saturated.  At  the  same  time,  the  quasi-Fermi  levels  for  elec¬ 
trons  and  holes  scan  the  surface  band  gap  as  the  photoin- 
duced  band  flattening  proceeds  and  this  gradually  increases 
the  number  of  surface  states  causing  recombination.  Thus, 
the  former  tends  to  increase  the  PL  efficiency  whereas  the 
latter  tends  to  reduce  it,  making  the  behavior  of  the  PL  effi¬ 
ciency  in  this  region  strongly  reflect  the  Ass  distribution. 
Roughly  speaking,  the  slope  of  the  PL  efficiency  gives  a 
good  indication  concerning  the  shape  of  the  distribution.  It  is 
equal  to  unity  for  discrete  states  due  to  complete  saturation 
of  the  SRH  process.  On  the  other  hand  it  becomes  less  steep 
for  continuous  distribution,  reflecting  the  aforementioned  in¬ 
crease  of  surface  states  taking  part  in  recombination.  The 
density  (A^g)  distribution  can  be  determined  by  fitting  the 
measured  with  the  results  of  rigorous  computer  simu¬ 
lation  taking  account  of  all  possible  recombination  pro¬ 
cesses.  The  procedure  for  the  computer  analysis  was  given  in 
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Fig.  3.  XPS  core-level  spectrum  of  the  Si  2/73/2  H-terminated  Si(lll) 
surface. 


detail  in  Ref.  15.  In  the  present  study,  a  PL  measurement 
system  which  is  fully  automated  by  computer  control  was 
used.  The  excitation  source  was  an  Ar^  laser  light  with  a 
wavelength  of  514.5  nm. 

III.  RESULTS  AND  DISCUSSION 
A.  XPS  analysis 

Figure  3  shows  the  spectrum  of  Si  2py2  core  level  ob¬ 
tained  on  the  H-terminated  surfaces.  The  Si  Ipxa  peak  was 
subtracted  from  the  original  signal  by  employing  a  spin- 
orbit- splitting  value  of  0.6  eV  and  the  Si  2/?  1/2  to  Si  2/73/2 
intensity  ratio  of  1/2  based  on  the  previous  results.^ ^  As  can 
be  seen,  there  is  no  oxide  component  around  103  eV  on  the 
H-terminated  surface.  In  addition,  the  peak  position  of  the 
Si — Si  bond  was  found  to  be  99.65  eV  consistent  with  those 
reported  by  other  workers. 

Figure  4  compares  the  Si  2/73/2  spectra  obtained  after  oxi¬ 
dation  of  the  H-terminated  surfaces  by  various  low- 
temperature  processes.  In  Fig.  4(a),  the  peak  position  and 
height  of  the  main  Si  peak  corresponding  to  Si — Si  bond 
were  shifted  and  adjusted  for  each  oxidation  process  so  that 
they  agree  with  those  of  the  H-terminated  surface.  It  is 
clearly  seen  in  Fig.  4(a)  that  an  ultrathin  oxide  was  formed 
after  each  of  the  oxidation  processes  tried.  The  oxide  film 
thickness  can  be  calculated  by  analyzing  these  Si  2/7  photo¬ 
electron  spectra,  if  the  electron  escape  depth  is  known.  By 
assuming  that  the  electron  escape  depth  in  Si  and  Si  oxide 
are  2.6  and  3.4  nm,  respectively,^^  the  thickness  estimates 
shown  in  Table  I  were  obtained  for  the  various  low- 
temperature  oxides.  These  values  are  in  excellent  agreement 
with  those  in  the  literature.^’^® 

In  order  to  see  the  intermediate  oxidation  status  more 
carefully,  oxide  peaks  are  shown  in  a  magnified  scale  in  Figs. 
4(b)  and  4(c).  The  positions  corresponding  to  the  oxidation 
status  of  Si  of  Si^"^,  Si^'*',  Si^^,  and  Si^"^  are  shown  by  arrows 
using  the  data  by  Himpsel  et  al?^  The  relative  amounts  of 
the  suboxide  components  were  separated  by  the  curve  fitting 
and  the  results  are  summarized  in  Table  I  in  terms  of  the 
integrated  intensity  ratio  with  respect  to  the  main  Si^  peak. 
As  seen  in  Table  I,  not  only  the  amount  of  the  Si02  compo- 
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(c) 

Binding  energy  (eV) 

Fig.  4.  (a)  XPS  spectra  of  the  Si  2/73/2  of  H-terminated  Si(lU)  surfaces  after 
formation  of  ultrathin  oxide  by  various  processes.  The  peak  position  and 
height  of  the  main  Si  peak  corresponding  to  Si — Si  bond  were  shifted  and 
adjusted  for  each  oxidation  process  so  that  they  agree  with  that  of  the  initial 
H-terminated  surface.  Spectra  are  shown  in  magnified  scale  for  (b)  air- 
exposed  and  chemically  oxidized  surfaces,  and  (c)  low-temperature  ther¬ 
mally  oxidized  surfaces. 


nent  (Si^"^)  but  also  the  amount  of  each  suboxide  component 
is  different  for  different  oxidation  processes  and  conditions. 

For  example,  the  Si^"^  phase  is  emphasized  at  the  surface 
chemically  oxidized  in  H2SO4  solution,  whereas  the  surface 
chemically  oxidized  in  HNO3  solution  exhibited  a  larger  Si^'^ 
component  in  agreement  with  the  spectra  reported  by  Ogawa 
and  Hattori.^^  For  the  low-temperature  thermal  oxidation,  a 
remarkable  change  in  the  interface  suboxide  phase  was  ob¬ 
served  by  a  slight  change  in  the  oxidation  temperature: 
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Table  I.  Summary  of  suboxide  thickness  and  components  for  various  oxidation  methods  at  low  temperatures. 


Method 

Oxide 

thickness  (nm) 

Si‘'"/Si" 

0 

+ 

Si^+/Si° 

Air  exposure 

1.0±0.1 

0.04±0.01 

0.02±0.01 

0.03±0.01 

0.24±0.005 

Chemical  oxidation:  H2  SO4 

0.8±0.1 

0.07±0.01 

0.02±0.01 

0.02±0.01 

0.20±0.005 

HNO3 

1.1±0.1 

0.06±0.01 

0.01  ±0.01 

0.08±0.01 

0.26±0.005 

Thermal  oxidation:  300  °C 

0.4±0.1 

0.04±0.01 

0.02±0.01 

0.07±0.01 

0.02±0.01 

350  °C 

0.8±0.1 

0.05  ±0.01 

0.02±0.01 

0.02±0.01 

0.22±0.005 

Namely,  the  component  was  dominant  in  the  oxidized 
interface  prepared  at  300  °C,  whereas  a  sharp  Si"^"^  peak  ap¬ 
peared  at  350  °C,  keeping,  however,  the  amount  of  other 
suboxide  components  more  or  less  the  same. 

Finally,  an  attempt  was  made  to  determine  the  zero-bias 
Fermi-level  positions  of  the  sample  surfaces  from  the  XPS 
peak  shifts  of  Slip  core  levels.  To  do  this,  one  needs  to 
know  the  energy  separation  between  the  Slip  core  level  and 
the  valence-band  edge.  Since  the  resolution  of  our  XPS 
equipment  was  not  high  enough  to  resolve  the  positions  of 
Si  2py2  und  Si  2py2  peaks  very  accurately  by  curve  fitting, 
we  adopted  the  procedure  by  Schwaltz  et  who  used  a 
weighted  average  peak  energy  of  Si2p  level  of 
Ey-E{Si2p)=99.0l  eV.  This  value  was  derived  from  a 
high-resolution  work  on  nearly  intrinsic  Si  wafers  by 
Himpsel  et  which  gave  ^^-^(Si  2/73/2)=98.86  eV  and 
Ey-E{Si  2pi/2)==99.46  eV.  The  estimated  Fermi-level  posi¬ 
tions  on  various  sample  surfaces  are  summarized  in  Fig.  5. 

As  seen  in  Fig.  5  the  Fermi  level  at  the  H-terminated 
surface  lies  at  0.60-0.70  eV  from  the  valence-band  edge.  On 
the  other  hand,  a  clear  energy  shift  of  the  core  level  was  seen 
on  the  surfaces  after  oxidation.  The  resultant  positions  de¬ 
pended  rather  weakly  on  the  oxidation  process  and  they  lie  in 
the  range  of  0.30-0.40  eV  from  the  valence-band  edge  as 
shown  in  Fig.  5.  Obviously,  the  present  determination  of  the 
Ep  position  depends  on  the  accuracy  of  the  value  of 
Ey-E(S\2p).  For  example,  a  more  recent  work  by 


Si2p  (eV) 


Fig.  5.  Zero-bias  Fermi-level  positions  of  the  sample  surfaces  determined 
from  the  XPS  peak  shifts  of  Si  2/?  core  levels  before  and  after  ultrathin 
oxidation  of  the  H-terminated  surfaces. 


Himpsel,  Hollinger,  and  Pollak^^  gave 
Ey~-E{Si  2/?3/2)=98.74  eV  and  Ey-E{Si  2j9i/2)=99.34  eV, 
and  this  would  shift  our  result  upward  by  0.12  eV,  making 
the  Ep  positions  on  the  H-terminated  and  oxidized  surfaces 
lie  at  0.72-0.82  and  0.42-0.52  eV,  respectively.  In  order  to 
check  the  validity  of  the  reference  value  of  Ey—E{S\2p), 
we  further  attempted  to  directly  measure  the  valence-band 
spectra  of  H-terminated  Si  surfaces.  This  gave 
jE'y-£’/r=0.62±0.05  eV  and  this  seems  to  favor  the  result  in 
Fig.  5  rather  than  the  more  recent  result  by  Himpsel  and 
co-workers.  This  latter  work  by  Himpsel  and  co-workers^^ 
assumed  that  deposition  of  a  Ga  overlayer  or  a  (Cs4-0)  over¬ 
layer  on  the  clean  Si  surface  brings  the  Fermi  level  exactly  to 
the  valence-  or  conduction-band  edge,  respectively.  Such  an 
assumption  might  be  wrong,  judging  from  the  result  on 
Schottky  barriers  formation.^^  On  the  other  hand,  the  result 
in  Fig.  5  agrees  very  well  also  with  C-F  results  shown  in 
the  following  subsection. 

B.  Contactless  C-  V  measurements 

C-V  measurement  was  performed  at  the  small  signal  fre¬ 
quency  of  500  kHz.  The  insulator  capacitance  Cj  was  esti¬ 
mated  from  the  optically  determined  air-gap  distance.  All 
measurements  were  carried  out  in  pure  N2  (99.9999%)  atmo¬ 
sphere,  utilizing  a  “nitrogen  gap”  rather  than  an  air  gap. 

The  results  of  C-V  measurement  for  the  samples  having 
high-temperature  thermal  oxide  film  (100  nm)  are  shown  in 
Fig.  6.  The  calculated  ideal  C-V  curve  is  shown  by  a  broken 
line.  It  was  calculated  from  the  doping  density  and  the  insu¬ 
lator  capacitance  including  the  air-gap  contribution,  assum¬ 
ing  flatband  at  zero  bias,  and  ignoring  the  effect  of  work- 
function  difference  between  the  metal  field  plate  and  the 
semiconductor  surface.  A  well-behaved  C-V  curve  was  ob¬ 
tained  for  the  high-temperature  thermally  oxidized  surface 
with  the  presence  of  some  amount  of  positive  charges.  The 
hysteresis  which  is  seen  only  in  the  deep  depletion  region 
corresponds  to  the  nonequilibrium  supply  of  the  minority 
carriers.  Pulsed  C-V  measurement  was  also  performed  in 
which  gate  voltages  were  supplied  by  pulse  wave  forms.  As 
expected,  the  response  of  minority  carriers  disappeared,  and 
the  IfC^-V  plot  gave  the  correct  carrier  concentration. 

The  contactless  C-V  curves  taken  on  a  H-terminated  sur¬ 
face  are  shown  in  Fig.  7(a).  Flattening  of  capacitance  took 
place  under  positive  and  negative  bias  sweeps  with  a  very 
large  hysteresis  resulting  in  a  very  limited  range  of  C-V 
variation.  Complete  flattening  under  positive  bias  indicates 
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Voltage(V) 

Fig.  6.  Contactless  C-V  curves  for  the  samples  having  high-temperature 
thermal  oxide  film  (100  nm).  The  calculated  ideal  C-V  curve  is  shown  by 
the  broken  line. 


Fig.  8.  Contactless  C~V  curves  after  chemical  oxidation  of  H-terminated 
surfaces. 


existence  of  a  strong  Fermi-level  pinning  at  slightly  above 
the  midgap.  By  comparing  the  measured  capacitance  and  the 
ideal  C-V  curve,  the  pinning  position  of  Ef  corresponding 
to  the  flattening  position  of  capacitance  was  estimated  to  be 
0.69  eV  above  the  valence-band  edge.  This  estimation  in- 


(a) 


Fig.  7.  Contactless  C-  V  curves  taken  (a)  on  the  initial  H-terminated  surface 
and  (b)  after  second  H  termination  done  on  chemically  oxidized  surface. 


eludes  a  possible  error  of  ±0.1  eV  mainly  caused  by  the 
measurement  error  of  the  air-gap  distance  of  Ad ^^^^5  nm. 
The  pinning  position  thus  obtained  is  in  good  agreement  with 
the  XPS  band  bending  data  shown  in  Fig.  5.  This  flattening 
persisted  in  pulsed  C-V  curves  and  takes  place  at  the  same 
position. 

On  the  other  hand,  the  width  of  hysteresis  under  a  nega¬ 
tive  bias  increased  as  the  sweep  rate  of  bias  decreased.  This 
large  hysteresis  was  also  observed  for  the  pulsed  C-  V  curve. 
The  flat  portion  in  the  negative  going  sweep  corresponds  to 
the  high-frequency  inversion  capacitance,  indicating  that  an 
inversion  layer  is  produced  rapidly  through  a  rapid  surface 
generation  of  minority  carriers.  However,  this  inversion  level 
is  not  stable  under  positive-going  sweep  and  the  capacitance 
quickly  comes  up  to  an  upper  plateau.  Furthermore,  our  pre¬ 
vious  analysis  of  the  capacitance  transients'^  also  showed 
anomalous  two-step  transients  under  a  negative  bias,  indicat¬ 
ing  that  the  inversion  is  not  stable  on  this  surface.  As  dis¬ 
cussed  in  detail  previously, the  whole  behavior  can  only  be 
explained  by  presence  of  amphoteric  discrete  states  which 
have  donor  and  acceptor  characters.  Such  a  state  may  be  due 
to  Si  dangling  bonds  surrounded  by  Si — H  bonds. 

Another  remarkable  feature  of  the  H-terminated  surface 
was  reproducibility  of  its  electronic  properties.  As  an  ex¬ 
ample,  Fig.  7(b)  shows  the  C-  V  curve  taken  after  the  second 
H  termination  done  on  chemical  oxidation  (hot  HNO3)  of  the 
initial  H-terminated  surface.  It  is  very  interesting  that  the 
characteristic  C-V  behavior  with  a  large  hysteresis  is  com¬ 
pletely  reproduced  after  removal  of  the  chemical  oxide  by  a 
NH4F  solution,  i.e.,  reappearance  of  the  H-terminated 
Si(lll)  surface,  although  the  C-V  behavior  of  the  chemi¬ 
cally  oxidized  surface  is  totally  different  as  shown  in  the 
following.  This  reproducibility  makes  the  H  termination  par¬ 
ticularly  attractive  as  the  initial  surface  treatment  for  atomi¬ 
cally  controlled  processing. 

Figure  8  shows  the  C-V  curves  after  two-types  of  chemi- 
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Fig.  9,  Contactless  C-V  curves  obtained  at  the  air-exposed  surfaces. 


Fig.  10.  Contactless  C-V  curves  after  low-temperature  thermal  oxidation  of 
H-terminated  surfaces. 


cal  oxidation  processes  of  H-terminated  surfaces.  As  seen  in 
Fig.  8,  the  behavior  is  completely  different  from  that  of  the 
H-terminated  surface.  An  extremely  limited  capacitance 
change  is  seen  at  a  lower  capacitance  level.  The  shape  of  the 
curves  is  similar  to  that  encountered  in  GaAs  MIS  systems 
where  the  C-V  curve  under  accumulation  bias  shows  a 
strong  frequency  dispersion  and  eventually  flattens  out  to  a 
small  value  which  is  far  below  the  insulator  capacitance. 
This  behavior  indicates  presence  of  a  high  density  of  inter¬ 
face  states  which  cause  a  strong  Fermi-level  pinning  at  a 
much  deeper  position  than  the  H-terminated  surface.  The  ca¬ 
pacitance  analysis  indicates  that  the  pinning  position  is  at 
about  0.35-0.40  eV  above  the  valence  band  edge.  This 
agrees  again  with  the  XPS  band-bending  result.  It  should  be 
noted  that  the  observed  energy  position  is  close  to  the  hybrid 
orbital  charge  neutrality  level,  of  0-32  eV,  or  to  the 

Tersoff  midgap  energy  leveP^  of  0.36  eV  for  Si. 

Figure  9  shows  the  results  of  C-V  measurements  on  sur¬ 
faces  after  air  exposure  of  the  H-terminated  surface.  The 
result  obtained  on  the  as-received  air-exposed  surface  is  also 
included.  Again,  a  very  limited  range  of  capacitance  varia¬ 
tion  was  observed,  indicating  Fermi-level  pinning  near  iFho* 
In  both  samples,  the  capacitance  flattens  under  a  positive 
bias,  and  then  further  increases  as  shown  in  Fig.  9.  This 
shape  of  the  curve  is  again  similar  to  that  of  an  n-type  GaAs 
MIS  capacitor^^  with  strong  frequency  dispersion  of  capaci¬ 
tance.  Thus,  it  is  most  likely  that  the  second  increase  of 
capacitance  will  flatten  out  at  high  measurement  frequencies. 

Figure  10  shows  the  C-V  curves  after  low-temperature 
thermal  oxidation  of  the  H-terminated  surfaces.  When  the 
surface  was  oxidized  at  300  °C,  a  very  flat  C-V  curve  was 
obtained.  For  the  surfaces  after  oxidation  at  350  °C,  on  the 
other  hand,  the  capacitance  change  was  enhanced  as  seen  in 
Fig.  10.  This  appears  to  be  correlated  with  the  result  of  XPS 
analysis.  Namely,  the  surfaces  having  a  large  amount  of  sub¬ 
oxide  components  exhibited  a  small  capacitance  variation. 


whereas  increase  of  Si'^'^  component  at  a  higher  oxidation 
temperature  led  to  a  larger  capacitance  variation. 

On  the  basis  of  the  measured  C-V  results,  one  should  be 
able  to  determine  the  distribution  by  applying  the  Ter- 
man  method.  However,  this  was  not  attempted  here  because 
of  the  following  ambiguities.  First,  the  Terman  method  as¬ 
sumes  that  a  true  high-frequency  limit  is  reached  in  the  C-V 
measurement.  In  the  present  contactless  C-V  measurement 
system  with  a  complex  capacitance  feedback  mechanism,  the 
measurement  frequency  was  limited  to  a  single  frequency  of 
500  kHz.  This  frequency  is  high  enough  for  the  standard 
thermally  oxidized  Si  MOS  capacitors  to  reach  the  high- 
frequency  limit.  However,  since  rather  nonstandard  inter¬ 
faces  are  dealt  here,  it  may  not  be  high  enough  judging  from 
our  previous  experience  on  GaAs  MIS  systems where  the 
true  high-frequency  limit  was  reached  at  microwave  frequen¬ 
cies.  Thus,  a  nonflattened  portion  of  C-V  curves  under  high- 
accumulation  bias  requires  a  more  careful  examination  in  the 
future  to  determine  the  distribution  quantitatively.  An¬ 
other  ambiguity  lies  in  the  determination  of  the  insulator 
capacitance.  Although  the  present  system  can  determine  the 
air-gap  distance  within  ±5  nm  on  thermally  oxidized  sur¬ 
faces  by  optical  method,  further  investigation  is  necessary  as 
to  its  applicability  to  the  surfaces  covered  with  other  types  of 
oxides. 

On  the  other  hand,  relative  comparison  of  the  density  of 
interface  states  between  different  surfaces  can  still  be  made 
by  comparing  the  range  of  capacitance  variation  with  bias: 
namely,  as  seen  in  Figs.  6,  8,  9,  and  10,  the  capacitance 
corresponding  to  the  capacitance  flattening  decreases  in  the 
order  of  the  high- temperature  thermally  oxidized  surface,  the 
low-temperature  thermally  oxidized  surface  at  350  °C,  the 
air-exposed  surface,  the  chemically  oxidized  surface,  and  the 
low-temperature  thermally  oxidized  surface  at  300  °C.  In 
C-V  curves,  the  lower  the  flattening  capacitance  level  under 
positive  bias  is,  the  narrower  the  surface  potential  sweep 
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Fig.  11.  The  result  of  PLS^  measurements  for  various  Si(lll)  surfaces.  The 
PL  efficiencies  are  plotted  as  a  function  of  excitation  intensity. 


with  bias  is,  and,  therefore,  the  higher  the  interface  state 
density  is.  Thus,  it  can  be  concluded  that  the  interface  state 
density  increases  in  the  aforementioned  order. 

C.  PLS^  measurements 

Figure  11  summarizes  the  result  of  PLS^  measurements 
where  the  measured  values  of  the  PL  efficiency  are  plotted 
against  excitation  intensity  for  various  surfaces  investigated 
here.  It  should  be  noted  that  efficiency  values  are  normalized 
by  the  PL  intensity  from  the  reference  metal-oxide- 
semiconductor  (MOS)  sample  and  given  in  arbitrary  units, 
since  absolute  determination  of  the  efficiency  was  extremely 
difficult.  It  is  seen  in  Fig.  1 1  that  the  PL  efficiency  takes  very 
different  values  for  different  samples.  More  precisely,  the  PL 
intensity  decreases  in  the  order  of  the  high-temperature  ther¬ 
mally  oxidized  surface,  the  air-exposed  surface,  the  chemi¬ 
cally  oxidized  surfaces,  and  the  low-temperature  thermally 
oxidized  surface  at  300  ""C.  One  would  naturally  expected 
that,  the  higher  the  PL  efficiency,  the  lower  the  interface  state 
density.  In  fact,  this  order  in  the  PL  efficiency  values  corre¬ 
sponds  qualitatively  well  to  the  C-V  results  mentioned  in 
the  previous  section  concerning  the  flattening  levels  of  the 
C~V  curves  under  positive  bias.  Thus,  PLS^  and  C~~V  re¬ 
sults  are  consistent  with  each  other  and  indicate  that  interface 
state  density  increases  in  the  aforementioned  order. 

As  mentioned  previously,  the  shape  of  the  PL  efficiency 
versus  </>  curve  provides  a  very  reliable  information  concern¬ 
ing  the  shape  of  the  distribution  of  the  surface  states  causing 
surface  recombination.  On  the  hydrogen-terminated  surface, 
the  slope  of  the  PL  efficiency  was  exactly  unity,  indicating 
that  discrete  surface  state  exists  within  the  forbidden  gap. 
This  is  again  consistent  with  the  interpretation  of  the  C~V 
curves  shown  in  Fig.  7.  For  all  the  oxide-covered  surfaces. 


Ev{=  OeV)  Eho  Epi  Eo  Ec 

(0.36eV)  (0.65eV) 


Fig.  12.  Schematic  illustration  of  Nss  distributions  estimated  from  the  re¬ 
sults  of  XPS,  C-  V,  and  PLS^  measurements  for  the  various  Si  surfaces. 


on  the  other  hand,  the  slope  of  the  curve  was  less  than  unity 
and  it  continuously  decreases  with  the  increase  of  the  photon 
flux  density.  This  indicates  that  the  distribution  is  U 
shaped  for  these  surfaces. 

The  PLS^  method  can  determine  not  only  the  shape  of 
Ass  distribution  but  also  magnitude  by  fitting  the  data  with 
the  computer  simulation;  but,  this  requires  knowledge  of 
electron  and  hole  capture  cross  sections  of  interface  states  as 
well  as  the  absolute  values  of  the  PL  efficiency.  Due  to  lack 
of  knowledge  of  these,  together  with  the  availability  of  a 
limited  amount  of  data  taken  at  the  high  excitation  range  due 
to  weak  PL  intensities,  quantitative  determination  of  the  Ass 
distributions  by  this  method  was  not  attempted  in  this  article. 

D.  Models  of  Nss  distribution  for  hydrogen 
terminated  and  ultrathin-oxide-covered  Si(111) 
surfaces 

The  models  of  Ass  distribution  deduced  combining  all  the 
results  of  XPS,  C-V,  and  PLS^  measurements  are  shown  in 
Fig.  12  for  various  Si  surfaces  studied  here. 

Starting  from  the  H-terminated  surface,  the  unity  slope  of 
PL  efficiency  indicates  that  there  exists  a  high  density  of 
discrete  states  which  cause  Fermi-level  pinning.  C-V  results 
indicate  the  amphoteric  nature  of  these  states.  The  pinning 
position  of  the  H-terminated  surface  determined  by  the  XPS 
analysis  shown  in  Fig.  5  is  in  fairly  good  agreement  with  that 
estimated  from  the  flattening  value  of  the  capacitance  in 
C-V  curves.  It  may  be  said  that  it  lies  at  a  position  of  0.65 
±0.05  eV. 

As  for  the  concentration  of  the  amphoteric  discrete  states, 
it  was  impossible  to  determine  its  value  due  to  danger  of 
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dielectric  breakdown  in  C-V  measurement.  From  the  ap¬ 
plied  voltage  sweep  corresponding  to  the  flattening  of  ca¬ 
pacitance,  it  should  be  larger  than  1X10*^  cm”^. 

This  value  is  much  larger  than  the  number  of  recombina¬ 
tion  centers  of  10^-10^  cm~^  obtained  by  Yablonovitch 
et  al.^^  for  Si  surfaces  immersed  in  HF  acid  using  the  pho¬ 
todecay  method.  This  large  difference  seems  to  be  due  to  the 
fact  that  the  H-terminated  surface  in  the  present  study  is 
exposed  directly  to  dry  N2,  whereas  the  sample  studied  by 
Yablonovitch  et  al.  was  held  in  equilibrium  with  hydrogen- 
containing  liquid.  In  fact,  a  recent  SPY  study of  a 
H-terminated  surface  in  liquid  has  shown  that  the  surface 
state  density  is  very  low  in  such  a  liquid,  unlike  the  present 
result. 

As  for  the  structural  origin  of  the  discrete  states,  it  is 
highly  unlikely  that  they  are  due  to  H  termination  itself,  be¬ 
cause  a  theoretical  calculation^^’^^  and  recent  STM/scanning 
tunneling  spectroscopy  (STS)  observations^’^^  indicate  that 
no  gap  states  are  produced  on  perfectly  H-terminated  sur¬ 
faces.  We  have  recently  proposed  that  these  states  are  due 
to  Si  dangling  bonds  on  the  H-terminated  surface.  Although 
STM  study  has  demonstrated  perfectly  H-terminated  sur¬ 
faces  in  nanometer  field  sizes,  it  is  highly  likely  that  hydro¬ 
gen  atoms  will  leave  from  the  surface  on  the  real  surface  of 
wide  extension,  leaving  dangling  bonds,  which  possess  am¬ 
photeric  properties. 

Since  no  theoretical  calculation  has  so  far  made  to  our 
knowledge  on  the  validity  of  such  a  structural  model,  a  fu¬ 
ture  theoretical  work  is  needed. 

As  for  the  surfaces  covered  with  various  low-temperature 
ultrathin  oxides,  their  C-V  and  PLS^  behaviors  were  rather 
similar  to  each  other  and  were  completely  different  from  that 
of  the  initial  H-terminated  surface.  As  shown  in  the  models 
in  Fig.  12,  distributions  of  these  surfaces  are  all  U 
shaped,  having  a  minimum  at  0.35  eV  above  the  valence- 
band  maximum,  although  the  magnitude  of  itself 
showed  considerable  process  dependence. 

According  to  the  disorder-induced  gap  state  (DIGS) 
model,^^  disorder  of  bonds  near  the  surface  produce  a  donor- 
and  acceptorlike  state  continuum  within  the  energy  gap 
whose  boundary  is  given  by  the  hybrid  orbital  charge  neu¬ 
trality  level  ^Ho-  Thus,  when  the  state  density  is  high,  the 
Fermi  level  pinned  at  £'ho  which  lies  at  about  £'^+0.32  eV 
for  Si.  This  seems  to  be  more  or  less  the  situation  which  took 
place  on  all  the  ultrathin-oxide-covered  surfaces  studied 
here,  and  may  be  due  to  the  presence  of  nonstoichiometric 
and  nonuniform  oxides.  The  present  work  indicates  rather 
unexpected  difficulty  in  obtaining  pinning-free  ultrathin- 
oxide-Si  interfaces  by  low-temperature  processing.  A  further 
work  is  necessary  to  establish  a  process  for  growth  of 
pinning-free  ultrathin  oxides.  In  this  connection,  the  result  on 
the  low-temperature  thermal  oxidation  at  350  °C  gives  hope 
for  future  successful  optimization. 

IV.  CONCLUSION 

Electronic  properties  of  ultrathin-oxide-formed  Si(lll) 
surfaces  were  characterized  in  contactless  fashion,  using  con¬ 


tactless  capacitance- voltage  (C~V)  and  photoluminescence 
surface  state  spectroscopy  (PLS^)  techniques  together  with 
x-ray  photoelectron  spectroscopy  (XPS).  Hydrogen  (H)  ter¬ 
minated  Si  surfaces  were  used  as  the  initial  surface  prior  to 
ultrathin  oxidation.  Ultrathin  oxides  were  formed  by  chemi¬ 
cal  oxidation  (hot  HNO3,  H2SO4+H2O2),  long-time  air  ex¬ 
posure,  and  low-temperature  oxidation  (below  350  °C).  The 
main  conclusions  are  listed  below. 

(1)  The  initial  H-terminated  surfaces  showed  the  pres¬ 
ence  of  Fermi-level  pinning  at  EQ  =  Ey+ 0.65  eV  due  to  a 
high  density  of  amphoteric  discrete  state,  probably  originat¬ 
ing  from  Si  dangling  bonds. 

(2)  All  the  ultrathin-oxide-covered  surfaces  exhibited 
very  limited  capacitance  variation  with  bias  similarity  to 
GaAs  capacitors.  The  results  indicate  that  the  Fermi  level  is 
pinned  near  the  hybrid  orbital  charge  neutrality  level  £^0 
due  to  presence  of  a  high-density  interface  states  with  narrow 
U-shaped  continuous  distributions. 

(3)  Low-temperature  thermal  oxidation  at  350  °C  slightly 
weakens  such  pinning. 

The  present  work  indicates  difficulty  of  realizing  un¬ 
pinned  ultrathin-oxide-silicon  interfaces  by  the  low- 
temperature  processes  studied  here. 
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High-resolution  x-ray  photoelectron  spectroscopy  (XPS)  in  conjunction  with  secondary-ion-mass 
spectrometry  was  used  to  study  the  chemical  nature  and  distribution  of  N  in  oxynitride  films  formed 
by  rapid  thermal  N2O  processes  (RTFs)  or  conventional  furnace  methods.  The  kinetics  of  furnace 
oxide  growth  in  N2O  are  slower  than  that  in  O2 .  During  reoxidation  the  oxidation  rate  increased  to 
that  in  pure  O2  and  the  N  in  the  Si02-Si  interface  region  is  displaced  into  the  bulk  of  the  oxide. 
High-resolution  synchrotron  Si2p  core-level  photoemission  spectroscopy  (PES)  was  used  to  study 
the  oxide- Si(  100)  interface  suboxide  structures  produced  by  RTP  with  and  without  the  presence  of 
N.  XPS  N  1^  studies  indicated  that  there  are  two  types  of  N  in  the  RTP  oxynitride  films.  The 
chemical  bond  configuration  of  the  first  type  of  N  is  similar  to  that  of  N  in  Si^N^  and  is  mainly 
distributed  within  the  first  1  nm  from  the  interface.  The  second  type  of  N  is  distributed  mainly 
outside  of  the  first  1  nm  region,  and  the  N  is  likely  bonded  to  two  Si  and  one  oxygen  atom.  PES 
studies  showed  that  Si  formed  suboxides  with  oxygen  at  the  interface  for  all  oxynitride  films.  It  is 
found  that  there  is  no  change  in  the  Si"^^  structure  while  there  is  a  dramatic  decrease  in  the  Si"^^  and 
Si'^^  states  with  the  inclusion  of  N  in  the  oxide.  Both  the  XPS  and  PES  results  are  explained  in  terms 
of  a  strain  reduction  as  N  is  incorporated  in  the  film  near  the  interface  region,  where  Si3N4  functions 
as  a  buffer  layer  which  reduces  the  stress  caused  by  the  large  Si  lattice  mismatch  between  the  bulk 
Si  and  the  oxide  overlayer.  About  1/5  of  the  Si^^  and  1/3  of  Si"^"^  atoms  at  the  Si02-Si  interface  have 
been  replaced  by  the  Si3N4  buffer  layer  at  the  oxynitride-Si  interface.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

The  gate-dielectric-Si(lOO)  interface  has  been  one  of  the 
prime  issues  in  the  development  of  deep  submicron  Si-based 
ultralarge- scale  integrated  circuit  (ULSI)  technology.  Be¬ 
cause  of  their  superior  electrical  properties,  nitrided  oxides 
or  oxynitrides  are  currently  under  intensive  study  for  use  as 
gate  dielectrics.^"^  Most  reported  work  has  focused  on  the 
growth  method  and  the  electrical  properties  of  these  thin 
films.  There  have  been  some  articles  dealing  with  the  physi¬ 
cal  and  chemical  structure  of  the  oxynitride  films.  For  ex¬ 
ample,  it  has  been  reported^ that  the  total  amount  of  N  is 
very  small  (a  few  monolayers).  The  N  accumulates  at  the 
Si02-Si  interface  and  leads  to  improved  electrical 
properties  however,  questions  remain  regarding  the  effect 
of  N  on  the  oxide-Si  interface  structure.  The  introduction  of 
N  into  the  films  using  NH3  was  hitherto  the  usual  method, 
but  subsequent  reoxidation  was  required  to  reduce  electron 
trapping  caused  by  hydrogen  incorporation.^^  The  alternative 
use  of  N2O  to  introduce  N  into  the  Si02  film  avoids  this 
problem  and  constitutes  a  single-step  process  by  direct  oxi¬ 
dation  in  N2O  without  further  reoxidation.  There  has 
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been  a  great  deal  of  interest  in  the  oxidation  of  Si  in  N2O 
over  the  last  few  years  because  of  these  factors  and  studies 
include  examination  both  of  the  N  distribution  in  and  elec¬ 
trical  properties  of  the  Si02  oxide  films.^^’^^’^^"^^  In  these 
studies  very  thin  oxide  films  have  been  grown  in  N2O  ambi¬ 
ents  both  in  conventional  furnaces  and  by  rapid  thermal  oxi¬ 
dation  (RTO)  methods. Oxide  growth  rates  were  de¬ 
termined  for  temperatures  in  the  range  950-1200  °C  under  a 
variety  of  conditions.  N  accumulation  at  or  near  the  Si02-Si 
interface  has  been  determined  by  Auger  electron  spectros¬ 
copy  (AES)^^’^^’^^  and  secondary-ion-mass  spectrometry 
The  nature  of  the  N-containing  species 
at  or  near  the  Si02-Si  interface  formed  during  oxidation  in 
N2O  has  been  examined  by  x-ray  photoelectron  spectroscopy 
(XPS).^^’^^’^^  The  aim  of  this  article  is  to  study  the  chemical 
nature  and  depth  distribution  of  N  in  oxynitride  films  and  the 
effects  of  N  on  the  interface  structure  using  XPS,  SIMS,  and 
photoemission  spectroscopy  (PES).  The  surface-analytical 
data  are  complemented  by  kinetic  studies  in  O2  and  N2O. 

II.  EXPERIMENT 

Two  types  of  oxide  and  oxynitride  films  were  grown.  The 
first  is  grown  in  low-pressure  O2  and  N2O  in  a  quartz  tube 
heated  in  a  conventional  furnace.  The  experimental  proce- 
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dures  used  as  well  as  details  of  the  determination  of  oxide 
thicknesses  and  N  content  in  the  films,  and  the  SIMS  analy¬ 
sis  are  described  more  fully  elsewhere. This  type  of  oxide 
was  used  to  study  oxide  growth  kinetics.  The  second  type  of 
oxide  and  oxynitride  is  grown  at  Northern  Telecom’s  stan¬ 
dard  0.5  fim  complementary  metal-oxide-semiconductor 
(CMOS)  process  facilities.  The  high  quality  of  these  films 
was  confirmed  by  electrical  measurements  on  both  n-  and 
p -channel  MOS  field-effect  transistors  (MOSFETs)  fabri¬ 
cated  on  these  oxynitride  films.  In  order  to  focus  on  the 
effect  of  N  on  the  interface  structure,  we  concentrate  on  ox¬ 
ynitride  films  processed  at  1050  °C.  This  avoids  the  compli¬ 
cations  of  other  effects  such  as  growth  temperature  on  the 
interface  suboxide  distributions.  In  a  previous  study^^"^  we 
have  established,  in  the  case  of  furnace-grown  oxide,  that 
high-temperature  treatment  leads  to  an  increase  of  Si  subox¬ 
ides,  Here  we  focus  our  study  on  several  representative 
growth  conditions  used  in  fabrication  of  CMOS  circuits: 

(a)  rapid  thermal  oxidation  using  pure  oxygen  (RTO)  on  a 
bare  Si  surface; 

(b)  rapid  thermal  oxidation  using  N2O  gas  (RTN)  on  a  bare 
Si  surface; 

(c)  RTN  treatment  of  the  RTO  oxide  (RTN/RTO);  and 

(d)  rapid  thermal  reoxidation  of  RTN/RTO  (ReOx  RTN/ 
RTO)  films. 

III.  RESULTS  AND  DISCUSSION 
A.  Growth  kinetics 

The  growth  of  the  film  formed  during  the  oxidation  of 
Si(lOO)  in  N2O  at  1050  ""C  and  10  Torr  pressure  can  be  best 
expressed  in  terms  of  a  direct  logarithmic  equation  of  the 
form 

x  =  kioalog{t+to)-\-A,  (1) 

where  x  is  the  oxide  thickness,  is  the  logarithmic  rate 
constant,  t  is  the  time,  tQ  is  the  time  at  x  =  0,  and  A  is  a 
constant.  This  is  in  contrast  to  film  growth  in  pure  O2  at 
1050  °C,  where  a  parabolic  rate  relation  was  found  to  best  fit 
the  kinetic  data.^'^  This  is  shown  in  Fig.  1(a),  where  kinetic 
data  are  compared  for  the  oxidation  of  Si(lOO)  in  O2  and 
N2O  at  10  Torr  pressure  and  1050  °C.  The  extent  of  oxida¬ 
tion  is  much  less  in  N2O  than  in  O2  at  any  given  time.  These 
data  support  the  view  that  the  formation  of  a  N-rich  interfa¬ 
cial  layer  retards  oxidant  diffusion  to  the  substrate  surface; 
the  more  N  at  the  interface,  the  less  is  the  extent  of 
oxidation. Lange  et  and  Ting  et  also  found  that 

the  oxide  thicknesses  formed  during  RTO  are  much  less  in 
N2O  than  in  O2.  They  also  found  that  the  oxide  growth  rate 
became  very  slow  after  the  film  thicknesses  reached  2-6  nm 
dependent  on  the  temperature.  They  also  support  the  view  of 
a  N-rich  interfacial  layer  retarding  oxidant  diffusion  to  the 
substrate.  Several  reoxidation  experiments  were  also  carried 
out,  in  which  samples  were  first  oxidized  in  N2O  for  1  h  and 
then  oxidized  in  ^^62  (nominally  97.0%-99.0%  and  con¬ 
taining  a  few  ppm  H2O)  for  periods  from  1  to  16  h  at 
1050  °C.  The  kinetics  of  reoxidation  are  compared  with 
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Fig.  1.  (a)  A  comparison  of  the  oxidation  kinetics  for  Si(lOO)  at  1050  °C  in 
N2O  and  in  pure  O2  at  10  Torr  pressure.  The  points  on  the  solid  lines  are  for 
individual  experiments,  the  lines  being  least-squares  fits  to  a  direct  logarith¬ 
mic  equation  for  N2O  and  to  a  parabolic  equation  for  pure  O2 ,  respectively, 
(b)  Kinetics  for  Si(lOO)  previously  oxidized  in  N2O  for  1  h  followed  by 
reoxidation  in  ^^02  for  various  times,  both  oxidations  being  at  1050  °C  and 
10  Torr  pressure.  The  points  on  the  solid  line  (a  parabolic  least-squares  fit  to 
the  data)  are  for  individual  experiments.  Kinetic  curves  for  oxidation  in  O2 
and  N2O  are  shown  for  comparison. 

those  for  oxidation  in  N2O  and  in  O2  only.  The  data  for  N2O 
have  been  extrapolated  to  longer  times.  As  shown  in  Fig. 
1(b)  during  subsequent  oxidation  in  ^^62  the  oxidation  rate 
increased  gradually  becoming  similar  to  that  in  pure  O2  after 
about  2  h.  The  longer  term  data  points  from  separate  runs  are 
fitted  to  a  parabolic  rate  equation  of  the  form 

x^  =  kpt-\-  B,  (2) 

where  k^^  is  the  parabolic  rate  constant  and  5  is  a  constant. 
SIMS  analysis  showed  that  during  reoxidation  in  ^^62  the 
amount  of  N  in  the  region  near  the  Si02-Si  interface  be¬ 
comes  reduced,  the  N  being  displaced  into  the  bulk  of  the 
oxide.  The  displaced  N-enriched  zone  moves  progressively 
outward  with  increasing  time;  the  amount  originally  at  the 
Si02-Si  interface  is  reduced  significantly  within  2  h  of  re¬ 
oxidation.  After  16  h  the  amount  displaced  into  the  bulk  of 
the  film  relative  to  the  total  oxide  becomes  very  small  and 
the  N  is  now  concentrated  near  the  gas-Si02  interface.  The 
total  percentage  of  N  in  the  reoxidized  film  relative  to  that  in 
the  oxide  formed  in  N2O  was  found  to  be  reduced  to  about 
half  its  original  value  after  2  h  of  reoxidation  and  this  value 
remains  roughly  constant  out  to  16  h  of  reoxidation.  This 
indicates  that  some  N  is  lost  to  the  ambient  initially,  but  the 
remainder  is  stable  within  the  oxide  thereafter.  The  replace¬ 
ment  of  N  by  O  in  the  Si02  network  occurs  because  of  the 
diffusion  of  O  to  the  Si02“Si  interface,  where  new  oxide 
forms.  Some  of  the  N  in  the  very  thin  Si3N4  and  oxynitride 
layers  adjacent  to  the  Si02-Si  interface  (see  below)  is  re¬ 
placed  by  O  and  these  oxynitride  layers  are  displaced  out- 
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Fig.  2.  N  1  .s  XPS  spectrum  recorded  on  a  RTN  etchback  sample.  The  open 
circles  are  as-recorded  data.  The  dotted  and  dashed  lines  are  curve-fitted 
peaks  with  the  chemical  state  as  labeled.  The  solid  line  is  the  sum  of  these 
two  peaks. 

ward  by  the  new  oxide  formation  at  this  interface.  This  dis¬ 
placement  process  has  also  been  observed  by  Saks  et 
Okada  et  and  Green  et  al^  These  SIMS  results  may  be 
correlated  with  the  kinetic  data,  which  show  that  after  about 
2  h  the  reoxidation  kinetics  are  similar  to  that  for  oxidation 
with  no  prior  oxynitride  present  [see  Fig.  1(b)].  This  is  the 
point  at  which  the  amount  of  total  N  in  the  oxide  film  be¬ 
comes  constant.  Evidently  this  level  of  N  in  the  oxide  film  is 
now  too  low  to  cause  any  significant  effect  on  the  inward 
diffusion  of  oxygen  and  oxidation  essentially  proceeds  at  the 
same  rate  as  in  pure  O2. 

B.  Chemical  structure  and  distribution  of  N 

X-ray  photoelectron  spectroscopy  (XPS)  was  used  to  de¬ 
termine  the  nature  and  distribution  of  N  in  the  film.  The 
measurements  were  carried  out  on  a  PHI  5500  system  which 
is  equipped  with  a  monochromated  Al  Xa  source  and  a 
hemispherical  electron  analyzer.  Thin  oxide  films  of  various 
thicknesses  were  obtained  by  etchback  in  a  dilute  (1%)  HF 
solution.  No  thickness  variation  was  found  on  the  etchback 
samples.  Si  2p  core  levels  were  recorded  and  were  used  to 
calculate  the  film  thickness  using  the  method  described  in 
Ref.  26  for  an  electron  free  path  of  3.5  nm.  In  order  to 
increase  the  depth  resolution,  the  measurements  were  made 
at  a  30°  take-off  angle.  Figure  2  shows  a  high-resolution 
N  Is  spectrum  taken  on  an  etchback  RTN  sample.  The  spec¬ 
trum  was  taken  with  a  5.8  eV  pass  energy.  It  normally  takes 
several  hours  to  obtain  such  a  spectrum  on  an  oxynitride 
sample  which  has  a  very  dilute  N  concentration  (as  low  as  a 
few  at.  %).  As  can  be  seen  in  the  figure,  there  are  two  com¬ 
ponents  in  the  spectrum.  Curve-fitting  analysis  indicated  one 
peak  located  at  396.97  eV  and  another  with  a  chemical  shift 
of  0.85  eV  at  397.82  eV.  Measurements  on  Si3N4  films  yield 
a  N  1.S  peak  at  397.0  eV.  This  suggests  that  the  chemical 
environment  of  the  N  at  396.97  eV  in  the  oxynitride  is  simi¬ 
lar  to  that  in  Si3N4,  i.e.,  one  N  bonded  to  three  Si  atoms  and 
is  referred  to  as  N(-Si3)  in  the  figure  and  the  following  text. 
Based  on  its  binding  energy,  the  nature  of  N  at  397.82  eV  is 
tentatively  assigned  to  N  bonded  to  two  Si  and  one  O,  which 


Film  Thickness  (A) 

Fig.  3.  N  concentration  profiles  as  a  function  of  oxynitride  film  thickness  for 
(a)  the  RTN  sample  and  (b)  the  reoxidized  RTN/RTO  sample. 


is  more  electronegative  than  N  and  thus  causes  the  N  Is  shift 
to  the  higher  binding  energy  with  the  formation  of  a  N — O 
bond.  This  N  species  is  referred  to  as  N(-Si20)  in  the  figure 
and  following  text.  From  synchrotron  Si  2p  core-level  mea¬ 
surements  to  be  presented,  it  was  found  that  the 
oxynitride-Si  interface  is  formed  by  suboxides.  This  means 
that  the  N(-Si3)  second-nearest-neighbor  atoms  are  either  N 
or  O.  Theoretical  calculations^^  on  the  Si02-Si  system  found 
that  the  core-level  shift  is  determined  by  first-nearest- 
neighbor  atoms  and  contributions  from  the  second-nearest- 
neighbor  atoms  are  negligible. 

The  concentration  of  N  in  the  oxynitride  can  be  calculated 
from  the  integrated  peak  intensities  of  N  Is,  Si"^"^  (i.e., 
Si02)  2p,  and  O  Is.  Their  relative  sensitivity  factors  were 
determined  from  thermally  produced  Si02  and  Si3N4 
samples.  We  have  found  that  there  are  some  residual  con¬ 
taminants  such  as  hydrocarbon  and  water  present  on  the 
etchback  samples.  In  order  to  eliminate  this  artificial  fluctua¬ 
tion  in  the  N  profile,  we  calculate  the  N  concentration  using 
only  the  N  l^-  and  the  2p  peaks.  Figure  3(a)  shows  N 
distributions  obtained  from  the  RTN.  The  data  show  that  the 
N(-Si3)  is  distributed  mainly  within  the  first  1  nm  region  of 
the  interface  while  the  N(-Si20)  is  distributed  mainly  about  2 
nm  away  from  the  interface  on  as-grown  oxynitride  films. 
There  is  no  apparent  difference  in  the  N  distribution  between 
the  RTN  and  the  RTN/RTO  films.  A  possible  explanation  for 
the  observed  N  distribution  is  related  to  the  Si  atomic  density 
mismatch.  It  is  known  that  there  is  a  large  mismatch  in  the  Si 
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atomic  density  across  the  oxide-Si  interface,  from  2.2X10^^ 
atoms  cm"^  in  the  Si02  to  5X10^^  atoms  cm""^  in  bulk  Si. 
The  transition  of  this  interface  is  realized  through  the  forma¬ 
tion  of  an  extra  sacrificial  buffer  layer  of  suboxide.^^  Details 
of  this  suboxide  structure  are  presented  in  the  following  text. 
It  is  also  known  that  the  Si  density  in  Si3N4  is  4X10^^ 
atoms  cm“^,  between  that  of  bulk  Si  and  Si02.  The  inclusion 
of  Si3N4  near  the  oxide-Si  interface  therefore  creates  a 
buffer  layer  which  reduces  the  mismatch-induced  strain  at 
the  oxide-Si(lOO)  interface. 

Figure  3(b)  shows  the  N  profile  on  the  reoxidized  oxyni¬ 
tride  film  (ReOx  RTN/RTO  film).  The  reoxidation  was  done 
at  1050  °C  for  1  min  which  produces  a  nominal  8  nm  oxide 
on  a  bare  surface.  Here  the  high  diffusion  barrier  of  the  ox¬ 
ynitride  film  restricts  the  oxide  growth  to  about  1  nm.  The 
new  oxide  film,  as  shown  in  the  figure,  is  formed  between 
the  Si  substrate  and  the  “old”  oxynitride.  The  reoxidation 
process  is  also  shown  to  reduce  the  amount  of  Si3N4,  which 
is  likely  converted  into  N(-Si20).  These  observations  appear 
to  contradict  our  above  conclusion  based  on  a  strain  energy 
minimization  principle  that  a  dense  Si  nitride  layer  between 
the  Si  substrate  and  oxide  film  is  favored.  This  paradox  may 
be  explained  by  the  growth  kinetics.  New  oxide  is  formed  on 
the  substrate  surface  by  continuous  flow  of  O  atoms;  there  is 
no  mobile  N  associated  species  readily  available.  The  N  in 
the  old  film  is  chemically  bonded  to  the  oxynitride  network 
and  a  large  activation  energy  is  required  to  break  the  network 
into  mobile  species.  As  the  new  oxide  grows  and  the  Si3N4 
species  move  away  from  the  interface  region,  the  strain  on 
the  N(-Si3)  species  is  increasing  due  to  its  moving  into  a  less 
dense  Si02  network.  The  strained  N(-Si3)  is  then  gradually 
converted  into  a  less  strained  N(-Si20)  species,  as  observed. 

C.  The  effect  of  N  inclusion  on  interface  suboxide 
structure 

Considering  the  interface  structure,  it  is  well  established 
that  the  Si02-Si  interface  is  formed  through  a  transition  re¬ 
gion  made  up  of  Si  suboxides  denoted  as  Si^\  Si"^^,  and 
Si"^^,  where  the  Si  atoms  have  one,  two,  and  three  first- 
nearest-neighbor  oxygen  atoms,  respectively.  Synchrotron 
photoelectron  spectroscopy  (PES)  is  probably  the  only  tech¬ 
nique  that  can  identify  unambiguously  these  suboxides  on 
thin  etchback  oxides. We  have  shown  recently^^’^®  that 
PES  can  be  used  to  effectively  monitor  SiO2-Si(100)  inter¬ 
facial  changes  under  various  industrial  ULSI  process  condi¬ 
tions.  Previous  PES  measurements  on  Si  2/7  core  levels  have 
found  that  the  chemical  shifts  of  these  Si"^^  peaks  are  0.97, 
1.8,  and  2.6  eV  for  x  =  1 ,  2,  and  3,  respectively.^^’^^  A  recent 
theoretical  first-principle  calculation  by  Pasquarello  and 
co-workers^^  found  excellent  agreement  with  these  chemical 
shifts. 

The  synchrotron  PES  measurements  were  carried  out  at 
the  Stanford  Synchrotron  Radiation  Laboratory  (SSRL)  us¬ 
ing  a  6  m  toroidal-grading  monochromator  (TGM)  beam  line 
(experimental  station  8-1).  The  photoelectrons  were  col¬ 
lected  by  an  angle-integrated  PHI  double-pass  cylindrical- 
mirror  analyzer  in  an  UHV  chamber  with  a  load-lock  cham- 
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Fig.  4.  Si  2p2/2  PES  spectrum  recorded  on  the  RTO  sample  using  130  eV 
photons.  The  open  circles  are  raw  data  with  the  2py2  and  the  Shirley  back¬ 
ground  subtracted.  The  dotted  lines  represent  various  Si  species  as  labeled. 
The  solid  line  the  sum  of  all  of  these  Si  peaks. 

ber.  The  photoelectron  take-off  angle  was  45°.  Si  2/7  core 
shell  electrons  were  excited  by  130  eV  photons.  Figure  4 
shows  a  Si  2/73/2  spectrum  recorded  from  the  RTO  sample 
with  an  etchback  Si02  thickness  of  1  nm.  The  data  were 
analyzed  using  the  same  curve-fitting  method  as  previously 
used  for  furnace  and  UHV-grown  Si02-Si  interfaces.^^’^^  In 
Fig.  4  the  dashed  lines  indicate  various  Si  peaks  with  the 
oxidation  states  as  labeled.  As  can  be  seen  in  the  figure,  the 
Si02  peak  is  very  strong  while  the  substrate  and  suboxide  Si 
peaks  are  very  weak,  which  requires  several  hours  of  data 
acquisition  time  in  order  to  obtain  usable  data  in  the  subox¬ 
ide  region. 

Since  the  relative  interfacial  suboxide  distribution  is  the 
main  focus  of  this  study,  we  have  subtracted  the  Si02  con¬ 
tribution  from  Fig.  4  and  the  results  are  shown  as  Fig.  5(a) 
where  the  suboxides  distribution  is  now  much  clearer.  Figure 
5(b)  shows  curve-fitting  analysis  on  the  RTN  sample.  The 
results  show  that  the  Si'^^  Si"^^,  and  Si"^^  peaks  have  the 
same  chemical  shift  as  those  found  for  the  RTO  sample.  This 
indicates  that  the  interface  suboxide  chemical  bonds  are 
Si — O  bonds.  The  relative  intensities  (to  that  of  substrate  Si) 
of  these  suboxides,  however,  is  quite  different.  The  suboxide 
intensities  from  the  RTN  oxynitride-Si(lOO)  is  significantly 
reduced  compared  with  those  of  the  RTO  SiO2-Si(100)  in¬ 
terface.  Based  on  their  relative  intensities  and  using  the 
photoionization  cross  section  and  mean  free  path  given  by 
Himpsel  et  the  amount  of  various  suboxides  can  be 
calculated  and  the  results  from  all  three  samples  are  listed  in 
Table  I.  The  table  shows  that  the  amount  of  Si"^^  is  the  same 
for  all  samples.  This  may  not  be  surprising  as  Si"^^  is  located 
right  at  the  substrate  surface^^  and  therefore  is  less  sensitive 
to  structural  changes  in  the  oxide  film.  The  amounts  of  both 
Si"^^  and  Si^^,  however,  decrease  significantly  by  about  1/5 
for  Si^^  and  1/3  for  Si'^^  at  the  oxynitride-Si(lOO)  interface. 
This  can  again  be  explained  by  a  reduction  in  the  mismatch- 
induced  strain  with  the  inclusion  of  N  near  the  interface.  In 
previous  studies,  we  established^^’^^  that  a  high  number  of 
Si"^^  and  Si'^^  is  produced  at  high  temperature.  This  was 
attributed  to  a  structural  relaxation  to  minimize  Si02-Si  in- 
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Fig.  5.  Si  PES  spectra  recorded  from  (a)  the  RTO  sample  and  (b)  the 
RTN  sample.  The  2py2,  the  Shirley  background,  and  the  Si02  contributions 
have  been  subtracted.  The  open  circles  are  the  raw  data.  The  dotted  lines  are 
contributions  from  the  substrate  silicon  Si®  and  the  various  interface  subox¬ 
ides  Si'*’^,  as  labeled.  The  solid  line  is  the  sum  of  all  of  these  Si  peaks. 


terfacial  strain  through  the  production  of  extra  sacrificial  lay¬ 
ers  of  Si'*'^  and  states  as  a  Si  density  buffer.  Here,  in  the 
case  of  the  oxynitride-Si  interface,  the  above  XPS  data 
showed  that  N(-Si3)  is  distributed  mainly  near  the  interface. 
The  nitrided  region  with  high  Si  density  also  functions  as  a 
buffer  layer  to  help  reduce  the  strains  caused  by  the  density 
mismatch  at  the  interface.  This  means  that  a  relatively  small 
amount  of  sacrificial  Si^^  and  Si"^^  is  required  at  the  inter¬ 
face  to  accommodate  the  interface  lattice  mismatch.  Upon 
reoxidation,  the  Si"^^  is  shown  to  increase  while  Si’*'^  and 
Si"^^  remain  unchanged.  This  can  be  explained  by  the  above 
observation  of  N  moving  away  from  the  interface  region.  An 
atomic  density  buffer  layer  on  the  reoxidized  film  therefore 
has  to  be  assumed  again  by  suboxides.  However,  the  amount 
of  suboxide  is  still  less  than  that  of  pure  oxide.  This  may  be 
explained  by  the  fact  the  oxynitride-Si  interface  is  less 
strained  even  without  Si3N4  as  a  buffer  layer  compared  with 
that  of  the  Si02“Si  interface.  It  is  also  possible  that  there 
may  be  a  small  amount  of  N  (below  the  XPS  sensitivity 
—0.1  at.  %)  existing  in  the  suboxide  region.  The  dramatic 


Table  I.  The  amount  of  suboxide  A^sio  .^  ^t  the  dielectric~Si(100)  interface 
formed  by  various  rapid  thermal  processes  (RTFs)  at  1050  °C. 


RTF  process 

^Sio^,  atoms  cm 

Si'"^ 

RTO 

2.22 

3.49 

4.16 

RTN 

2.25 

2.98 

2.43 

RTN/RTO 

2.24 

2.90 

2.91 

ReOx  RTN/RTO 

2.31 

2.70 

3.61 

effect  of  small  amount  of  impurity  on  the  Si  lattice  structure 
is  well  known.^^ 

IV.  CONCLUSIONS 

A.  Growth  kinetics 

The  oxidation  of  p-type  Si(lOO)  in  N2O  at  10  Torr  pres¬ 
sure  at  1050  °C  produced  oxide  films  significantly  thinner 
than  those  which  would  have  been  produced  during  oxida¬ 
tion  in  O2  under  the  same  conditions.  During  oxidation  in 
N2O  a  N-rich  layer  is  produced  near  to  the  Si02-Si  interface, 
which  inhibited  the  influx  of  oxidant  probably  because  of 
slower  oxygen  diffusion  through  this  layer.  The  conformity 
of  the  oxidation  kinetics  to  a  direct  logarithmic  rate  relation 
lends  support  to  the  notion  of  a  reduction  in  rate  caused  by  a 
second  phase  at  the  reaction  zone.  Reoxidation  caused  the  N 
in  the  Si02-Si  interface  region  to  be  displaced  into  the  bulk 
of  the  oxide  and  a  reduction  in  the  total  N  content  of  the  film 
to  a  constant  50%  of  its  original  value.  The  oxidation  rate  on 
reoxidation  increased  to  that  for  oxidation  in  pure  O2  and  the 
time  at  which  this  occurred  corresponded  to  that  where  the 
total  N  content  of  the  film  became  constant. 

B.  N  structure  and  distribution 

XPS  studies  found  two  types  of  N  present  in  the  N2O 
oxynitride  films:  one  with  a  N  l^  binding  energy  at  396.97 
eV  attributed  to  N  in  the  form  of  Si3N4,  and  another  with  a 
chemical  shift  of  0.85  eV  at  397.82  eV  attributed  to  N  situ¬ 
ated  in  a  more  electronegative  environment  such  as  N 
bonded  to  two  Si  and  one  O  atom.  XPS  etch  profiling  found 
that  Si3N4  is  located  within  the  first  1  nm  from  the  interface 
and  N(-Si20)  is  mainly  distributed  adjacent  to  the  Si3N4 
layer.  Both  types  of  N  are  pushed  away  from  the  interface 
upon  reoxidation.  The  amount  of  Si3N4  species  is  also  found 
to  decrease  significantly  after  reoxidation.  The  observed  data 
are  explained  by  the  strain-energy  minimization  principle. 

C.  Interface  structure 

Synchrotron  Si  2;?  core-level  studies  found  that  the  topo¬ 
logical  link  between  oxynitride  film  and  the  Si  surface  is 
made  by  various  Si  suboxides,  similar  to  that  for  the  Si02-Si 
interface.  The  amounts  of  the  various  suboxides,  however, 
are  found  to  differ  dramatically.  The  data  showed  that  the 
amount  of  Si"^^  and  Si^^  states  at  the  oxynitride-Si  interface 
is  about  30%  less  than  that  at  the  Si02-Si  interface,  while 
the  amount  of  Si^^  states  remains  the  same.  The  current  re¬ 
sults,  together  with  previous  data^^’^^  on  the  effect  of  tem¬ 
perature  where  suboxides  are  found  to  vary  significantly  with 
growth  and  anneal  temperature,  are  explained  by  a  new  in¬ 
terface  model  where  the  functionalities  of  suboxides  are  two¬ 
fold.  The  first  function  of  the  suboxide  is  to  provide  a  topo¬ 
logical  link  between  the  crystalline  Si  substrate  and  the 
amorphous  Si02  network.  The  current  theoretical  model  is 
focused  only  on  this  type  of  suboxide  structure.^^  The  second 
function  of  the  suboxides,  mainly  Si”^^  and  Si"^^,  is  to  act  as 
“buffer”  layers  to  minimize  strains  caused  by  the  large 
(about  50%)  Si  density  gradient  across  the  interface.  In  the 
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case  of  the  oxynitride-Si  interface,  the  role  of  the  “buffer” 
suboxides  has  been  replaced  by  Si3N4.  Since  the  Si  atomic 
density  in  the  nitride  is  between  that  in  bulk  Si  and  that  in 
Si02,  the  interface  is  less  strained.  The  less  strained 
oxynitride-Si  interface  may  be  correlated  with  the 
observations^  of  a  lower  amount  of  interface  states  generated 
under  current  stress  and  radiation,  compared  with  conven¬ 
tional  thermal  Si02-Si  interfaces.  The  results  also  indicate 
that  oxynitride  films  are  very  attractive  in  terms  of  a  reduced 
interface  transition  region  for  future  0.1  fim  or  less  ULSI 

TO 

manufacturing  which  uses  3.4  nm  or  less  gate  dielectrics. 
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Surface  state  effects  in  Ino53Gao.47As  one-dimensional  quantum  wires  and  the  effectiveness  of  the 
Si  interface  control  layer  (Si  ICL)-based  passivation  technique  are  investigated  using 
photoluminescence  (PL)  as  the  probe.  Scanning  electron  microscope  and  x-ray  photoelectron 
spectroscopy  measurements  were  made  to  characterize  the  structure  and  the  interface  properties. 

The  Ino.53Gao47As  quantum  wires  embedded  in  Ino.52Alo.48As  barrier  material  were  fabricated  by 
selective  molecular  beam  epitaxy  growth  on  patterned  InP  substrates.  Unpassivated  near-surface 
quantum  wires  showed  an  exponential  decrease  of  PL  intensity  with  reduction  of  surface-to-well 
distance,  similarly  to  the  near-surface  quantum  wells.  By  applying  the  Si  ICL-based 
passivation,  a  nearly  complete  recovery  of  PL  intensity  was  achieved  with  an  observed  maximum 
recovery  factor  of  250  for  the  InGaAs  quantum  wire  directly  passivated  with  Si02/Si  ICL  (?ws“0)- 
The  mechanism  for  the  PL  recovery  is  explained  in  terms  of  suppression  of  surface  states  by 
passivation.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Recent  intensive  research  efforts  in  the  study  of  quantum 
effects  indicate  that  compound  semiconductor  quantum 
nanostructures^"^  such  as  quantum  wires  and  quantum  dots 
may  become  fundamental  building  blocks  of  the  next  gen¬ 
eration  ultralarge-scale  integrated  circuits  (ULSIs)  based  on 
quantum  effects.  For  successful  large-scale  planar  integra¬ 
tion,  such  structures  should  be  placed  near  surface. 

However,  the  surface  of  compound  semiconductors  is 
generally  known  to  posses  a  high  density  of  surface  states 
and  an  interaction  between  the  surface  states  and  the  quan¬ 
tum  structure  may  spoil  the  proper  operation  of  the  devices. 
In  fact,  the  presence  of  strong  surface  state  effects  has  been 
reported  recently  by  several  groups^"  on  photolumines¬ 
cence  (PL)  properties  of  near-surface  quantum  wells.  Ac¬ 
cording  to  these  reports,  PL  intensity  from  a  quantum  well 
lying  near  surface  is  reduced  exponentially  as  the  distance 
between  the  surface  and  the  well  is  reduced.  This  reduction 
can  be  explained  in  terms  of  the  interaction  between  the 
quantum  well  states  and  the  surface  states,  leading  to  leak  of 
confined  carriers  to  the  surface  and  subsequent  nonradiative 
recombination.  Therefore,  a  suitable  passivation  technology 
applicable  to  compound  semiconductor  quantum  structures 
must  be  established  for  the  successful  planar  integration  of 
quantum  devices. 

As  a  powerful  approach  for  successful  passivation  of 
compound  semiconductor  surfaces,  we  proposed  and  ana¬ 
lyzed  a  new  passivation  structure  including  a  silicon  inter¬ 
face  control  layer  (Si  ICL).^^"^^  In  this  structure,  an  ultrathin 
Si  layer  grown  by  molecular  beam  epitaxy  (MBE)  is  inserted 
between  the  compound  semiconductor  and  the  outer  thick 
Si02  passivation  dielectric.  The  basic  idea  is  to  achieve  co¬ 
herent  termination  of  surface  bonds  of  the  compound  semi¬ 
conductor  by  the  Si  ICL  and  subsequently  to  passivate  Si 
ICL  by  Si-based  dielectric.  This  technique  has  been  success¬ 


fully  applied  to  fabrication  of  InGaAs  metal-insulator- 
semiconductor  field  effect  transistors  (MISFETs).*^’^^  More 
recently,  this  passivation  technique  has  been  applied  to 
GaAs/AlGaAs  near- surface  quantum  wells  and  has  realized 
more  than  10^  times  recovery  of  PL  intensity  at  the  surface- 
to-well  distance  of  5  nm}^ 

As  an  alternative  approach  for  passivation,  use  of  low- 
energy  ion-beam  hydrogenation  has  also  been  reported  to  be 
powerful  for  recovery  of  PL  intensity  of  near-surface  quan¬ 
tum  wells.^^’^^  However,  this  technique  seems  to  be  appli¬ 
cable  only  to  native  oxide  covered  bare  surfaces  of  AlGaAs, 
and  does  not  seem  to  compatible  with  a  standard  dielectric 
layer  passivation. 

Being  encouraged  by  the  success  of  the  Si  ICL-based  pas¬ 
sivation  of  the  two-dimensional  quantum  wells,  the  purpose 
of  the  present  article  is  to  investigate  the  surface  state  effects 
in  the  Ino.53Gao47As  one-dimensional  quantum  wires  and  to 
demonstrate  the  effectiveness  of  the  Si  ICL  passivation  tech¬ 
nique  on  such  structures.  The  sample  structure  is  schemati¬ 
cally  shown  in  Fig.  1(a).  The  surface  state  effects  and  the 
effectiveness  of  passivation  were  investigated  by  measuring 
the  PL  intensity  from  the  wires  as  a  function  of  the  wire-to- 
surface  distance,  t^^.  Scanning  electron  microscope  (SEM) 
and  x-ray  photoelectron  spectroscopy  (XPS)  measurements 
were  also  made  to  characterize  the  structure  and  the  interface 
properties  of  the  samples. 

The  cross-sectional  views  of  the  specific  unpassivated  and 
passivated  wire  structures  used  in  this  study  are  shown  in 
Figs.  1(b)  and  1(c),  respectively.  They  are  Ino.53Gao.47As 
quantum  wires  embedded  in  Ino  52AI048AS  barrier  material. 
As  compared  with  the  standard  AlGaAs/GaAs  system,  this 
InP-based  Ino.53Gao  47 As/Ino  52AI0  4gAs  material  system  pos¬ 
ses  a  larger  conduction  band  discontinuity  and  superior  elec¬ 
tron  transport  properties,  both  of  which  are  favorable  for 
constructing  high  performance  quantum  devices  operating  at 
high  temperatures. 
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Fig.  1.  (a)  Schematically  illustration  of  the  passivated  quantum  wire.  The 
surface  state  effects  and  the  effectiveness  of  passivation  were  investigated 
by  measuring  the  PL  intensity  from  the  wires  as  a  function  of  the  wire-to- 
surface  distance  ,  (b)  and  (c)  are  the  cross-sectional  views  of  the  specific 
unpassivated  and  passivated  Ino.53Gao.47As  wire  structures  used  in  this  study, 
respectively. 


The  wires  were  fabricated  by  selective  MBE  growth  on  pat¬ 
terned  InP  substrates.  Details  of  wire  growth  as  well  as  PL 
and  cathodoluminescence  (CL)  properties  of  deeply  embed¬ 
ded  wires  with  large  wire-to-surface  distances  have  been 
published  elsewhere.^’^ 

In  this  article,  it  is  shown  that  unpassivated  near-surface 
quantum  wires  showed  an  exponential  decrease  of  PL  inten¬ 
sity  with  reduction  of  surf ace-to- well  distance  and  that  a 
nearly  complete  recovery  of  PL  intensity  can  be  achieved  by 
the  Si  ICL-based  passivation  technique  with  an  observed 
maximum  recovery  factor  of  250  for  the  quantum  wire  di¬ 
rectly  passivated  with  Si02/Si  ICL  The  mechanism 

for  the  PL  recovery  is  explained  in  terms  of  suppression  of 
surface  states  by  passivation. 

11.  EXPERIMENT 

All  the  samples  were  fabricated  using  an  ultrahigh 
vacuum  (UHV)-based  integrated  fabrication/characterization 
system  where  a  standard  solid-source  MBE  chamber,  a  pho- 
toenhanced  chemical  vapor  deposition  (photo-CVD)  cham¬ 
ber,  an  XPS  chamber,  and  other  chambers  are  connected  by 
an  UHV  transfer  chamber. 

The  sequence  for  sample  preparation  is  shown  in  Figs. 
2(a)  and  2(b).  The  preparation  starts  with  formation  of 
mesa  patterns  on  InP  substrates  with  (001)  terrace  width,  W, 
of  1  /xm  as  shown  in  Fig.  2(a),  using  photolithography  and 
wet  chemical  etching.  The  orientation  of  the  mesa  stripe  was 
(110)  direction  and  the  pitch,  P,  was  4  /im.  Then,  wires  were 
formed  by  following  the  UHV-based  sequence  shown  in  Fig. 
2(b).  As  the  first  step,  a  thick  InGaAs  buffer  layer  was  grown 
on  the  patterned  InP  substrates.  During  the  growth  of  this 
layer,  sharp  ridge  structures  consisting  of  (3 11) A  sidewall 
facets  were  formed  on  the  mesa.  Subsequent  growth  of  an 
InAIAs  lower  barrier  layer  resulted  in  formation  of  a  narrow 
(411)A  facet  region  on  the  top  of  the  ridge.  Then,  InGaAs 
was  grown  preferentially  on  the  (411)A  facet  region  of  the 
InAIAs  ridge,  leading  to  formation  of  an  InGaAs  quantum 
wire  surrounded  by  (311)A  and  (411)A  high-index  facets. 


(a) 


UHV-processing 


(b) 


Fig.  2.  (a)  Mesa-pattemed  InP  substrates  used  in  this  study.  The  (001)  ter¬ 
race  width,  W,  and  the  pitch,  P,  were  1  and  4  /im,  respectively,  (b)  UHV- 
based  selective  MBE  growth  and  passivation  sequence  for  fabrication  of 
passivated  and  unpassivated  InGaAs  quantum  wires. 


Details  of  the  conditions  and  the  mechanism  of  such  a  selec¬ 
tive  growth  have  been  discussed  elsewhere.^  In  the  present 
study,  the  growth  conditions  were  adjusted  to  obtain  wires 
whose  base  width  was  100  nm.  Then,  the  top  InAIAs  barrier 
layer  with  a  thickness  of  was  grown.  Here,  is  mea¬ 
sured  perpendicular  to  the  (311)A  facet  and  is  defined  as  the 
wire-to-surface  distance  in  this  study.  Sample  with 
fws— 0-300  nm  were  grown,  where  t^^=0  corresponds  to 
direct  deposition  of  Si  ICL  passivation  structure  on  the 
Ino.53Gao47As  wire  as  the  top  barrier.  All  the  MBE  layers 
were  undoped.  For  the  preparation  of  the  unpassivated  wires, 
the  samples  were  taken  out  from  the  UHV  system  and  were 
exposed  to  air. 

For  the  preparation  of  passivated  quantum  wires,  a  Si  ICL 
and  an  outer  thick  Si02  layer  were  grown  successively  on  the 
top  of  the  wire  by  in  situ  MBE  and  photo-CVD  processes, 
respectively,  without  breaking  vacuum.  For  the  MBE  growth 
of  the  Si  ICL  the  Si  Knudsen  cell  (K  cell)  was  used  as  the  Si 
source  and  the  substrate  temperature  was  250  °C.  The  thick¬ 
ness  of  the  Si  ICL  should  be  thin  enough  to  maintain  pseudo- 
morphic  lattice  matching,  but  thick  enough  to  prevent  pen¬ 
etration  of  oxygen  radicals  during  the  photo-CVD  process. 
Based  on  the  previous  studies  by  our  group,^^  thickness  of 
the  Si  ICL  was  chosen  to  be  1  nm.  For  the  photo-CVD  Si02 
formation,  a  gas  mixture  of  SiH4,  N2O,  and  Ar  was  used  with 
an  ArF  excimer  laser  (193  nm)  irradiation.  The  deposition 
temperature  was  250  ®C. 

The  cross-sectional  structures  of  the  fabricated  InGaAs 
quantum  wires  were  characterized  by  scanning  electron  mi- 
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croscope  (SEM).  To  investigate  interface  structure,  XPS 
measurements  were  made  on  separately  prepared  planar 
samples,  a  Perkin-Elmer  spectrophotometer  was  used  with 
an  excitation  source  of  Al  Ka  line  (1486.6  eV).  To  charac¬ 
terize  the  optical  properties  of  the  unpassivated  and  passi¬ 
vated  InGaAs  ridge  quantum  wires,  low  temperature  PL 
measurements  were  performed  using  Ar*^  laser  (514.5  nm)  as 
an  excitation  source. 

III.  RESULTS  AND  DISCUSSION 

A.  SEM  observation  of  the  InGaAs  quantum  wire 

Figure  3(a)  shows  schematically  the  cross-sectional  struc¬ 
ture  of  the  fabricated  wire.  SEM  photographs  taken  at  the 
main  wire  portion  and  at  the  bottom  groove  portion  are 
shown  in  Figs.  3(b)  and  3(c),  respectively.  As  seen  clearly  in 
Fig.  3(b),  the  well-defined  InGaAs  quantum  wire  with  a  lat¬ 
eral  width  of  100  nm  was  formed  at  the  top  of  the  InAlAs 
ridge  structure.  It  is  also  seen  that  a  very  thin  quantum  wells 
is  formed  on  the  (3 11) A  sidewall  facets.  Additionally,  wells 
having  lager  thickness  were  also  formed  at  the  bottom  of  the 
grooves  as  seen  in  Fig.  3(c).  In  the  groove  region,  consider¬ 
able  waving  of  the  growth  boundary  and  fluctuation  of  the 
well  width  took  place  due  to  irregular  growth  involving  large 
amounts  of  mass  transfer. 

B.  XPS  analysis  of  surface  and  interface 

To  clarify  the  effect  of  the  insertion  of  the  Si  ICL  on  the 
chemical  structure  of  the  interface,  air-exposed  surfaces  and 
a  passivated  interfaces  of  InAlAs  and  InGaAs  were  analyzed 
by  the  XPS  method  using  separately  grown  planar  samples. 
In  the  passivated  samples,  the  thickness  of  the  outer  Si02 
layer  was  kept  to  be  1.5  nm  in  order  to  allow  photoelectrons 
to  escape  from  the  interface  region. 

The  results  of  XPS  analysis  of  InAlAs  samples  are  sum¬ 
marized  in  Fig.  4,  where  the  In  3d,  A\2p,  and  As  2;?  core 
level  spectra  from  air  exposed  and  Si  ICL-passivated 
samples  are  compared.  It  is  clearly  seen  that  large  amounts 
of  In,  Al,  and  As  oxide  components  exist  on  the  air  exposed 
surface  and  that  they  are  drastically  reduced  by  the  Si02/Si 
ICL-passivation  process.  Although  certain  amounts  of  Al  and 
As  oxide  components  as  well  as  the  elementary  As  compo¬ 
nent  still  exist  at  the  Si  ICL-passivated  interface,  the  inter¬ 
face  structure  becomes  much  more  coherent  and  stoichio¬ 
metric. 

As  for  the  InGaAs,  the  XPS  results  have  already  been 
reported  by  our  group  elsewhere.  It  has  been  found  that  the 
surface  oxides  were  drastically  reduced  again  by  the  Si  ICL 
passivation  process  as  compared  with  the  air-exposed 
InGaAs  surface. 

C.  PL  properties  of  unpassivated  wires 

For  a  systematic  investigation  of  the  surface  state  effects 
on  the  optical  properties  of  the  InGaAs  ridge  quantum  wires, 
the  unpassivated  InGaAs  ridge  quantum  wires  having  vari¬ 
ous  values  of  the  wire-to-surface  distance,  ,  were  fabri¬ 
cated  and  their  PL  properties  were  studied. 


InGaAs 


500nm 

(C) 


Fig.  3.  Results  of  the  SEM  observation  of  the  InGaAs  quantum  wire,  (a) 
The  schematic  cross-sectional  structure  of  the  fabricated  wire,  (b)  and  (c) 
The  SEM  photographs  taken  at  the  main  wire  portion  and  at  the  bottom 
groove  portion,  respectively. 


Three  examples  of  the  PL  spectra  of  the  unpassivated 
wires  taken  at  20  K  are  shown  in  Fig.  5.  The  wires  had  the 
values  of  of  300,  30,  and  5  nm.  Gaussian  deconvolution 
of  the  PL  spectra  are  given  by  dashed  curves. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2891 


Fujikura  et  a/.:  Surface  passivation  of  lno.53Gao.47As 


2891 


Air-exposed  InAIAs  Si02/Si/lnAIAs 


1330  1325  1320 


As2p 

elementary 


1330  1325  1320 


binding  energy  (eV) 


Fig.  4.  The  results  of  XPS  analysis  of  planar  InAIAs  samples.  The  In  3d, 
A\2p,  and  As  2p  core  level  spectra  from  air  exposed  and  Si  ICL-passivated 
samples  are  compared. 


In  the  PL  spectrum  of  the  wire  having  of  300  nm,  an 
intense  peak  was  observed  at  1.03  eV.  The  peak  has  two 
shoulders  on  the  higher  and  lower  energy  sides.  The  posi¬ 
tions  of  the  main  peak  and  two  shoulders  after  Gaussian 
deconvolution  are  indicated  by  arrows  in  Fig.  5. 

As  for  the  identification  of  the  spatial  origin  of  these 
peaks,  we  performed  previously  a  detailed  low  temperature 


Energy  (eV) 


Fig.  5.  Three  examples  of  the  PL  spectra  of  the  unpassivated  wires  taken  at 
20  K.  The  wires  had  the  values  of  of  300,  30,  and  5  nm.  Gaussian 
deconvolution  of  the  PL  spectra  are  given  by  dashed  curves. 


spatially  resolved  CL  study.^’^  According  to  this  study,  the 
intense  peak  seen  at  1.03  eV  originates  from  the  wires 
themselves.^’^  Two  shoulders  seen  at  the  higher  and  lower 
energy  sides  of  the  main  peak  also  come  from  the  parasitic 
quantum  wells  formed  on  the  (3 11) A  sidewall  facets  and 
those  formed  at  the  bottom  of  the  grooves,  respectively.  Ad¬ 
ditionally,  a  peak  and  a  weak  tail  were  observed  at  around 
0.8  eV  and  above  1.2  eV,  respectively.  The  peak  at  around 
0.8  eV,  which  corresponds  to  the  energy  gap  of  InGaAs,  is 
due  to  the  emission  from  the  InGaAs  buffer  layer.  The  peak 
observed  below  0.8  eV  may  correspond  to  the  impurity  re¬ 
lated  transition  in  the  InGaAs  buffer  layer.  On  the  other 
hand,  the  tails  seen  above  1.2  eV  were  identified  to  be  emis¬ 
sions  from  the  InAIAs  barrier,  including  deep  levels,  since 
they  were  also  found  in  the  PL  spectrum  of  InAIAs  layers 
grown  on  the  mesa  patterned  substrate  without  InGaAs  wire 
and  buffer  layers.^ 

With  the  reduction  of  the  wire-to-surface  distance, 
the  essential  features  of  the  PL  spectra  kept  unchanged,  as 
can  also  be  seen  in  Fig.  5,  although  the  PL  intensity  de¬ 
creased  as  discussed  fully  later. 

The  position  of  the  main  wire  peak  was  held  remarkably 
constant  with  the  reduction  of  ,  indicating  stability  of  the 
selective  growth  process  employed  here.  It  also  indicates  that 
peak  energy  shifts  caused  by  interaction  of  quantum  confined 
states  and  the  surface  states  as  reported  by  Moisson  et  al^ 
for  in  situ  PL  studies  of  AlGaAs/GaAs  near  surface  quantum 
wells  are  absent  here.  Our  previous  study  on  ex  situ  PL  of 
AlGaAs/GaAs  near-surface  quantum  wells  did  not  detect 
such  shifts.  On  the  other  hand,  considerable  differences  were 
observed  in  the  energy  positions  of  the  subpeaks  from  the 
two  types  of  parasitic  quantum  wells.  The  origin  of  these 
difference  may  be  attributed  to  the  fluctuations  of  the  quan¬ 
tum  well  thickness  caused  by  fluctuation  in  the  long-range 
material  transfer  during  growth. 

The  observed  variation  of  the  PL  intensity  of  the  main 
wire  peak  from  the  unpassivated  quantum  wires  is  summa¬ 
rized  in  Fig.  6  as  a  function  of  the  wire-to-surface  distance 

.  Here,  the  PL  intensities  are  normalized  by  its  value  ob¬ 
tained  for  the  wire  with  f^g=300  nm. 

The  result  of  Gaussian  deconvolution  was  used  to  accu¬ 
rately  determine  the  main  peak  intensity  in  each  sample.  As 
seen  in  Fig.  6,  the  PL  intensity  decreases  gradually  as  is 
reduced  from  300  down  to  10  nm.  Then,  it  starts  to  drop 
exponentially  as  below  10  nm.  Thus,  the  PL  inten¬ 

sity  of  the  unpassivated  wire  with  t^^=5  nm  was  60  times 
smaller  than  that  of  the  wire  having  ?^g  of  20  nm  and  400 
times  smaller  than  that  of  the  wire  having  f^g=300  nm. 

As  in  the  case  of  the  previous  reports  on  quantum 
wells,^’^^’^^  the  mechanism  of  the  strong  exponential  reduc¬ 
tion  of  PL  intensity  from  the  near-surface  quantum  wire  with 
the  reduction  of  the  surface-to-wire  distance  ?^g  below  10  nm 
can  be  explained  in  terms  of  the  interaction  between  confined 
quantum  wire  states  and  surface  states.  Namely,  surface 
states  provide  a  r^g-dependent  channel  for  nonradiative  sur¬ 
face  recombination  in  competition  with  radiative  transition 
between  electron  and  hole  confined  quantum  states. 
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wire-to-surface  distance  (tws)  (nm) 

Fig.  6.  The  observed  variation  of  the  PL  intensity  of  the  main  wire  peak 
from  the  unpassivated  quantum  wires  as  a  function  of  the  wire-to-surface 
distance,  .  Here,  the  PL  intensities  are  normalized  by  its  value  obtained 
for  the  wire  with  nm. 


In  addition,  absence  of  the  appreciable  shifts  of  PL  peak 
energy  from  the  wire  with  the  change  of  surface-wire  dis¬ 
tance  suggests  that  the  observed  reduction  of  PL  intensity  in 
this  study  is  not  due  to  the  surface  states  with  spatially  ex¬ 
tended  wave  function  but  due  to  the  highly  localized  one,  as 
has  been  discussed  previously?^ 

On  the  other  hand,  the  more  gradual  PL  intensity  reduc¬ 
tion  that  took  place  from  t^^—300  nm  down  to  t^^=lO  nm  in 
Fig.  6  is  a  new  feature  that  was  not  observed  in  the  near¬ 
surface  quantum  wells.  This  is  not  due  to  the  surface  states. 
We  believe  that  this  was  caused  by  the  particular  sample 
structure  used  here.  Namely,  the  thickness  of  the  bottom 
InAlAs  barrier  was  limited  to  about  100  nm  in  order  to  real¬ 
ize  a  narrow  wire  with  the  width  of  100  nm  using  the  present 
selective  MBE  growth.  Thus,  when  is  much  larger  than 
the  penetration  depth  of  the  excitation  light  about  100  nm, 
almost  all  the  excitation  light  is  absorbed  in  the  top  InAlAs 
layer  and  generated  carriers  diffuse  and  recombine  mainly  in 
the  InGaAs  wire.  On  the  other  hand,  as  the  thickness  of  the 
top  InAlAs  barrier  layer  become  smaller  than  the  penetra¬ 
tion  depth,  the  carriers  are  generated  mainly  in  the  thin  bot¬ 
tom  InAlAs  barrier  layer  and  InGaAs  buffer  layers.  Only  a 
certain  portion  of  carriers  generated  in  the  bottom  InAlAs 
barrier  can  supply  carriers  to  the  wire,  because  some  part  of 
them  escape  into  the  InGaAs  buffer  layer.  Thus,  the  decrease 
of  reduces  the  net  carrier  supply  into  the  wire,  resulting  in 
the  PL  intensity  reduction.  This  model  is  consistent  with  the 
fact  that  the  relative  peak  intensity  of  the  peak  due  to  the 
InGaAs  buffer  becomes  larger  with  the  reduction  of  ,  as 
can  be  seen  in  Fig.  5. 

As  for  the  variation  of  PL  intensities  of  subpeaks,  the 
reduction  of  the  PL  intensity  of  the  (311)A  well  showed 
nearly  the  same  trend  as  that  of  the  quantum  wire,  as  can  be 
seen  in  Fig.  5.  This  is  exactly  what  is  expected  from  the 
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Fig.  7.  PL  spectra  of  the  wires  passivated  with  Si  ICL  having  of  10  nm 
and  0  nm  are  compared  with  that  of  an  unpassivated  wire  having  t^f.=300 
nm.  The  PL  spectrum  of  the  wire  directly  passivated  with  Si02  without 
insertion  of  Si  ICL  is  also  shown  at  the  bottom. 

explanations  given  above  for  the  two  ranges  of  .  On  the 
other  hand,  the  bottom  well  showed  weaker  dependence  on 
also  seen  in  Fig.  5.  This  seems  to  be  due  to  the  larger 
thickness  of  the  top  InAlAs  barrier  layer  caused  by  material 
transfer,  which  can  be  seen  in  Fig.  3(c). 

D.  Recovery  of  PL  intensity  in  wires  with  Si  iCL- 
based  passivation 

Figure  7  compares  the  PL  spectra  of  the  wires  passivated 
with  Si  ICL  having  of  10  and  0  nm  with  that  of  an 
unpassivated  wire  having  t^^=300  nm.  The  PL  spectrum  of 
the  wire  directly  passivated  with  Si02  without  insertion  of  Si 
ICL  is  also  shown  at  the  bottom.  By  comparing  Fig.  7  with 
Fig.  5,  it  is  clearly  seen  that  the  wires  passivated  with  the  Si 
ICL  technique  give  rise  to  much  increased  PL  emission  from 
the  wire,  although  the  shape  of  the  PL  spectrum  remained  the 
same.  Additionally,  the  emission  intensities  from  the  para¬ 
sitic  quantum  wells  on  the  (311)A  facets  and  at  the  bottom 
grooves  are  also  increased.  On  the  other  hand,  the  wire  with 
passivated  by  Si02  without  Si  ICL  showed  a  much 
weaker  PL  intensity.  It  is  also  noted  in  Fig.  7  that  the  inten¬ 
sities  of  the  emission  peak  from  the  InGaAs  buffer  layers  are 
very  much  increased  in  the  two  types  of  the  passivated  wires, 
with  ?ws“0.  This  is  obviously  caused  by  increased  carrier 
flow  into  the  InGaAs  buffer  layer  due  to  the  absence  of  the 
top  InAlAs  barrier  as  discussed  above. 

Figure  8  shows  the  PL  intensity  of  the  quantum  wires 
passivated  by  the  Si  ICL-based  process  by  open  circles  as  a 
function  of  the  wire-to-surface  distance  .  For  comparison, 
the  PL  intensity  data  on  the  unpassivated  wires  and  that  on 
the  wire  passivated  by  Si02  without  Si  ICL  are  also  included 
in  Fig.  8  by  filled  circles  and  open  square,  respectively.  As 
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wire-to-surface  distance  (tws) 

Fig.  8.  PL  intensity  of  the  quantum  wires  passivated  by  the  Si  ICL-based 
process  by  open  circles  as  a  function  of  the  wire-to-surface  distance  .  For 
comparison,  the  PL  intensity  data  on  the  unpassivated  wires  and  that  on  the 
wire  passivated  by  Si02  without  Si  ICL  are  also  included  by  closed  circles 
and  open  square,  respectively. 


seen  in  Fig.  8,  the  PL  intensity  of  the  unpassivated  wire  is 
reduced  by  the  factor  of  400  by  decreasing  the  thickness  of 
the  top  InAlAs  barrier  layer  from  300  to  5  nm,  as  shown  in 
Fig.  6.  In  contrast  to  this,  the  wire  passivated  with  Si  ICL 
showed  PL  emission  only  10  times  smaller  than  that  of  the 
wire  with  f^s=300  nm  even  in  the  severest  case  of 
On  the  other  hand,  the  emission  intensity  of  the  wire  directly 
passivated  by  Si02  without  Si  ICL  is  2500  times  smaller  than 
that  of  the  wire  with  r^s=300  nm.  Thus,  the  Si  ICL-based 
passivation  technique  is  very  powerful  in  enhancing  PL 
emission  from  near- surface  quantum  wires. 

It  is  also  noted  in  Fig.  8  that  the  Si  ICL-passivated  wires 
no  longer  show  as  strong  an  exponential  decrease  of  PL  in¬ 
tensity  as  the  unpassivated  wire.  In  fact,  the  data  points  of 
the  wires  passivated  with  Si  ICL  lie  on  a  line  that  is  obtained 
by  naturally  extrapolating  the  data  points  of  the  gradually 
decreasing  portion  of  the  unpassivated  wires  having 
above  10  nm.  This  indicates  that  the  nonradiative  recombi¬ 
nation  process  through  the  surface  states  become  almost  neg¬ 
ligible  in  the  wires  passivated  with  Si  ICL  having  values 
smaller  than  10  nm.  Thus,  it  can  be  said  that  the  Si  ICL- 
based  passivation  process  can  almost  entirely  remove  surface 
state  induced  PL  deterioration  from  the  InGaAs  ridge  quan¬ 
tum  wires  formed  near  surface. 

The  observed  PL  recovery  can  be  attributed  to  reduction 
of  surface  states  achieved  by  the  Si  ICL-based  passivation 
process.  Referring  to  the  XPS  results  in  Fig.  4  and  following 
the  previous  discussion  made  on  a  similar  improvement  ob¬ 
tained  in  the  case  of  Si  ICL-based  passivation  of  near-surface 
quantum  wells,^^  reduction  of  the  surface  states  can  be  re¬ 
garded  as  a  consequence  of  the  formation  of  a  more  ordered, 
coherent,  and  stoichiometric  interface  that  minimizes  bond 
disorder  at  the  interface. 
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Fig.  9.  PL  intensity  of  the  wire  with  t^^=0  nm  passivated  with  Si  ICL  as  a 
function  of  measurement  temperature.  Data  on  the  PL  intensities  of  the 
unpassivated  wire  with  ^^^=300  nm  and  that  of  a  planar  InGaAs  quantum 
well  simultaneously  grown  with  the  unpassivated  wire  having  of  300  nm 
are  also  included  by  the  open  circles  and  open  squares,  respectively.  Each  of 
the  PL  intensities  are  normalized  by  their  values  at  20  K. 


Finally,  in  view  of  the  application  of  such  passivated 
wires  to  the  devices  operating  at  a  higher  temperature,  the 
temperature  dependence  of  the  PL  intensity  of  the  wire  pas¬ 
sivated  with  Si  ICL  was  investigated.  Figure  9  compares  the 
PL  intensity  of  a  wire  with  t^^=0  nm  passivated  with  Si  ICL 
with  that  of  an  unpassivated  wire  with  t^^=300  nm  as  a 
function  of  measurement  temperature.  For  the  purpose  of 
comparison,  the  data  on  the  PL  intensity  of  a  planar  InGaAs 
quantum  well  simultaneously  grown  with  the  unpassivated 
wire  having  ^^^=300  nm  is  also  included.  In  Fig.  9,  the  PL 
intensities  are  normalized  by  their  values  at  20  K.  As  seen  in 
Fig.  9,  the  PL  intensities  of  these  three  samples  showed  the 
same  trend.  Namely,  the  PL  intensities  were  almost  constant 
up  to  100  K.  Then,  they  started  to  decrease  rapidly  above 
100  K  over  two  orders  of  magnitude  reduction  up  to  290  K, 
although  the  peaks  maintained  still  reasonably  strong  inten¬ 
sity  even  at  290  K.  Thus,  this  reduction  in  PL  intensity  with 
temperature  is  not  due  to  surface  passivation  but  is  related 
either  to  the  InGaAs  crystal  or  to  the  InGaAs-InAlAs  inter¬ 
face.  Similar  decreases  of  the  PL  intensities  have  been  re¬ 
ported  in  the  GaAs-based  wires  by  several  other  groups.^"^ 
However,  quantitatively  speaking,  the  present  InGaAs  quan¬ 
tum  wires  show  much  smaller  decrease  of  the  PL  intensity 
with  temperature  than  these  GaAs-based  wires. 

IV.  CONCLUSIONS 

In  view  of  the  urgent  necessity  to  establish  a  suitable  sur¬ 
face  passivation  technique  applicable  to  near-surface  com¬ 
pound  semiconductor  quantum  structures,  the  Si  ICL-based 
passivation  technique  was  applied  to  passivation  of 
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Ino.ssGao  47 As  ridge  quantum  wires  having  different  wire-to- 
surface  distance  that  were  formed  by  selective  MBE.  The 
main  results  are  listed  below. 

(1)  The  PL  intensity  of  the  unpassivated  wires  decrease 
exponentially  with  reduction  of  wire-to-surface  distance 
below  about  10  nm.  The  behavior  can  be  explained  in  terms 
of  tunneling  capture  and  subsequent  nonradiative  recombina¬ 
tion  of  carriers  from  the  quantum  wire  by  surface  states 
present  on  air  exposed  surfaces. 

(2)  A  nearly  complete  recovery  of  PL  intensity  can  be 
achieved  by  use  of  the  Si  ICL-based  passivation  process.  An 
maximum  recovery  factor  of  the  PL  intensity  of  250  was 
obtained  in  the  severest  case  of  ty^^=0.  The  wire  with  t^^=0 
passivated  by  the  Si  ICL  technique  showed  reasonably  strong 
PL  emission  even  at  room  temperature. 

(3)  The  recovery  of  PL  intensity  is  due  to  reduction  of 
interface  state  densities  by  the  Si  ICL  process,  which  realizes 
reasonably  ordered  and  coherent  interface  structures,  as  indi¬ 
cated  by  the  XPS  analysis. 
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The  SiNj^/GaAs  (110)  interface  structure  of  the  metal-insulator-semiconductor  (MIS)  capacitors 
modified  with  ultrathin  Si  or  Si/Ge  (5  to  15  A)  heterolayers  and  sulfur  passivation  were  studied  by 
capacitance-voltage  (CV)  measurements,  high  resolution  medium  energy  ion  scattering  and 
extended  x-ray  absorption  fine  structure  with  synchrotron  radiation.  The  results  showed  that  Si/S 
multilayer  passivation  can  improve  the  interface  properties  of  the  MIS  capacitors  made  on  n-type 
GaAs  (110).  The  element  depth  profiling  in  a  subnanometer  scale  and  the  atomic  structure 
measurements  showed  that  a  Si  or  Si/Ge  heterolayer  on  sulfur  passivated  GaAs  (110)  can 
substantially  limit  Ga(As)  precipitation  at  the  interface  but  cannot  completely  suppress  the 
incorporation  of  Ga(As)  into  the  Si  heterolayer.  These  results  can  be  used  to  interpret  the  observed 
CV  characteristics  of  the  capacitors  and  to  understand  the  limitation  of  the  passivation  approach. 
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L  INTRODUCTION 

Understanding  the  mechanism  for  reducing  the  density  of 
state  at  the  insulator/GaAs  interface  is  crucial  for  the  poten¬ 
tial  for  metal-insulator-semiconductor  (MIS)  device  technol¬ 
ogy  of  GaAs.  Such  a  problematic  interface  has  been  studied 
for  years.  The  predominant  issues  have  been  attributed  to 
interfacial  disordering,  including  that  from  local  stress  and 
defects,  oxygen  induced  deep  levels,  and  arsenic  segregation. 
All  of  these  defects  can  lead  to  GaAs  surface  Fermi  level 
pinning  and  make  modulation  of  the  GaAs  surface  potential 
very  difficult.  To  eliminate  the  interface  defects,  an  interface 
with  atomic  abruptness  is  naturally  a  model  to  pursue.  The 
approach  of  using  a  thin  interfacial  control  layer  (ICL)  of 
pseudomorphic  silicon  by  Hasegawa  and  co-workers^  has 
been  considered  a  success.  Limited  improvement  of  the 
capacitance-voltage  (CV)  characteristics  with  the  Si  ICL  ap¬ 
proach  for  the  MIS  capacitors  made  on  GaAs  (100)  has  been 
encouraging  many  researchers  to  improve  the  quality  of  the 
silicon  ICL  with  the  expectation  that  a  more  ordered  inter¬ 
face  will  further  lower  the  interface  state  density  and  hence 
substantiate  device  performance.  This  includes  the  use  of  a 
Si/Ge  multilayer  to  replace  a  silicon  single  layer  with  the 
hope  for  a  better  lattice  match  (atomic  abruptness)  since  the 
lattice  of  germanium  is  closer  to  that  of  GaAs  than  that  of 
silicon.  In  addition,  reasonably  good  pseudomorphic  growth 
of  silicon  on  germanium  has  proved  feasible.^" On  the 
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Other  hand,  to  bring  down  the  cost  for  a  complete  in  situ 
fabrication,  sulfur  passivation  has  been  identified  as  an  effec¬ 
tive  approach  to  inhibit  GaAs  surface  oxidation  and  form  an 
ordered  surface  for  ex  situ  device  fabrication.  Nevertheless, 
the  interface  atomic  structure  of  the  MIS  capacitors  fabri¬ 
cated  with  these  approaches  remains  unclear  and  the  present 
results  are  still  far  from  that  for  practical  applications  of  MIS 
capacitors. 

In  this  article,  MIS  capacitors  were  fabricated  on  a  GaAs 
(110)  substrate  with  its  interface  modified  by  Si  or  Si/Ge  and 
sulfur  passivation.  To  understand  the  CV  characteristics  of 
the  MIS  capacitors,  high  resolution  medium  energy  scatter¬ 
ing  (MEIS)  and  extended  x-ray  absorption  fine  structure 
(EXAFS)  with  synchrotron  radiation  were  used  to  measure 
the  interface  structure.  The  results  are  nontrivial  for  under¬ 
standing  the  MIS  capacitor  performance  using  the  present 
interface  passivation  approach. 

II.  EXPERIMENT 

Both  n-type  (Si  doped)  and  p-type  (Zn  doped)  GaAs 
(110)  wafers  supplied  by  American  Xtal  Technology  were 
used  for  the  fabrication  of  the  MIS  capacitors.  The  substrate 
etch  pit  densities  were  less  than  10^/cm^  and  both  the  n-  and 
/7-dopant  concentration  was  IX  10^^  at./cm^.  The  as-received 
wafers  were  treated  using  two  cycles  of  UV/ozone  oxidation 
for  10  minutes  followed  by  etching  in  an  1:60  hydrofluoric 
(HF)  acid  solution  for  2  minutes.  The  samples  were  then 
rinsed  with  deionized  water  for  30  seconds  and  blown  dry 
with  pure  nitrogen  gas.  The  oxide-free  samples  were  subject 
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to  gas  phase  poly  sulphide  passivation  for  20  minutes  and  the 
as**passivated  samples  were  immediately  transferred  into  a 
VG  V80  molecular  beam  epitaxy  system  for  e-beam  deposi¬ 
tion  of  Si  or  Si/Ge  ICLs.  The  layer  thickness  varied  from  3  A 
to  20  A  and  the  substrate  temperature  during  deposition  was 
250  For  MIS  capacitor  fabrication,  the  samples  after  ICL 
deposition  were  immediately  deposited  with  350  A  silicon 
nitride  in  a  remote  plasma  chemical  vapor  deposition  system 
and  then  400  A  of  aluminum  top  contacts  were  evaporated 
onto  the  nitride  surface  through  a  shadow  mask.  The  diam¬ 
eter  of  the  contacts  varied  from  0.2  to  1.0  mm.  The  capaci¬ 
tors  were  annealed  at  450  °C  for  15  minutes  in  a  forming  gas 
mixture  of  4.2%  hydrogen  in  nitrogen.  Back  contacts  to  the 
capacitors  were  made  by  In/Ga  eutectic.  CV  data  for  the 
fabricated  MIS  capacitors  were  collected  with  a  Hewlett- 
Packard  4061 A  semiconductor  component  test  system.  For 
the  quasi-static  (QS)  measurements  the  potential  of  the  gate 
was  scanned  from  inversion  to  accumulation,  while  for  the 
high  frequency  measurements  it  was  scanned  from  negative 
to  positive  bias  and  then  immediately  scanned  back  to  nega¬ 
tive  bias.  A  detailed  discussion  of  the  CV  analysis  can  be 
found  in  our  previous  publications.^^ 

To  understand  the  CV  characteristics  of  the  MIS  capaci¬ 
tors,  MEIS  and  EXAFS  were  employed  to  study  the  interface 
structure.  MEIS  was  carried  out  with  the  1.7  MV  Tandetron 
accelerator  at  the  University  of  Western  Ontario  by  using  a 
collimated  ion  beam  at  280  keV  and  a  high  resolution 
hemispherical  electrostatic  energy  analyzer.  The  spectra  were 
collected  in  a  double  crystal  alignment  configuration  in 
which  the  scattering  angle  between  the  incoming  channelling 
string  and  outgoing  ion  direction  was  90°.  The  energy  reso¬ 
lution  for  all  the  MEIS  data  collected  was  about  0.24  keV, 
which  is  equivalent  to  about  4  A  for  a  GaAs  single  crystal, 
EXAFS  spectra  were  measured  at  the  Canadian  Synchrotron 
Radiation  Facility  (Aladdin)  on  the  1  GeV  electron  storage 
ring,  University  of  Wisconsin-Madison.  The  x-ray  was 
monochromatized  and  the  photon  resolution  at  1800  eV  (sili¬ 
con  A:-edge  at  1839  eV)  was  about  0.8  eV.  The  total  electron 
yield  (TEY)  mode  was  used  for  the  EXAFS  measurements. 
In  order  to  eliminate  the  unrelated  structures  in  the  measure¬ 
ments,  all  the  TEY  spectra  were  normalized  to  the  simulta¬ 
neously  recorded  TEY  signal  from  a  gold  mesh  reference 
monitor.  The  detailed  description  of  the  above  experimental 
systems  and  the  relevant  experimentation  can  be  found  in  the 
literature  or  our  previous  publications.^^ 

III.  RESULTS  AND  DISCUSSION 
A.  CV  characteristics 

Figure  1  shows  the  CV  characteristics  for  the  MIS  capaci¬ 
tors  fabricated  on  n-type  GaAs  (110)  with  the  interface  modi¬ 
fied  by  (a)  HF  etch,  (b)  HF  etch  and  15  A  Si  ICL,  and  (c) 
sulfur  passivation  and  15  A  Si  ICL.  Some  measurable  pa¬ 
rameters  from  the  curves  are  listed  in  Table  I.  One  sees  that 
the  capacitor  treated  with  HF  etch  had  only  a  very  poor  high 
frequency  CV  characteristic  [Fig.  1(a)].  The  large  frequency 
dispersion  indicated  a  strong  Fermi  level  pinning  away  from 
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Fig.  1.  The  CV  characteristics  for  the  MIS  capacitors  made  on  n-type  GaAs 
(110)  with  the  interface  modified  by  (a)  HF  etch,  (b)  HF  etch  and  15  A  Si 
ICL,  and  (c)  sulfur  passivation  and  15  A  Si  ICL. 


the  conduction  band  minimum  (CBM)  and  the  shift  of  the 
high  frequency  characteristics  to  the  right  with  increasing 
frequency  shows  a  high  density  of  shallow  states  at  the 
insulator/semiconductor  interface.  In  addition,  the  difference 
between  the  maximum  and  minimum  values  of  the  high  fre¬ 
quency  curves  was  very  small  and  the  amount  of  hysteresis 
was  large  (about  1.1  V),  which  came  from  a  large  amount  of 
negative  charge  trapping  in  slow  states.  The  very  large  offset 
between  the  QS  and  the  high  frequency  CV  curve  in  accu¬ 
mulation  also  suggested  that  there  was  a  high  density  of 
interface  states  near  the  CBM  edge.  All  of  these  characteris¬ 
tics  indicated  that  such  interface  could  not  be  used  for  any 
practical  applications.  Applying  a  Si  ICL  to  the  interface  of 
the  capacitor  [Fig.  1(b)]  could,  however,  only  improve  the 
CV  characteristics  to  a  limited  amount.  The  difference  be¬ 
tween  the  maximum  and  minimum  values  of  the  high  fre- 


Table  I.  CV  characteristic  parameters  measured  for  the  ?T-type  samples  dis¬ 
cussed  in  the  text. 


Passivation 

approach 

Frequency 

dispersion 

(V) 

Hysteresis 

(V) 

QS  dip 

Offset  between 
quasi-static 
and  1  MHz 

CV  curve 

(10'2  eV"' 
cm-^) 

HF  etch 

1.1 

0.11 

0.27 

Si 

1.8 

0.8 

0.19 

0.29 

5.1 

Si/S 

1.0 

0.7 

0.21 

0.10 

3.4 
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quency  curves  increased  by  a  factor  of  2,  the  amount  of 
hysteresis  (0.7  V)  decreased  by  a  factor  of  1.6,  and  the  offset 
between  the  QS  and  high  frequency  curves  in  accumulation 
decreased  by  a  factor  of  1.5.  But  the  overall  CV  characteris¬ 
tics  were  still  poor.  The  observed  large  frequency  dispersion 
showed  that  the  interface  Fermi  level  was  still  pinned  away 
from  the  CBM.  The  QS  curve  at  the  negative  bias  was  domi¬ 
nated  by  the  slow  movement  of  charges  and  the  hysteresis 
was  still  very  large  (about  1  V).  These  characteristics  are 
typical  for  the  MIS  capacitors  made  on  GaAs  (100),  and  the 
results  observed  had  no  substantial  difference  as  compared  to 
those  in  the  literature. Additional  improvements  were  ob¬ 
served  for  the  capacitor  whose  interface  was  modified  by 
sulfur  passivation  and  a  15  A  Si  ICL.  Improvement  of  the 
capacitor  interfacial  properties  is  evident  from  the  measured 
CV  characteristics  [Fig.  1(c)]  such  as  an  even  deeper  QS 
integral  dip  and  a  smaller  offset  between  the  QS  curve  and 
the  high  frequency  data  in  accumulation  compared  with  that 
shown  in  Fig.  1(b).  The  frequency  dispersion  was  also  sub¬ 
stantially  reduced  and  the  measured  hysteresis  was  decreased 
to  about  0.5  V.  Nonetheless,  the  frequency  dispersion  was 
still  present  and  the  deep  depletion  was  seen  from  the  CV 
characteristics.  By  applying  a  Si/Ge/S  multilayer  interface 
modification  approach,  the  CV  characteristics  can  be  further 
improved^^;  however,  there  was  no  substantial  improvement 
in  the  calculated  interface  state  densities  ('-10^^  eV“^ 
cm"^)  by  comparing  QS  and  high  frequency  (1  MHz)  ca¬ 
pacitance  {high-low  method). 

B.  Interface  structure 

It  was  previously  proposed that  using  a  Si  ICL  could 
prevent  the  As  at  the  interface  from  precipitation  by  stabiliz¬ 
ing  the  free  As  and  hence  eliminating  one  of  the  critical 
factors  leading  to  interface  Fermi  level  pining.  It  was  also 
suggested^  that  the  dangling  bonds  in  Si  or  the  high  conduc¬ 
tion  band  density  of  Si  (assuming  a  fixed  0.2  eV  relative 
conduction  band  offset  between  Si  and  GaAs)  would  limit 
further  improvement  of  the  MIS  interface  characteristics.  To 
clarify  the  interface  structure  of  the  MIS  capacitors  with  in¬ 
terface  modification  by  the  Si  ICL  and  sulfur  passivation, 
high  resolution  MEIS  and  EXAFS  were  used  to  measure  the 
interface  element  profiling  and  atomic  structure  with  Si  or 
Si/S  interface  passivation.  This  enabled  us  to  know  the  ele¬ 
ment  distribution  and  atomic  structure  at  the  interface.  Figure 
2  shows  the  MEIS  spectrum  for  an  n-type  GaAs  (110) 
cleaned  by  UV/ozone  oxidation  and  with  a  1:60  HF  etch. 
The  spectrum  was  collected  in  a  double  alignment  configu¬ 
ration  in  which  (101)  was  chosen  as  the  incoming  ion  chan¬ 
nelling  direction  and  the  outgoii^  ions  were  collected  at  a 
scattering  angle  of  90°  along  (121).  The  surface  peak  shown 
in  the  spectrum  was  fitted  for  ^^As,  ^^Ga,  and  ^^Ga.  In  the 
present  peak  fitting  approach,  the  ^^As,  ^^Ga,  and  ^^Ga  peak 
positions  were  determined  by  the  corresponding  ion  scatter¬ 
ing  energy  E  —  kE^,  where  Eq  was  the  analytical  beam  en¬ 
ergy,  and  k  was  the  kinematic  factor  for  ^^As,  ^^Ga,  and 
^^Ga,  respectively.  The  ratio  of  the  ^^Ga  and  ^^Ga  peak  area 
intensity  was  also  fixed  at  0.576  according  to  the  natural 
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Fig.  2.  MEIS  spectrum  for  a  HF  cleaned  n-iypt  GaAs  (110), 


isotope  abundance.  A  Gaussian  peak  was  chosen  for  the  fit¬ 
ting  and  the  data  were  fitted  to  the  minimum.  It  was  found 
that  the  surface  As  to  Ga  ratio  was  about  1,  i.e.,  the  surface 
was  close  to  stoichiometry.  The  number  of  surface  GaAs 
atoms,  (A^^)GaAs’  can  be  calculated  from  the  measured  area, 
A,  of  the  surface  peaks  in  the  spectrum  by^^ 

{Nt)Q^p,^=AI{VlQ(T)cos  0, 

where  o"  is  the  cross  section,  El  is  the  solid  angle  of  the 
detector,  Q  is  the  total  number  of  ^He*^  ions  of  the  analytical 
beam,  and  0  is  the  angle  between  the  direction  of  the 
beam  and  the  sample  normal.  From  the  spectrum,  we  found 
that  the  calculated  areal  density  for  the  surface  peak  was 
about  3X  10^^/cm^.  Figure  3  shows  the  MEIS  spectra  for  a 
15  A  Si  heterolayer  on  sulfur  passivated  [Fig.  3(a)]  and  UV/ 
ozone/HF  cleaned  [Fig.  3(b)]  GaAs  (110).  It  can  be  measured 
from  the  spectra  that  the  edge  of  the  As  surface  peak  shifted 
about  0.9  keV  towards  the  lower  energy  end,  which  indicated 
that  there  was  no  As  or  Ga  on  the  surface  of  the  Si  ICL.  Such 
an  energy  shift  was  equivalent  to  a  depth  of  9  A.  Since  the 
samples  were  exposed  to  air  during  the  transfer  to  the  MEIS 
analytical  chamber,  surface  oxidation  would  occur.  If  we  as¬ 
sume  that  the  oxide  thickness  was  the  same  as  that  for  native 
silicon  oxide  (~  10  A),  or  that  the  top  5  A  of  the  silicon  was 
oxidized,  the  front  surface  of  the  As  or  Ga  was  distributed 
below  3-4  A  of  the  Si  ICL.  The  striking  feature  shown  in 
Fig.  3  is  that  for  the  sample  with  sulfur  passivation,  the  sur¬ 
face  As  intensity  was  substantially  lower.  The  measured 
atomic  areal  density  for  the  surface  As  was  about 
3Xl0^'^/cm^,  or  more  than  5  times  lower  than  that  of  the 
virgin  GaAs  (110)  surface  shown  in  Fig.  2,  while  for  the 
sample  without  sulfur  passivation  the  measured  As  areal  den¬ 
sity  was  about  1.2X10^^/cm^,  which  is  about  4  times  higher 
than  that  with  sulfur  passivation  and  about  0.8  times  lower 
than  that  of  virgin  GaAs  (110).  This  could  be  one  of  the 
reasons  for  the  improvement  of  the  CV  characteristics  of  the 
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Fig.  3.  MEIS  spectra  for  a  15  A  Si  heterolayer  on  (a)  sulfur  passivated  and 
(b)  UV/ozone/HF  cleaned  ^?-type  GaAs  (110). 


corresponding  MIS  capacitors  with  sulfur  passivation 
(Fig.  1);  i.e.,  sulfur  passivation  removed  the  surface  As  by 
forming  S-Ga  bonds  which  further  suppressed  As  precipita¬ 
tion,  since  the  As  segregation  would  induce  high  interface 
state  density. 

The  results  that  the  As(Ga)  peak  moved  into  the  Si  ICL  as 
shown  in  Fig.  3  also  indicated  that  both  Si  or  Si/S  passiva¬ 
tion  could  not  inhibit  the  site  exchange  between  Ga  and  Si, 
and  completely  suppress  the  As  precipitation  at  the  interface 
as  a  considerable  amount  of  As(Ga)  had  incorporated  into  the 
Si  ICL.  This  implied  that  the  proposed  S-Ga  terminated 
GaAs  surface  by  sulfur  passivation  would  mostly  be  de¬ 
stroyed  after  the  Si  ICL  deposition.  The  actual  ICL  was  in 
fact  a  layer  mixed  with  Si-Ga(As).  The  amounts  of  sulfur 
diffused  into  the  Si  ICL  could  be  measured  using  particle 
induced  x-ray  emission.  The  total  amount  of  sulfur  at  the 
interface  was  measured  to  be  about  4X 10^^  at./cm^,  or  about 
half  a  monolayer.  Therefore,  even  we  assumed  that  all  the 
sulfur  diffused  into  the  Si  ICL,  the  concentration  of  sulfur  in 
the  “mixed”  layer  would  be  less  than  5  at.%.  Our  results^^ 
showed  that  sulfur  diffused  towards  both  the  Si  ICL  and  the 
substrate.  Therefore,  the  amount  of  sulfur  in  the  ICL  was 
much  less  than  that  of  Ga(As),  which  was  about  20  at.%  in 
this  mixed  layer.  The  atomic  structure  of  this  mixed  layer 
was  measured  by  EXAFS.  In  Fig.  4  the  weighted  EXAFS 
data  are  shown  for  the  Si  ICL  on  GaAs  (110)  with  and  with¬ 
out  the  sulfur  passivation.  The  data  measured  from  a  Si  (100) 
single  crystal  are  also  plotted  for  comparison.  Compared  to 
the  Si  EXAFS  data,  two  extra  peaks  at  wave  vector 


Fig.  4.  The  weighted  EXAFS  spectra  for  a  15  A  Si  heterolayer  on  (a) 
sulfur  passivated  and  (b)  UV/ozone/HF  cleaned  n-type  GaAs  (110).  The 
weighted  EXAFS  spectrum  of  Si  (100)  is  shown  in  (c). 


4.8  and  8.2  could  be  seen  clearly.  Figure  5  shows  the 
Fourier  transformation  (FT)  of  the  corresponding  EXAFS 
spectra  shown  in  Fig.  4.  Since  the  local  structure  of  thin 
amorphous  or  microcrystalline  silicon  film  is  close  to  that  of 
a  silicon  single  crystal, the  data  showed  clearly  that  there 
were  foreign  atoms  in  this  Si  ICL  whose  distance  to  silicon 
host  atoms  contributed  the  additional  peak  between  the  Si-Si 
first  neighbor  and  the  second  neighbor  in  both  the  Si  and  Si/S 
heterolayers  on  GaAs  (110).  Since  such  a  distance  is  longer 
than  that  for  Si-Si  and  there  were  large  amounts  of  Ga(As) 
atoms  in  the  heterolayer,  it  is  reasonable  to  assume  that  this 
peak  was  contributed  by  the  Si-Ga(As)  neighbors.  Such  a 
distance  and  the  Si-Si  neighbor  distances  in  these  heterolay¬ 
ers  can  be  calculated  from  the  FT  data  based  on  the  experi¬ 
mental  phase  function,  <&(/:)  =  sin(2^r  +  p^/,),  where  k  is  the 
photoelectron  wave  vector  and  p^ is  a  theoretical  atomic 
pair  phase  function  for  a  Si-Si  absorber- scatter  pair.^^  The 
measured  absorption  coefficient,  ;^(A:),  was  weighted  with 
(«  =  1,2,3)  for  different  regions  of  the  data  in  k  space.  By 
filtering  and  back-transforming  the  FT  data,  the  calculations 
showed  that  the  first  shell  Si-Si  nearest  interatomic  distances 
in  the  Si  ICL  are  2.43  A  with  sulfur  passivation  and  2.37  A 
without  sulfur  passivation  (compared  to  2.35  A  for  the  first 
neighbor  distance  in  a  Si  (100)  single  crystal).  The  distances 
calculated  from  the  foreign  atom  contributed  peak  using  a 
weight  is  3.34  A  for  the  sample  with  sulfur  passi- 
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Fig.  5.  Fourier  transformed  data  for  the  corresponding  spectra  showed  in 
Fig.  4. 


vation  and  3.32  A  for  the  sample  without  sulphur  passiva¬ 
tion.  Slightly  longer  distances  for  the  Si-Si  and  Si-Ga(As)  in 
the  heterolayer  with  sulfur  passivation  could  be  due  to  either 
the  trace  amount  of  sulfur  incorporation  during  the  break-up 
of  the  Ga-S  bonds  or  the  defects  in  these  layers.  The  data 
shown  in  Fig.  5  also  indicated  that  for  the  heterolayers  with 
and  without  sulfur  passivation  the  Si-Si  bonds  were  still 
dominating  but  the  relative  amplitude  for  the  Si-Ga(As)  in 
the  hetero layer  with  sulfur  passivation  was  lower  compared 
to  that  without  sulfur  passivation.  This  observation  supported 
the  results  measured  by  MEIS  that  sulfur  passivation  can 
further  suppress  As  precipitation. 

If  the  improvement  of  the  CV  characteristics  of  the  MIS 
capacitors  whose  interface  was  modified  by  the  Si  ICL  and 
the  sulfur  passivation  was  only  interpreted  as  being  caused 
by  less  As  precipitation  at  the  insulator/GaAs  interface,  and 
the  lack  of  long-range  atomic  abruptness  or  the  Si-Ga  mixing 
in  the  Si  ICL  is  to  be  blamed  for  the  limitation  of  further 
interface  state  density  reduction,  one  could  expect  that  Si/ 
Ge/S  multilayer  passivation  could  be  a  better  choice  since  Ge 
is  better  lattice-matched  to  GaAs  and  the  pseudomorphic 
growth  of  Si  on  Ge  is  readily  achievable.  This  had  been 
confirmed  by  the  results  by  Falta  et  al  where  Ge  growth  on 
an  epitaxial  GaAs  layer  led  to  a  better  interface  than  that  by 
Si  growth.  The  growth  on  the  GaAs  also  led  to  surface 
segregation  of  both  Ga  and  As;  however,  the  mixing  of  the 
heterolayer  was  not  an  issue.  This  difference  could  come 


from  the  dopant  modulated  growth.  We  have  applied  the  Si/ 
Ge/S  approach  to  both  n~  and  p-typo  GaAs  (110)  substrates.^ 
The  CV  characteristics  of  the  MIS  capacitors  made  on  the 
n-type  substrate  indeed  improved  impressively  but  the  results 
on  the  /?-type  substrate  was  just  the  opposite  to  that  of  Si/S 
passivation.  Our  subsequent  interface  atomic  abruptness 
measurements  by  x-ray  scattering  showed  surprisingly  that 
the  Si/Ge/S  passivation  led  to  a  smoother  interface  for  the 
MIS  capacitors  on  the  p-type  substrate  than  that  on  the 
n-type  substrate  but  yet  had  worse  CV  characteristics.^^  The 
atomic  abruptness,  therefore,  does  not  seem  to  be  the  very 
factor  which  determines  the  CV  characteristics  of  the  MIS 
capacitors  discussed  here.  Our  preliminary  results using 
x-ray  absorption  near-edge  structure  measurement  and  x-ray 
photoemission  spectroscopy  showed  that  the  band  gap  of  the 
Si-Ga  mixed  layer  was  much  wider  than  that  for  the  bare  Si 
ICL  or  Si/Ge  multilayer.  We  speculate  that  band  gap  engi¬ 
neering  might  be  a  key  for  further  improvement  of  the  inter¬ 
face  properties  of  the  MIS  capacitors  made  on  GaAs.  Addi¬ 
tional  work  towards  this  direction  is  currently  under  way. 

IV.  SUMMARY 

In  summary,  we  found  that  using  Si/S  or  Si/Ge/S 
multilayer  interface  passivation  for  the  MIS  capacitors  made 
on  GaAs  (110)  could  improve  the  CV  characteristics  of  these 
capacitors.  Both  Si  ICL  and  sulfur  passivation  can  suppress 
As  precipitation  at  the  interface,  and  such  an  effect  is  sub¬ 
stantial  by  applying  Si  and  S  or  Si/Ge  and  S  at  the  same 
time.  However,  the  sulfur  passivation  could  not  prevent  the 
Ga(As)-Si  site  exchange  or  Ga(As)  from  incorporation  into 
the  Si  ICL  and  the  mixture  was  dominated  by  the  Ga  incor¬ 
poration. 
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We  present  the  results  of  theoretical  calculations  of  the  electronic  structure  of  the  Sb/GaAs(110) 
interface  in  the  submonolayer  coverage  regime  performed  in  the  full  ab  initio  self-consistent 
pseudopotential  scheme.  Different  structural  models  for  the  edges  of  the  extended  Sb  islands  have 
been  considered  and  their  equilibrium  geometry  has  been  determined  by  total  energy  minimization. 
The  single  particle  band  structure  shows  interface  states  arising  mainly  from  the  incomplete  bonding 
of  the  Sb  adatoms  at  the  terrace  edge  to  the  substrate,  which  fall  within  the  optical  gap.  Distinct 
features  are  associated  to  different  island  terminations.  The  interface  turns  out  to  be  metallic  in  all 
the  considered  cases  with  a  partially  occupied  peak  at  the  Fermi  level.  We  have  also  studied  the 
effect  of  including  explicitly  the  on-site  Hubbard  electron-electron  correlation  in  the  calculation  of 
the  quasiparticle  spectrum,  obtaining  the  observed  semiconducting  interface  when  the  Coulomb 
interaction  parameter  U  is  larger  than  3  eV.  The  interface  states  within  the  optical  band  gap  can  be 
present  also  at  higher  coverages  when  some  disorder  exists,  evidentiating  a  general  mechanism  for 
the  Fermi  level  pinning  at  this  interface.  ©  1996  American  Vacuum  Society, 


L  INTRODUCTION 

The  interface  between  Sb  and  the  GaAs(llO)  surface  can 
be  considered  the  prototypical  example  of  a  metal- 
semiconductor  interface  as  the  deposition  of  one  Sb  mono- 
layer  on  GaAs  is  known  to  produce  a  well  characterized  1X1 
ordered  structure.^  Theoretical  calculations  have  been  per¬ 
formed  in  the  first-principles  pseudopotential  approach^""^ 
and  in  the  semi-empirical  tight-binding  scheme^  assuming  in 
both  cases  a  full  monolayer  deposition  with  Sb  atoms  peri¬ 
odically  arranged  along  zigzag  chains  following  the  same 
geometry  of  the  Ga  and  As  atoms  at  the  topmost  layer  of  the 
GaAs(llO)  surface.  This  structure  has  been  denoted  as  epi¬ 
taxial  continued  layer  structure  (ECLS)  and  has  provided  a 
description  of  the  electronic  structure  of  the  full  monolayer 
deposition  consistent  with  the  experimental  evidence  de¬ 
duced  from  direct^  and  inverse^  photoemission  spectroscopy 
and  by  scanning  tunneling  microscopy  (STM)  and  spectros¬ 
copy  measurements.^  In  the  ECLS  the  Sb-derived  states  have 
been  found  in  the  valence  band  and  close  to  the  bottom  of 
the  conduction  band;  the  absence  of  interface  states  in  the 
band  gap  implies  that  the  ordered  Sb  monolayer  cannot  alter 
the  flat  band  condition  at  the  GaAs  surface. 

Photoemission  measurements^’ have  shown  that  the 
Fermi  level  pinning  in  this  system  is  strongly  correlated  to 
surface  disorder:  thermal  annealing  has  been  shown  to  alter 
the  Fermi  level  position  and  a  greater  band-bending  has  been 
found  for  unannealed  (disordered)  samples.  Moreover  in  the 
submonolayer  deposition  regime  STM  measurements^  have 
found  electronic  states  within  the  optical  gap  which  are  lo¬ 
calized  near  the  edges  of  the  Sb  terraces.  These  states  are 
assumed  to  be  responsible  for  the  Fermi  level  pinning  at  the 
Sb/GaAs(110)  interface.  The  band-gap  states  were  found  on 
samples  whose  Sb  coverage  was  about  0.7  ML.  They  form 
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two  separate  bands,  one  (seen  only  in  some  spectra)  filled 
and  the  other  (observed  in  most  spectra)  empty.  The  surface 
Fermi  level  turns  out  to  be  pinned  between  the  two  bands. 
The  states  within  the  optical  gap  were  observed  only  when 
the  spectrum  was  acquired  at  the  edges  of  the  Sb  terraces  and 
probably  arise  from  dangling  bonds  localized  near  the  edge 
of  the  islands  but  their  precise  structure  is  unknown. 

The  shape  of  these  features  may  change  at  different  loca¬ 
tions  near  the  edge  of  the  terraces  but  the  presence  of  a  large 
peak  near  the  top  of  the  band  gap  and  of  a  minimum  of  the 
measured  density  of  states  at  the  Fermi  level  seem  to  be  a 
general  result.  These  features  have  been  observed  also  for 
other  metal  adsorbates  at  the  submonolayer  coverage  regime 
such  as  Au/GaAs(110),  Fe/GaAs(110),  and  even  in  the  case 
of  a  simple  two-atom  cluster  of  Sb  on  GaAs(llO).^^  The  tails 
of  states  of  these  features  are  degrading  toward  the  minimum 
at  the  Fermi  level,  in  some  cases  giving  rise  to  a  small  gap, 
in  other  cases  there  is  almost  no  gap. 

Little  is  known  about  the  submonolayer  case.  Recent 
theoretical  calculations^  seem  to  confirm  the  experimental 
observation^^  that  in  the  submonolayer  coverage  regime  an¬ 
timony  tends  to  form  flat  islands,  approximately  one  mono- 
layer  thick  with  a  local  1 X 1  symmetry.  However,  no  calcu¬ 
lations  have  been  performed  so  far  for  extended  Sb  islands. 
Preliminary  tight-binding  calculations  of  the  band  structure 
of  the  half  monolayer  2X1  missing  chain  model  (every  sec¬ 
ond  110  Sb  chain  is  missing  in  a  ECLS)  did  not  show  any 
electronic  state  within  the  optical  band  gap.^^ 

In  this  article  we  present  a  theoretical  investigation  of  the 
electronic  structure  of  the  Sb/GaAs(110)  interface  at  sub¬ 
monolayer  coverages  in  order  to  identify  the  interface  states 
associated  with  the  incomplete  bonding  of  Sb  atoms  to  the 
substrate  as  possible  candidates  for  pinning  states.  We  show 
that  electronic  states  within  the  optical  gap  are  indeed  ob¬ 
tained  when  the  Sb  zigzag  chains  are  truncated.  They  are 
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Fig.  1.  Top  view  of  the  Sb/GaAs(110)  interface  for  the  submonolayer  cov¬ 
erage:  (A)  0=0.625  ML  deposition  with  Sb  terraces  terminated  with  Sb  on 
Ga  atoms,  structure  A;  (B)  half  monolayer  deposition,  structure  B;  and  (C) 
0=0.625  ML  deposition  with  Sb  terraces  terminated  with  Sb  on  As  atoms, 
structure  C.  a,  b,  c,  and  d  indicate  the  inequivalent  Sb  adatoms  for  each 
structure. 


associated  mainly  to  Sb  dangling  bonds,  however,  the  dan¬ 
gling  bonds  of  substrate  atoms  contribute  somewhat  to  the 
band-gap  bands.  The  states  are  strongly  localized  near  the  Sb 
adatoms  at  the  terrace  edges.  Our  first-principles  calculations 
predict  metallic  interfaces,  with  a  partially  occupied  peak  at 
the  Fermi  level  for  all  the  considered  geometries. 
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and  Sb  on  As  atoms  of  the  substrate,  Sb(As).  In  the  structures 
A,  B,  and  C  of  Fig.  1  there  are,  respectively,  3,  4,  and  3 
inequivalent  Sb  adatoms,  since  we  have  to  add  to  Sb(Ga)  and 
Sb(As)  also  the  Sb  adatoms  at  the  terrace  edges,  which  we 
indicate  in  the  following  as  Sbg(Ga)  and  Sb^.(As).  In  Fig.  1 
these  inequivalent  Sb  adatoms  are  indicated  with  the  letters 
a,  b,  c,  and  d. 

We  calculated  the  equilibrium  geometry  for  each  structure 
by  performing  full  ab  initio  total  energy  calculations  in  the 
pseudopotential  approach,  using  the  density  functional 
theory in  the  local  density  approximation  (LDA).  The 
Ceperley-Alder  form  of  exchange  and  correlation  energy  has 
been  used.^^’^^  The  Sb  covered  GaAs(llO)  surface  has  been 
modelled  by  a  repeated  slab.  Each  slab  contained  five  atomic 
(110)  layers  of  GaAs  plus  two  Sb  overlayers  on  both  sides  of 
the  slab.  The  vacuum  region  had  a  thickness  of  three  atomic 
layers.  The  equilibrium  geometry  has  been  determined  by  a 
total  energy  converged  to  less  than  1  meV  per  atom  and 
forces  on  the  atoms  less  than  0.01  eV/A.  The  search  of  the 
minimum  energy  configuration  has  been  performed  using  a 
plane  wave  cutoff  8  Ry,  while  the  electronic  structure  has 
been  calculated  using  an  higher  cutoff,  11  Ry. 

This  computational  recipe  has  provided  well  converged 
geometries  for  the  fully  relaxed  clean  GaAs(llO)  surface  and 
the  ECLS,  which  are  in  good  agreement  with  the  experimen¬ 
tal  results and  other  previous  calculations^’'"^’^'^’^'^  and  for 
their  band  structures. 

In  the  case  of  the  three  structural  models  of  Fig.  1,  we 
have  found  that  the  Sb  atoms  lower  toward  the  substrate.  The 
shortening  of  the  distance  of  the  adatoms  from  the  substrate 
is  very  pronounced  for  the  adatoms  at  the  terrace  edge,  that 
move  away  toward  the  free  surface  area,  trying  to  saturate 
their  partially  occupied  dangling  bonds  by  forming  new 
bonds  with  other  substrate  atoms,  restoring  in  this  way  the 
threefold  coordination  they  had  originally  in  the  zigzag  chain 
before  truncation.  However,  probably  because  the  model  is¬ 
lands  are  only  few  atoms  wide  in  the  110  direction,  these 
new  bonds  with  the  substrate  are  not  completely  formed  and 
the  saturation  of  the  dangling  bonds  is  only  partial.  The 
buckling  between  the  inequivalent  Sb  atoms  turns  out  to  be 
enormously  enhanced  with  respect  to  the  ECLS  case. 


II.  COMPUTATIONAL  DETAILS 

As  shown  in  Fig.  1  we  have  considered  different  cases 
corresponding  to  different  coverages:  a  deposition  of  0.625 
ML  [cases  (A)  and  (C)],  realized  by  truncating  the  Sb  chains 
and  eliminating  3  over  8  Sb  adatoms;  a  deposition  of  half 
monolayer  [case  (B)]  where  4  over  8  adatoms  have  been 
eliminated.  Since  Sb  atoms  bonded  to  Ga  and  As  substrate 
atoms  are  not  equivalent,  the  configuration  corresponding  to 
0.625  ML  can  be  obtained  in  two  different  ways,  with  ter¬ 
races  terminated  with  Sb  atoms  bonded  to  Ga  substrate  at¬ 
oms  [case  (A)]  or  with  Sb  atoms  bonded  to  As  substrate 
atoms  [case  (C)].  These  geometries  correspond  to  4X1  sur¬ 
face  unit  cells. 

In  the  ECLS  there  are  two  inequivalent  Sb  adatoms  in  the 
surface  unit  cell:  Sb  on  Ga  atoms  of  the  substrate,  Sb(Ga), 


III.  RESULTS  AND  DISCUSSION 

A.  Electronic  states  at  the  overlayer  terrace  edges 

In  Fig.  2  the  total  density  of  states  (DOS)  of  structures  A, 

B,  and  C  are  compared  with  the  total  DOS  of  the  ECLS.  The 
DOS  have  been  calculated  for  the  optimized  minimum  en¬ 
ergy  atomic  configurations  using  the  plane  wave  cutoff  1 1  Ry 
and  sets  of  27,  9,  and  10  equivalent  k  points  for  the  ECLS, 
the  A(C)  structures  and  the  B  structure,  respectively.  To  com¬ 
pare  the  positions  of  the  peaks  we  have  aligned  the  onsets  of 
the  fundamental  GaAs  related  structures.  As  a  consequence 
the  entire  DOS  of  the  ECLS  has  been  rigidly  shifted  1.2  eV 
towards  a  lower  energy,  while  the  DOS  of  the  B  structure  has 
been  shifted  0.4  eV  towards  a  higher  energy  with  respect  to 
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Fig.  2.  Total  DOS  of  the:  (a)  ECLS,  (b)  A,  (c)  B,  and  (d)  C  structures  in  an 
energy  region  around  the  optical  bandgap.  The  calculated  Fermi  levels  have 
been  indicated  by  dashed  lines.  The  zero  of  the  energy  axis  is  at  the  VBM  of 
the  ECLS. 


the  DOS  of  the  A  and  C  structures.  The  DOS  are  depicted  in 
an  energy  region  including  the  band-gap  region  of  the  ECLS. 

It  is  possible  to  recognize  in  the  DOS  of  the  ECLS  the 
surface  S5  and  S'gbands,  which  are  attributed  to  the  bonding 
of  the  Sb  chains  to  the  substrate.  The  band  (related  to  the 
Sb — ^As  bonds)  contributes  mostly  to  the  peak  at  about  -1.1 
eV  below  the  VBM,  while  the  features  lying  between  —1.0 
eV  and  the  VBM  can  be  attributed  to  the  contribution  of  the 

band  (related  to  the  Sb — Ga  bonds).  Obviously,  since 
there  are  more  inequivalent  Sb  adatoms  in  the  A,  B,  and  C 
structures,  and  they  are  in  different  relaxed  configurations 
with  respect  to  the  ECLS,  more  peaks  appear  in  the  DOS  and 
the  contribution  of  the  Sb-substrate  bonds  is  much  more 
widespread  over  the  energy  region  ranging  from  —  1 .5  eV  to 
the  band-gap  peaks. 

We  can  see  that  the  effect  of  truncating  the  Sb  chains  is 
mainly  to  introduce  prominent  features  in  the  region  of  the 
optical  band  gap.  The  peaks  have  different  shapes  and  posi¬ 
tions  passing  from  structure  A,  to  structures  B  and  C. 

Structure  A  [case  (b)]  presents  a  well  defined  peak  about 
0.2  eV  above  the  top  of  the  broad  valence-band  feature 
where  another  sharp  feature  appears,  much  more  distinct 
than  in  the  case  of  the  ECLS.  In  the  case  of  structure  B  [case 
(c)]  the  gap  separating  the  band-gap  states  from  the  valence- 
band  states  is  smaller,  about  0.1  eV  The  band-gap  states  in 
this  case  are  organized  in  two  sharp  peaks  separated  by  a 
very  small  gap.  Finally,  in  case  of  structure  C  there  are  no 
distinct  gaps  between  the  band-gap  states  and  the  valence 
band  maximum.  They  form  an  extension  of  the  broad 
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Fig.  3.  LDOS  of  the  inequivalent  Sb  surface  sites  in  structure  A.  a,  b,  and  c 
refer  to  the  Sb  adatoms  (see  Fig.  1). 

valence-band  feature  and  the  peaks  are  much  smaller  than  in 
the  previously  considered  cases. 

For  all  the  considered  geometries  the  electronic  states  ap¬ 
pearing  in  the  band-gap  region  are  only  partially  occupied, 
predicting  metallic  interfaces. 

To  gain  insight  about  the  nature  of  these  states  we  ana¬ 
lyzed  the  atom  and  angular  momentum  resolved  local  den¬ 
sity  of  states  (LDOSs)  and  the  electronic  charge  density  dis¬ 
tribution  associated  to  them.  In  Figs.  3-5  the  calculated 
LDOS  of  the  inequivalent  Sb  adatoms  in  structures  A,  B,  and 
C,  respectively,  are  depicted.  From  the  analysis  of  these  re¬ 
sults  we  can  deduce  that  a  big  contribution  to  the  partially 
occupied  peaks  at  the  Fermi  level  in  the  band-gap  region 
comes  from  the  Sb  adatoms  at  the  terrace  edge.  For  example 
in  the  case  of  structure  A,  the  p  states  of  planar  x  and  z 
character  (see  below),  which  give  rise  to  the  planar  bond¬ 
ing  between  Sb  adatoms  in  the  chain  become  dangling  bonds 
after  the  chain  truncation.  These  dangling  bonds  contribute 
up  to  the  50%  of  the  prominent  peak  at  the  Fermi  level  and 
are  by  far  the  predominant  contribution.  The  p  states  related 
to  the  Ga  and  As  dangling  bonds  of  the  free  substrate  also 
contribute  to  the  peak,  but  their  contribution  is  much  smaller, 
only  few  percent.  We  can  also  interpret  these  states  in  terms 
of  one-dimensional  Tamm  states,  showing  weight  predomi¬ 
nantly  on  the  last  atom  of  the  chain  and  decaying  into  the 
body  of  the  chain. 

We  can  understand  the  outcome  of  a  metallic  interface  for 
all  the  considered  geometries  through  a  simple  electron 
counting  rule.  We  consider  for  simplicity  an  unrelaxed  ideal 
configuration.  Each  Sb  adatom  at  the  terrace  edge  has  five 
valence  electrons.  In  an  unrelaxed  position  it  forms  only  two 
bonds,  one  with  the  substrate  and  the  other  with  only  one 
other  Sb  adatom.  Two  dangling  bonds  (if  we  still  assume  a 
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Fig.  4.  Same  as  Fig.  3  for  structure  B. 


tetrahedral  configuration  around  the  atom,  which  is  approxi¬ 
mately  true)  have  to  be  occupied  by  three  electrons,  which 
results  in  one  dangling  bond  only  half  occupied. 

The  situation  is  only  slightly  modified  if  we  include  in  the 
description  the  atomic  relaxation.  In  the  case  of  structures  A 
and  C,  the  Sb  adatoms  at  the  terrace  edges  are  both  on  Ga 
(structure  A)  or  on  As  (structure  C)  atoms,  and  for  symmetry 


Fig.  6.  Contour  plots  of  the  valence  charge  density  relative  to  the  electronic 
states  within  the  intra-gap  peak  of  structure  A  plotted  along  the  surface 
plane.  The  contour  step  is  15  in  units  of  electrons  per  unit  cell  volume. 


their  energies  have  to  be  the  same.  Therefore  even  in  this 
case  we  should  have  partially  occupied  dangling  bonds. 
Moreover  the  atoms  of  the  GaAs  substrate,  in  its  free  portion, 
are  only  partially  relaxed.  We  can  suppose  that  since  these 
substrate  atoms  are  not  fully  relaxed,  their  dangling  bonds 
overlap  the  band  region  and  contribute  mostly  to  the  states  at 
the  band  edges  and  somewhat  to  the  band-gap  peaks^^  (in¬ 
deed,  in  the  A  and  C  structures,  the  dangling  bonds  of  the  As 
atoms  in  the  substrate  are  not  anymore  in  the  same  number 
of  the  dangling  bonds  of  the  Ga  atoms). 

The  situation  is  different  for  the  configuration  B,  where 
the  Sb  adatoms  at  the  terrace  edges  are  one  on  a  Ga  substrate 
atom,  the  other  on  an  As  atom.  The  energies  of  the  two 
dangling  bonds  can  be  now  different  and,  in  theory,  one 
could  be  fully  occupied  and  the  other  empty.  In  practice  the 
interplay  with  the  dangling  bonds  of  the  free  substrate  and 
the  only  partial  saturation  of  the  Sb  dangling  bonds  with  the 
substrate  gives  rise  again  to  a  metallic  interface. 

The  second  important  observation  about  the  LDOS  of  the 
Sb  adatoms.  Figs.  3-5,  is  that  the  terrace  edges  on  As  atoms 
produce  two  distinct  peaks,  while  those  on  Ga  atoms  produce 
only  one  peak  at  higher  energy.  This  can  be  concluded  by 
comparing  Fig.  3(c)  with  Fig.  4(d)  and  Fig.  5(c)  with  Fig. 
4(c).  The  identification  of  the  band-gap  peaks  has  been  ob¬ 
tained  by  inspection  of  the  electronic  charge  density  distri¬ 
bution  associated  to  the  states  falling  in  this  energy  region. 
The  assignment  of  these  DOS  features  to  band-gap  states  has 
been  easily  performed  because  of  the  very  characteristic  non¬ 
bonding  and  spatially  localized  nature  of  these  states.  As  an 
example  we  show  in  Fig.  6  and  Fig.  7  the  contour  plots  of  the 
electronic  charge  density  associated  to  the  states  within  the 
prominent  band-gap  peak  in  the  case  of  structure  A  [see  also 
Fig.  2(b)],  plotted  along  the  surface  plane  and  along  a 
(110)  plane  perpendicular  to  the  surface  and  passing  through 
the  Sb^(Ga)  atoms,  respectively.  These  plots  reveal  the  na¬ 
ture  of  these  states  as  nonbonding  states,  strongly  localized 
at  the  terrace  edges  with  the  typical  shape  of  dangling  bonds 
of  p  character  with  maxima  of  charge  pointing  out  of  the 
surface.  Identical  results  have  been  obtained  in  the  case  of 
the  band-gap  states  of  structures  B  and  C.  The  investigation 
of  the  electronic  charge  distributions  associated  to  the  double 
peak  of  the  Sb(As)  termination  and  the  single  peak  of  the 
Sb(Ga)  termination  reveals  that  the  nature  of  these  states  is 
essentially  the  same.  The  fact  that,  in  the  case  of  the  Sb  (As) 
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Fig.  7.  Same  as  Fig.  6  but  plotted  along  the  (110)  plane  perpendicular  to  the 
surface  plane  passing  through  the  Sb,.(Ga)  atoms.  The  contour  step  is  15  in 
units  of  electrons  per  unit  cell  volume. 

termination,  the  states  give  rise  to  a  doublet  is  probably  due 
to  a  different  dispersion  of  the  four  bands  derived  from  the 
four  terrace  edges  in  the  unit  cell  (two  on  each  side  of  the 
slab).  In  the  limit  of  an  infinite  distance  between  the  terrace 
edges,  these  four  bands  would  be  degenerate  with  a  single 
peak  well  within  the  optical  band  gap.  The  single  peak  would 
be  closer  to  the  VBM  in  the  case  of  the  Sb  (As)  termination. 
However,  because  of  the  finite  (and  relatively  small)  dimen¬ 
sion  of  the  slab  used  to  model  the  system,  these  states  inter¬ 
act  between  them  and  split  in  their  bonding  and  antibonding 
combinations.  In  the  case  of  the  Sb  (As)  termination  these 
four  bands  are  very  close  to  (even  slightly  overlapping)  the 
states  at  the  top  of  the  valence  band.  The  interaction  with 
these  other  states  influences  further  their  dispersion,  giving 
rise  to  the  double  peak. 

Concluding  this  subsection  we  have  found  electronic 
states  within  the  band  gap  when  the  Sb  surface  chains  of  the 
ECLS  are  truncated.  These  electronic  states  are  organized  in 
distinct  peaks  which  can  be  assigned  mainly  to  dangling 
bonds  of  Sb  adatoms  bonded  to  Ga  and  As  atoms  of  the 
substrate,  respectively,  and  are  strongly  localized  on  the  ter¬ 
race  edges,  in  agreement  with  the  experimental  result.  How¬ 
ever  our  first-principles  calculations  predict  metallic  inter¬ 
faces  for  all  the  considered  configurations,  while  the 
experiment  shows  a  semiconducting  interface  with  a  very 
small  gap  between  the  filled  and  the  empty  bands.  In  particu¬ 
lar  we  have  been  unable  to  individuate  the  prominent  empty 
peak  shown  in  Fig.  4  of  Ref.  8.  One  possible  reason  could  be 
that  the  atomic  relaxation  of  the  Sb  adatoms  has  been  limited 
by  the  small  dimension  along  the  [110]  direction  of  our 
model  islands.  In  a  more  realistic  (and  large)  island  the  po¬ 
sition  of  the  Sb  adatoms  at  the  edge  could  give  rise  to  a  more 
effective  saturation  of  their  dangling  bonds  with  the  dangling 
bonds  of  the  substrate,  changing  their  energy.  Again,  it  is 
possible  to  think  of  some  geometry  in  which  the  interplay 
between  edges  on  Ga  atoms  and  edges  on  As  atoms  could 
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create  a  new  phase  at  the  island  boundary  leading  to  elec¬ 
tronic  states  similar  to  those  seen  in  the  experiment. 

Another  explanation  could  be  related  to  the  following  ob¬ 
servations: 

(a)  The  band-gap  states  are  highly  localized  in  the  space, 
nonbonding,  and  because  of  the  lower  atomic  coordina¬ 
tion  of  the  adatoms  at  the  terrace  edges  they  are  more 
similar  to  atomic  p  states. 

(b)  The  LDA  method,  like  all  the  theories  based  on  a  mean 
field  approach  does  not  take  into  a  proper  account  the 
dynamical  fluctuations  of  the  orbital  occupation  and  of¬ 
ten  provides  an  inaccurate  description  of  the  dynamical 
correlation  between  electrons  localized  on  the  same 
atom.  Because  of  the  high  spatial  localization  of  the 
terrace-edge  states  this  effect  might  be  not  negligible. 

(c)  The  experiment  probes  excited  states  with  N-1  and 
N+I  particles.  The  experimental  result  should  therefore 
be  compared  more  correctly  with  a  quasiparticle  spec¬ 
trum  instead  of  the  spectrum  of  the  eigenstates  of  the 
ground  state  of  the  system  provided  by  the  LDA  theory. 

And,  most  importantly: 

(d)  The  experimental  results  seem  to  be  independent  of  the 
actual  geometry  of  the  terrace  edge. 

The  hypothesis  that  the  Coulomb  repulsion  U  between 
electrons  in  the  partially  occupied  band-gap  states  could  play 
a  substantial  role  has  been  suggested  also  in  Ref.  11.  Theo¬ 
retical  work  on  Au/GaAs(110)^^  and  on  alkali-metal  adsor¬ 
bates  on  GaAs(llO)^^  has  given  an  explanation  of  the  experi¬ 
mental  results  in  terms  of  negative-U  systems,  in  which 
doubly  occupied  levels  are  energetically  favored. 

In  this  article  we  try,  instead,  to  interpret  the  experiment 
on  the  Sb/GaAs(110)  system  in  the  submonolayer  coverage 
regime  in  terms  of  a  Mott-Hubbard  system.  We  have  calcu¬ 
lated  a  quasiparticle  spectrum  where  the  electron-electron 
correlation  beyond  the  mean-field  approach  of  the  LDA  has 
been  treated  explicitly  through  an  Hubbard  term.  We  will 
illustrate  this  calculation  in  the  next  subsection,  applying  the 
method  to  the  case  of  structure  A. 


B.  Effects  of  the  inclusion  of  the  on-site 
electron-electron  correlation 

The  Hubbard  hamiltonian  has  often  been  used  to  modify 
single  particle  eigenvalues,  but  most  of  the  studies  are  based 
either  on  a  crude  description  of  the  single  particle  states  or 
on  simplified  approaches  to  treat  the  electron-electron  corre¬ 
lation.  Starting  from  the  first-principles  single  particle  results 
we  obtained  the  quasiparticle  spectrum  for  electron  and  hole 
states  adopting  a  multiband  extension  of  a  recent  approach 
which  treats  the  on-site  Hubbard  repulsion  beyond  the  mean 
field  theory.^^  The  method  allows  to  include  both  the  full 
details  of  the  single  particle  band  structure  and  to  treat  cor¬ 
relation  effects  through  a  non-perturbative  solution  of  the 
Hubbard  hamiltonian.  In  this  approach  the  many-body  states 
of  the  interacting  system  are  expanded  in  terms  of  uncorre¬ 
lated  states  with  a  different  number  of  electron-hole  pairs. 
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The  expansion  is  here  truncated  to  include  three-body  inter¬ 
actions.  The  interactions  between  configurations  with  the 
same  number  of  hole-electron  pairs  are  represented  by  a  set 
of  two-body  scattering  ^-matrices,  which  can  be  exactly  cal¬ 
culated.  The  Fadeev  theory^^  is  used  to  find  the  total  scatter¬ 
ing  matrix  and,  ultimately,  the  resolvent  giving  the  energy  of 
the  many-body  system  and  the  quasiparticle  spectrum. 

In  this  theory  the  quasiparticle  states  are  given  in  terms  of 
the  spectral  functions  for  hole  and  electron  states  as 


^  1  1 
Z>k„(u))  = - Im - + 


with  (o^Sf  for  (— )  and  (o^sj  for  (+)  and 

ic"^(k)i4+s  (2) 

13  [  a 


for  the  hole  (-)  and  electron  (+)  self-energy.  Here  a  and  p 
label  different  atomic  orbitals,  {c^(k)}  the  orbital  coeffi¬ 
cients  of  the  single  particle  state  (k,n),  ^  the  Coulomb 

repulsion  between  electrons  on  orbitals  a  and  (3  on  the  same 
site,  is  the  percentage  of  empty  (filled)  states  of  orbital 
character  a  over  the  total.  Ultimately  we  calculate  the  total 
density  of  quasiparticle  states  as 
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Fig.  8.  Decomposition  of  a  Sb^XGa)  adatom  p  states  in  the  py  and  p- 
components.  The  dashed  line  indicates  the  position  of  the  Fermi  level. 


D-{m)='2j  Dln{<a).  (3) 

k,rt 

A  detailed  description  of  the  method  is  given  in  a  previous 
paper. The  inputs  of  the  calculation  of  the  quasiparticle 
spectra  are  simply  an  orbital  resolved  single  particle  density 
of  states,  the  single  particle  eigenstates,  and  the  values  of  the 
intra-site  correlation  parameter  ^ .  In  order  to  obtain  the 
orbital  composition  of  the  crystal  wavefunctions  „  from 
the  first-principles  pseudopotential  calculations,  (which  are 
expanded  in  plane  waves),  we  calculated  their  projections 
onto  s,  p,  and  d  symmetry  atomic  orbitals.  The  projection 
coefficients  are  given  by^^ 

r  R 

CiTi,lm,nk)=  (4) 

Jo 

where  the  integration  extends  over  the  Wigner-Seitz  radius 
centered  at  the  lattice  site  and  7/^  are  the  spherical 
harmonics.  Brackets  imply  integration  over  angular  coordi¬ 
nates.  The  projection  coefficients  are  normalized  such  that 

2  2  C{riJm,nk)=  1,  (5) 

T,'  Im 

and  symmetrized  over  the  star  of  the  /:-point.  Through  the 
projection  coefficients  we  are  able  to  obtain  the  orbital  re¬ 
solved  single  particle  density  of  states  ,/m(^)^ 

«T.,;m(e)=  22  <5(s-e„k)C(Tj,/OT,nk).  (6) 

‘  nk 

We  have  found  by  inspection  of  the  electronic  charge  distri¬ 
bution  that  only  the  electronic  states  within  the  optical  band 
gap  are  highly  localized  in  the  space  (apart  the  nonbonding  s 
states  of  the  same  Sb  adatoms  whose  energies  are  however, 


much  lower  and  are  not  the  objects  of  our  investigation).  We 
assumed  that  the  on-site  electron-electron  correlation  has  not 
been  adequately  described  in  the  framework  of  the  LD A  only 
for  these  states.  These  states  have  a  planar  (py),  and  p^ 
character.  Figure  8  shows  the  angular  momentum  decom¬ 
posed  LDOS  of  a  Sbg(Ga)  atom  relative  to  the  p  symmetry 
states.  Only  the  planar  p^^  (py  for  the  other  terrace  edge  atom 
in  the  surface  unit  cell)  and  p^  states  contribute  substantially 
to  the  band-gap  peak,  while  their  weight  on  other  energy 
regions  is  quite  small.  The  other  py  (p^)  orbital  contributes 
mainly  to  the  energy  region  corresponding  to  the  top  of  the 
broad  valence  feature.  The  electronic  charge  distribution  as¬ 
sociated  to  this  energy  region  (Fig.  9)  reveals  indeed  bonding 
states  with  maxima  of  charge  between  the  Sb^  adatoms  and 
the  substrate  Ga  atoms  and  between  the  As  and  Ga  atoms  of 
the  substrate.  The  electronic  charge  is  much  less  localized 
(the  contour  step  in  figure  is  6)  than  in  the  case  of  the  elec¬ 
tronic  states  lying  within  the  optical  gap  shown  in  Fig.  7 
(where  the  contour  step  is  15). 

Since  the  key  quantity  measuring  the  “strength”  of  the 
electron-electron  correlation  is  the  ratio  U/W  {W  being  the 
orbital  band  width),  we  have  included  electron-electron  cor¬ 
relation  only  on  those  states  with  a  narrow  bandwidth,  that  is 
between  the  p^  (py)  and  p^  states.  We  assume  a  model  planar 
Hamiltonian, 

2 

H=H^f+  2  UijKanj.-<^^  (2) 

i,j=\ 

where  ij  run  over  the  planar  p  states  of  Sbg(Ga)  and 
indicates  the  density  functional,  LDA,  hamiltonian  which  has 
been  self-consistently  solved  and  treats  all  the  correlation 
effects  in  a  sort  of  mean  field  approach.  Because  the  value  of 
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Fig.  9.  Contour  plots  of  the  valence  charge  density  of  the  electronic  states 
falling  in  the  peak  at  the  top  of  the  broad  valence  feature  in  structure  A 
drawn  along  a  (110)  plane  perpendicular  to  the  surface  plane  passing 
through  Sbe(Ga).  The  contour  step  is  6  (in  unit  of  electrons  per  unit  cell 
volume). 

the  Hubbard  Coulomb  integral  U  is  reasonably  on  the  same 
order  for  all  the  p  orbitals,  we  take  the  Coulomb  repulsion 
parameters  Uij=U  for  all  ij. 

Figure  10  shows  our  results  for  D-(cu),  the  quasiparticle 
density  of  states,  when  the  Coulomb  correlation  parameter 
ranges  from  0  to  6  eV.  Figure  10(a)  is  the  total  density  of 
states  of  Figure  2(b),  translated  in  the  formalism  of  the  spec¬ 
tral  function  [Eq.  (1)]  for  hole  and  electron  states.  Increasing 


Energy  (eV) 

Fig.  10.  Density  of  quasiparticle  states  in  the  energy  region  around  the 
optical  gap  in  the  cases:  (a)  without  the  Hubbard  on-site  correlation;  (b)  with 
an  Hubbard  parameter  C/=4;  and  (c)  with  an  Hubbard  parameter  U=6.  The 
Fermi  level  position  has  been  indicated  by  a  dashed  line. 


the  Hubbard  correlation  in  the  Sbg(Ga)  p  symmetry  states  we 
can  see  that  the  peak  at  the  Fermi  level  tends  to  disappear, 
while  the  features  at  the  valence  and  conduction  band  edges 
broaden  and  move  toward  the  Fermi  level.  The  separation 
between  the  valence  peak  and  the  conduction  peak  is  about  1 
eV,  which  is  in  reasonable  agreement  with  the  experimental 
results. 

We  have  analyzed  the  DOS  projected  on  the  atomic  sites 
in  order  to  recognize  and  assign  the  band-gap  features  to 
those  observed  in  the  experimental  spectra.  As  U  increases  a 
feature  in  the  calculated  DOS  broadens  at  about  Ey+l.O  eV. 
This  peak  can  be  associated  to  the  substrate  dangling  bonds, 
mainly  to  states  on  the  substrate  Ga  atom.  Moreover  the 
feature  in  the  occupied  part  of  the  spectrum  which  moves 
closer  to  the  Fermi  level  is  related  to  terrace  edge  states 
pushed  down  by  the  Hubbard  interaction  plus  some  minor 
contribution  from  substrate  states.  We  can  explain  the  result 
of  the  calculation  in  this  way:  once  the  localized  states  are 
removed  from  the  gap  through  the  Hubbard  mechanism,  the 
band  gap  states  are  dominated  by  features  associated  with 
substrate  states.  This  could  explain  the  experimental  outcome 
that  the  peak  at  about  1.0  eV  from  the  Fermi  level  is  sub¬ 
stantially  independent  from  the  actual  geometry  of  the  adsor¬ 
bate  while  the  tail  of  states  in  the  valence  band  seems  to  be 
more  affected  by  it. 

Considering  that  the  model  applied  is  very  simple  and  the 
correlation  effects  have  been  taken  into  account  only  in  the 
case  of  two  orbitals,  which  constitute  only  half  of  the  wave 
functions  of  the  band-gap  states  (probably  this  fact  has  led  to 
the  need  to  double  the  value  of  the  parameter  U  necessary  to 
obtain  these  results),  this  result  is  quite  interesting,  as  we  can 
infer  that  the  on-site  electron-electron  correlation  could  play 
a  substantial  role  in  determining  the  energies  of  the  terrace 
edge  states. 

IV.  SUMMARY 

In  conclusion  we  have  studied  the  Sb/GaAs(110)  interface 
in  the  submonolayer  coverage  regime  using  first-principles 
band  structure  methods.  Different  structural  models  for  the 
Sb  island  edges  have  been  examined  and  we  have  shown  that 
the  pinning  of  the  Fermi  level  found  for  Sb  deposited  on  the 
GaAs(llO)  surface  could  be  associated  to  unsaturated  Sb  and 
substrate  dangling  bonds  which  may  as  well  be  present  even 
at  a  full  monolayer  coverage  when  some  residual  disorder 
still  exist.  Terrace  edge  states  have  been  found  within  the 
optical  bandgap  with  distinctive  features  associated  to  the 
possible  different  terminations  of  the  Sb  islands.  A  better 
agreement  with  the  experiment  about  the  Fermi  level  posi¬ 
tion  has  been  achieved  when  the  effects  of  the  on-site 
electron-electron  correlation  have  been  explicitly  included  in 
the  calculation. 
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As-rich  GaAs  (110)  is  prepared  by  ion  bombardment  and  annealing,  followed  by  chlorination  and 
reannealing.  The  surface  is  then  reacted  at  room  temperature  with  CI2  gas  and  examined  with  soft 
x-ray  photoelectron  spectroscopy  of  the  Ga  and  As  2d  core  levels.  After  low  exposures  (<5X10"^ 
L),  the  surface  appears  to  passivate  with  half  a  monolayer  of  Cl  adsorbed,  primarily  as  AsCl. 
Following  sufficiently  large  (>5X10"^  L)  exposures,  however,  the  surface  begins  to  etch,  as 
indicated  by  the  continuous  uptake  of  chlorine  and  the  formation  of  As  and  Ga  chlorides.  After  the 
largest  exposures,  the  distribution  of  As  chlorides  still  favors  the  monochloride,  whereas  the  Ga 
chlorides  favor  GaCl2.  It  is  proposed  that  the  heavily  reacted  surface  is  covered  with  -AsCl-GaCl2 
treelike  structures.  The  addition  of  Cl  to  form  GaCl3  from  GaCl2  is  identified  as  the  rate-limiting 
step  in  the  overall  etching  reaction.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

It  is  becoming  increasingly  important  to  understand  the 
basic  mechanisms  involved  in  the  chemical  etching  of  semi¬ 
conductor  wafers  as  processing  technology  is  applied  to 
III-V  substrates.  The  interactions  of  CI2  with  GaAs  are  par¬ 
ticularly  significant,  since  chlorine  is  the  primary  chemical 
etchant  involved  in  GaAs  device  fabrication.  The  study  of 
Cl2/GaAs  reactions  is  complicated  somewhat  by  the  fact  that 
many  GaAs  surface  stoichiometries  are  stable  and  that  the 
composition  and  crystallinity  of  the  starting  surface  affect  the 
path  of  the  reaction. 

For  example,  in  the  room-temperature  reaction  of  CI2  with 
GaAs(llO),  there  is  a  competition  between  passivation  and 
etching,  as  well  as  between  Ga~Cl  and  As-Cl  bonding.  Un¬ 
der  certain  conditions,  GaAs(llO)  is  observed  to  passivate 
against  further  reaction  with  CI2  at  room  temperature,  pre¬ 
sumably  via  the  formation  of  an  AsCl  overlayer.^  To  explain 
this  result,  surface  order  was  suggested  to  be  the  controlling 
factor  in  determining  whether  or  not  passivation  occurs.  On 
the  other  hand,  soft  x-ray  photoelectron  spectroscopy 
(SXPS)  studies  of  the  room-temperature  reaction  of  CI2  with 
cleaved  GaAs  (110),  which  is  highly  crystalline,  observed 
both  Ga  and  As  chloride  formation  accompanied  by  continu¬ 
ous  chlorine  uptake,  which  is  suggestive  of  etching.^"^  More¬ 
over,  scanning  tunneling  microscopy  (STM)^  and  high- 
resolution  electron  energy  loss  studies  (HREELS)^  of  the 
etching  reaction  showed  that  there  is  a  preference,  at  least 
initially,  for  Ga-Cl  bonding  over  As-Cl  bonding. 

A  possible  means  of  reconciling  these  apparently  contra- 
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dictory  results  lies  in  the  sample  preparation  technique  used 
in  each  study.  In  Refs.  3  and  5-7,  clean  starting  surfaces 
were  prepared  by  cleavage  in  vacuum,  whereas  in  Ref.  4  they 
were  prepared  by  ion  bombardment  and  annealing  (IB A).  In 
contrast,  GaAs(llO)  wafers  were  prepared  in  Ref.  2  by  IB  A 
followed  by  exposure  to  CI2  and  reannealing  to  —535  °C, 
which  likely  produced  a  starting  surface  different  than  those 
in  Refs.  3-7. 

In  this  investigation,  it  is  shown  that  cleaning  GaAs(llO) 
with  CI2  in  this  manner  produces  an  As-rich  surface.  The 
subsequent  room-temperature  reaction  of  CI2  with  such  an 
As-rich  surface  is  then  followed  with  core-level  SXPS.  At 
low  exposures,  the  reaction  appears  to  passivate  with  a  half¬ 
monolayer  (ML)  of  AsCl  formed,  as  reported  in  Ref.  2.  After 
sufficiently  large  CI2  exposures,  however,  further  reaction 
does  occur,  generating  AsCl  and  GaCl2  as  the  primary  sur¬ 
face  products. 

II.  EXPERIMENTAL  PROCEDURE 

The  SXPS  measurements  were  performed  in  a  multicham¬ 
ber  ultrahigh  vacuum  (UHV)  system  on  beamline  UV-8a  at 
the  National  Synchrotron  Light  Source.  Spectra  were  col¬ 
lected  using  a  3  m  toroidal  grating  monochromator  and  an 
angle-integrating  ellipsoidal  mirror  analyzer  (EM A).  Ga3<i 
and  As  3(7  spectra  were  collected  with  photon  energies  of  57 
and  79  eV,  respectively,  to  generate  core-level  photoelectrons 
with  kinetic  energies  of  —35  eV,  which  maximizes  the  sur¬ 
face  sensitivity  of  the  measurements.  The  combined  resolu¬ 
tion  of  the  analyzer  and  monochromator  at  these  photon  en¬ 
ergies  is  better  than  150  meV. 

Prior  to  being  placed  in  the  UHV  system,  GaAs(llO)  wa¬ 
fers  {n  type,  Si  doped,  10^^  cm“^)  were  chemically  etched  in 
a  dilute  solution  of  HNO3/H2O2/H2O,  rinsed  in  isopropyl  al- 
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cohol,  and  then  dried  with  N2  gas.  The  samples  were  de¬ 
gassed  in  UHV,  then  ion  bombarded  with  500  eV  Ar”^  ions 
and  annealed  to  —550  °C  in  order  to  remove  any  remaining 
oxides.  As  a  final  step,  the  wafers  were  exposed  to  a  small 
(<100  L)  dose  of  CI2  and  reannealed  to  —550  °C  for  —10 
min  {note:  1  L=l  Langmuir=10"^  Torr  s).  Exposure  to  CI2 
gas  was  carried  out  in  a  separate  UHV  dosing  chamber  (base 
pressure=8X10“^^  Torr).  Following  chlorination,  the  sample 
was  transferred  to  the  spectrometer  chamber  (base 
pressure=2XlO"^®  Torr)  for  measurement.  The  sample  re¬ 
mained  in  UHV  throughout  the  cleaning,  dosing,  transfer¬ 
ring,  and  measuring  procedures. 


III.  RESULTS 

The  Ga  and  As  3d  core-level  spectra  were  numerically  fit, 
via  the  method  outlined  in  Ref.  4,  to  a  sum  of  spin-orbit  split, 
Gaussian-broadened  Lorentzian  line  shapes.  A  Lorentzian 
width  of  0.15  eV  was  used,  as  were  spin-orbit  splittings  and 
branching  ratios  of  0.45  eV  and  0.65  for  Ga  3d  and  0.68  eV 
and  0.65  for  As  3d,  respectively.  In  addition  to  the  signal 
arising  from  atoms  in  the  bulk,  the  spectra  collected  from  the 
clean  surface  have  contributions  from  surface  Ga  and  As 
atoms,  which  are  in  a  tricoordinate  geometry,  and  therefore 
have  different  core-level  binding  energies  (BEs)  than  the 
bulk  atoms.  The  magnitude  and  direction  of  shifts  obtained 
for  these  surface-shifted  core  levels  (SSCLs)  are  identical  to 
those  previously  reported  for  cleaved  GaAs.^ 

Examples  of  spectra  from  the  reacted  surfaces  are  shown 
in  Figs.  1  and  2,  which  contain  As  3d  and  Gsi3d  spectra 
collected  after  exposures  of  1.5X10^  L  and  1.2X10^  L  of 
CI2.  In  both  figures,  the  filled  circles  are  the  background- 
subtracted  raw  data,  dashed  lines  are  the  individual  contribu¬ 
tions  to  the  spectra  from  each  chemical  species,  and  the  solid 
line  is  the  sum  of  all  these  contributions.  Except  for  the 
low-BE  Ga  peak,  each  spectral  component  is  labeled  by  the 
chemical  species  with  which  it  is  associated. 

In  addition  to  the  signal  originating  from  bulk  atoms,  the 
spectra  collected  from  the  reacted  surfaces  have  several  other 
core-level  components,  which  have  been  identified  previ¬ 
ously.  The  As  3d  spectra  contain  features  shifted  —0.45  and 
—  1.2  eV  higher  in  BE  than  the  bulk  component,  which  are 
identified  as  arising  from  AsCl  and  ASCI2,  respectively. 
Additional  Ga3d  components  are  also  observed,  shifted 
—0.9  and  —1.7  eV  higher  in  BE  than  the  bulk  component, 
and  are  identified  as  due  to  GaCl  and  GaCl2, 
respectively. 

There  are  also  contributions  in  the  spectra  from  tricoordi¬ 
nate  Ga  and  As  atoms,  which  have  been  observed  previously 
in  the  reactions  of  CI2  and  XeF2  with  GaAs.^’^“^^  The  pro¬ 
duction  of  subsurface  tricoordinate  Ga  and  As  atoms  has 
been  shown  to  be  a  consequence  of  the  etching  reaction. 
Because  of  the  valences  of  Ga  and  As,  tricoordinate  configu¬ 
rations  are  reasonably  stable.  Ga,  with  three  valence  elec¬ 
trons,  prefers  a  tricoordinate  bonding  geometry,  while  As, 
with  five  valence  electrons,  has  a  stable  lone  pair  of  electrons 
remaining  when  the  other  three  are  involved  in  bonding. 


Fig.  1,  High-resolution  SXPS  spectra  of  the  As  3d  core  level  collected  from 
GaAs(llO)  exposed  at  room  temperature  to  (a)  1.5X10^  L  of  CI2  and  (b) 
1.2  X 10^  L  of  Cb-  Dots  are  the  background-subtracted  raw  data.  The  dashed 
lines  are  the  individual  fit  components.  The  solid  line  is  the  sum  of  the  fit 
components. 

Finally,  for  CI2  exposures  above  —  5X10"^  L,  an  additional 
component  shifted  0.5 -0.6  eV  lower  in  BE  than  the  bulk 
component  is  present  in  the  Ga  3d  spectra  (see  Fig.  2).  A 
similar  shift,  0.5-0.6  eV  to  lower  BE,  is  observed  for  Ga 
surface  atoms  on  clean  GaAs(001)-4X6,  which  persists  with 
chlorination.^  It  is  therefore  proposed  that  this  low-BE  shift 
is  due  to  Ga-Ga  bonds  and  that  Ga  dimers,  or  possibly  some 
other  type  of  Ga  cluster,  may  be  responsible  for  this  shift. 
The  persistence  of  the  low-BE  shifted  component  with  in¬ 
creasing  chlorination  may  indicate  that  Ga  atoms  in  such  a 
configuration  are  reasonably  inert  to  further  reaction. 

A  simple  calculation  based  on  the  areas  of  the  core-level 
components  provides  a  quantitative  estimate  of  the  coverage 
of  each  chemical  species  present  on  the  surface  following 
reaction.  The  coverage  of  each  species  is  simply  proportional 
to  its  relative  contribution  to  the  total  core-level  intensity,  if 
attenuation  within  the  overlayer  and  chemically  induced 
variations  in  the  photoionization  cross  sections  are  neglected. 
The  proportionality  constant  is  determined  straightforwardly 
by  comparing  intensity  ratios  of  the  reacted  surfaces  to  those 
of  a  surface  with  known  composition.  This  is  done  by  assum¬ 
ing  that  the  ratio  of  the  SSCL  area  to  the  total  core-level  area, 
in  a  spectrum  collected  with  the  same  photon  energy  from  a 
cleaved  GaAs(llO)  surface,  represents  the  signal  from  0.5 
ML  of  surface  atoms.  Coverages  estimated  in  this  manner, 
using  the  core-level  intensities  reported  in  Ref.  3  for  cleaved 
GaAs(llO),  are  given  in  Fig.  3}^  The  amount  of  Cl  adsorbed 
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Fig.  2.  High-resolution  SXPS  spectra  of  the  Ga  3^/  core  level  collected  from 
GaAs(llO)  exposed  at  room  temperature  to  (a)  1.5X10^  L  of  CI2  and  (b) 
1.2X10^  L  of  CI2.  Dots  are  the  background-subtracted  raw  data.  The  dashed 
lines  are  the  individual  fit  components.  The  solid  line  is  the  sum  of  the  fit 
components. 


on  the  surface  is  found  by  summing  the  coverages  of  each  of 
the  Ga  and  As  chlorides,  appropriately  weighted  by  the  num¬ 
ber  of  Cl  atoms,  and  is  given  in  Fig.  4. 


IV.  DISCUSSION 

The  starting  surface,  which  was  cleaned  by  CI2  adsorption 
and  annealing,  is  slightly  As-rich.  This  is  apparent  from  the 
coverage  estimates,  described  above,  which  indicate  that 
there  is  '-0.65  ML  of  surface- shifted  As  and  only  -^0.35  ML 
of  surface-shifted  Ga  on  the  clean  surface.  These  data  are 
shown  at  the  left  side  of  the  top  panel  of  Fig.  3  by  the  two 
points  labeled  “SSCL.”  The  atoms  that  contribute  to  the 
SSCL  components  are  all  exposed  at  the  surface,  as  their 
total  coverage  is  not  more  than  ^^1  ML.  Experimentally,  it 
has  been  shown  that  up  to  1  ML  of  As  can  be  grown  on 
GaAs(llO),  and  that  the  excess  As  is  stable  on  the  surface  for 
temperatures  below  '^575  The  structure  of  As-rich 
GaAs(llO)  is  unknown,  but  a  theoretical  investigation  sug¬ 
gests  that  the  excess  As  atoms  form  chains,  bonding  to  both 
Ga  and  As  atoms  below.  Presumably,  the  CI2  adsorption 
and  annealing  procedure  generates  an  As-rich  surface  via  the 
preferential  formation  and  removal  of  Ga  chlorides.  Note 
that,  in  other  systems,  halogen  adsorption  followed  by  an¬ 
nealing  also  produces  a  group-V  terminated  surface  via  the 
preferential  removal  of  group-III  halides. 


Fig.  3.  Coverage  of  each  surface  species,  in  monolayers  (ML),  as  a  function 
of  CI2  exposure. 


From  Figs.  3  and  4,  it  is  seen  that  this  As-rich  surface  is 
remarkably  resistant  to  chlorine  attack.  That  is,  once  —0.5 
ML  of  Cl  is  adsorbed  (primarily  as  AsCl),  there  is  minimal 
Cl  uptake  over  an  order-of-magnitude  increase  in  CI2  expo¬ 
sure.  As  previously  noted,  this  apparent  passivation  to  chlo¬ 
rine  attack  is  not  observed  for  cleaved  GaAs(llO),^’^  nor  for 
sputtered  and  annealed  GaAs(110),‘^  but  it  is  seen  for  sput¬ 
tered  and  annealed  GaAs(llO)  wafers  that  are  further  cleaned 
by  exposure  to  CI2  and  reannealing. ^  Since  the  cleaved  sur¬ 
faces,  which  are  highly  crystalline,  show  no  sign  of  passiva¬ 
tion,  surface  order  does  not  appear  to  be  the  only  factor 


CI2  Exposure  (L) 

Fig.  4.  Cl  coverage,  in  monolayers  (ML),  as  a  function  of  CI2  exposure. 
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governing  whether  or  not  the  surface  passivates.  Instead,  it  is 
suggested  that  excess  surface  As  is  also  involved  in  impeding 
the  reaction.  How  this  occurs  is  unclear,  however,  especially 
since  the  surface  is  not  entirely  covered  with  As,  but  has  only 
—0.15  ML  of  excess  As.  A  possibility  is  that  As  migrates  to 
defects,  thereby  impeding  the  dissociation  of  CI2  by  tying  up 
the  active  sites. 

It  is  unexpected  for  the  principal  chloride  on  the  passi¬ 
vated  surface  to  be  AsCl,  since  photoemission^’^  and  STM^ 
studies  indicate  that  there  is  a  clear  preference  for  Ga-Cl 
bonding  in  the  initial  stages  of  the  reaction  on  GaAs(llO).  It 
is  possible  that  differences  in  the  atomic  structure  or  the 
morphology  of  As-rich  GaAs(llO),  as  compared  to  cleaved 
GaAs(llO),  are  responsible  for  altering  the  distribution  of 
surface  reaction  products.  STM  measurements  of  As-rich 
GaAs(IlO),  both  before  and  after  reaction  with  CI2,  would  be 
of  great  use  for  understanding  this  unexpected  result. 

This  apparently  passive  surface  is  not  absolutely  unreac¬ 
tive,  however,  since  the  reaction  does  continue  providing  that 
the  CI2  exposure  is  sufficiently  large.  That  is,  for  exposures 
greater  than  — 5X10"^  L,  the  amount  of  AsCl  increases  no¬ 
ticeably  and  a  measurable  amount  of  GaCl2  forms.  The  chlo¬ 
rine  uptake  also  begins  to  increase  at  this  point  (see  Fig.  4), 
signaling  the  onset  of  etching.  Once  etching  has  begun,  the 
Cl  coverage  increases  linearly  with  the  logarithm  of  the  CI2 
exposure,  as  is  observed  for  the  chlorination  of  GaAs(OOl).^ 
This  linear  dependence  of  the  coverage  on  the  logarithm  of 
the  exposure  implies  that  the  sticking  coefficient  decays  ex¬ 
ponentially  with  coverage,  i.e.,  S(Q)  =  a  exp(— ^0).  Thus, 
although  excess  surface  As  does  slow  the  reaction  signifi¬ 
cantly,  once  this  As  is  removed  the  etching  reaction  contin¬ 
ues  normally. 

By  carrying  out  the  reaction  at  a  sufficiently  low  tempera¬ 
ture,  in  this  case  room  temperature,  the  lifetimes  of  the  reac¬ 
tion  intermediates  are  sufficiently  long  that  a  determination 
of  the  rate-limiting  step  can  be  made.  It  is  seen  from  Fig.  3 
that,  following  the  largest  CI2  exposures,  the  principal  As 
surface  reaction  product  continues  to  be  AsCl,  whereas 
GaCl2  is  the  primary  Ga  product.  The  Cl2/GaAs  reaction  oc¬ 
curs  in  a  stepwise  fashion,  with  the  primary  volatile  etch 
products  at  or  near  room  temperature  being  ASCI3  and 
GaCl3.^^’^^  Because  the  crystal  structure  of  GaAs  constrains 
the  availability  of  reaction  sites,  the  etching  reaction  cannot 
be  simply  divided  into  discrete  decoupled  pathways.  Hence, 
a  build-up  of  AsCl  and  GaCl2  on  the  surface  suggests  that  the 
rate-limiting  step  in  the  overall  etching  reaction  is  either 


GaCl2+Cl— ^GaCl3 

(1) 

AsCl+Cl^AsCl2. 

(2) 

The  rate  of  one  of  these  steps  probably  limits  the  other, 
which  then  results  in  the  simultaneous  build-up  of  both 
GaCl2  and  AsCl. 

After  the  highest  CI2  exposures,  the  coverages  of  AsCl 
and  GaCl2  are  nearly  identical  (see  Fig.  3).  A  likely  reason 
for  there  being  equal  amounts  of  AsCl  and  GaCl2  on  the 
surface,  and  for  the  stability  of  one  of  them  to  limit  the 


removal  of  the  other,  is  that  they  form  a  common  moiety.  If 
such  an  assumption  is  made,  then  the  simplest  and  most  rea¬ 
sonable  arrangement  having  AsCl  and  GaCl2  bound  together, 
and  to  the  surface,  is  -AsCl-GaCl2.  In  this  moiety,  the  Ga 
and  As  atoms  are  bound  to  each  other,  while  the  Ga  atom  is 
also  bonded  to  two  Cl  atoms  and  the  As  atom  is  bonded  to  a 
single  Cl  atom  and  to  the  surface.  This  structure  has  three 
requisite  properties:  (1)  it  consists  of  an  equal  number  of 
AsCl  and  a  GaCl2  moieties;  (2)  it  can  bind  to  the  surface,  in 
this  case  through  the  As  atom;  and  (3)  both  Ga  and  As  are 
tricoordinately  bonded,  which  they  prefer  in  the  absence  of  a 
crystal  field.  Any  other  possible  arrangement  does  not  satisfy 
these  criteria.  In  this  configuration,  it  must  be  the 
terminal“GaCl2  group  that  is  the  more  stable  of  the  two  spe¬ 
cies,  i.e.,  the  one  that  limits  the  reaction  rate.  Otherwise,  it 
would  quickly  form  GaCl3  and  desorb.  Furthermore,  due  to 
its  stability,  the  terminal-GaCl2  group  prevents  further  chlo¬ 
rination  of  the  AsCl  moiety  by  tying  up  an  otherwise  avail¬ 
able  reaction  site.  Thus,  it  appears  that  the  rate-limiting  step 
in  the  reaction  is  that  given  by  Eq.  (1). 

Note  that  the  formation  of  an  -AsCl~GaCl2  structure  in 
the  CI2  etching  of  GaAs  is  analogous  to  the  generation  of 
fluorosilyl  trees  (e.g,,  -SiF-SiF3  and  -SiF2-SiF3)  in  the 
XeF2/Si  etching  reaction.^^  An  interesting  experiment,  which 
could  confirm  the  existence  of  such  a  structure,  would  be  to 
study  heavily  chlorinated  GaAs(llO)  with  molecular  beam 
scattering  or  secondary  ion  mass  spectrometry,  and  to  look 
for  desorbing  species  that  contain  both  Ga  and  As  atoms. 

The  CI2  exposure  at  which  significant  chlorine  uptake  be¬ 
gins  on  As-rich  GaAs(llO)  (—5X10"^  L)  is  uncommonly 
large  compared  to  the  exposures  that  cause  appreciable  reac¬ 
tion  on  other  GaAs  surfaces. In  addition,  prior  to  the 
onset  of  reaction,  the  AsCl  coverage  dips  slightly  and  the 
ASCI2  and  GaCl  coverages  begin  to  rise  (see  Fig.  3).  The 
gradual  transformation  of  the  surface  product  distribution 
away  from  AsCl,  combined  with  the  very  large  exposures 
needed  to  produce  appreciable  reaction,  suggests  that  the  re¬ 
action  nucleates  at  defects  on  an  otherwise  unreactive  AsCl- 
stabilized  surface.  Thus,  barring  such  defects,  an  AsCl- 
covered  surface  would  be  inert  to  further  reaction. 


V.  SUMMARY 

In  summary,  As-rich  GaAs(llO)  was  exposed  at  room 
temperature  to  large  doses  of  CI2  and  examined  with  SXPS. 
Following  the  initial  CI2  exposures,  the  surface  appears  to 
passivate  with  half  a  monolayer  of  AsCl  formed,  in  a  manner 
similar  to  that  observed  previously.^  The  reaction  continues, 
however,  following  much  larger  CI2  exposures,  with  both  As 
and  Ga  chlorides  formed  on  the  surface.  For  the  most  heavily 
reacted  surfaces,  the  distribution  of  chlorides  favors  AsCl 
and  GaCl2 .  It  is  concluded  that  the  chlorination  of  GaCl2  to 
form  GaCl3  is  the  rate-limiting  step  in  the  reaction,  and  that 
GaCl2  stabilizes  AsCl  on  the  surface,  possibly  through  the 
formation  of  a  treelike  ~AsCl-GaCl2  structure. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2913 


Simpson,  Shuh,  and  Yarmoff:  RT  chlorination  of  As-rich  GaAs  (110) 


2913 


ACKNOWLEDGMENTS 

This  work  is  based  upon  work  supported  by  the  U.S. 
Army  Research  Office  under  Grant  No.  DAAH04-94-G- 
0410.  This  work  was  performed  at  the  NSLS,  which  is  sup¬ 
ported  by  the  Department  of  Energy  (Division  of  Materials 
Sciences  and  Division  of  Chemical  Sciences  of  Basic  Energy 
Sciences)  under  Contract  No.  DE-AC0276CH0016.  The  au¬ 
thors  recognize  the  contributions  of  J.  Yurkas  and  A.  Marx 
for  the  construction  of  the  EMA.  J.  Yurkas  and  C.  Costas  are 
also  thanked  for  their  efforts  in  upgrading  the  3  m  beamline. 

^W.  C.  Simpson,  D.  K.  Shuh,  W.  H.  Hung,  M.  C.  Hakansson,  J.  Kanski,  U. 
O.  Karlsson,  and  J.  A.  Yarmoff,  J.  Vac.  Sci.  Technol.  A  14,  1815  (1996). 

^D.  J.  D.  Sullivan,  H.  C.  Flaum,  and  A.  C.  Kummel,  J.  Chem.  Phys,  101, 
1582  (1994). 

^D.  K.  Shuh,  C.  W.  Lo,  J.  A.  Yarmoff,  A.  Santoni,  L.  J.  Terminello,  and  F. 
R.  McFeely,  Surf.  Sci.  303,  89  (1994). 

^W.  C.  Simpson,  W.  M.  Tong,  C.  B.  Weare,  D.  K.  Shuh,  and  J.  A.  Yarmoff, 
J.  Chem.  Phys.  104,  320  (1995). 

^F.  Stepniak,  D.  Rioux,  and  J.  H.  Weaver,  Phys.  Rev.  B  50,  1929  (1994). 

^J.  C.  Patrin  and  J.  H.  Weaver,  Phys.  Rev.  B  48,  17913  (1993). 

■^J.  Pankratz,  H.  Niehaus,  and  W.  Mdnch,  Surf.  Sci.  307-309,  211  (1994). 

^R.  D.  Schnell,  D.  Rieger,  A.  Bogen,  K.  Wandelt,  and  W.  Steinmann,  Solid 
State  Commun.  53,  205  (1985). 


^W.  C.  Simpson,  T.  D.  Durbin,  P.  R.  Varekamp,  and  J.  A.  Yarmoff,  J.  Appl. 
Phys.  77,  2751  (1995). 

‘^W.  C.  Simpson,  P.  R.  Varekamp,  D.  K.  Shuh,  and  J.  A.  Yarmoff,  J.  Vac. 
Sci.  Technol.  A  13,  1709  (1995). 

"P.  R.  Varekamp,  W.  C.  Simpson,  D.  K.  Shuh,  T.  D.  Durbin,  V.  Chakarian, 
and  J.  A.  Yarmoff,  Phys.  Rev.  B  50,  14  267  (1994). 

*^Note  that  the  photon  energies  used  here  are  ~5  eV  higher  than  those  used 
in  Ref.  3.  This  introduces  only  a  negligible  variation  in  the  electron  es¬ 
cape  length,  however,  and  therefore  does  not  significantly  affect  the  cov¬ 
erage  calibration. 

‘^T.  T.  Chiang  and  W.  E.  Spicer,  J.  Vac.  Sci.  Technol.  A  7,  724  (1988). 

'^J.  E.  Northrup,  Phys.  Rev.  B  44,  1349  (1991). 

Jacobi,  G.  Steinert,  and  W.  Ranke,  Surf.  Sci.  57,  571  (1976). 

‘^R.  G.  Jones,  N.  K.  Singh,  and  C.  F.  McConville,  Surf.  Sci.  208,  L34 
(1989). 

‘^A.  Ludviksson,  M.  Xu,  and  R.  M.  Martin,  Surf.  Sci.  277,  282  (1992). 
‘^A.  P.  Mowbray  and  R.  G.  Jones,  Vacuum  41,  672  (1990). 

*^P.  R.  Varekamp,  M.  C.  Hakansson,  D.  K.  Shuh,  J.  Kanski,  L.  liver,  Z.  Q. 

He,  J.  A.  Yarmoff,  and  U.  O.  Karlsson,  Vacuum  46,  1231  (1995). 

^^P.  R.  Varekamp,  M.  C.  Hakansson,  J.  Kanski,  D.  K.  Shuh,  M.  Bjorkqvist, 
M.  Gothelid,  W.  C.  Simpson,  U.  O.  Karlsson,  and  J.  A.  Yarmoff,  Phys. 
Rev.  B  (in  press). 

^‘H.  Hou,  Z.  Zhang,  S.  Chen,  C.  Su,  W.  Yan,  and  M.  Vernon,  Appl.  Phys. 
Lett.  55,  801  (1989). 

A.  DeLouise,  J.  Chem.  Phys.  94,  1528  (1991). 

^^C.  W.  Lo,  D.  K.  Shuh,  V.  Chakarian,  T.  D.  Durbin,  P  R.  Varekamp,  and  J. 
A.  Yarmoff,  Phys.  Rev.  B  47,  15648  (1993). 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


Interaction  of  hydrogen  ions  with  oxidized  GaAs(100)  and  AiAs(100) 
surfaces 


Ying-Lan  Chang®* 

Center  for  Quantized  Electronic  Structures  (QUEST),  University  of  California,  Santa  Barbara, 

California  93106 

R.  Cao,  W.  E.  Spicer,  and  P.  Pianetta 

Stanford  Synchrotron  Radiation  Lab  (SSRL),  Stanford  University,  Stanford,  California  94305 

Song  Shi  and  Evelyn  Hu 

Center  for  Quantized  Electronic  Structures  (QUEST),  University  of  California,  Santa  Barbara, 

California  93106 

James  Merz 

Department  of  Electrical  Engineering,  University  of  Notre  Dame,  Notre  Dame,  Indiana  46556 
(Received  22  January  1996;  accepted  20  April  1996) 

We  have  performed  photoemission  experiments,  using  a  tunable  soft  x-ray  synchrotron  radiation 
source  to  study  the  chemical  changes  of  oxidized  GaAs  and  AlAs  surfaces  subject  to  exposure  from 
hydrogen  ions.  Results  indicate  that  the  net  effects  for  hydrogen  ion  irradiation  are  (i)  the  reduction 
of  arsenic  and  (ii)  the  growth  of  the  cation  oxide  components.  The  reduction  of  arsenic  can  result 
from  the  formation/desorption  of  arsine.  The  oxide  overlayer  after  hydrogen  ion  treatments  is 
dominated  by  cation  oxides  which  are  the  more  stable  chemical  species  as  described  in  the  phase 
diagram.  This  oxide  layer  should  then  remain  stable  in  atmosphere.  These  results  can  provide  insight 
into  the  chemical  reaction  between  hydrogen  ions  and  oxidized  AlGaAs  surfaces.  ©  1996 
American  Vacuum  Society. 


I.  INTRODUCTION 

Surface  passivation  has  been  an  important  issue  for  the 
fabrication  of  semiconductor  devices.  The  ability  to  reduce 
or  control  surface  states  can  result  in  enhanced  optical  and 
electronic  device  properties,  particularly  with  the  increasing 
surface-to- volume  of  the  devices  involved. 

The  common  approach  to  obtain  surface  passivation  has 
been  to  remove  the  oxide  layers  and  terminate  the  surface 
dangling  bonds.  However,  those  treatments  lack  long-term 
stability.  The  study  of  the  stability  of  the  GaAs  surface 
passivated  by  different  methods  has  shown  that  the  reason 
for  the  subsequent  degradation  of  electronic  properties  is  re¬ 
oxidation  of  the  surface.^  To  improve  the  chemical  stability, 
one  needs  to  understand  the  chemical  reactions  that  may  take 
place  between  the  passivated  surface  and  the  atmosphere, 
and  try  to  prevent  the  passivated  surface  from  reoxidation. 

We  have  previously  demonstrated  a  new  approach  to  sur¬ 
face  passivation,  utilizing  hydrogen  ion  irradiation.  The  as¬ 
sessment  of  passivation  is  related  to  the  luminescence  effi¬ 
ciency  of  a  near-surface/surface  quantum  well  (QW).^“^  The 
passivation  process  takes  place  in  the  presence  of  the  native 
oxide.  The  improvement  in  luminescence  has  been  found  to 
persist  for  periods  as  long  as  three  years.^’^  In  situ  Auger 
electron  spectroscopy  (AES)  and  temperature  programmed 
desorption  measurements  suggest  that  the  passivation  is 
linked  to  the  removal  of  arsenic  in  the  form  of  arsine^’ with 
the  consequent  reduction  in  formation  of  arsenic  antisite  de- 


“^Present  address:  Hewlett-Packard  Co.,  Hewlett-Packard  Laboratory,  Palo 
Alto,  CA  94304;  Electronic  mail:  ying-len  chang@hpl.hp.com 


fects  which  may  be  responsible  for  high  surface  recombina¬ 
tion  velocity. 

A  more  direct  assessment  of  the  hydrogen-ion  surface 
chemistry  requires  the  use  of  techniques  such  as  photoemis¬ 
sion  spectroscopy  (PES).  This  article  describes  in  situ  PES 
measurements  carried  out  with  a  tunable  soft  x-ray  synchro¬ 
tron  radiation  source.  Such  experiments  provide  good  sur¬ 
face  sensitivity  and  valuable  information  about  the  evolution 
of  surface  oxide  components  due  to  hydrogen  ion  treatments; 
therefore,  they  can  provide  further  insight  into  the  mecha¬ 
nism  of  hydrogen  passivation  and  long-term  stability. 

II.  EXPERIMENT 

In  our  earlier  studies,  the  substrates  used  had  surface  lay¬ 
ers  of  AlojGaojAs  cladding  GaAs  QWs.^  To  simplify  the 
analysis  of  the  data,  these  studies  were  carried  out  on  GaAs 
and  AlAs  surfaces  separately,  and  the  behavior  of 
Al^Gai_;^As  surfaces  will  be  extrapolated  from  the  behavior 
of  the  binary  substrates. 

The  GaAs  epilayer  was  grown  by  molecular  beam  epitaxy 
(MBE)  on  GaAs(lOO)  substrates.  The  sample  was  treated  by 
buffered  HF  solution  to  remove  the  surface  oxide  layer  due 
to  long-term  air  exposure.  Then  the  sample  was  immediately 
transferred  to  the  ultrahigh  vacuum  (UHV)  chamber  for  hy¬ 
drogen  ion  treatments  and  photoemission  measurements.  The 
total  exposure  time  during  this  sample  transfer  is  '^15  min. 

The  AlAs  epilayer  was  also  grown  by  MBE  on 
GaAs(lOO)  substrates  and  was  capped  by  As  before  removal 
from  the  MBE  chamber.  To  prevent  any  degradation  of  the 
protecting  As  cap,  the  samples  were  kept  in  a  nitrogen  box  at 
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all  times  between  removal  from  the  MBE  chamber  and  in¬ 
sertion  to  the  UHV  analysis  chamber.  The  As  capping  layer 
was  thermally  desorbed  in  the  photoemission  chamber  by 
heating  the  sample  at  450  °C  for  —15  min.  Subsequently,  an 
oxide  layer  was  grown  by  exposing  this  clean  AlAs  surface 
either  to  air  for  —15  min  or  to  oxygen  in  the  UHV  chamber. 

Hydrogen  ions  were  generated  by  using  an  ion  sputter  gun 
with  a  differential  gas  inlet,  operated  at  an  ion  energy  of  100 
eV.  The  flux  was  measured  using  the  ion  current  collected  on 
the  sample.  A  typical  ion  current  density  of  1.5  /xA/cm^  was 
used.  The  hydrogenation  process  was  done  at  room  tempera¬ 
ture. 

The  photoemission  measurements  were  performed  on 
beam  line  8-1  at  the  Stanford  Synchrotron  Radiation  Labo¬ 
ratory  with  tunable  synchrotron  radiation  as  the  source  for 
photoelectron  spectroscopy.  Photon  energy  could  be  tuned  in 
the  range  between  10  and  200  eV.  Typically,  the  photon  en¬ 
ergy  used  for  Ga-3  J  and  As-3d  core  level  spectra,  and  va¬ 
lence  band  spectra  is  80  eV,  and  the  photon  energy  used  for 
AI-2/7  core  level  spectra  is  120  eV.  Under  these  measurement 
conditions,  surface  sensitive  (—5-10  A  escape  depth)  va¬ 
lence  band  and  Ga,  As,  A1  core  level  spectra  could  be 
obtained. 

The  investigation  of  the  surface  chemical  reaction  re¬ 
quires  appropriate  deconvolution  of  the  core  level  spectra. 
Although  the  resolution  of  our  PES  system  is  about  0.3  eV, 
several  factors,  e.g.,  the  inhomogeneity  of  the  native  oxide, 
may  contribute  to  the  broadening  of  the  core  level  peaks.  The 
large  linewidth  complicates  the  interpretation  of  the  core 
level  spectra.  For  clarity,  we  studied  the  interaction  between 
hydrogen  ions  and  oxidized  binary  compounds,  including 
GaAs  and  AlAs  surfaces.  The  results  on  these  surfaces  can 
provide  some  insight  into  the  chemical  reaction  on  ternary 
compounds,  such  as  AlGaAs. 

III.  RESULTS 

A.  Chemical  reaction  of  hydrogen  ions  with  oxidized 
GaAs  surface 

The  As-3  d  core  level  spectrum  for  a  native-oxide-covered 
GaAs  surface  is  shown  in  Fig.  1(a).  Deconvolution  of  the 
As-3d  core  level  spectrum  reveals  the  substrate  component 
(at  —33.2  eV)  and  AS2O3  (at  —30.3  eV).  Elemental  As  was 
not  observed  on  this  native-oxide-covered  surface.  Previous 
results  on  a  clean  GaAs  surface  indicate  that  the  surface 
Fermi  level  can  be  pinned  at  submonolayer  oxygen 
coverage,  which  is  much  lower  than  the  coverage  required 
for  the  formation  of  elemental  As.^^’’^  As  a  matter  of  fact, 
elemental  As  has  only  been  observed  on  surfaces  subjected 
to  heavy  oxidation,  e.g.,  ionized  oxygen  treatments,  etc.^^ 

The  As~3d  core  level  spectrum  for  the  oxidized  GaAs 
surface  after  in  situ  room  temperature  hydrogen  ion  treat¬ 
ments  with  exposure  of  10^^  cm“^  is  shown  in  Fig.  1(b).  The 
As(oxide)/As(substrate)  ratio  was  found  to  decrease  by  about 
11%  after  hydrogen  ion  treatments. 

The  Ga  core  level  spectra  similarly  show  a  substrate  com¬ 
ponent  (at  —55.1  eV)  and  Ga203  (at  —54.0  eV),  as  shown  in 


(a)  Kinetic  energy  (eV) 


Fig.  1.  The  As-3  J  core  level  spectra  for  an  oxidized  GaAs  surface  (a)  before 
and  (b)  after  room  temperature  hydrogen  ion  treatments.  The  hydrogen  ion 
exposure  was  10**^  cm“^.  The  dots  represent  the  raw  data,  the  solid  line 
through  the  dots  is  the  curve  fit  to  the  data,  and  the  various  components  are 
labeled  according  to  the  shading:  The  dark  area  represents  the  substrate 
components,  and  the  light  area  represents  the  oxide  component. 

Figs.  2(a)  and  2(b).  Contrary  to  As,  the  Ga(oxide)/ 
Ga(substrate)  ratio  increases  by  about  27%  after  hydrogen 
ion  treatments. 

The  As/Ga  integrated  intensity  ratio  decreases  by  about 
33%  after  room  temperature  hydrogen  ion  treatment  with 
exposure  of  10^^  cm~^.  The  reduction  of  the  As/Ga  ratio  is 
consistent  with  our  previous  observation  by  AES 
measurements.^^ 

It  was  also  noticed  that  after  room  temperature  hydrogen 
passivation  at  low  exposure,  a  small  amount  of  AS2O3  still 
exists  on  the  surface,  as  shown  in  Fig.  1(b). 

B.  Chemical  reaction  of  hydrogen  ions  with  oxidized 
AlAs  surface 

Studies  of  the  AlAs  surface  produced  similar  results;  that 
is,  the  As(oxide)/As(substrate)  ratio  decreases  after  hydrogen 
ion  treatment. 

Since  A1  oxides  rapidly  in  air,  only  the  oxide  component 
can  be  observed  in  the  A\~2p  core  level  spectrum  for  the 
native-oxide-covered  AlAs  surface.  It  is  therefore  difficult  to 
observe  the  modification  of  the  A1  core  level  components 
after  hydrogen  ion  treatments.  The  studies  of  interaction  be¬ 
tween  hydrogen  ions  and  oxidized  AlAs  surfaces  had  to  be 
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Fig.  2.  The  GsL-3d  core  level  spectra  for  an  oxidized  GaAs  surface  (a) 
before  and  (b)  after  room  temperature  hydrogen  ion  treatments.  The  hydro¬ 
gen  ion  exposure  was  10^^  cm~^.  The  dots  represent  the  raw  data,  the  solid 
line  through  the  dots  is  the  curve  fit  to  the  data,  and  the  various  components 
are  labeled  according  to  the  shading.  The  dark  area  represents  the  substrate 
components,  and  the  light  area  represents  the  oxide  component. 

carried  out  by  capping  the  grown  AlAs  layer  with  arsenic, 
and  then  carefully  desorbing  the  arsenic  and  doing  a  con¬ 
trolled,  in  situ  oxidation  of  the  surface. 

The  Al-2p  core  level  spectra  for  the  clean  AlAs  surface 
and  the  surface  treated  with  10^  L  oxygen  exposure  are 
shown  in  Figs.  3(a)  and  3(b),  respectively.  Both  the  substrate 
and  oxide  components  can  be  clearly  observed  in  Fig.  3(b). 
The  spectrum  obtained  from  the  oxide  covered  surface  after 
hydrogen  ion  treatment  is  presented  in  Fig.  3(c).  After  room 
temperature  hydrogen  ion  treatments,  the  increase  of 
Al(oxide)/Al(substrate)  ratio  due  to  hydrogen  ion  irradiation 
is  clearly  evident. 


Fig.  3.  The  A\-2p  core  level  spectra  for  a  clean  AlAs  surface  before  and 
after  different  surface  treatments,  (a)  Clean  (prepared  by  As-decapping),  (b) 
after  10^  L  oxygen  exposure,  and  (c)  after  room  temperature  hydrogen  ion 
treatment  with  exposure  of  10^^  cm“^.  The  dots  represent  the  raw  data,  the 
solid  line  through  the  dots  is  the  curve  fit  to  the  data,  and  the  various 
components  are  labeled  according  to  the  shading:  The  dark  area  represents 
the  substrate  components,  and  the  light  area  represents  the  oxide  component. 


IV.  DISCUSSION 

As  mentioned  above,  the  reduction  of  As  from  the  oxi¬ 
dized  surfaces  was  observed  after  hydrogen  ion  treatments. 
Our  earlier  experiments  validated  the  evolution  of  ASH3  from 
the  hydrogenated  substrate  and  the  attendant  reduction  in 
surface  arsenic  concentration.^’^®  Two  possibilities  for  the 
mechanism  of  reduction  involve  either  (1)  reaction  of  hydro¬ 
gen  and  arsenic  at  the  oxide- semiconductor  interface  or  (2) 
the  reaction  of  the  hydrogen  ions  with  the  arsenic  oxide. 

Reaction  (1)  requires  the  formation  of  ASH3  below  the 
surface  of  the  native  oxide,  and  the  subsequent  desorption  of 


the  ASH3.  This  process  can  be  limited  by  the  transport  of 
ASH3  molecule  through  the  oxide  to  the  uppermost  surface. 

However,  this  may  not  be  an  issue  if  the  formation  of 
ASH3  takes  place  through  the  oxide.  Such  a  reaction  would 
account  for  the  preferential  reduction  of  the  arsenic  bound  as 
AS2O3.  What  could  then  explain  the  increase  of  Ga203? 

The  interaction  between  hydrogen  and  AS2O3  can  produce 
H20.^^  If  the  generated  H2O  remained  physisorbed  on  the 
surface  of  the  substrate,  further  oxidation  of  GaAs  could  take 
place,  with  gallium  oxidizing  more  readily  than  arsenic  and 
some  portion  of  the  AS2O3  being  formed  reacting  with  the 
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hydrogen  and  being  desorbed  as  arsine.  That  is,  during  the 
treatments,  the  reduction  of  AS2O3  and  As  is  accompanied  by 
formation  of  Ga203  and  the  net  reaction  can  be  written  as 

2GaAs+ AS2O3  +  12H^Ga203+4AsH3. 

As  a  result,  the  As/Ga  ratio  is  reduced.  However,  the  oxy¬ 
gen  concentration  remains  unchanged,  as  observed  by  our 
AES  measurements.^^ 

Evidence  that  this  reaction  can  be  taking  place  has  been 
provided  by  previous  in  situ  real-time  Fourier  transform  in¬ 
frared  spectroscopy  measurements  which  were  carried  on  for 
a  closely  related  process,  that  of  hydrogen  plasma  treatment 
of  an  oxidized  GaAs  surface. In  this  study,  it  was  found 
that  the  concentration  of  physisorbed  H2O  on  the  GaAs  sur¬ 
face  increases  during  hydrogen  treatments.  The  formation  of 
Ga203  was  observed  as  a  result  of  the  oxidation  of  the  GaAs 
through  reaction  with  the  physisorbed  water.  Although  the 
oxidized  GaAs  surface  is  treated  by  hydrogen  ions,  rather 
than  hydrogen  plasma  in  the  present  case,  it  is  likely  that 
similar  reactions  are  taking  place. 

We  have  previously  reported  that  the  enhancement  of  lu¬ 
minescence  efficiency  from  a  near-surface  GaAs/AlGaAs 
QW  due  to  hydrogen  passivation  remains  stable  in 
atmosphere.^  Regarding  the  long-term  stability  of  passivation 
effects,  two  things  need  to  be  considered.  First,  the  surface 
termination  layer  needs  to  remain  intact  after  the  exposure  to 
atmosphere.  Second,  the  overlayer  needs  to  be  tightly  packed 
so  that  the  diffusion  of  oxygen  through  the  overlayer  can  be 
minimized. 

Based  on  our  PES  results  shown  before,  the  chemical  re¬ 
action  taking  place  on  oxidized  GaAs  and  AlAs  surfaces  ap¬ 
pears  to  be  similar.  That  is,  the  net  effects  for  the  hydrogen 
ion  treatments  on  oxidized  surfaces  are  (i)  the  reduction  of 
As  and  (ii)  the  growth  of  cation  oxide  components.  One  can 
expect  that  similar  results  can  be  obtained  on  an  oxidized 
AlGaAs  surface  after  hydrogen  ion  irradiation. 

That  is,  hydrogen  ion  treatments  not  only  reduce  the 
amount  of  As  and  the  density  of  midgap  states  at  the  oxide/ 
semiconductor  interfaces, but  also  improve  the  chemical 
stability  of  the  oxides.^  As  a  result,  the  chemical  reaction 
between  the  surface  layer  and  the  atmosphere  can  be  elimi¬ 
nated.  Contrary  to  other  passivation  approaches,  we  use  this 
oxide  as  an  overpassivation  layer,  rather  than  remove  it  to 
achieve  a  clean  surface. 

It  has  been  suggested  that  the  complete  removal  of  As  and 
AS2O3  is  important  in  surface  passivation.^  However,  the 
peak  corresponding  to  AS2O3  is  still  observable  in  the  surface 
sensitive  PES  spectra  taken  from  samples  after  hydrogen  ion 
treatments.  On  the  basis  of  the  phase  diagram,^  if  AS2O3  does 
not  directly  contact  the  substrate,  the  reaction  resulting  in  the 


formation  of  free  As  will  not  take  place.  Although  we  do  not 
have  direct  evidence,  we  believe  that  this  is  likely  for  an  As 
deficient  surface.  Moreover,  since  the  thickness  of  the  oxide 
overlayer  is  small,  the  presence  of  a  small  amount  of  AS2O3 
may  be  important  in  forming  a  closely  packed  oxide. 

V.  CONCLUSIONS 

In  summary,  we  report  in  situ  monitoring  of  the  chemical 
changes  of  oxide-covered  GaAs  and  AlAs  surfaces  caused  by 
hydrogen  ion  treatments.  Experimental  evidence  shows  that 
the  hydrogen  ion  irradiation  decomposes  the  arsenic  oxide 
and  reduces  the  amount  of  arsenic  at  the  oxide/ 
semiconductor  interface.  The  oxide  overlayer  after  hydrogen 
ion  treatments  is  dominated  by  the  cation  oxides,  which  are 
the  more  stable  chemical  species  as  described  in  the  phase 
diagram.^  This  oxide  layer  remains  stable  in  atmosphere,  and 
can  serve  as  an  effective  capping  layer  for  the  hydrogen  ion- 
treated  semiconductor  surface,  preventing  surface  reoxida¬ 
tion. 

It  was  also  noted  that  a  small  amount  of  AS2O3  was  still 
present  on  the  surface  after  hydrogen  passivation.  It  is  likely 
that  the  residual  AS2O3  does  not  directly  come  in  contact 
with  the  semiconductor  substrate,  therefore  does  not  react 
with  the  substrate  and  form  free  arsenic  at  the  interface.  The 
oxide-semiconductor  interface  remains  arsenic  deficient.  As 
a  result,  the  improved  interface  quality  can  be  preserved,  as 
indicated  by  the  long-term  stability  of  the  passivation 
effects.^’^ 
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We  report  a  study  of  the  effect  of  source  switching  sequences  on  the  interface  abruptness  of 
InGaAs/InGaAsP  quantum  wells  (QWs)  grown  by  gas-source  molecular  beam  epitaxy.  Four 
methods  were  investigated:  i.e.,  at  each  interface  during  growth  interruption,  (1)  introducing  a 
residual  group-V  source  evacuation  time  (RSE)  t  where  all  sources  are  shut  off;  (2)  no  RSE;  (3)  no 
RSE  but  adding  a  group-III  In-hGa  prelayer  of  m  monolayers;  or  (4)  using  RSE  plus  the  In+Ga 
prelayer.  InGaAs/InGaAsP  QWs  grown  by  the  different  methods  were  characterized  by 
low-temperature  photoluminescence  (PL).  The  results  show  that  by  optimizing  t  or  m,  both  RSE  and 
prelayer  methods  can  improve  the  heterostructure  interface  abruptness,  which  is  evidenced  by  the 
stronger  PL  intensity  and  narrower  PL  linewidth,  and  that  combining  the  two  with  optimized  t  and 
m  gives  the  best  result.  This  is  due  to  the  minimization  of  the  memory  effect  by  using  RSE  and  of 
the  As/P  exchange  by  using  the  group-III  prelayer.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

InGaAs/InGaAsP  quantum  well  (QW)  structures  are  used 
extensively  for  optoelectronic  device  applications,  such  as 
long- wavelength  lasers  and  modulators.  It  is  important  to 
obtain  QW  structures  with  abrupt  heterointerfaces,  as  inter¬ 
face  roughness  will  randomize  quantum-confined  levels,  in¬ 
crease  the  linewidth  of  optical  emission,  and  greatly  influ¬ 
ence  the  properties  of  hetero structure  devices.  There  are  two 
main  issues  that  are  needed  to  be  solved  when  growing  high- 
quality  arsenide/phosphide  heterostructures.  One  is  the  so- 
called  memory  effect  or  carry over,^”^  i.e.,  the  residual 
group-V  source  will  be  incorporated  into  the  succeeding 
layer,  forming  a  compositionally  graded  transitional  layer  at 
the  heterointerface.  This  usually  happens  when  phosphide- 
containing  compounds  are  grown  on  arsenide-containing 
compounds  as  a  result  of  the  higher  incorporation  coefficient 
of  As  compared  with  The  reverse  case  can  also  happen 
when  the  P2  required  is  much  larger  than  As  2,  where  the 
residual  P2  will  also  be  incorporated  into  the  subsequent  As- 
containing  layers. Another  problem  encountered  is  the  ex¬ 
change  of  As  and  P  when  switching  the  group-V 
sources. The  new  group-V  source  will  replace  the  old 
one  in  the  underlying  layer,  resulting  in  a  strained  interfacial 
layer.  It  has  been  reported  that  relatively  low-temperature 
growth  can  reduce  the  thickness  of  the  intermixing  layer^^  or 
can  obtain  abrupt  interfaces.^^  The  optical  quality  of  the 
grown  structure,  however,  is  usually  of  poor  quality,  and 
post-growth  annealing  is  required  to  improve  the  material 
quality.  On  the  other  hand,  using  one  monolayer  of  In  on  an 
InGaAs  layer  before  P2  exposure  has  been  proposed  to  im¬ 
prove  the  InGaAs/InP  interface.^  Due  to  the  reaction  of  the 
deposited  In  metal  with  the  As2  in  the  underlying  InGaAs 
layer,  a  transition  region  still  exists  by  using  this  method,  and 
ways  to  further  improve  the  interface  quality  are  needed. 
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In  this  article,  we  will  investigate  different  ways  to  im¬ 
prove  the  interface  quality  of  InGaAs/InGaAsP  heterostruc¬ 
tures  by  minimizing  the  memory  effect  and  the  As/P  ex¬ 
change.  Besides  the  conventional  method,  three  approaches 
are  explored,  and  InGaAs/InGaAsP  QWs  with  improved  in¬ 
terfaces  are  obtained,  as  evidenced  by  stronger  photolumi¬ 
nescence  (PL)  intensity  and  narrower  PL  linewidth. 


II.  EXPERIMENT 

The  experiment  was  carried  out  in  a  Varian  Gen  II  MBE 
system,  which  had  been  modified  to  handle  group-V  gas 
sources.  The  hydride  gas  sources  for  arsenic  and  phosphor¬ 
ous  dimers  are  pure  arsine  and  phosphine,  which  were  intro¬ 
duced  into  the  growth  chamber  via  two  hydride  injectors 
operated  at  970  "’C.  The  use  of  separate  crackers  is  thought  to 
be  effective  in  minimizing  the  cross  contamination  of  As/P. 
The  samples  were  grown  on  a  (lOO)-oriented  semi-insulating 
InP  substrate.  The  growth  rate  of  Ino.84Gao.i6Aso.32Po.68  was  1 
monolayer  per  second  (ML/s),  and  that  of  Ino.53Gao.47 As  was 
0.3-0.4  ML/s.  The  flow  rate  of  phosphine  used  was  2.8  seem 
and  that  of  arsine  was  0.5  seem.  The  typical  chamber  pres¬ 
sure  was  between  2X10“^  and  1X10“^  Torn  Reflection 
high-energy  electron  diffraction  (RHEED)  was  used  to  ex¬ 
amine  the  surface  roughness  of  the  As/P  interface.  The  inten¬ 
sity  of  the  RHEED  specular  beam  was  detected  by  an 
optical-fiber-coupled  photodiode,  and  the  signal  was  en¬ 
hanced  with  a  dual-channel  differential  amplifier  before  it 
was  recorded  by  a  computer.  Low-temperature  photolumi¬ 
nescence  measurement  was  carried  out  by  mounting  the 
samples  in  a  liquid  He  cryostat  and  using  the  514.5  nm  line 
of  an  argon  ion  laser  as  the  excitation  source.  A  thermoelec- 
trically  cooled  Ge  photodiode  was  used  to  detect  the  signal  at 
the  exit  of  a  50-cm  monochromator  through  an  amplifier.  The 
pumping  power  density  was  '^2  W/cm^  and  the  measure¬ 
ment  temperature  was  15  K. 
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Fig.  1.  Shutter  switching  sequences  used  for  the  growth  of  InGaAs/InGaAsP 
QWs  (not  drawn  to  scale  in  time),  where  the  time  period  is  used  to  smooth 
the  grown  surface  and  G  to  stabilize  the  gas  flow  after  switching  the 
group- V  sources  (conventional  method). 


III.  RESULTS  AND  DISCUSSION 

Our  control  sample  was  grown  by  the  source  switching 
sequences  shown  in  Fig.  1,  where  the  solid  lines  indicate  that 
the  corresponding  sources  are  turned  on.  This  is  a  conven¬ 
tional  way  to  grow  arsenide/phosphide  heterostructures;  i.e., 
at  each  interface  during  growth  interruption,  after  a  continu¬ 
ing  supply  of  a  group- V  source  for  sec  to  smooth  the 
grown  surface,  a  new  group-V  source  is  switched  on  for  to, 
sec  to  stabilize  the  gas  flow  before  growing  the  subsequent 
layer  (the  conventional  method).  As  a  result,  a  transient  in¬ 
terfacial  layer  caused  by  both  the  memory  effect  and  the 
As/P  exchange  might  be  formed  by  using  this  method.  The 
memory  effect  is  more  pronounced  at  the  InGaAs-on- 
InGaAsP  interface  because  of  the  incorporation  of  the  re¬ 
sidual  P2  into  the  succeeding  InGaAs  layer,  while  the  As/P 
substitution  is  the  main  source  responsible  for  the  transient 
layer  at  the  InGaAsP-on-InGaAs  interface.  The  latter  can  be 
evidenced  by  examining  the  change  of  the  RHEED  pattern  of 
InGaAs  under  the  exposure  of  As2+P2»  as  surface  roughen¬ 
ing  is  a  direct  evidence  of  As/P  substitution.  Figs.  2(a)  and 
2(b)  show  the  RHEED  pattern  of  an  Ino.53Gao.47As  layer  ex¬ 
posed  to  As2  and  that  to  AS2+P2  for  12  sec  at  500  °C,  re- 


Fig.  2.  RHEED  patterns  of  InGaAs  under  As2  (a)  and  AS2+P2  (b)  exposure 
for  12  sec  at  500  °C  measured  from  [110]  azimuth. 
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Fig.  3.  Shutter  switching  sequences  used  for  the  growth  of  InGaAs/InGaAsP 
QWs  (not  drawn  to  scale  in  time):  (a)  RSE  method,  where  a  time  period 
t2  is  introduced  between  r,  and  to  evacuate  the  excess  group-V  species; 
(b)  prelayer  method,  where  one  ML  In+Ga  is  deposited  between  and  to 
avoid  As/P  exchange;  and  (c)  combined  method,  where  RSE  plus  prelayer 
are  used. 


spectively.  The  freshly  grown  Ino.ssGao  47 As  surface  at 
500  °C  shows  an  initial  (2X 1)  streaky  RHEED  pattern, 
which  is  quite  stable  under  As2  exposure.  Upon  being  ex¬ 
posed  to  AS2+P2,  however,  the  RHEED  pattern  gradually 
becomes  spotty  within  12  sec  [Fig.  2(b)],  indicating  that  the 
Ino.53Gao.47As  surface  becomes  rough  as  a  result  of  the  As/P 
exchange.  To  solve  these  problems,  we  tried  three  other 
source  switching  sequences  as  illustrated  in  Figs.  3(a),  3(b), 
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and  3(c).  The  first  method  used  [Fig.  3  (a)]  introduces  a 
residual  group-V  source  evacuation  (RSE)  time  period  be¬ 
tween  1 1  and  ^3 ,  where  all  sources  are  shut  off  (the  RSE 
method).  The  second  method  [Fig.  3(b)]  does  not  use  RSE 
but  inserts  one  monolayer  (ML)  of  In  and  Ga  metals  (In 
+Ga)  before  the  switching  of  the  group-V  sources  (the 
prelayer  method).  In  method  three  [Fig.  3  (c)],  RSE  plus  one 
ML  of  In+Ga  prelayer  is  used  at  each  interface  (the  com¬ 
bined  method).  To  compare  the  effectiveness  of  the  three 
methods  in  improving  interface  abruptness,  we  grew  four 
samples  with  the  same  sample  structure,  i.e.,  three 
Ino.53Gao.47As/Ino.84Gao.i6Aso.32Po.68  single  quantum  wells 
(SQWs)  with  well  width  of  0.5,  1,  and  2  nm  and  InGaAsP 
barrier  width  of  20  nm,  except  that  at  each  interface  a  differ¬ 
ent  source  switching  method  was  used.  The  smoothing  time 

was  determined  by  examining  the  RHEED  intensity  recov¬ 
ery  time  after  growth  interruption,^'^  as  the  magnitude  of  the 
RHEED  intensity  reflects  the  degree  of  surface  roughness  of 
the  grown  surface.  The  RSE  time  was  optimized  by  growing 
different  samples  with  the  same  sample  structure  but  differ¬ 
ent  RSE  time  previously.^'^  A  RSE  time  of  6  sec  was  used 
here. 

Figure  4(a)  and  4(b)  show  a  comparison  of  the  PL  line- 
width  and  PL  intensity  of  the  four  samples,  respectively.  The 
open  squares  are  the  results  of  the  sample  grown  with  the 
combined  method;  the  solid  triangles,  with  the  RSE  method; 
the  solid  squares,  with  the  prelayer  method;  and  the  open 
circles,  with  the  conventional  method.  Clearly  seen  here  is 
that  as  compared  with  the  conventional  method,  both  the 
RSE  and  the  prelayer  methods  can  improve  the 
Ino.53Gao.47As/In  o.84Gao.i6Aso.32Po.68  interfaces,  while  com¬ 
bining  the  two  gives  best  result.  This  is  evidenced  by  the 
stronger  PL  intensity  and  narrower  PL  linewidth  of  the 
InGaAs/InGaAsP  SQWs  grown  by  these  methods  as  com¬ 
pared  with  that  grown  with  the  conventional  method.  The 
improvement  of  using  RSE  is  thought  to  be  due  to  minimi¬ 
zation  of  the  memory  effect.  When  no  RSE  is  used,  excess 
As  and/or  P  accumulated  at  the  grown  surface  during  time 
ti  might  be  incorporated  into  the  succeeding  layer,  resulting 
in  a  transitional  layer.  By  introducing  a  RSE  time,  the  re¬ 
sidual  group-V  sources  will  be  evacuated,  thus  minimizing 
the  memory  effect.  The  reason  the  prelayer  method  also  im¬ 
proves  the  interface  abruptness  is  due  to  the  elimination  of 
As/P  exchange  at  the  interface.  Usually  As/P  exchange  will 
occur  when  an  arsenide  compound  (phosphide  compound)  is 
exposed  to  phosphorous  (arsenic)  for  a  period  of  time,  result¬ 
ing  in  a  rough  surface  of  the  grown  layer,  as  has  already  been 
shown  by  observing  the  RHEED  pattern  when  a  freshly 
grown  In  o.53Gao.47As  surface  is  exposed  to  a  AS2+P2  flux 
(Fig.  2),  This  As/P  exchange  can  be  eliminated  by  inserting 
an  In+Ga  prelayer  before  switching  the  group-V  sources; 
e.g.,  after  the  Ino.53Gao  47AS  growth  and  depositing  one  ML 
of  In+Ga  before  supplying  As2  and  P2,  the  direct  exposure 
of  the  InGaAs  surface  to  P2  is  prevented,  thus  avoiding  the 
exchange  of  As/P  and  leading  to  an  improved  interface.  Stud¬ 
ies  on  the  InGaAs/InP  interfaces^  also  show  that  by  adding 
one  ML  of  In  on  the  InGaAs  surface  before  P2  beam  expo- 


(b)  Well  width  (nm) 

Fig.  4.  A  comparison  of  the  PL  linewidth  (a)  and  intensity  (b)  of  the 
InGaAs/InGaAsP  QWs  grown  by  the  four  different  shutter  switching  meth¬ 
ods.  The  open  squares  are  the  results  of  the  sample  grown  with  the  com¬ 
bined  method,  the  solid  triangles  are  the  results  of  the  sample  grown  with 
the  RSE  method,  the  solid  squares  are  the  results  of  the  sample  grown  with 
the  prelayer  method,  and  the  open  circles  are  results  of  the  sample  grown 
with  the  conventional  method. 


sure  can  improve  the  interface  quality.  The  reason  we  use 
one  ML  In+Ga  instead  of  In  here  is  to  avoid  a  strained 
transitional  layer  that  might  be  formed  by  using  this  method. 
For  InGaAs/InGaAsP  growth,  the  deposited  one  ML  of 
In+Ga  or  In  will  react  with  the  group-V  species  in  the  un¬ 
derlying  InGaAs  layer  and  also  with  the  As2  and  P2  that  are 
supplied  subsequently  in  the  time  period  of  ^3  before  grow¬ 
ing  the  next  InGaAsP  layer.  Quite  obviously,  if  In  instead 
was  used,  a  strained  layer  of  InAs  and/or  InAsP  might  be 
formed  at  the  InGaAsP-on-InGaAs  heterointerface,  thereby 
deforming  the  shape  of  the  well  and  influencing  the  optical 
properties  of  the  heterostructures.  By  using  a  mixture  of  In 
+Ga,  the  effect  can  be  minimized.  When  combining  the  RSE 
and  prelayer  methods,  both  memory  and  As/P  substitution 
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effects  can  be  minimized,  and  the  best  interface  is  expected, 
as  has  been  evidenced  by  our  results. 

IV.  SUMMARY 

In  summary,  InGaAs/InGaAsP  QWs  have  been  grown  by 
GSMBE.  Interface  abruptness  was  investigated  by  changing 
the  growth  conditions  and  characterized  by  low-temperature 
PL.  By  introducing  a  short  period  of  growth  interruption 
where  all  sources  are  evacuated  plus  one  monolayer  of 
group-III  elements  before  growing  the  next  layer,  both  the 
memory  effect  and  the  As/P  exchange  can  be  minimized, 
resulting  in  improved  interfaces  of  InGaAs/InGaAsP  QWs. 
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We  have  examined  the  influence  of  strain  relaxation  on  the  excitonic  recombination  and  diffusion  in 
Ino2Gao.8As/Al^Gai_;^As  quantum-well  (QW)  samples  designed  for  high-electron-mobility 
transistors,  using  spectrally  and  spatially  resolved  polarized  cathodoluminescence  (CL).  Six 
molecular-beam  epitaxial  grown  samples,  with  varying  channel  thicknesses  ranging  from  75  to  300 
A,  were  examined  at  various  temperatures  between  87  and  300  K.  An  increase  in  misfit  dislocation 
density  occurred  with  increasing  channel  thicknesses  and  resulted  in  changes  in  the  dark  line  defect 
(DLD)  density,  polarization  anisotropy,  QW  excitonic  luminescence  energy,  and  luminescence 
activation  energy,  as  observed  in  CL.  The  influence  of  misfit  dislocations  on  the  ambipolar  diffusion 
of  excess  carriers  in  a  direction  parallel  to  the  dislocation  line,  in  varying  proximity  to  the  DLDs, 
was  examined  with  a  CL-based  diffusion  experiment.  The  temperature  dependence  of  the  CL 
imaging  was  examined,  enabling  a  study  of  the  spatial  variation  of  the  activation  energies  associated 
with  thermal  quenching  of  the  GaAs/Alo^sGao  75AS  multiple  QW  and  Ino^GaogAs  QW 
luminescence.  The  CL  intensity  exhibits  an  Arrenhius-type  dependence  on  temperature  and  is 
controlled  by  thermally  activated  nonradiative  recombination.  The  activation  energies  for  both  the 
Ino2Gao  sAs  QW  and  Alo25Gao75As  MQW  luminescence  are  found  to  vary  spatially  in  close 
proximity  to  the  misfit  dislocations.  We  have  utilized  a  new  approach  to  obtain  2D  images  of  the 
activation  energies.  The  influence  of  the  strain  relaxation  on  the  polarization  and  energy  of  the 
Ino.2Gao8As  QW  excitonic  luminescence  was  examined  with  linearly  polarized  CL  and  CL 
wavelength  imaging.  A  strain-induced  modification  of  the  luminescence  energy  and  an  increase  in 
the  polarization  anisotropy  was  measured  near  DLDs.  Thus,  we  find  that  certain  DLDs  exhibit 
significant  polarization  and  energy  variations  in  their  optical  properties,  in  addition  to  their  more 
familiar  nonradiative  behavior.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  growth  of  high-quality  strained  films  of  In^Ga^^^As 
on  GaAs  substrates  has  several  diverse  applications  in 
millimeter-wave  electronic  devices,  such  as  high-electron- 
mobility  transistors  (HEMTs),^"^  and  near-infrared  photonic 
devices,  such  as  lasers"^”^  and  light  modulators.^" The  pri¬ 
mary  deleterious  effect  of  strain  relaxation  is  to  introduce 
misfit  dislocations  at  the  In^Gai  __;^.As/GaAs  interfaces  which 
can  serve  as  nonradiative  recombination  centers  and  impede 
the  transport  of  carriers.^®"^^  The  generation  of  an  asym¬ 
metrical  density  of  60°  dislocations  along  the  orthogonal 
(110)  directions  will  also  modify  the  biaxial  symmetry  of  the 
strain  tensor,  and  induce  an  asymmetry  in  the  transport  of 
carriers^®’^^  and  a  polarization  anisotropy  in  the  excitonic 
luminescence. Electron-beam  probes  which  yield  greater 
than  a  ~1 -yam-scale  resolution,  such  as  in  spatially  and  spec¬ 
trally  resolved  cathodoluminescence  (CL),  can  help  advance 
our  understanding  of  the  influence  of  defects  on  the  optical 
and  transport  properties  of  quantum  heterostructures.  A 
dislocation-induced  asymmetry  in  the  ambipolar  diffusion  of 
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excess  carriers  in  nipi-doped  Ino^GaogAs/GaAs  multiple 
quantum  wells  (MQWs)  has  been  previously  measured  using 
an  electron-beam-induced  absorption  modulation  tech¬ 
nique.  In  other  studies,  changes  in  the  polarization  an¬ 
isotropy  and  energy  of  excitonic  luminescence  in  thick 
In^Gai„;^.As/GaAs  films  were  found  to  correlate  with  the  po¬ 
sition  of  dark  line  defects  (DLDs)  from  studies  of  linearly 
polarized  cathodoluminescence  (LPCL)  and  CL  wavelength 
imaging  (CLWI).^^"^^ 

In  this  study  we  aim  to  further  explore  the  interplay  be¬ 
tween  the  strain  relaxation,  optical  properties,  and  transport 
properties  in  Ino.2Gao.8As/Al;^Gai  _^As  QW  samples,  de¬ 
signed  for  HEMTs,  using  LPCL  and  CLWI.  A  four-band  k*p 
calculation  is  used  to  examine  the  energy  and  polarization 
variations  caused  by  the  strain  relaxation.  A  self-consistent- 
field  calculation  using  the  transfer-matrix  method  is  used  to 
determine  the  electron  and  hole  eigenstates  in  the  channel 
and  the  excitonic  transition  energies. 

The  temperature  dependence  of  luminescence  in  QWs  and 
superlattices  has  been  investigated  previously;^^’^^  however, 
only  recently  have  we  begun  to  examine  the  effects  of  ther- 
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mal  quenching  of  luminescence  by  misfit  dislocations.  Ther¬ 
mal  quenching  of  the  luminescence  has  been  interpreted  in 
several  ways  by  different  authors,  and  has  been  attributed  to 
either  thermal  dissociation  of  excitons  and  thermally  acti¬ 
vated  nonradiative  recombination,^^  or  due  to  thermal  emis¬ 
sion  of  carriers  out  of  the  QWs,  resulting  in  a  reduction  of 
luminescence  intensity  at  higher  temperatures.^^  The  tem¬ 
perature  dependence  of  the  CL  imaging  was  examined  here 
in  the  87-250  K  range,  enabling  a  study  of  the  spatial  varia¬ 
tion  of  the  activation  energies  associated  with  thermal 
quenching  of  luminescence  near  individual  dislocations.  The 
CL  intensity  exhibits  an  Arrenhius-type  dependence  on  the 
temperature  and  is  controlled  by  thermally  activated  nonra¬ 
diative  recombination.  We  have  utilized  a  new  approach  to 
obtain  2D  images  of  the  activation  energy. 

In  particular,  we  have  also  examined  the  change  in  the 
ambipolar  diffusion  length  of  carriers  parallel  to  and  in  close 
proximity  to  misfit  dislocations  using  a  CL-based  diffusion 
experiment.  The  changes  in  diffusion  length  are  measured  in 
varying  proximity  to  DLDs,  and  correlations  with  the  CL 
intensity,  activation  energy,  polarization  anisotropy,  and  lu¬ 
minescence  energy  are  established.  A  detailed  study  of  the 
CLWI,  LPCL,  activation  energy  imaging,  and  ambipolar  dif¬ 
fusion  length  variations  is  performed  for  two  samples  whose 
Ino.2Grao.8As  channel  thicknesses  are  just  beyond  the  critical 
thickness  in  order  that  fundamental  optical  and  transport 
properties  can  be  examined  near  individual  dislocations  with 
a  ~1  yarn  spatial  resolution.  An  examination  of  relaxation- 
and  defect-induced  changes  in  carrier  diffusion  and  lumines¬ 
cence,  on  a  /xm  scale,  is  further  essential  in  enabling  an 
evaluation  of  fundamental  design  and  growth  parameters 
necessary  for  optimized  HEMT  performance. 

II.  EXPERIMENT 

Six  HEMT  devices  were  grown  by  molecular-beam 
epitaxy.^^"^^  Each  sample  consisted  of  the  following  layers 
(in  order  from  substrate  to  the  surface)  grown  on  semi- 
insulating  GaAs(OOl)  substrates:  a  1750-A-thick  GaAs  un¬ 
doped  buffer  layer;  a  GaAs/Alo.25Gao  75 As  MQW  (~42-A- 
thick  QWs);  a  3000-A-thick  undoped  GaAs  barrier;  an 
Ino  2Gao  sAs  QW  (channel)  varying  in  thickness  from  75  to 
300  A;  a  530  A  layer  of  Alo.25Grao.75As,  containing  a  ^doped 
Si  layer  (—5X10^^  cm“^)  within  about  30  A  of  the  channel; 
and  a  50-A-thick  undoped  GaAs  cap.  These  samples  were 
previously  analyzed  with  transmission  electron  microscopy 
(TEM),  triple-axis  x-ray  diffraction  (XRD),  and  Hall  mea¬ 
surements  to  establish  a  relationship  between  degradation  of 
the  device  performance  and  the  formation  of  (llO)-oriented 
misfit  dislocations.^^’^^  That  study  established  that  the  onset 
of  a  substantial  device  degradation  occurred  when  misfit  dis¬ 
locations  formed  along  both  (110)  directions.  High- 
resolution  XRD  further  showed  that  the  average  In  compo¬ 
sition  X  in  the  samples  varied  from  0.203^x^0.214. 

Scanning  monochromatic  CL,  panchromatic  CL,  CLWI, 
and  LPCL  were  performed  with  a  modified  JEOL  840-A 
scanning  electron  microscope.  A  rotatable  linear  polar¬ 
izer  was  mounted  in  vacuo  to  perform  polarization 
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Fig.  1 .  Schematic  diagram  of  the  HEMT  sample  structure  showing  the  am¬ 
bipolar  diffusion  length  experiment. 


measurements.^^  The  light  collected  was  dispersed  by  a  0.25 
m  monochromator  and  detected  with  a  liquid-nitrogen- 
cooled  Ge  p-i-n  diode  detector.  CL  spectra  were  obtained 
with  a  spectral  resolution  of  1  nm.  An  electron-beam  energy 
of  12  keV  with  varying  beam  currents  from  0.1  to  10  nA  was 
used  to  probe  the  samples.  The  temperature  of  the  samples 
was  varied  between  87  and  300  K,  for  the  various  CL  mea¬ 
surements. 

In  CLWI,  the  wavelength  at  which  the  intensity  of 
luminescence  is  a  maximum  is  mapped  as  a  function  of  the 
spatial  (x,y)  position,  and  a  gray  scale  or  false  color  image 
representing  these  wavelengths  is  generated.^"^"^^’^^  A  scan¬ 
ning  area  of  128X94  /xm^  is  discretized  into  640X480  pix¬ 
els.  In  order  to  determine  X^„(x,y),  a  spectrum  consisting  of 
20  wavelength  points  (obtained  from  20  discrete  monochro¬ 
matic  CL  images)  was  obtained  at  each  (x,y)  position, 
thereby  enabling  a  mapping  of  the  Ino.2Gao  gAs  channel  in¬ 
terband  transition  energy.  The  wavelength  ranges  for  the  150 
and  185  A  Ino  2Gao  gAs  samples  was  from  960  to  965  nm  and 
967  to  977  nm,  respectively.  Spectrally  integrated  CL  images 
(panchromatic  in  the  specified  wavelength  range)  were  like¬ 
wise  obtained  by  summing  the  20  discrete  monochromatic 
images  at  each  (x,y)  pixel  position. 

LPCL  imaging  and  spectroscopy  measurements  were 
taken  with  the  polarizer  rotated  to  detect  the  excitonic  lumi¬ 
nescence  with  E_L[110]  or  Ell[110],  where  E  is  the  electric 
field  of  the  detected  light.  In  order  to  emphasize  the  polar¬ 
ization  variations,  the  ratio  of  the  LPCL  images  at  each  (x,y) 
position  is  represented  as  log[/^(x,y)//j|(x,y)],  where 
and  /|i  are  the  pixel  intensities  under  E±[110]  and  Ell[110] 
detection  orientations,  normalized  to  a  256-level  gray  scale. 
The  1  and  II  subscripts  are  defined  with  respect  to  the  [110] 
direction  in  this  study. 

The  ambipolar  diffusion  lengths  were  measured  using  an 
approach  illustrated  in  Fig.  1.  The  HEMT  samples  were 
coated  with  a  800-A-thick  Ag  mask  over  part  of  the  sample. 
The  Ag  film  had  lateral  dimensions  of  1 00X100  /xm^,  and 
the  edges  of  these  squares  were  oriented  along  the  (110) 
directions.  The  e-beam  energy  of  12  keV  was  sufficiently 
large  so  that  —80%  of  the  beam  penetrated  the  Ag  film  and 
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generated  electrons  and  holes  in  the  region  just  below  the 
mask.^^  The  Ag  mask  prevented  light  from  radiative  recom¬ 
bination  in  the  generation  region  just  below  the  mask  from 
being  detected  by  the  CL  collection  system;  however,  lumi¬ 
nescence  from  carriers  which  diffused  along  the  [110]  di¬ 
rection  (as  shown  in  Fig.  1)  and  recombined  just  beyond  the 
edge  of  the  mask  was  detected.  By  simultaneously  scanning 
the  e  beam  toward  the  edge  of  the  mask  and  recording  the 
integrated  intensity  of  excitonic  luminescence  coming  from 
the  Ino.2GaQ  gAs  channel  as  a  function  of  x,  the  distance  from 
the  mask  edge,  we  have  measured  the  ambipolar  diffusion 
length  ,  of  carriers  in  the  channel.  From  a  simple  diffu¬ 
sion  model,  the  CL  intensity  is  proportional  to  exp(-x/L^), 
as  first  demonstrated  by  Zarem  et  al}^  for  transport  in 
GaAs/Al^Gai_^As  heterostructures  using  a  similar  CL  ex¬ 
periment.  Our  diffusion  length  experiment  was  performed  for 
various  line  scans  parallel  to  and  in  varying  proximity  to  the 
DLDs  in  the  samples  with  150  and  185  A  Ino  ^Gao  gAs  chan¬ 
nel  thickness.  Owing  to  the  formation  of  an  orthogonal  net¬ 
work  of  dislocations  for  channel  thicknesses  greater  than  185 
A,  we  did  not  attempt  this  diffusion  experiment  in  samples 
with  thicker  Ino.2Gao  gAs  channels.  A  nonexponential  depen¬ 
dence  of  the  CL  intensity  would  be  expected  for  an  e-beam 
crossing  misfit  dislocations  in  this  situation,  thus  requiring 
more  elaborate  means  to  extract  the  diffusion  lengths. 


III.  RESULTS  AND  DISCUSSION 

A.  Evaluation  of  the  average  strain  relaxation  in  the 
HEMT  sampies 

An  anisotropy  was  observed  in  the  density  of  (110)- 
oriented  dislocations,  where  dislocations  first  form  along 
[110]  and  continue  to  have  a  greater  density  along  this  di¬ 
rection  as  the  channel  thickness  increases.  The  two  types  of 
60°  dislocations  are  chemically  inequivalent,  owing  to  the 
difference  in  termination  of  the  extra  half-plane  which,  e.g., 
in  the  type-I  (shuffle)  set  has  a  Ga  and  As  termination,  re¬ 
spectively,  for  the  unreconstructed  a  ([ifO]  line  direction) 
and  yS  ([110]  line  direction)  dislocation  cores.  For  a  nonvici- 
nal  GaAs(OOl)  substrate  (i.e.,  nominally  no  misorientation)  it 
is  well  established  that  for  single  thin  In^Gai_^As  (x^O.2) 
films  grown  on  GaAs(OOl),  a  dislocations  are  the  first  to 
form  in  relaxing  the  strain.  This  has  previously  been  at¬ 
tributed  to  the  different  levels  of  stress  required  to  nucleate  a 
and  p  dislocations  and  the  differences  in  a  and  ^  dislocation 
propagation  velocities  on  nonvicinal  GaAs(OOl) 
substrates.^^’^^  Recently,  studies  have  shown  that  a  misorien¬ 
tation  in  the  GaAs(OOl)  substrate  can  also  affect  the  degree 
of  the  asymmetry  in  the  a  and  yS  dislocation  density. 
Assuming  a  predominance  of  60°-type  misfit  dislocations  in 
these  samples,  the  average  strain  relaxation  along  a  (110) 
direction  is  0.02%  for  a  linear  dislocation  density  (LDD)  of 
1 X 1 0"^  cm“  ^  The  maximum  in-plane  strain  of  1 .4 1  %  in  the 

Ino  2Gao  gAs  channel  is  therefore  reduced  in  proportion  to  the 
orthogonal  LDD.  The  average  [110]-  and  [1 10] -oriented 
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Table  L  Linear  dislocation  densities  (LDD)  along  the  [110]  and  [110] 
directions  for  the  various  Ino2Gao  8As  channel  thicknesses.  The  calculated 
6]  JO  and  6ij“o  strains  for  each  sample  are  shown. 


Channel 
thickness  (A) 

[no]  LDD 

(cm-') 

[110]  LDD 

^110 

^n’o 

75 

<1X10' 

<1X10' 

0.014  129 

0.014  129 

150 

3X10" 

<1X10" 

0.014  068 

0.014  127 

185 

5X10" 

4.4X10" 

0.014  027 

0.014  120 

205 

1.7X10'' 

5.5X10" 

0.013  785 

0.014  117 

250 

4X10“ 

1.8X10" 

0.013  313 

0.014  092 

300 

1.16X10" 

2.5X10" 

0.011  769 

0.014  078 

LDD  and  resulting  in-plane  strains,  and  e^fo,  are  shown 
in  Table  I  for  the  HEMT  samples  with  various  Ing  2Gao  gAs 
channel  thicknesses.  The  linear  dislocation  densities  were 
obtained  from  a  combination  of  CL  imaging  and  plan-view 
TEM.  For  CL  imaging  of  DLDs,  a  maximum  area  of  —0.4 
X0.4  mm^  limited  by  the  field  of  view  of  the  ellipsoidal 
mirror  was  used  to  determine  the  LDD. 

In  order  to  evaluate  the  average  strain  relaxation  in  the 
Ino  2Gao,gAs  HEMT  samples,  we  have  examined  spatially  in¬ 
tegrated  CL  spectra  of  these  samples  at  room  temperature,  as 
shown  in  Fig.  2.  An  area  of  128X94  /xm^  was  scanned  dur¬ 
ing  the  acquisition  of  the  these  spectra.  The  energy  of  the 
peak  position  is  found  to  decrease  from  1,217  to  1.179  eV  as 
the  channel  thickness  is  increased  from  150  to  300  A.  The 
first  confined  electron  to  heavy-hole  (el-hhl)  transition  en- 


Fig.  2.  Stack  plot  of  spatially  integrated  CL  spectra  at  room  temperature  for 
all  HEMT  samples. 
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QW  thickness  (A) 

Fig.  3.  Self-consistent-field  calculation  of  the  band  profile  for  the 
Ino2Gao8As  HEMT  sample  (channel  thickness  of  150  A)  showing  the  (a)  el 
and  hhl  wave  functions  and  Fermi-level  position  and  (b)  plots  of  the  theo¬ 
retical  el-hhl  luminescence  energy  position  vs  the  Ino.2Gao  8As  QW  width 
for  the  (i)  fully  strained,  (ii)  fully  relaxed,  (iii)  uniaxially  strained,  and  (iv) 
partially  relaxed  strain  (squares)  conditions  using  the  data  in  Table  L  The 
experimental  CL  peak  positions  from  Fig.  2  are  shown  with  dots  in  (b). 
Calculations  of  el-hhl  vs  the  QW  width  are  also  shown  for  a  fully  strained 
Ino.2iGao  79As  QW,  to  illustrate  the  affect  of  an  In  composition  variation. 


ergy  was  calculated  as  a  function  of  the  channel  thickness  for 
the  cases  of 

(i)  fully  biaxially  strained  (pseudomorphic), 

(ii)  uniaxially  strained, 

(iii)  fully  relaxed,  and 

(iv)  partially  relaxed  Ino2Gao.8As  channels. 

The  purpose  of  the  top  Alo^sGaojsAs  barrier  is  to  increase 
the  confinement  energy  for  electrons,  thereby  effectively  in¬ 
creasing  the  electron  density  in  the  channel.  The  resulting 
asymmetrical  QW  structure  (with  a  150  A  width)  subject  to 
an  electric  field  created  by  the  ionized  <5-doped  Si  layer  is 
shown  in  Fig.  3(a). 

The  electron  and  hole  envelope  wave-function  calcula¬ 
tions  were  performed  with  a  transfer-matrix  method  (TMM) 
technique  using  a  self-consistent-field  approximation  that  in¬ 
cludes  the  Hartree  term.^^’^^”^^  All  occupied  confined  elec¬ 
tron  states,  as  determined  by  the  Fermi-level  position,  were 
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Fig.  4,  Self-consistent-field  calculation  of  the  maximum  electric  field  F,„  in 
the  Ino.2Gao.8As  channel  and  change  in  the  el-hhl  transition  energy  vs 
electron  concentration  at  room  temperature. 

used  in  calculating  the  field  as  a  function  of  position  in  the 
channel.  A  similar  method  was  employed  to  calculate  the 
field  self-consistently  in  a  nipi-doped  Ino.2Gao.8As  MQW 
structure,  as  discussed  in  Ref.  10.  The  wave  functions  and 
potential  were  calculated  self-consistently  using  Airy  func¬ 
tions  in  the  TMM  by  discretizing  the  potential  into  discrete 
linear  field  regions  with  '*--5  A  widths.  The  use  of  narrower 
widths  only  negligibly  affected  the  calculated  electron  and 
hole  eigenstates.  The  conduction  to  valence-band  offset  ra¬ 
tios  (AEJAE^)  at  both  interfaces  of  the  In^Gai_^As  QW 
were  taken  as  linear  interpolations  between  70/30  and  60/40 
for  GaAs/In^^Gai  „^As  and  GaAs/Al^Gai_yAs  interfaces, 
respectively.^^  The  results  of  a  calculation  showing  the  band 
diagram,  ground-state  electron  and  hole  wave  functions,  and 
Fermi  level  for  the  pseudomorphic  case  is  shown  in  Fig. 
3(a).  The  ground- state  electron  and  heavy-hole  wave  func¬ 
tions  are  shown  superimposed  on  the  calculated  band  profile 
in  Fig.  3(a).  The  maximum  electric  field  in  the 
Ino2Gao8As  channel  (for  a  150  A  width)  and  the  el-hhl 
transition  energy  versus  the  electron  concentration  in  the 
channel  at  room  temperature  is  shown  in  Fig.  4.  The 
maximum  field  in  the  channel  occurs  at  the 
Alo.25Gao  75As/Ino.2Gao  sAs  interface,  resulting  in  Stark  shifts 
of  the  confined  electron  and  hole  states.  The  field-induced 
Stark  shift  decreases  for  a  reduced  electron  concentration  in 
the  channel  and  Si  <5^doping  concentration,  resulting  in  a 
larger  el-hhl  transition  energy  as  decreases  (as  shown  in 
Fig.  4). 

The  strain-induced  changes  in  the  Ino.2GaQ  sAs  band  edges 
were  calculated  using  the  4X4  Luttinger-Kohn  and  Pikus- 
Bir  Hamiltonian  for  a  general  in-plane  strain,  ^  ^iio^ 
referred  to  the  (110)  dislocation  directions.^^’^^  The  strain- 
induced  energy  change  AF  of  the  band  gap  involving  the 
j  =  3/2  valence  bands  is  given  by  the  following  solution  of 
the  orbital- strain  Hamiltonian  for  k=0:^^’^^ 


AF= 


^xx  ^yy)  ~^3b  (^‘^.x;  ^yy) 


(1) 
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where =  Cyy  =  (cuo  +  ^u'o)/2,  e^y  =  (ej fo  -  eiio)/2,  and 
e^^=-2e^^C\2lCix,  ejio  and  6ifo  are  the  strains  along  [110] 
and  [110]  directions,  respectively.  The  constant  a  is  the  hy¬ 
drostatic  deformation  potential;  b  and  d  are  uniaxial  defor¬ 
mation  potentials  associated  with  strains  of  tetragonal  and 
rhombohedral  symmetries,  respectively,  which  remove  the 
degeneracy  of  the  bands  as  indicated  by  the  ±  sign;  and 
Ci2  are  elastic  constants;  these  constants  for  In^^Ga^.^As  are 
found  by  interpolating  between  values  for  GaAs  and 
InAs.^^’^^  The  low  symmetry  of  a  6^0  strain  in  the 

Ino  2Gao  sAs  channel  required  the  use  of  the  Luttinger-Kohn 
Hamiltonian  to  determine  the  Ino.2Gao.8As  effective  mass  of 
holes  along  the  [001]  growth  direction.  Standard  hole  masses 
and  band  gaps  are  used  for  Al;^-Gal_^As.  The  Luttinger  pa¬ 
rameters  are  taken  as  a  linearization,  again,  between  the  val¬ 
ues  for  InAs  and  GaAs.^^  The  effective  masses  and  strain- 
modified  barrier  heights  were  then  used  in  the  TMM 
calculation  to  calculate  the  el~hhl  transition  energies  for 

(i)  thepseudomorphiccase(6no^  6n"o  =  0.0141), 

(ii)  the  uniaxial  strain  condition  (euQ—O  and  €iIq 
=  0.0141), 

(iii)  the  fully  relaxed  case  ( 61 10  ”  10  “  0) ,  and 

(iv)  the  partially  relaxed  case  with  general  strain  values, 
6110  ^iio»  obtained  from  the  measured  disloca¬ 
tion  densities  shown  in  Table  I  for  each  channel  thick¬ 
ness. 

The  results  are  shown  in  Fig.  3(b)  as  solid  lines,  medium- 
dashed  lines,  long-dashed  lines,  and  squares  for  cases  (i)- 
(iv),  respectively.  The  experimental  el-hhl  peak  energies 
(dots)  are  found  to  lie  closest  to  the  theoretical  curve  for  the 
pseudomorphiccase(6iio  =  ^n’o“  0.0141). 

The  deviation  between  the  experimental  peak  positions 
and  pseudomorphic  calculation  increases  as  the  channel 
thickness  increases,  consistent  with  an  increased  strain  relax¬ 
ation  of  the  Ino.2Gao.8As  channel.  The  use  of  the  measured 
strains  of  Table  I  resulted  in  a  better  agreement  between  the 
experimental  and  calculated  el-hhl  transition  energies,  as 
shown  in  Fig.  3(b).  The  CL  imaging  and  plan-view  TEM 
revealed  an  increase  in  the  dislocation  density  with  increas¬ 
ing  channel  thickness  as  shown  in  Table  I.  It  is  apparent  that 
even  for  the  case  of  the  largest  channel  width  of  300  A,  the 
largest  dislocation  density  yields  a  strain  of  eno^^.OllS,  and 
the  film  is  still  '^84%  strained  in  the  [110]  direction.  There¬ 
fore,  the  TMM  calculations,  when  incorporating  the  ob¬ 
served  strain  relaxation,  explain  the  salient  features  of  the  CL 
peak  energies  for  varying  channel  thicknesses.  Further,  the 
deviation  between  the  experiment  and  calculations  for  the 
partially  relaxed  case  [dots  and  squares,  respectively,  in  Fig. 
3(b)]  is  likely  due  to  a  variation  in  the  In  composition  x.  A 
calculation  of  the  el-hhl  energy  for  a  fully  strained 
Ino  2iGao79As  QW  (short-dashed  lines)  is  shown  in  Fig.  3(b), 
indicating  a  —10  meV  decrease  in  the  calculated  el-hhl 
energy  will  occur  for  cases  (i)  and  (iv)  above  if  x  =  0.21  is 
used  instead  of  x  =  0.20  in  these  calculations. 
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(a)  CL 


(b)  CLWI 
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Fig.  5.  (a)  CL  intensity  and  (b)  CLWI  images  of  the  same  regions  in  the 
Ino.2Gao.8As  HEMT  sample  with  150  A  channel  thickness. 


B.  Local  CL  energy  and  polarization  variations  in  close 
proximity  to  dislocations 

Recently,  we  have  established  that  there  are  significant 
energy  and  polarization  variations  in  the  optical  transitions  in 
close  proximity  to  misfit  dislocations,  in  addition  to  the  non- 
radiative  behavior  of  DLDs.^"^"^^  The  previous  systems  ex¬ 
amined  were  partially  relaxed  In^Gai_^As/GaAs  films  which 
had  linear  dislocations  densities  greater  than  —  1X10"^  cm“^ 
or  one  dislocation  per  /xm.  Since  the  carrier  diffusion  length 
is  —1  /xm,  defect  densities  greater  than  1 X 10"^  cm”^  result  in 
DLDs  that  are  composed  of  bunches  of  dislocations,  which 
cannot  be  resolved  individually  with  conventional  CL  imag¬ 
ing.  Bunches  of  dislocations  that  are  formed  with  very  nar¬ 
row  dislocation  spacings  can  result  in  nearly  complete  strain 
relaxation  along  one  (110)  direction  with  a  partial  strain  re¬ 
maining  in  the  orthogonal  direction.  This  can  further  result  in 
a  quasi-uniaxial  stress  leading  to  a  large  polarization 
anisotropy  and  a  reduction  in  the  excitonic  transition  energy, 
as  previously  reported. 

We  have  pursued  a  similar  analysis  here  for  the  case  of 
dislocations  separated  by  a  length  on  average  greater  than  the 
carrier  diffusion  length,  so  as  to  examine  the  effects  of  iso¬ 
lated  dislocations.  Figures  5  and  6  show  (a)  CL  intensity  and 
(b)  CLWI  images  of  samples  with  150  and  185  A  channel 
thicknesses,  respectively.  The  false-color  scale  represents  the 
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(a)  CL 


(b)  CLWI 
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Fig.  6.  (a)  CL  intensity  and  (b)  CLWI  images  of  the  same  regions  in  the 
Ino.2Gao  8As  HEMT  sample  with  185  A  channel  thickness. 


wavelength  position  of  the  peak  CL  intensity.  A  particu¬ 
larly  striking  feature  is  observed  in  the  CLWI  images.  The 
wavelength  of  emission  is  found  to  decrease  near  the  OLD 
position,  showing  a  blue  shift  correlated  with  a  defect- 
induced  CL  intensity  reduction.  Figures  7  and  8  each  show  a 
line  scan  analysis  for  an  arbitrary  line  scan  taken  perpendicu¬ 
lar  to  the  [1  fO]  dislocation  direction  and  illustrate  a  one-to- 
one  correlation  between  the  blue  shift  and  the  DLD  position. 
A  maximum  increase  of  ~5  and  ~10  meV  is  seen  at  the 
DLD  centers  for  the  150  and  185  A  samples,  respectively,  in 
Figs.  5-8.  This  behavior  appears  contradictory  to  the  previ¬ 
ously  observed  red  shift  measured  near  DLD  positions  in 
In;fGai_;^.As  samples  exhibiting  a  greater  strain  relaxation 
and  greater  dislocation-induced  reductions  in  the 
strain. This  apparent  discrepancy  is,  however,  explained 
by  the  influence  of  the  dislocations  and  associated  point  de¬ 
fects  on  the  electron  density  in  the  channel.  It  is  our  hypoth¬ 
esis  that  these  defects  create  localized  deep  levels  and  traps 
that  reduce  the  effective  electron  density  in  the  channel  near 
DLDs,  thereby  simultaneously  reducing  the  field  in  the  chan¬ 
nel.  The  reduction  in  both  the  electron  density  and  field  will 
concomitantly  reduce  the  Stark  shift,  thereby  resulting  in  a 
local  increase  in  the  el-hhl  emission  energy.  A  similar  be¬ 
havior  was  observed  for  defect-induced  reductions  in  the 
electric  field  and  Stark  shifts  for  nipi-doped 
Ino.2Gao.8As/GaAs  MQWs.^^  The  larger  blue  shift  in  the  el- 


Fig.  7.  Line  scan  analysis  for  an  arbitrary  [110]-oriented  line  for  the  images 
of  Fig.  5  (150  A  Ino  2Gao  gAs  channel  width)  showing  the  CL  intensity  and 
CLWI  position  con'elations.  Dashed  and  solid  vertical  lines  are  used  to  show 
correlations  between  a  reduced  CL  intensity  (DLDs)  and  a  blue  shift  in  the 
el-hhl  transition  energy. 


hhl  transition  energies  for  the  185  A  sample  (Figs.  6  and  8) 
is  evidently  due  to  the  enhanced  dislocation  density.  Dislo¬ 
cation  bunching  and  an  enhanced  point  defect  density  are 
more  prevalent  for  larger  Ino.2Gao.8As  channel  thicknesses, 
which  should  result  in  a  greater  local  depletion  of  the  elec¬ 
tron  density  in  the  channel. 

The  local  strain  relaxation  appears  to  minimally  effect  the 
el-hhl  transition  energy.  As  shown  in  Fig.  3,  the  curve  for  a 
uniaxial  strain  condition  is  ~50  meV  lower  than  that  for  the 
pseudomorphic  case.  A  defect-induced  reduction  in  the  quan- 
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Fig.  8.  Line  scan  analysis  for  an  arbitrary  [110]-oriented  line  for  the  images 
of  Fig.  6  (185  A  Ino  2Gao  gAs  channel  width)  showing  the  CL  intensity  and 
CLWI  position  correlations.  Dashed  and  solid  vertical  lines  are  used  to  show 
con'elations  between  a  reduced  CL  intensity  (DLDs)  and  a  blue  shift  in  the 
el-hhl  transition  energy. 
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turn  confined  Stark  effect  could  also  cause  a  blue  shift  of 
^50  meV  in  the  el-hhl  transition  energy,  as  seen  from  Fig. 
4.  That  is,  strain  relaxation  and  defect-induced  reductions  in 
the  field  cause  the  el-hhl  energy  to  shift  in  opposite  direc¬ 
tions,  thereby  possibly  masking  the  effects  caused  by  strain 
relaxation.  However,  from  the  CL  polarization  results  dis¬ 
cussed  below,  we  show  that  the  average  strain  tensor  within 
the  ~1  jiim  carrier  diffusion  length  near  DLDs  is  still  well 
described  by  a  biaxial  strain  with  ^  fo ,  for  the  150  and 
185  A  samples  analyzed  in  Figs.  5-8. 

Linearly  polarized  monochromatic  CL  images  were  taken 
with  the  polarizer  rotated  to  detect  emission  of  light  with 
E_L[110]  and  El|[110]  detection  orientations  at  wavelengths 
of  962  and  972  nm  for  the  150  and  185  A  samples,  respec¬ 
tively.  In  order  to  emphasize  the  polarization  variations,  the 
ratio  of  these  images  is  displayed  in  Figs.  9  and  10.  The 
pixels  in  the  ratio  image  at  a  (x,y)  position  are  presented  as 
log[/j^(x,y)//||(x,y)],  where  and  /||  are  the  pixel  intensi¬ 
ties  under  ElL[110]  and  E||[110]  detection  orientations.  The 
bright  and  dark  bands  present  in  the  LPCL  ratio  images  ex¬ 
hibit  a  local  polarization  anisotropy,  which  indicates  the 
presence  of  /zm-scale  variations  in  These  bands  corre¬ 
late  with  the  peaks  and  dips  in  the  CL  intensity  image,  as 
shown  in  the  line  scan  analysis  of  Fig.  11  for  the  185  A 
sample.  The  maximum  polarization  anisotropies  (minimum 
ratios)  are  /^//[|^0.95  and  /^//||'^0.85  for  the  150  and  185 
A  samples,  respectively.  The  spatially  averaged  /^//||  ratios 
for  the  LPCL  images  of  Figs.  9(b)  and  10(b)  are  0.98  and 
0.95,  respectively.  From  the  four-band  k-p  calculation  dis¬ 
cussed  above,  we  have  calculated  /j^//||  using  the  dipole  ap¬ 
proximation  in  Fermi’s  golden  rule,  i.e., 
where  is  the  electron  wave  func¬ 
tion,  is  the  wave  function  of  the  uppermost  hole  state,  and 
p  is  the  linear  momentum  operator.  Both  and  Uf^  include 
the  envelope  wave  functions  and  zone-center  Bloch  func¬ 
tions  for  the  5-  and  ;?-type  conduction-  and  valence-band 
states,  respectively.  The  polarization  ratio  was  calculated  for 
a  fixed  6ifo  =  0.0141  and  variable  to  simulate  the  effect 
of  a  transition  from  uniaxial  to  biaxial  (pseudomorphic) 
strain.  The  results  are  shown  in  Fig.  12  for  the  150  and  185 
A  samples,  where  we  also  show  the  calculated  el-hhl  tran¬ 
sition  energy  as  a  function  of  Eho-  From  the  measured  aver¬ 
age  polarization  anisotropy  ratios  above  and  the  calculation 
in  Fig.  12,  the  estimated  values  for  are  0.0138  and 
0.0132  (i.e.,  98%  and  94%  strained)  for  the  150  and  185  A 
samples,  respectively.  This  is  in  reasonable  agreement  with 
the  low  strain  relaxation  and  values  observed  in  Table  I 
from  the  linear  dislocation  densities  for  these  samples.  These 
results  contrast  with  the  situation  previously  studied  for 
highly  relaxed  In^Gai-^^^As  samples  with  dislocation  densi¬ 
ties  greater  than  —1X10^  cm“\  where  dislocation  bunching 
lead  to  a  quasiuniaxial  strain  and  a  larger  polarization 
anisotropy.  Thus,  for  individual  dislocations  studied 
here,  within  the  minority  carrier  diffusion  length  of  ~  1  yum, 
the  presence  of  dislocations  with  densities  less  than  ~1  X 10"^ 
leads  to  a  measured  luminescence  behavior  still  well  de¬ 
scribed  by  a  biaxial  strain,  enQ  ^  6i  fo . 
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C.  The  CL  temperature  dependence  and  spatial 
variations  in  the  activation  energy 

The  integrated  CL  intensity  Ij  of  the  el-hhl  transition  in 
the  Ino2Gao  gAs  channel  and  the  excitonic  luminescence  of 
the  GaAs/Alo  25Gao .75AS  MQW  was  measured  as  a  function 
of  temperature  for  selected  local  regions  in  close  proximity 
to  and  away  from  dislocations,  labeled  D  and  B,  respectively 
to  denote  dark  and  bright  regions  in  the  Ino.2Gao  gAs  CL  im¬ 
aging.  The  results  are  shown  in  Fig.  13  for  both  the  150  and 
185  A  samples  in  a  log  Ij  vs  1000/7  plot.  The  CL  intensities 
reduce  as  the  temperature  increases  from  87  to  250  K.  This 
reduction  corresponds  to  the  increase  of  thermally  activated 
nonradiative  recombination,  which  causes  an  Arrhenius  de¬ 
pendence  in  the  high-temperature  range.  In  previous  reports 
of  the  temperature-dependent  luminescence  efficiency, 
the  linear  region  of  the  Arrhenius  behavior  in  the  high- 
temperature  range  was  characterized  by  one  or  two  thermally 
activated  nonradiative  recombination  processes.  We  use  the 
following  fitting  equation  for  the  temperature  dependence  of 
the  Ino2Gao  gAs  QW  CL  intensity: 

Ij=R7],  (2) 

where  r]=  1/(1  ^-RJR^)  is  the  quantum  efficiency, R  is 
a  coefficient  which  depends  on  the  generation  rate  of 
electron -hole  pairs  and  the  relative  weights  of  monomolecu- 
lar  and  bimolecular  recombination,"^^  R^.  is  the  radiative  re¬ 
combination  rate  which  is  assumed  to  be  temperature  inde¬ 
pendent,  and  represents  the  rate  for  nonradiative 
recombination  and  is  assumed  to  have  the  following  tem¬ 
perature  dependence: 

^nr=  ^nrl  +  ^nrO  exp(  -EJkT),  (3) 

where  are  the  temperature-independent  prefactor 

and  the  thermal  activation  energy,"^®’"^^  respectively,  and  R^^^ 
is  the  rate  for  nonthermally  activated  nonradiative  recombi¬ 
nation  (i.e.,  independent  of  temperature).  The  temperature- 
dependent  exponential  term  is  due  to  the  enhancement  of  the 
capture  cross  section  of  nonradiative  recombination  centers 
as  seen  by  carriers  as  the  temperature  increases.  The  model 
of  Eq.  (3)  is  motivated  by  the  existence  of  DLDs  in  the  CL 
imaging  at  the  lowest  temperatures,  indicating  that  there  are 
nonradiative  channels  which  are  independent  of  any  thermal 
activation.  Therefore, 

0 

1  +  yS  exp(  -  IkT)  ’ 

where  0=R/(l-\- R^^i/R^)  and  /7=/?nro/(^r+^nri)  ii^de- 
pendent  of  temperature  and  depend  on  the  density  of  non¬ 
thermally  and  thermally  activated  nonradiative  recombina¬ 
tion  centers.  At  the  low-temperature  limit,  tj  saturates  since 
thermally  activated  nonradiative  recombination  vanishes  and 
0.  The  spatial  variation  in  0  therefore  accounts  for  the 
difference  in  Ij,  the  el-hhl  emission  intensity,  when 

100  K,  as  shown  in  Figs.  9,  10,  and  13.  The  solid  lines  in 
Fig.  13  show  a  fit  of  Eq.  (4)  to  the  experimental  CL  data  for 
the  el-hhl  transition  at  the  corresponding  B  (bright)  and  D 
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(d)  CL  (GaAs/AIGaAs  MOW) 
T  =  87  K,  A.  =  776  nm 


(e)  CL  (GaAs/AIGaAs  MOW) 
T=  180K,  X  =794nm 


(b)LPCL  log 
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Fig.  9.  CL  imaging  of  the  150  A  Ino^Gao.gAs  HEMT  sample  showing  (a)  spectrally  integrated  CL  intensity  images  for  the  el-hhl  emission,  (b)  LPCL 
ratio  images,  (c)  activation  energy  images  for  the  Ino^Gao  gAs  QW  luminescence,  monochromatic  CL  images  for  GaAs/Alo95Gao.75As  MQW  at  (d)  r=87 
K  and  (e)  180  K,  and  (f)  activation  energy  E^2  images  for  the  GaAs/Aly  25Gao  75AS  MQW  emission. 
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(a)  CL  (lno.2Gao.8As)  T  =  87  K 


(d)  CL  (GaAs/AIGaAs  MOW) 
T  =  87  K,  X  =  776  nm 
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(c)  Eai  (lno.2Gao.8As)  (f)  Ea2  (GaAs/AIGaAs  MQW) 


Fig.  10.  CL  imaging  of  the  185  A  Ino.2Gao.8As  HEMT  sample  showing  (a)  spectrally  integrated  CL  intensity  images  for  the  el-hhl  emission,  (b)  LPCL/^//ii 
ratio  images,  (c)  activation  energy  images  for  the  Ino.2Gao.8As  QW  luminescence,  monochromatic  CL  images  for  GaAs/Alo.25Gao.75As  MQW  at  (d)  7=87 
K  and  (e)  180  K,  and  (f)  activation  energy  £^2  images  for  the  GaAs/Alo.25Gao.75As  MQW  emission. 
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Distance  (/xm) 

Fig.  11.  Line  scan  analysis  for  an  arbitrary  [110]-oriented  line  for  the  im¬ 
ages  of  Fig.  10  (185  A  Ino.2Gao,8As  channel  width)  showing  (a)  the  CL 
intensity  for  the  el-hhl  emission,  GaAs/Alo.25Gao75As  MQW  emission  in¬ 
tensity  at  (b)  r=  180  K  and  (c)  87  K,  (d)  LPCL  ratio,  and  activation 
energies  (e)  E^2  (0 

(dark)  regions.  The  activation  energies  were  determined 
for  each  pixel  position  by  fitting  all  640X480  pixel  intensity 
values  for  the  monochromatic  CL  images  of  the  el-hhl  tran¬ 
sition  taken  at  12  different  temperatures.  The  results  of 
for  each  (x,y)  position  are  shown  in  Figs.  9(c)  and  10(c),  for 


Fig.  12.  Calculation  of  the  el-hhl  emission  ratio  and  Ino.^GaogAs 
QW  transition  energy  vs  6iio  for  a  fixed  [q  =  0.0141  at  room  temperature. 


Temperature  (K) 
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4  6  8  10 
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Fig.  13.  CL  intensity  vs  lOOO/T  for  the  same  local  bright  (B)  and  dark 
regions  (D)  in  the  Ino.2Gao8As  el-hhl  and  GaAs/Alo.25Gaoj5As  MQW 
monochromatic  CL  images.  The  solid  lines  running  through  the  curves  are  a 
fit  of  Eq.  (4)  to  the  data  to  determine  the  activation  energies  and  E^,2  for 
the  Iuq  2Gao  gAs  QW  and  GaAs/Alo  25Gao.75As  MQW  luminescence,  respec¬ 
tively. 


the  150  and  185  A  samples,  respectively.  A  line  scan  analysis 
is  also  shown  in  Fig.  11(f)  for  the  185  A  sample. 

For  the  luminescence  originating  from  the 
GaAs/Alo  25Gao  75 As  MQW,  1q  ,  a  low-temperature  saturation 
of  its  intensity  was  not  reached  for  the  87  K  minimum  in  this 
study  (as  shown  in  Fig.  13).  We  have  also  fit  the  CL  images 
of  /g  ,  taken  at  the  12  different  temperatures,  with  the  model 
of  Eqs.  (2)-(4),  obtaining  the  activation  energy,  £"^2  the 
GaAs/Alo,25Gao  75AS  MQW  excitonic  luminescence.  The  re¬ 
sults  of  two  fits  are  shown  by  the  solid  lines  in  the  log  plot  of 
Fig.  13  for  the  same  regions,  B  and  D,  as  indicated  for  the 
el-hhl  Ino2Gao8As  luminescence.  We  further  observe  a 
change  in  the  relative  intensity  of  the  GaAs/Alo^sCaojsAs 
MQW  emission  near  features  which  correspond  to  DLDs  in 
the  CL  imaging  of  the  el-hhl  Ino  ^Gao.gAs  QW  emission.  At 
180  K,  for  both  the  150  and  185  A  samples,  the  imaging 
of  the  GaAs/Alo,25Gao.75As  MQW  emission  [Figs.  9(e)  and 
10(e)]  results  in  DLDs  which  correlate,  one  to  one,  with  that 
of  the  el-hhl  imaging,  as  also  shown  in  the  line  scan  data  of 
Fig.  11.  As  the  temperature  is  lowered,  a  reversal  in  the  rela¬ 
tive  intensity  of  the  GaAs/Alo.25Gao  75AS  MQW  emission  oc¬ 
curs  near  defects,  resulting  in  bright  lines  in  the  CL  images, 
as  shown  in  Figs.  9(d)  and  10(d).  This  striking  contrast  re¬ 
versal  is  also  illustrated  in  the  line  scan  analysis  of  Figs. 
11(b)  and  11(c).  That  is,  high-temperature  DLDs  in  the 
GaAs/Alo  25Gao.75As  MQW  emission  become  bright  line  de¬ 
fects  (BLDs)  at  lower  temperatures.  The  activation  energy 
for  this  emission  £^2  increases  near  these  defects,  as  indi¬ 
cated  by  the  imaging  and  line  scan  analysis.  This  is  in  con¬ 
trast  to  the  decrease  in  £^i  near  DLDs. 

This  behavior  reflects  salient  differences  in  the  thermal 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2932 


Rich  et  aL:  Effect  of  interface  defect  formation 


2932 


activation  of  carriers  which  are  in  close  proximity,  but  on 
opposite  sides,  of  the  Ino.2Gao gAs/GaAs  interface.  The 
energy  represents  the  activation  energy  for  thermal  reemis¬ 
sion  of  carriers  out  of  the  Ino  ^Gao  gAs  QW,  as  has  been  ob¬ 
served  for  similar  QW  and  MQW  systems. Once  out  of 
the  Ino.2Gao  gAs  QW,  the  carriers  can  be  recaptured  by  the 
QW,  recombine  in  the  GaAs  barrier,  or  diffuse  to  the 
GaAs/Alo  25Gao  75 As  MQW  where  recombination  can  occur. 
An  extremely  weak  GaAs  near-band-edge  luminescence  was 
detected  relative  to  emissions  from  the  Ino  ^Gao.gAs  QW  and 
GaAs/Alo  25Gao  75As  MQW,  indicating  the  very  low  impurity 
concentration  and  high  quality  of  the  GaAs  barrier  layer. 
Carriers  which  are  generated  in  the  GaAs  barrier  will  then 
primarily  diffuse  to  the  underlying  GaAs/Alo  25Gao.75 As 
MQW  or  Ino^GaogAs  QW,  where  a  higher  carrier  capture 
rate  in  these  layers  is  expected,  owing  to  quantum  capture. 
The  3000  A  thickness  of  the  GaAs  barrier  is  less  than  the 
fjLm  ambipolar  diffusion  length  for  high-quality  and  low- 
impurity  GaAs,  thereby  enabling  the  GaAs  barrier  to  serve  as 
a  conduit  for  efficient  transport  of  electrons  and  holes  to  and 
from  the  GaAs/Alo  ^sGaojsAs  MQW  and  Ino^Gao.gAs  QW. 

The  defects  created  by  the  misfit  dislocations  further  in¬ 
troduce  other  recombination  channels  for  carriers  in  the 
Ino.2GaQ  gAs  QW.  These  defects  enhance  the  probability  for 
thermally  assisted  nonradiative  recombination  for  carriers  al¬ 
ready  residing  in  the  Ino  2Gao.gAs  QW,  thereby  resulting  in  a 
decrease  in  near  DLDs.  The  analysis  in  Figs.  9(c),  10(c), 
and  11(f)  shows  that  there  is  a  maximum  decrease  in  E^^  of 
—60  and  —80  meV  near  DLDs  for  the  150  and  185  A 
samples.  The  greater  reduction  in  E^i  for  the  185  A  sample 
reflects  the  enhanced  defect  density. 

For  carriers  recombining  in  the  GaAs/Alo^sGag  75AS 
MQW,  the  misfit  dislocations,  likewise,  introduce  additional 
thermally  assisted  nonradiative  channels  which  are  acces¬ 
sible  at  higher  temperatures.  These  channels  may  compete 
with  carrier  capture  by  the  Iuq  2Gao  gAs  QW  and  subsequent 
radiative  recombination.  However,  at  lower  temperatures, 
carrier  capture  by  these  defects  is  substantially  reduced  and 
simultaneously  results  in  a  reduced  transfer  of  carriers  from 
the  GaAs/Alo.25Gao.75As  MQW  to  the  Ino^Gao.gAs  QW  at 
defect  sites,  thereby  resulting  in  a  local  increase  of  E^2- 
These  defect  sites  also  appear  to  act  as  a  barrier  to  transport 
into  the  Ino2Gao.8As  QW  at  low  temperatures,  thereby  en¬ 
hancing  the  relative  GaAs/Alo  25Gao  75 As  MQW  emission 
near  dislocations  and  resulting  in  BLDs  correlated  with  the 
Ino2Gao8As  DLDs.  The  spatially  averaged  values  of  E^2 
[from  Figs.  9(f)  and  10(f)]  are  32.3  and  35.3  meV  for  the  150 
and  185  A  samples.  The  larger  value  in  the  latter  sample 
again  likely  reflects  the  larger  relaxation-induced  defect  den¬ 
sity  in  that  sample. 

D.  Spatial  variations  in  the  carrier  diffusion  length 

The  diffusion  length  experiment  was  performed  by  scan¬ 
ning  the  e  beam  along  the  [1  fO]  direction  (parallel  to  the 
DLDs)  and  recording  the  intensity  of  the  el-hhl  lumines¬ 
cence  as  a  function  of  the  distance  x  from  the  mask  edge  (as 
illustrated  in  Fig.  1).  Typical  scans  are  shown  for  the  150  and 


Fig.  14.  CL  intensity  vs  beam  position  x  for  different  line  scans  (labeled 
a-d)  parallel  to  the  [  1  f  0]  dislocation  line  direction  near  and  between  DLDs 
for  both  the  150  and  185  A  Ino.2Gao.8As  HEMT  samples. 


185  A  Ino  2Gao  8As  samples  at  four  different  regions  in  each 
sample,  labeled  a-d,  in  Fig.  14.  These  regions  are  further 
identified  in  the  CL  line  scan  analysis  of  Figs.  15  and  16, 
which  show  the  Ino.2Gao  gAs  QW  luminescence  intensity  ver¬ 
sus  distance  (bottom  scan)  along  [110]  (i.e.,  perpendicular  to 
the  dislocation  line  direction).  The  dips  in  the  CL  intensity 
scan,  again,  represent  the  DLDs.  For  both  samples,  regions  a 
and  b  are  away  from  DLDs,  while  c  and  d  are  near  the  center 
of  DLDs.  From  Fig.  14  it  is  apparent  that  a  reduction  in  CL 
intensity  at  these  regions  is  also  accompanied  by  a  greater 
negative  slope,  resulting  in  a  smaller  ambipolar  diffusion 
length.  This  correlation  is  illustrated  in  Figs.  15  and  16  for 
several  diffusion  length  measurements  performed  in  varying 
proximity  to  DLDs  at  different  temperatures.  For  both 
samples,  regions  near  and  far  from  the  DLDs  correspond  to 
regions  of  smaller  and  larger  diffusion  lengths,  respectively. 
A  wide  variation  in  Lj^  is  observed  from  about  0.5  to  3  /mm  in 
Fig.  17,  showing  that  defect  regions  can  substantially  influ¬ 
ence  the  transport.  No  clear  systematic  variations  in  with 
temperature  are  observed. 

From  previous  Hall  measurements  of  these  samples,  the 
Ino.2Gao  gAs  channel  contains  a  large  electron  concentration 
of  —3x10^^  cm~^  at  77  The  Lq  measured  here  there¬ 
fore  reflects  the  diffusion  of  the  minority  carriers,  i.e.,  holes. 
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Fig.  15.  CL  intensity  and  diffusion  length  plot  vs  distance  along  [110] 
(perpendicular  to  the  dislocation  direction)  for  various  temperatures  for  the 
150  A  sample.  The  dashed  and  solid  vertical  lines  show  the  spatial  correla¬ 
tions  between  the  DLD  positions  and  a  reduced  diffusion  length. 

This  hole  diffusion  length  will  involve  an  interplay  between 
local  changes  in  mobility  ,  and  minority-carrier  recombi¬ 
nation  lifetime  as  where  fjUp  and  Dp  are 

related  by  the  Einstein  relation  fXp  =  eDpIkT,^^  The  presence 
of  defects  will  evidently  reduce  both  fip  and  as  a  result  of 
enhanced  scattering  near  defects  and  introduction  of  nonra- 
diative  recombination  channels.  As  discussed  in  Sec.  Ill  A, 
the  enhanced  defect  density  near  dislocations  is  expected  to 
locally  reduced  the  electric  field.  The  reduction  in  the  field  is 
also  expected  to  result  in  a  decrease  in  since  the  electron 
and  hole  envelope  wave-function  overlap  will  also  increase 
with  a  decrease  in  the  field  [see  Fig.  3(a)].  However,  without 
a  quantitative  measurement  of  the  lifetime  and  its  variations 
we  refrain  from  attempting  to  deconvolve  Lp  into  separate 
/jbp  and  Tp  terms  here. 

IV.  SUMMARY  AND  CONCLUSIONS 

A  detailed  study  of  fundamental  optical  and  transport 
properties  of  two  Ino  2Gao  gAs  HEMT  samples  has  been  per¬ 
formed  for  channel  thicknesses  that  just  exceed  the 
Ino  2Gao  gAs  critical  thickness.  This  facilitated  a  study  of  the 
excitonic  luminescence  polarization  and  wavelength,  ther¬ 
mally  activated  nonradiative  recombination,  and  carrier  dif- 
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Fig.  16.  CL  intensity  and  diffusion  length  plot  vs  distance  along  [110] 
(perpendicular  to  the  dislocation  direction)  for  various  temperatures  for  the 
185  A  sample.  The  dashed  and  solid  vertical  lines  show  the  spatial  correla¬ 
tions  between  the  DLD  positions  and  a  reduced  diffusion  length. 

fusion  in  a  strain  relaxation  regime  where  the  optical  and 
transport  properties  near  individual  dislocations  were  exam¬ 
ined.  The  dislocations  and  DLD  network  were  found  well 
separated  on  a  ^tm  scale  and  oriented  primarily  along  one  of 
the  (110)  directions.  A  plan- view  TEM  and  CL  imaging 
analysis  was  used  to  determine  the  linear  dislocation  densi¬ 
ties  for  a  greater  range  of  samples  with  Ino  ^Gao  gAs  channel 
thicknesses  below  and  well  beyond  the  critical  thickness. 
Theoretical  calculations  using  a  four-band  k-p  and  TMM 
were  used  to  calculate  the  band  profile  and  eigenstates  of  the 
HEMT  samples.  The  electric  field  in  the  channel  and  the 
electron  and  hole  eigenstates  were  treated  self-consistently. 
CL  wavelength  imaging  showed  a  striking  blue  shift  of  ~5 
and  '^10  meV  in  the  150  and  185  A  samples,  respectively, 
contrary  to  a  previous  red  shift  observed  in  undoped  samples 
which  exhibited  a  greater  strain  relaxation,  as  reported  in 
Refs.  14-16.  We  hypothesize  that  a  defect-induced  reduction 
in  the  field  occurs  and  leads  to  a  reduced  Stark  shift  of  the 
electron  and  hole  eigenstates. 

A  small  polarization  anisotropy  is  observed,  consistent 
with  the  level  of  strain  relaxation  in  each  sample  and  theo¬ 
retical  calculations  of  the  polarization  ratio.  The  activation 
energies  for  the  Ino2Gao  gAs  QW  and  GaAs/Alo  25Gao  75 As 
MQW  luminescence  were  obtained  by  temperature- 
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Temperature  (K) 


Fig.  17.  Diffusion  length  vs  temperature  for  various  bright  (a  and  b)  and 
dark  (c  and  d)  regions  in  both  150  and  185  A  samples. 


dependent  CL  measurements.  Spatial  variations  in  the  acti¬ 
vation  energies  are  found  to  correlate  with  DLDs.  These  de¬ 
fects  were  found  to  lower  the  activation  energy  for  thermal 
reemission  of  carriers  from  the  1%  2Gao.8As  QW  and  increase 
the  activation  energy  for  carrier  transfer  into  the  InQ  2Gao  gAs 
QW  from  the  underlying  GaAs  barrier  and 
GaAs/Alo.25Gao.75As  MQW.  The  ambipolar  diffusion  length 
of  carriers  in  the  Ino^Gao.gAs  QW  was  quantified  with  a 
noncontact  optical  diffusion  length  measurement  which  uti¬ 
lized  a  one-dimensional  line  scan  during  monochromatic  CL 
detection  and  a  Ag  mask  which  covered  part  of  the  sample. 
The  diffusion  lengths  of  holes  parallel  to  the  dislocation  line 
direction  were  found  to  decrease  by  as  much  as  a  factor  of 
~5  in  close  proximity  to  DLDs,  thereby  revealing  that  an 
important  transport  parameter  is  deleteriously  effected  by  the 
dislocation  formation.  Future  time-resolved  CL  measure¬ 
ments  involving  lifetime  determinations  on  a  /jcm  scale 
should  help  clarify  the  competing  roles  of  mobility  and  the 
minority-carrier  lifetime  in  determining  the  diffusion  lengths. 
This  study  revealed  that  the  optical  and  transport  properties 
of  In^Gai__;^As  HEMTs  are  linked  on  a  yum  scale  and  are 
both  intimately  tied  to  the  presence  of  misfit  dislocations  and 
dark  line  defects  which  form  during  strain  relaxation. 
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We  calculate  with  the  ab  initio  pseudopotential  method  the  valence-band  offset  (VBO)  of  GaAs/GaP 
lattice-mismatched  heterojunctions  pseudomorphically  grown  on  GaAs^Pi_;c(001)  substrates 
(O^jc^l).  The  resulting  value  of  0.76  eV  for  the  GaP  substrate  supports  the  highest  among  the 
experimental  data  available.  We  show  that  the  VBO  for  any  substrate  can  be  easily  obtained  from 
that  calculated  for  one  substrate  following  standard  band-structure  theory  since  the 
interface-dependent  contribution  to  the  VBO,  which  is  the  electrostatic  potential  lineup,  has  no 
significant  variation  with  the  substrate.  Such  variation,  however,  can  be  taken  into  account 
accurately  by  a  simple  analytic  scaling  law,  which  is  demonstrated  here.  Finally,  we  show  that  the 
VBO  of  GaAs/GaP  (001)  is  due  to  structural  and  chemical  effects,  and  we  propose  an  operative 
method  for  identifying  and  separating  the  two  contributions,  whose  dependence  on  strain  is  also 
discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

At  variance  with  the  case  of  lattice-matched  (LM)  inter¬ 
faces  for  which  a  general  theory  has  been  definitely 
established,^  the  problem  of  valence-band  offsets  (VBOs)  at 
lattice-mismatched  (LMM)  heterojunctions  under  arbitrary 
strain  is  far  from  being  generally  understood.  With  respect  to 
LM  heterojunctions  where  VBO  tuning  is  possible  only  via 
local  chemical  modifications  at  the  interface,  LMM  hetero¬ 
junctions  offer  the  additional  degree  of  freedom  of  varying 
the  strain — and  hence  the  band-edge  energies — of  the  two 
constituents,  by  changing  the  substrate.  A  theoretical  com¬ 
prehension  of  the  mechanisms  which  are  responsible  for  the 
VBO  in  these  systems  and  its  variation  with  strain  is  there¬ 
fore  necessary. 

To  this  aim  we  study  a  prototypical  case  of  LMM  hetero¬ 
junctions,  and  we  choose  the  isovalent,  heteropolar, 
common-cation  GaP/GaAs  (001)  heterostructure,  which  is 
used  in  optoelectronic  devices^  and  for  which  a  few  (not  very 
accurate)  experimental  measurements^’"^  and  only  one  ab  ini¬ 
tio  theoretical  investigation^  have  been  reported. 

We  use  the  ab  initio  pseudopotential  method,  within  the 
framework  of  density-functional  theory  in  the  local-density 
approximation  (LDA).  As  in  Ref.  1,  we  divide  the  VBO  into 
two  contributions:  VBO  —  AF^  +  AV.  The  band- structure 
term  is  the  difference  between  the  energies  of  the  two 
valence-band  edges  measured  with  respect  to  the  average 
electrostatic  potential  in  each  bulk  material.  It  can  thus  be 
obtained  from  two  separate  calculations  for  each  (appropri- 

a) 
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ately  strained)  bulk  material.  The  second  term,  AV,  is  the 
electrostatic  potential  lineup  generated  by  the  valence- 
electron  charge  distribution  and  by  the  ionic  cores.  It  con¬ 
tains  all  interface-specific  details  and  requires  therefore  the 
calculation  of  the  self-consistent  electronic  density  of  the 
heterojunction.  The  latter  is  modeled  with  a  periodically  re¬ 
peated  supercell  of  16  atoms,  corresponding  to  a 
(GaP)4/(GaAs)4  (001)  superlattice.  Other  technical  details  of 
the  calculations  are  described  in  Ref.  6. 


11.  RESULTS 

In  order  to  study  the  variation  of  the  VBO  with  strain  we 
consider  three  different  substrates:  (i)  GaP,  (ii)  GaAso.5Po.5 
corresponding  to  an  in-plane  lattice  constant  aj  equal  to  the 
average  between  the  bulk  lattice  parameters  a^  of  the  con¬ 
stituents  (/  denotes  GaAs  or  GaP),  and  (iii)  GaAs.  The  sub¬ 
scripts  II  and  ±  indicate  quantities  parallel  or  perpendicular 
to  the  interface  plane.  We  consider  only  pseudomorphic 
growth  conditions  where  aj  is  constant  throughout  the  struc¬ 
ture  and  equals  the  substrate  lattice  parameter,  and  the  over¬ 
layer  material  contracts  or  expands  along  the  growth  direc¬ 
tion  for  accommodating  the  stress  induced  by  the  lattice 
mismatch.  The  theoretical  calculations  give  for  the  latter  a 
value  of  3.7%  in  good  agreement  with  the  experimental 
value  of  3.5%.  For  a  given  substrate,  the  equilibrium  value 
of  a /I  for  each  material  (see  Table  I)  is  determined  according 
to  the  macroscopic  theory  of  elasticity.^  We  found  that  for 
GaAs/GaP  the  latter  describes  very  accurately  the  equilib¬ 
rium  position  of  the  atoms  also  in  the  interface  region. 
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Table  I.  Perpendicular  lattice  constants  (a.u.),  their  average  value 
(aj^)  (a.u.),  parallel  (ey),  and  perpendicular  components  of  the  macro¬ 
scopic  strain  tensor  6  for  strained  GaAs/GaP(001)  heterostructures  grown  on 
the  three  different  substrates.  The  a^’s  are  calculated  using  the  macroscopic 
theory  of  elasticity.  The  change  of  the  substrate  has  the  effect  of  a  uniform 
macroscopic  strain  throughout  the  structure. 


fl||  (a.u.) 

10.20 

10.40 

10.60 

(GaAs) 

10.94 

10.77 

10.60 

aj.(GaP) 

10.20 

10.03 

9.86 

(«±> 

10.57 

10.40 

10.23 

^11 

“0.019 

0 

+  0.019 

+  0.016 

0 

-0.016 

The  band-structure  term  is  obtained  from  bulk  calcula¬ 
tions  for  GaP  and  GaAs  in  the  proper  strain  configurations. 
Table  II  shows  that  ave^  which  is  the  band-structure  term 
relative  to  the  averages  of  the  valence  band-edge  manifolds, 
has  a  very  small  variation  (less  than  0.05  eV)  when  the  sub¬ 
strate  changes  from  GaAs  to  GaP.  Conversely,  variations  of 
about  0.4  eV  are  found  for  ,  which  is  the  band-structure 
term  relative  to  the  topmost  valence  states,  whose  shifts  with 
respect  to  the  average  result  from  both  strain  and  spin-orbit 
effects  (see  Refs.  6  and  7  for  details). 

The  potential  lineup  AV,  which  must  be  added  to  A£’y  to 
get  the  total  VBO,  is  obtained  from  supercell  calculations, 
using  the  technique  of  double  macroscopic  average^  to  get 
the  profile  of  the  charge  density  and  of  the  corresponding 
electrostatic  potential  along  the  superlattice  growth  direction. 
Figure  1  shows  the  profile  of  the  electrostatic  potential  origi¬ 
nated  by  the  valence-electron  density  and  by  the  ionic  cores, 
for  the  three  strain  configurations  considered  in  this  work. 
Electronic  and  ionic  potentials  are  opposite  in  sign  and  give 
rise  to  a  total  electrostatic  potential  V  which  is  almost  two 
orders  of  magnitude  smaller.  Table  II  shows  that,  within  our 
numerical  accuracy,  the  electrostatic  potential  lineup  A  V  has 
the  same  value  of  0.16  eV  for  the  three  configurations.  We 
conclude  that  the  possibility  of  tuning  the  VBO  with  strain 
relies  on  the  strain  dependence  of  the  band-structure  term  via 
the  energy  splittings  produced  by  the  strain  at  the  valence- 


Ga  As  Ga  As  Ga  P  Ga  P  Ga  P  Ga  P  Ga  As  Ga  As  Ga 


Fig.  1 .  Macroscopic  average  of  the  total  electrostatic  potential  together  with 
the  two  separate  contributions  due  to  electrons  and  ions,  scaled  by  a  factor 
10“^.  Dashed,  solid,  and  dotted  lines  give  the  results  for  GaP, 
GaAso.5Po.5 ,  and  GaAs  substrates,  respectively. 


Table  II.  Valence-band  offset  at  the  GaP/GaAs(001)  interface  for  different 
substrates.  AV  is  the  potential-lineup  contribution.  AEy  ^ve  is  the  band- 
structure  term  relative  to  the  average  of  the  valence-band  manifolds  while 
AEj^  is  the  same  term  but  relative  to  the  topmost  valence  states  with  spin- 
orbit  effects  included;  VBO  =A£:y  +  Ay  and  VBOave=  A^^  avc+ ^ All 
energies  are  in  eV  Positive  energy  values  indicate  that  the  GaAs  level  is 
higher  than  that  of  GaP.  An  energy  correction  of  about  +0.15  eV  must  be 
added  to  the  VBO  values  reported  here  in  order  to  take  into  account  self¬ 
energy  corrections. 


a||  (a.u.) 

10.20 

10.40 

10.60 

AV 

0.16 

0.16 

0.16 

0.45 

0.24 

0.02 

VBO 

0.61 

0.40 

0.18 

A£^i,,ave 

0.20 

0.17 

0.16 

VBO,ve 

0.36 

0.33 

0.32 

band  top  of  the  two  materials.  The  above  conclusion  is  con¬ 
sistent  with  the  findings  of  Ref.  8,  thus  indicating  a  general 
trend  for  lattice-mismatched  interfaces.  The  results  for  the 
total  VBO,  calculated  both  from  the  topmost  valence  states 
(VBO  =  AEj^  +  AV)  and  from  the  average  of  the  valence- 
band  manifolds  (VBO^ve  =  ^Iso  summa¬ 

rized  in  Table  II.  Assuming  that  strain  does  not  affect  con¬ 
siderably  the  self-energy  corrections  to  the  LDA  energies  of 
GaAs  and  GaP,  a  correction  of  about  +0.15  eV  (Ref.  9)  has 
to  be  added  to  our  ab  initio  results  for  a  meaningful  com¬ 
parison  with  the  experimental  data.  The  final  theoretical  es¬ 
timate  of  the  VBO  is  0.76  eV  for  a||=  10.20  a.u.,  0.55  eV  for 
a||=  10.40  a.u.,  and  0.33  eV  for  (2||=  10.60  a.u.  Our  result  for 
the  GaP  substrate  supports  the  highest  value  [0.80  eV  (Ref. 
4)]  among  the  available  experimental  data. 

In  the  present  system  the  potential  lineup  is  practically  the 
same  for  the  three  configurations  considered.  A  general  study 
of  the  strain  dependence  of  potential  lineups  has  never  been 
done  as  opposed  to  the  case  of  the  AE^  term.^  Since  AV  is 
the  contribution  to  the  VBO  which  requires  the  strongest 
numerical  effort,  it  would  be  of  considerable  importance  to 
predict  analytically  its  variation  with  the  substrate.  We  will 
focus  on  this  problem  in  what  follows. 

III.  ANALYTIC  SCALING  LAW  FOR  A  V 

Let  us  consider  the  case  (ii)  as  the  reference  configuration. 
When  the  substrate  is  GaAso.sPo.s,  we  have 
flj|  =  (a)  =  (ai+a2)/2,  and  the  average  perpendicular  lattice 
constant  (aj^)  =  (aix  +  is  also  equal  to  (a)  (see  Table 
I).  Let  A  V  be  the  lineup  for  this  reference  configuration.  We 
will  indicate  with  the  superscript  '  the  quantities  referring  to 
a  different  configuration,  corresponding  to  a  different  sub¬ 
strate,  and,  consequently,  to  a  different  macroscopic  strain 
tensor  e  of  the  supercell.  The  relevant  e  components  are 
those  parallel  and  perpendicular  to  the  interface  plane,  which 
arerespectively  6||  =  (a|[/(a))-- 1  and  6^  =  {{a^^)l{a))  —  1. 

We  show  that,  if  the  change  of  the  substrate  has  the  effect 
of  a  uniform  strain  throughout  the  structure,  i.e., 

=  f-  e^) a ,  as  it  is  indeed  the  present  case  (see  Table 
I),  then  the  lineup  for  the  new  configuration  is 
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AV'- 
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AV. 


(1) 


In  fact,  if  z  is  the  direction  perpendicular  to  the  interface,  we 
can  write 


Ay=  lim  V{z)-  lim  V(z)=  lim  V{k^)-  lim  V{k^), 

+  z-^-oo 

where  V{z)  is  the  macroscopic  average  of  the  electrostatic 
potential,  V(kJ  the  corresponding  Fourier  transform, 
V{k^)^47Ten(k^)/kl  and  n(k^)  the  Fourier  transform  of  the 
macroscopic  average  of  the  total  charge  density  (i.e.,  elec¬ 
tronic  plus  ionic)  of  the  interface.  Thus, 


-4776 


1 

lim  p 


n{0)  +  k^ 


dn{k^)\ 

dkj 


n{Q)-\-k^ 


dn{kj\ 

dk^  j 


—  4776 


/ 

lim 


1  dn{k^) 

kz  dkz 


dn{kz)\ 

kz  dkz  j  ■ 


The  former  step  is  valid  up  to  linear  order  in  k^  and  the  last 
step  is  justified  since  n(0)  =  0,  i.e.,  the  total  charge  of  the 
system  vanishes.  A  similar  expression  can  be  found  for 
AyMflim,^^o^^^^(^;)/^^;  Veeii/Cn  lim,^-.o^^^~(^z)/ 
dkz  then  AV' I  AV  =  Vcdi'^z/'^ceii^z  where  =  (1  +  e||)^ 
X  (1  +  Vceii  is  the  supercell  volume  of  the  reference 

configuration  and  k'^  =  (l  +  €^)~^k^.  A  uniform  strain 
throughout  the  structure  affects  the  position  of  the  ionic 
cores,  such  that  the  above  condition  on  dn' ldk[  is  verified 
exactly  for  the  ionic  part.  If  the  strain  is  small,  we  expect  that 
the  electronic  charge  distribution  follows  rigidly  the  ionic 
cores,  and  the  condition  is  fulfilled  also  for  the  electronic 
charge  density. 

The  predictions  of  Eq.  (1)  for  GaAs/GaP  (001)  are  in 
excellent  agreement  with  the  ab  initio  results,  within  the  nu¬ 
merical  precision  of  the  latter.  Equation  (1)  also  holds  sepa¬ 
rately  for  the  electronic  and  ionic  contributions,  which  are 
about  two  orders  of  magnitude  larger  than  the  total  value.  We 
stress  that  the  scaling  law,  Eq.  (1),  can  be  extended  to  other 
systems,  but  we  expect  that  it  will  not  be  valid  for  interfaces 
without  common  ions  where  local  interface  strains  can  be 
important,  and  contribute  significantly  to  the  potential  lineup. 


IV.  STRUCTURAL  AND  CHEMICAL 
CONTRIBUTIONS  TO  THE  POTENTIAL  LINEUP 

The  AV  term  contains  both  chemical  and  structural  con¬ 
tributions,  which  originate  from  the  chemical  difference  be¬ 
tween  the  two  constituents  and  their  different  strain.  More 
insight  into  the  dependence  of  A  V  on  strain  will  be  possible 
if  we  succeed  in  separating  these  two  contributions.  Keeping 


in  mind  the  case  of  LM  heterojunctions,  where  the  potential 
lineup  is  mainly  due  to  the  chemical  difference  between  the 
elements  constituting  the  junction  (with  a  few  exceptions  for 
heterojunctions  without  common  ions  due  to  important  local 
interface  strains),  it  will  be  reasonable  to  define  a  structural 
term  which  vanishes  in  such  a  case. 

For  the  sake  of  clarity  let  us  focus  on  one  strain  configu¬ 
ration,  say  (iii),  identified  by  a  given  a\\  and  the  correspond¬ 
ing  values  a IX  and  ^21  •  consider  the  virtual  crystal  GaX 
which  is  a  fictitious  crystal  whose  anions  X  are  described  by 
a  pseudopotential  which  is  the  arithmetic  average  between 
those  of  As  and  P.  The  perpendicular  lattice  constant  of  this 
virtual  crystal  equals  («x)  — (^^ u while  its  parallel 
lattice  constant  is  <3|| .  This  reference  crystal  has  an  electronic 
charge  density  no  whose  macroscopic  average  is  constant 
everywhere,  and  it  can  be  seen  as  a  homojunction  with  a 
uniform  strain  and  zero  potential  lineup.  Starting  from  this 
reference  homojunction  we  can  build  up  the  real  strained 
interface  following  two  different  two-step  paths  which  differ 
in  the  initial  step:  (I)  keeping  the  virtual  anions,  but  intro¬ 
ducing  the  non-uniform  strain  of  the  real  heterojunction;  (II) 
keeping  the  uniform  strain  of  the  virtual  crystal,  but  trans¬ 
forming  the  homojunction  into  the  heterojunction  by  replac¬ 
ing  the  virtual  anions  with  the  real  ones. 

In  the  non-uniformly  strained  homojunction  obtained  as 
intermediate  system  along  the  former  path,  the  electronic 
charge  density  nj  has  the  same  average  asymptotic  values — 
far  from  the  interface — as  the  electronic  charge  density  n  of 
the  real  heterojunction.  By  construction  ni  and  Uq,  and  con¬ 
sequently  their  difference  A«struct^'^/”'^0’  contain 

any  term  arising  from  chemical  differences  between  the  two 
constituents.  Chemical  effects  instead  are  present  in  n,  and  in 
the  difference  An^^=n  —  nj,  which  has  a  vanishing  asymp¬ 
totic  value  far  from  the  interface  on  both  sides.  The  electron 
density  of  the  real  heterojunction  can  thus  be  written  as 
n  =  no+  Anstruct+  •  The  upper  panel  of  Fig.  2  shows  the 
macroscopic  average  of  the  electronic  charge  densities 
Awch(z)  (dashed  line)  and  Anstruct(^)  (dotted  line).  The  nor¬ 
malization  volume  (average  cell  volume)  is  the  average  of 
the  unit-cell  volumes  of  GaAs  and  GaP  under  the  proper 
strain. 

The  potential  lineups  generated  by  the  electronic  charge 
densities  An^h  and  An^^^nct  can  be  identified  as  the  chemical 
(AVch)  and  structural  (AVsJ^uct)  electronic  contributions  to 
the  total  electronic  potential  lineup  AV®^ .  The  total  potential 
lineup  AV  contains  also  the  ionic  contribution,  which  is  en¬ 
tirely  of  structural  type.  The  inhomogeneous  ionic  core  dis¬ 
tributions  corresponding  to  n  and  are  equal  and  therefore 
the  electronic  chemical  term  does  not  have  an  ionic  counter¬ 
part,  whereas  the  ionic  contribution  corresponding  to 
Abstract  gives  a  non  vanishing  dipole.  The  two  terms  Vch(z) 
and  Vs,,uet(z)  =  ^:!ruct(2)  +  ^;tmct(z)  are  displayed  in  the 
lower  panel  of  Fig.  2  together  with  the  total  potential  V(z). 

We  can  similarly  define  a  chemical  and  a  structural  con¬ 
tribution  to  the  total  electronic  charge  density  n,  by  follow¬ 
ing  the  second  path.  We  find  that  the  lineups  are  equal  to 
those  obtained  following  path  I.^  We  can  thus  univocally 
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Fig.  2.  Upper  panel:  macroscopic  average  of  structural  [Artstruct(^)»  dotted 
line,  right  scale]  and  chemical  [A/ich(^)’  dashed  line,  left  scale]  contribu¬ 
tions  to  the  electronic  charge  density  of  the  (GaP)4 /(GaAs)4  superlattice 
with  GaAs  substrate.  The  charge  density  is  normalized  with  respect  to  the 
average  of  the  unit-cell  volumes  of  GaAs  and  GaP  under  the  proper  strain. 
Lower  panel:  macroscopic  average  of  structural  (Vstruct»  dotted  line)  and 
chemical  (Vch»  dashed  line)  contributions  to  the  total  potential  (V  =  ^ch 
+  ^struct »  solid  line)  of  the  (GaP)4/(GaAs)4  superlattice  with  GaAs  substrate. 
The  results  of  both  panels  have  been  obtained  following  path  1. 

define  the  chemical  contribution  AV^h  the  strain  contri¬ 
bution  A  Vstruct  (including  the  ionic  core  contribution,  which 
is  the  same  in  the  two  paths)  whose  sum  gives  the  total 
potential  lineup  AV.  We  stress  that  AV^h  niay  also — in 
principle — depend  on  strain  and  hence  be  different  for  dif¬ 
ferent  configurations.  Actually,  Ay^h  ^nd  AVstj-uct  vary 
slightly  with  :  e.g.,  Ay^tj-uct  for  configurations  (i),  (ii),  and 
(hi)  is  0,89,  0.88,  and  0.87  eV,  respectively.  The  variation  is 
even  smaller  in  the  total  lineup. 

V.  CONCLUSIONS 

We  have  presented  self-consistent  pseudopotential  calcu¬ 
lations  of  the  VBO  at  strained  GaP/GaAs  (100)  heterojunc¬ 


tions.  We  have  shown  that  the  VBO  can  be  tuned  from 
0.33  to  0.76  eV  by  varying  the  substrate  from  GaAs  to  GaP, 
and  that  this  is  a  bulk  effect  rather  than  an  interface  feature. 
The  value  VBOave^  calculated  between  the  averages  of  the 
valence-band  manifolds  is,  on  the  contrary,  practically  insen¬ 
sitive  to  the  strain.  These  results  confirm  previous  findings^ 
indicating  a  general  trend  for  lattice-mismatched  heterojunc¬ 
tions. 

We  have  demonstrated  that  once  the  potential  lineup 
(which  is  the  interface-dependent  and  computationally  ex¬ 
pensive  contribution  to  the  VBO)  is  known  for  one  substrate, 
its  value  for  any  other  substrate  can  be  calculated  analyti¬ 
cally,  with  a  very  high  accuracy,  by  a  simple  scaling  law 
which  is  expected  to  be  valid  for  the  whole  class  of  isovalent 
common-ion  heterojunctions.  For  GaAs/GaP,  this  law  pre¬ 
dicts  that  the  variation  of  the  potential  lineup  with  the  in¬ 
plane  lattice  constant  is  within  the  numerical  accuracy  of  the 
ab  initio  calculations  (—20  meV).  Therefore  the  potential 
lineup  can  be  considered  as  a  constant  and  can  be  calculated 
for  one  strain  configuration  only.  On  the  opposite,  in  order  to 
reproduce  the  strain  dependence  of  the  VBO,  it  is  important 
to  take  into  account  the  large  strain  dependence  of  the  bulk 
band-structure  term,  which  however  can  be  calculated  easily 
with  standard  band-structure  theory. 

Finally,  we  have  presented  a  method  for  separating  chemi¬ 
cal  and  structural  contributions  to  the  potential  lineup  Ay 
and  we  have  shown  that,  similarly  to  A  V,  both  contributions 
vary  negligibly  for  GaAs/GaP  (001)  with  the  in-plane  lattice 
constant. 
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We  present  cross-sectional  scanning  tunneling  microscopy  (STM)  studies  of  the  atomic-scale 
structural  and  electronic  properties  of  InAs/InAsi  superlattices  grown  by  modulated 

molecular-beam  epitaxy.  In  this  technique  the  Group  V  composition  is  controlled  by  rapid 
modulation  of  the  Group  V  beams  rather  than  adjustment  of  the  Group  V  flux  ratios.  A  superlattice 
sample  was  grown  at  475  °C  consisting  of  52  A  InAsi_;^-Sb^  alternating  with  172  A  InAs  for  30 
periods  on  a  GaSb  (001)  substrate.  The  InAsj  alloy  layers  consisted  nominally  of  7.8  A  InAs 
alternating  with  5,2  A  InSb  for  four  periods.  X-ray  diffraction  was  used  to  determine  an  average 
composition  of  InAso.76Sbo.24  for  ^Hoy  layers.  Constant-current  STM  images  of  the  superlattice 
exhibit  clear,  electronically  induced  contrast  between  the  InAs  layers  and  InAso.76Sbo.24  layers,  and 
also  reveal  ordering  within  the  InAso.76Sbo.24  layers.  Interfaces  between  the  InAs  layers  and  the 
InAso.76Sbo.24  layers  appear  sharp,  though  there  is  evidence  of  more  atomic  intermixing  at  the 
InAs-on-InAso.76Sb  0.24  interface.  Significant  variation  in  composition  within  individual 
InAso.76Sb  0.24  alloy  layers  and  apparent  Sb  incorporation  from  the  InAso,76Sbo.24  layers  into  the 
surrounding  InAs  layers  are  also  observed.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

III-V  semiconductor  material  systems  containing  both  ar¬ 
senide  and  antimonide  compounds  offer  a  wide  range  of  pos¬ 
sibilities  for  designing  heterostructure  devices,  including 
resonant-tunneling  diodes,  infrared  detectors, and  infra¬ 
red  lasers.^"^  However,  differences  in  heterointerface  struc¬ 
ture  and  composition  arising  from  different  growth  tech¬ 
niques  and  parameters  can  have  significant  effects  on 
heterostructure  device  performance.  For  example, 
Gai„;,In^Sb/InAs  superlattices  grown  with  InSb-like  inter¬ 
faces  are  found  to  have  superior  device  characteristics  than 
those  grown  with  Gai„;,In;^.As-like  interfaces. Spontane¬ 
ous  ordering  in  III~V  ternary  semiconductors  has  been 
shown  to  have  substantial  effects  on  the  structural  and  opti¬ 
cal  characteristics  of  the  materials. Atomic  ordering  has 
also  been  observed  in  Group  V  alloys  such  as  InAsSb  (Refs. 
14  and  15)  and  InAsR^^  However,  precise  control  over 
Group  V  composition  in  ternary  compounds  containing  both 
As  and  Sb  has  proven  to  be  extremely  difficult  to  achieve  in 
conventional  solid-source  molecular-beam  epitaxy 
(MBE).*’-'® 

Recently,  dramatic  improvements  in  compositional  con¬ 
trol  and  crystalline  quality  of  AlAs^Sbi-;^,  GaAs^^Sbj  , 
and  InAs^Sbi_jf  ordered  alloys  have  been  achieved  using 
solid-source  modulated  molecular-beam  epitaxy 
(MMBE).^^^^  In  this  technique,  the  Group  V  composition  is 
controlled  by  rapid  modulation  of  the  As2  and  Sb2  beams 
rather  than  adjustment  of  the  Group  V  flux  ratios.  Modula- 
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tion  of  the  Group  V  beams  is  expected  to  lead  to  the  growth 
of  ordered  rather  than  random  alloys.  The  electronic  and  op¬ 
tical  properties  of  heterostructures  incorporating  such  layers 
will  therefore  depend  significantly  on  the  detailed  structure 
of  the  alloys  and  of  the  heterojunction  interfaces. 

In  this  article  we  report  cross-sectional  scanning  tunneling 
microscopy  (STM)  investigations  of  the  structural  and  elec¬ 
tronic  properties  of  InAs/InAsi_;cSb;c  superlattices  grown  by 
MMBE.  InAs/InAsi_^Sb^  heterojunctions  grown  on  InAs 
substrates  exhibit  a  Type-II  band  alignment,  and  have  shown 
potential  for  midwave  infrared  applications. High- 
resolution  constant-current  images  obtained  by  STM  show 
that  MMBE  produces  superlattices  with  sharp  interfaces  be¬ 
tween  the  InAsi_;^.Sb;t  layers  and  the  InAs  layers,  and  con¬ 
firm  that  the  MMBE  technique  leads  to  the  growth  of  or¬ 
dered  rather  than  random  alloys. 

11.  EXPERIMENT 

The  superlattice  sample  used  in  these  experiments  was 
grown  in  a  solid-source  VG  V80  molecular  beam  epitaxy 
system  using  the  MMBE  growth  technique.  In  this  tech¬ 
nique,  the  Group  V  composition  during  alloy  growth  is  con¬ 
trolled  by  rapid  modulation  of  the  AS2  and  Sb2  beams  rather 
than  adjustment  of  the  Group  V  flux  ratios.^^  The  superlattice 
sample  consisted  of  52  A  InAsj  alternating  with  172  A 
InAs  for  30  periods,  and  was  grown  on  top  of  a  GaSb  buffer 
on  a  p-type  GaSb  (001)  substrate.  Figure  1  shows  a  sche¬ 
matic  diagram  of  the  epitaxial  layer  structure  and  the  Type  II 
band-edge  alignment  expected  in  this  system.  The 
InAsi_;^.Sb;,.  layers  consisted  nominally  of  7.8  A  InAs  alter¬ 
nating  with  5.2  A  InSb  for  four  periods.  However,  because 
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Fig.  1 .  (a)  Schematic  diagram  of  the  epitaxial  layer  structure  of  the  MMBE 
superlattice  sample,  (b)  Schematic  diagram  of  the  Type  II  band-edge  align¬ 
ment  expected  for  this  structure.  The  conduction  band,  valence  band,  and  the 
InAso.76Sbo.24  heavy-hole  and  InAs  electron  energy  levels  are  indicated. 


the  MMBE  shutter  times  used  were  very  short  (4-6  s),  ex¬ 
cess  As  remaining  in  the  growth  chamber  from  the  growth  of 
the  InAs  layers  was  likely  to  have  been  incorporated  into  the 
growth  of  the  InSb  layers,  resulting  in  InSb(As)  layers;  x-ray 
diffraction  measurements  yielded  an  average  composition  of 
InAso  .ve^b  0.24  in  the  ternary  alloy  layers.  A  growth  tempera¬ 
ture  of  475  °C  and  a  growth  rate  of  1.3  A/s  were  employed. 
Figure  2  shows  a  schematic  diagram  of  the  shutter  sequence 
used  to  grow  the  InAso.76Sbo.24  alloy  layers  and  the  surround¬ 
ing  InAs  layers.  Further  details  of  the  sample  growth  have 
been  described  elsewhere.^^ 


Shutter 


Fig.  2.  Schematic  diagram  of  the  MMBE  shutter  sequence  used  to  grow  the 
In Aso.76Sbo  24  alloy  layers  and  the  surrounding  InAs  layers.  The  filled  bars 
represent  intervals  when  the  corresponding  shutters  are  open. 


28.5  29.0  29.5  30.0  30.5  31.0  31.5 

Degrees 


Fig.  3.  High-resolution  x-ray  diffraction  spectrum  of  the  superlattice 
sample.  The  presence  of  many  sharp  satellite  peaks  confirms  the  overall  high 
quality  of  the  sample  structure. 

Sample  growth  and  quality  were  monitored  using  high- 
resolution  x-ray  diffraction  and  in  situ  reflection  high-energy 
electron  diffraction  (RHEED).  The  x-ray  spectrum  of  the 
sample,  shown  in  Fig.  3,  exhibits  many  sharp  satellite  peaks, 
confirming  the  overall  high  quality  of  the  sample  structure. 
This  x-ray  spectrum  was  used  to  calculate  the  average  period 
and  composition  of  the  epilayer  region.  RHEED  data  ob¬ 
tained  during  sample  growth  were  similar  to  that  observed  in 
growth  of  AlAs^Sbi„^  ordered  alloys^^  using  the  MMBE 
technique.  During  growth  of  the  InAso,76Sbo.24  layers,  a  clear, 
streaky  1X3  reconstruction  was  seen  when  the  Sb  shutter 
was  opened,  but  only  a  mixed  2X4  reconstruction  was  seen 
when  the  As  shutter  was  opened.  The  RHEED  data  were 
inconclusive  regarding  interface  abruptness  in  the  ordered 
structures  because  the  shutter  times  used  (4-6  s)  were  com¬ 
parable  to  the  RHEED  transient  time. 

Samples  used  for  the  STM  studies  were  cleaved  in  ultra- 
high  vacuum  at  a  base  pressure  of  <2X  10“^^  Torr  along  the 
(110)  cross-sectional  plane,  yielding  an  atomically  flat,  elec¬ 
tronically  unpinned  surface  on  which  tunneling  experiments 
were  performed.  Commercially  available  Pt-Ir  tips  cleaned 
in  situ  by  electron  bombardment  were  used  for  these  studies. 
Since  all  images  presented  here  were  obtained  at  negative 
sample  bias,  the  features  seen  correspond  to  the  Group  V 
sublattice  on  the  (110)  cross-sectional  surface. 

III.  RESULTS 

Figure  4  shows  a  high-resolution  constant-current  image 
of  the  MMBE  sample,  obtained  at  a  sample  bias  voltage  of 
—2.4  V  and  a  tunneling  current  of  0.1  nA.  Electronically 
induced  contrast  between  the  bright  InAso.76Sbo.24  layers  and 
the  darker  InAs  layers  can  clearly  be  seen  in  the  image.  The 
interfaces  between  the  InAs  layers  and  the  outer  InSb(As) 
layers  of  the  InAso.76Sbo.24  alloy  layers  appear  sharp.  Lateral 
variations  in  the  InAso.76Sbo.24  structure  over  length  scales  of 
approximately  100  A  can  be  seen.  The  image  also  shows 
contrast  arising  from  the  ordered  structure  along  the  growth 
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Fig.  4.  High-resolution  constant-current  image  of  the  MMBE  sample.  Elec¬ 
tronically  induced  contrast  between  the  InAso,76Sbo.24  layers  and  the  InAs 
layers  can  be  seen.  Also  visible  is  the  ordered  structure  within  the 
InAso.76Sbo.24  layers. 


direction  within  the  InAso.76Sbo.24  layers.  Features  corre¬ 
sponding  to  the  four-period  InAs/InSb(As)  superlattice  struc¬ 
ture  of  the  InAso.76Sb  0.24  layers  are  visible.  The  elec¬ 
tronically-induced  contrast  within  the  InAso.76Sbo.24  layers  is 
asymmetric  along  the  growth  direction,  suggesting  some 
nonuniformity  in  composition. 

Figure  5(a)  shows  a  high-resolution  constant-current  im¬ 
age  of  a  single  InAso.76Sbo.24  layer  surrounded  by  InAs, 
taken  at  a  sample  bias  voltage  of  -2.4  V  and  a  tunneling 
current  of  0.1  nA.  As  in  Fig.  4,  contrast  between  the  InAs 


(a) 

Growth  Direction 


InAs  InAsSb  InAs 


(b) 


layers  and  the  InAso.76Sbo.24  alloy  layer  can  be  seen  clearly 
in  the  image.  The  growth  direction  is  indicated  in  the  figure. 
Figure  5(b)  shows  an  averaged  topographic  line  scan  of  the 
InAso.76Sbo.24  layer  and  the  surrounding  InAs  layers,  taken 
over  200  individual  scan  lines  covering  an  area  approxi¬ 
mately  200  A  X  170  A  shown  in  the  image.  The  interfaces 
between  the  outer  InSb(As)  layers  of  the  InAso.76Sbo.24  alloy 
layer  and  the  surrounding  InAs  appear  sharp,  though  a  more 
gradual  change  in  contrast  can  be  seen  at  the  InAs-on- 
InAs  o.76Sbo.24  interface,  suggesting  a  larger  degree  of  atomic 
intermixing  at  that  interface  than  at  the 
InAso.76Sbo.24''On-InAs  interface.  Interface  asymmetry  has 
also  been  observed  in  other  arsenide/antimonide  systems,  in- 
eluding  InAs/GaSb,^^"^®  InAs/Gai_^In.,Sb,^°  and 
InAs/AlSb.^'  Also  visible  in  the  image  is  contrast  caused  by 
the  ordered  structure  within  the  InAso.76Sbo.24  layer  produced 
by  the  MMBE  growth  technique.  These  features  were  ob¬ 
served  consistently  with  different  cleaved  samples  and  tips. 
The  averaged  line  scan  makes  more  apparent  the  asymmetric 
appearance  of  the  InAso.76Sbo.24  layers  in  the  constant- 
current  images.  Differences  in  topographic  height,  caused  by 
differences  in  electronic  structure,  between  the  short-period 
superlattices  and  the  darker  InAs  layers  become  larger  as 
more  InAso.76Sb  0.24  is  grown.  The  maximum  contrast  in 
height  (corresponding  to  maximum  Sb  content)  is  consis¬ 
tently  observed  at  the  next-to-last  (i.e.,  the  third)  grown 
InSb-like  layer  in  the  InAs  o.76Sbo.24  ^Hoy  region.  This  non¬ 
uniformity  in  the  electronic  structure  in  the  InAs  o.76Sbo.24 
layers  suggests  the  presence  of  atomic  cross-incorporation 
within  the  InAso.76Sb  0.24  layers,  possibly  due  to  growth  in¬ 
corporation  of  residual  material  in  the  growth  chamber  dur¬ 
ing  the  MMBE  shutter  switching  sequences.  Finally,  the  ap¬ 
parent  width  of  the  InAs  o.76Sbo.24  layer  in  the  image  is 
approximately  52  A.  If  there  was  no  atomic  intermixing 
present  in  the  InAso.76Sbo.24  layer  (i.e.,  in  the  four  InAs/InSb 
short-period  superlattices  comprising  the  InAso.76Sbo.24 
layer),  then  only  the  four  InSb  layers  and  the  three  InAs 
layers  between  them  should  lead  to  visible  contrast;  the 
fourth  InAs  layer  in  the  InAso.76Sbo.24  structure  would  be 
indistinguishable  from  the  surrounding  InAs  layers.  This 
would  lead  to  an  apparent  width  of  44  A  for  the  InAso.76Sb 
0.24  layer  in  the  image.  The  fact  that  the  full  52  A  width  is 
seen  suggests  that  some  Sb  is  incorporated  into  the  surround¬ 
ing  InAs  layers,  though  this  incorporation  appears  limited  to 
areas  within  1-2  monolayers  of  the  InSb-like  layers.  From 
the  averaged  line  scan,  the  Sb  incorporation  appears  more 
prominent  at  the  InAs-on-InSb(As)  interface  than  at  the 
InSb(As)-on-InAs  interface.  Taking  into  consideration  the 
growth  direction  indicated  at  the  top  of  Figure  5,  this  would 
suggest  Sb  riding  up  into  grown  InAs  layers,  a  behavior 
which  is  consistent  with  that  observed  in  growth  of  InAs/ 
GaSb  superlattices. 

IV.  CONCLUSIONS 


Fig.  5.  (a)  Constant-current  image  of  a  single  IuAsq  76Sbo,24  alloy  layer  and 
the  surrounding  InAs  layers,  (b)  Topographic  line  scan  of  the 
InAso,76Sb  0.24  alloy  layer  and  the  surrounding  InAs  layers,  averaged  over  an 
area  approximately  200  A  X  170  A  shown  in  the  image. 


We  have  used  cross-sectional  scanning  tunneling  micros¬ 
copy  (STM)  to  investigate  the  structural  and  electronic  prop¬ 
erties  of  InAso.76Sb  0.24  superlattices  grown  by  modulated 
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molecular-beam  epitaxy  (MMBE).  High-resolution  constant 
current  images  of  the  superlattice  structure  reveal  elec¬ 
tronically-induced  contrast  between  the  InAs  layers  and  the 
InAso.76Sbo.24  layers.  The  interfaces  between  the  InAs 
layers  and  the  InAso.76Sbo.24  layers  appear  sharp,  though 
there  is  evidence  of  more  atomic  intermixing  at  the 
InAs-on-InAso.76Sbo.24  interface.  Contrast  corresponding  to 
the  four-period  InAs/InSb(As)  superlattice  structure  can  be 
seen  in  the  InAso.76Sbo.24  layers.  The  contrast  within  the 
InAso.76Sbo.24  layers  appears  asymmetric  along  the  growth 
direction;  the  maximum  contrast  in  height  is  consistently  ob¬ 
served  at  the  third  grown  InSb-like  layer  in  the 
InAso.76Sbo.24  alloy  regions.  This  non-uniformity  in  contrast 
suggests  the  presence  of  atomic  cross-incorporation  within 
the  InAso,76Sbo.24  layers,  possibly  caused  by  growth  incorpo¬ 
ration  of  residual  material  in  the  growth  chamber  during  the 
MMBE  shutter  switching  sequences.  Further  analysis  of  the 
data  suggests  Sb  incorporation  into  areas  within  1-2  mono- 
layers  of  the  InSb-like  layers,  possibly  arising  from  Sb  riding 
up  into  subsequently  grown  InAs  layers.  The  STM  data  dem¬ 
onstrate  that  MMBE  can  be  used  to  attain  superior  composi¬ 
tional  control  and  crystalline  quality  in  arsenide/antimonide 
ordered  alloys,  and  also  confirm  that  MMBE  leads  to  the 
growth  of  ordered  rather  than  random  alloys. 
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We  present  the  first  results  on  Ino.29Alo,7iAs  which  conclusively  show  a  large  number  of  traps  that 
are  not  in  equilibrium  with  the  conduction  band  at  low  temperatures,  the  signature  of  a  DX 
centerlike  defect.  From  low-temperature  gated  resistivity,  Hall  and  Shubnikov-de  Haas 
measurements  of  relaxed,  Ino.29Alo.71As/Ino  3Gao  7 As  modulation  doped  heterostructures,  we  find  a 
significant  negative  shift  in  the  threshold  voltage,  >0),  as  the  cooling  bias  is  shifted  to  more 

negative  values.  Simultaneously,  we  observe  a  decrease  in  the  mobility  measured  at  the 
two-dimensional  electron  gas  as  the  cooling  bias  is  decreased,  an  indication  of  the  “negative  U” 
behavior  of  DX  centers.  Finally,  we  observe  Fermi  level  pinning  which  we  attribute  to  the 
occupation  of  a  DX  center  level.  From  simulation  of  the  heterostructure  at  large  positive  gate  biases, 
and  the  change  in  the  threshold  voltage,  we  are  able  calculate  the  donor  ionization  energy  of  this  trap 


to  be  320  meV  below  the  In^Ali_^.As  conduction-band 
in  Al^Gaj  _^As.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

DX  centers,  a  donor  related  point  defect,  have  been  found 
in  many  compound  semiconductors,  including  the  well- 
known  AlAs/GaAs  ternary  alloy  system. These  defects  al¬ 
ways  move  into  the  gap  at  some  given  alloy  composition  as 
if  they  were  tracking  a  non-F  conduction-band  minimum.  As 
suggested  by  Tachikawa,^  this  alloy  dependence  can  be  used 
to  predict  the  appearance  of  DX  centerlike  behavior  in  ma¬ 
terials  with  direct  to  indirect  crossovers.  In^^Al^.^As,  a  high 
band-gap  ternary  alloy,  has  such  a  direct  to  indirect  transition 
at  x^0.30.  Following  this  suggestion,  DX  centerlike  behav¬ 
ior  would  be  expected  at  aluminum  compositions  between 
x=0  and  x=0A5. 

This  prediction  is  difficult  to  verify  because  of  a  lack  of  a 
suitable  substrate  at  all  alloy  compositions.  At  one  composi¬ 
tion  x=0.48,  In;,.Ali_^As  is  latticed  matched  to  InP.  The 
availability  of  high  quality  material  has  allowed  extensive 
work  on  this  issue  at  this  important  composition.  Although 
Nakashima  et  al.^  believed  they  had  found  DX  centerlike 
behavior,  other  researchers,^"^  using  a  combination  of  tech¬ 
niques,  concluded  that  the  traps  found  are  not  DX  centers. 
Furthermore,  Calleja  et  alf"  subjected  the  material  to  12  kbar 
of  hydrostatic  pressure  roughly  equivalent  to  the  band  struc¬ 
ture  of  Ino.4oAlo.6oAs.  They  found  no  evidence  of  DX  center¬ 
like  behavior.  For  the  nonlatticed  match  case  we  could  only 
find  one  reference.  In  that  work,  Hong  et  al  examined  thick, 
Ino43Alo57As  layers  grown  with  a  dislocation  reducing 
buffer  on  InP.  Measuring  the  material  by  deep-level  transient 
spectroscopy  (DLTS),  a  weak  signal  corresponding  to  an 
emission  energy  of  0.37  eV  and  strongly  dependent  on  dop¬ 
ing  was  found. 

Considering  the  binary  alloys,  for  Si-doped  AlAs,  the  DX 
center  level  seems  to  merge  with  the  shallow  hydrogenic 
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edge,  over  twice  the  depth  of  the  DX  center 


level  57  meV  below  the  X  band.^  At  the  other  extreme,  no 
DX  center  has  been  reported  for  In  As,  as  would  be  the  case 
if  the  DX  center  followed  one  of  the  non-F  conduction-band 
minima.  To  the  best  of  our  knowledge,  that  is  the  extent  of 
the  known  data  at  the  moment  with  regard  to  DX  centers  in 
n-type  In^Alj_^As.  By  determining  the  alloy  compositions 
where  these  defects  might  occur,  we  can  find  the  “useful 
range”  of  this  material  for  electronic  and  optoelectronic  ap¬ 
plications. 

II.  EXPERIMENT 

Figure  1  shows  a  cross-sectional  schematic  of  the  struc¬ 
ture  investigated  in  this  article.  The  Ino29Alo7iAs/ 
Ino.3Gao  7AS  heterostructures  were  grown  by  molecular  beam 
epitaxy  on  a  semi-insulating  (100)  GaAs  substrate  cut  2° 
toward  the  nearest  (110)  plane.  The  active  layers  are  a 
modulation-doped  field-effect-transistor  (MODFET)  type 
structure  ^ doped  with  silicon  to  6X10*^  cm“^.  Between  the 
active  layers  and  the  substrate  is  a  0.8  yitm  step-graded 
In^Gai_^As  buffer  used  to  relieve  the  lattice  mismatch.  Fur¬ 
ther  details  of  the  crystal  growth  are  described  elsewhere.^ 

From  the  material,  eight-armed  Hall  bridges  were  photo¬ 
graphically  defined  using  standard  wet  chemical  techniques. 
The  ohmic  contacts  were  made  by  thermal  evaporation  of  a 
AuGe  alloy  with  a  Ni  overlayer,  which  was  subsequently 
annealed  at  420  °C.  The  gate  was  composed  of  300  nm  of 
thermally  evaporated  aluminum.  To  reduce  errors  in  the  elec¬ 
trical  measurements,  the  length  to  width  ratio  was  2400  /xm/ 
200  fxm  or  12/1. 

The  charge  transport  properties  of  the  these  structures 
have  been  initially  described  in  Chen  et  al.  In  that  work, 
the  specimens  were  cooled  to  either  77  or  1.6  K,  whereupon 
their  electrical  properties  were  measured.  At  1.6  K,  both  the 
low  magnetic  field  (.6  <0.1 5  T),  as  well  as  high-field 
Shubnikov-de  Haas  oscillatory  magnetoresistance  measure¬ 
ments  were  obtained  as  a  function  of  gate  bias.  This  enabled 
the  properties  of  the  individual  subbands  to  be  characterized 
separately.  In  this  work,  a  gate  bias  is  applied  at  room  tem- 
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40  nm  Ihq  3Gao.7As 


25  nm  undoped 

6-doped  6.0  x  10^2  cm-2 

3  nm  undoped  Ihq  29AI0.71AS 


200  nm  undoped  InQ ^Guq  7AS 


800  nm  Buffer 
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SI-GaAs  (001)  Substrate 

Fig.  1.  Schematic  diagram  of  the  <5-doped  Ino.29Alo.71As/Ino  3Gao  7As  hetero¬ 
structure  under  study. 


perature  to  alter  the  distribution  of  free  charge  when  the  sys¬ 
tem  is  nominally  in  equilibrium.  Then  the  sample  is  cooled 
to  r=1.6  K,  and  the  charge  transport  properties  are  remea¬ 
sured  both  at  low  and  high  magnetic  fields. 

III.  RESULTS  AND  DISCUSSION 

Figure  2  shows  the  gate  voltage  dependence  of  the  free 
electron  density,  n^,  of  the  same  specimen  for  different,  fixed 
values  of  the  cooling  bias  potential,  .  Each  data  point  was 
obtained  from  a  fast  Fourier  transform  of  the  Shubnikov-de 
Haas  oscillations  in  a  magnetic  field  up  to  7,5  T.  While  a 
digital  filter  could  have  been  used  subsequently  on  the  trans¬ 
form,  it  was  decided  not  to  use  it  in  this  case  due  to  a  slight 
shifting  of  the  peak  of  the  second  subband  near  zero.  The 


(V) 


Fig.  2.  Free  carrier  density  vs  gate  bias  for  several  values  of  the  cooling  gate 
bias  Vc  •  Each  value  of  the  charge  density  is  the  sum  of  the  two  subbands  as 
determined  from  a  fast  Fourier  transform  of  the  Shubnikov-de  Haas  oscil¬ 
lations. 


Fig.  3.  Conduction-band  edge,  electron  density,  and  the  Fermi  level  calcu¬ 
lated  from  a  self-consistent  solution  of  Schrodinger-Poisson  equations.  The 
free  charge  is  measured  by  integration  of  the  charge  density  over  the  whole 
structure. 


peak  was  considered  strong  enough  that  the  filter  was  not 
necessary.  Each  of  the  curves  derived  from  this  data  has  an 
initially  linear  n^iVg)  dependence  and  a  tendency  to  saturate 
at  higher  gate  voltages.  The  gate  voltage  dependent  sum  of 
the  electron  subbands  is  in  good  agreement  with  the  respec¬ 
tive  charge  densities  derived  from  Hall  measurements.  This 
indicates  that  the  mobilities  of  the  electrons  in  the  two  sub¬ 
bands  is  similar. 

Figure  3  shows  a  representative  plot  of  the  calculated  po¬ 
sition  of  the  conduction-band  edges  of  the  heterostructure 
obtained  by  means  of  Schrodinger-Poisson  simulations.^^ 
The  simulated  total  n^(Vg)  was  found  to  be  in  fair  agreement 
with  the  experimentally  measured  values  in  the  linear  re¬ 
gime.  However,  our  simulation  does  not  include  possible  DX 
centerlike  defects,  only  the  shallow  ionized  donors.  Because 
our  measurements  were  made  at  1 .6  K,  with  a  comparatively 
small  displacement  of  the  Fermi  level  produced  by  the  ap¬ 
plied  gate  voltages,  the  emission  of  electrons  from  DX  cen¬ 
ters  is  considered  negligible.  However,  we  have  introduced 
explicitly  in  the  simulations  other  deep  levels  found  in 
In^Ali_j^As.^’^’^^  For  modeling  purposes,  we  assume  in 
Ino.29Alo.71As  a  dominant  0.6  eV  deep  level  trap  whose  den¬ 
sity  is  considered  to  be  1X10^^  cm”^. 

If  there  are  DX  centers  in  the  Ino.29Alo.71  As,  there  will  be 
a  characteristic  freeze-in  temperature,  Tf,  where  the  system 
will  no  longer  be  in  equilibrium.  As  the  heterostructure  is 
cooled  under  the  different  gate  biases,  different  amounts  of 
free  charge  would  be  trapped  in  the  Ino.29Alo.71  As,  the  result 
being  a  change  in  the  ionized  donor  density,  N"^.  By  extrapo¬ 
lating  the  charge  density  in  the  linear  regime  of  Fig.  2  to  zero 
charge  density,  the  threshold  voltage,  ,  expressed  as 

qN^ 

V,=  cl>,-AE,+Efo-~^  (1) 

can  be  obtained.  In  Eq.  (1),  ^^  =  1.25  eV  is  the  Schottky 
barrier  height  of  Ino.29Alo.71  As,  AE^  is  the  conduction-band 
offset  relative  to  InGaAs,  the  parameter  EfQ^O.04-  eV  is  the 
Fermi  level  at  threshold,  which  was  determined  from  the 
experimental  data  for  V^—0  The  capacitance  can  be 
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Fig.  4.  The  threshold  voltage  and  the  calculated  ionized  impurity  density  as 
deduced  from  Eq.  (1)  as  a  function  of  the  cooling  bias  . 


computed  as  the  effective  distance  between  the  gate  and  the 
center  of  the  delta  doping  layer,  then  divided  by  the  dielectric 
constant  of  Ino.29Alo.71  As  (e=11.2*8o)  similar  to  the  analysis 
in  Tian  et  al}^  This  is  in  reasonable  agreement  with  the  ca¬ 
pacitance  Ci  determined  from  Fig.  2  as  the  slope  of  the 
curves  in  the  linear  regime. 

Using  the  capacitance  and  the  threshold  voltages  deter¬ 
mined  from  Fig.  2,  the  ionized  donor  density  can  be  calcu¬ 
lated  from  Eq.  (1).  Figure  4  shows  a  plot  of  both  the  mea¬ 
sured  threshold  voltage  and  the  calculated  ionized  donor 
density  as  a  function  of  the  cooling  bias  .  Because  the 
total  impurity  density  is  considered  to  be  the  sum  of  the 
shallow  donors  plus  ^  the  charges  trapped  on  these  DX 
centers,  the  decrease  in  N"^  with  increasing  ,  as  shown  in 
Fig.  4,  is  attributed  here  to  the  increase  in  the  trapped  charge 
on  the  DX  centers.  If  N^  =  6X10^^  cm“^,  then  we  can  see 
from  Fig.  4  that  ^28%  of  the  charge  has  been  “frozen  in”  by 
the  DX  centerlike  trap. 

The  saturation  of  n^(Vg)  implies  Fermi  level  pinning  or, 
in  other  words,  the  Fermi  level  has  risen  to  the  lowest  active 
donor  energy  in  the  system.  In  this  case,  we  believe  it  is 
caused  by  the  thermal  activation  energy  of  a  DX  center  in  a 
manner  similar  to  that  observed  in  AlGaAs/GaAs 
heterostructures. From  the  conduction-band  diagram  in  Fig. 
3,  we  can  define  this  energy  as 

E,  =  ^E,-Ep(n,)-Vp,  (2) 

where  is  the  potential  difference  between  the  bottom  of 
the  ^doping  well  and  the  heterointerface.  In  order  to  calcu¬ 
late  this  quantity,  we  need  a  relation  between  the  Fermi  level 
and  the  charge  density.  In  the  work  by  Chen  et 
just  such  a  relation  was  found  from  the  linear  regime  of  a 
figure  similar  to  Fig.  3.  They  found  Ep{tW)=5.%A 
Xio“i4*^^_j_0  04_  Inserting  the  measured  saturation  value  of 
3.4X10^^  cm”^  we  obtain  a  value  of  Ej=240  meV. 

Now  for  the  calculation  of  ,  the  normal  component  of 
the  displacement  vector  is  continuous  across  the  heterojunc¬ 
tion 

1  ^(InGaAs)  (  280"^  )  =  «  2^  (InAlAs)  (280  )  (3) 
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Fig.  5.  Mobility  vs  free-carrier  density  for  several  values  of  the  cooling 
potential  bias  .  As  becomes  more  positive,  more  electrons  become 
trapped  in  the  Ino29Alo.7iAs  layer  as  the  structure  cools.  If  the  scattering 
sites  are  correlated  in  a  DX~~d^  configuration,  then  the  mobility  measured 
at  the  2DEG  interface  will  be  higher  than  for  uncorrelated  defects.  If  there 
are  correlated  DX  centers  in  the  Ino  29Alo.7iAs,  the  mobility  will  increase  the 
higher  the  cooling  bias,  as  shown  in  the  figure. 


and  the  electric  field  on  the  InGaAs  side  of  the  heterojunc- 
tion  is  £(i„GaAs)(280'*')=^n,/s(i„GaAs)So.  therefore  the  electric 
field  on  the  InAlAs  side  of  the  junction  is,  in  this  approxi¬ 
mation: 


£(InAIAs)(280-)=  -V-  (4) 

By  inserting  the  saturated  charge  density  sat?  integrat¬ 
ing  the  electric  field  over  the  thickness  of  the  spacer =3  nm, 
we  obtain  for  Vp,  0.165  eV.  Now  placing  the  value  for  Vp 
and  Ep  back  into  Eq.  (2),  we  obtain  for  the  thermal  ioniza¬ 
tion  energy  E^  a  value  of  0.32±25%  eV, 

To  further  strengthen  the  argument  that  we  are  seeing 
Fermi  level  pinning  due  to  a  DX  centerlike  defect,  it  is  useful 
to  consider  the  arguments  in  Buks  et  al}"^  in  support  of  the 
negative  U  behavior  of  DX  centers  in  AlGaAs/GaAs  hetero¬ 
structures.  This  behavior  is  related  to  the  capture  of  two  elec¬ 
trons  by  the  center  in  its  relaxed  state.  If  such  a  state  exists, 
then,  upon  cooling  the  heterostructure  through  the  freeze-in 
temperature,  the  correlation  between  the  negatively  charged 
DX“  state  and  the  positively  charged  substitutional  donor 
state  would  result  in  an  increase  in  overall  mobility  for  a 
given  charge  density  as  the  cooling  potential  increases.  This 
is  precisely  what  they  measured  in  Fig.  4(a)  of  that  work.  In 
Fig.  5,  we  plot  the  mobility  in  our  InAlAs/InGaAs  versus  the 
charge  density  in  an  identical  fashion,  and  we  too  see  an 
increase  in  the  mobility  as  becomes  more  positive.  By 
reversing  this  argument,  we  believe  the  mobility  dependence 
implies  a  negative  U  behavior  of  this  defect,  i.e.,  a  DX  cen¬ 
ter. 

Figure  6  is  a  plot  of  the  different  conduction-band  minima 
of  In^Alj.^As,  as  a  function  of  alloy  composition.  On  this 
plot,  we  have  put  the  thermal  ionization  level  of  the  DX 
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Fig.  6.  Conduction-band  diagram  of  the  In;^-All  _;^.As  alloy  system  along  with 
the  thermal  ionization  energies  of  the  DX  centers. 


center  in  AlAs,  the  alloy  region  where  Calleja  et  al^  claim 
not  to  find  DX  centers,  and  the  depth  of  the  level  measured 
in  this  work.  While  Hong  et  al^  measured  an  emission  en¬ 
ergy  of  0.37  eV  and  a  photocapacitance  turn  on  at  0.61  eV,  it 
is  unclear  where  to  place  their  data,  since  they  did  not  pub¬ 
lish  a  thermal  ionization  energy.  If  the  DX  center  activation 
energy  for  Ino.29Alo,7iAs  is,  in  accordance  with  our  measure¬ 
ments,  located  0.32  eV  below  the  L  valley  minimum,  then 
the  extrapolated  linear  relation  in  Fig.  6  suggests  that  no  DX 
centers  would  be  occupied  in  lattice-matched  InAlAs/InP, 
because  the  energy  level  would  be  resonant  with  the  conduc¬ 
tion  band.  That  no  DX  centers  were  found  under  pressure  by 
Calleja  et  al^  could  be  due  to  the  need  to  dope  the  material 
more  heavily  than  1X10^^  cm“^. 

Within  the  context  of  a  theory  about  the  universality  of 
the  DX  center  is  the  work  by  Walukiewicz^^  regarding  the 
position  of  native  deep  levels  in  III-V  compounds  all  being 
referenced  to  the  Fermi  level  pinning  energies,  which  are 
themselves  referenced  relative  to  the  vacuum  level.  In  the 
case  of  In^Ali_^As  in  the  indirect  region  of  the  alloy  system, 
the  conduction-band  (L  band)  edge  moves  up,  while  the  va¬ 
lence  band  moves  down  as  the  amount  of  indium  increases. 
Therefore,  while  the  overall  band  gap  of  In^Alj^^As  de¬ 
creases  relative  to  AlAs,  the  conduction-band  edge  moves 
higher  relative  to  the  Fermi  level  pinning  energy.  If  you  con¬ 
sider  the  DX  center  as  a  kind  of  “native  defect”  of  the  doped 
material,  the  free  energy  of  formation  of  the  defect  remains 
constant  relative  the  vacuum  level.  It  is  then  the  prediction  of 
this  theory,  that  as  the  L  band  moves  higher,  the  DX  center 
energy  level  will  move  into  the  gap.  Once  the  alloy  material 
moves  into  the  direct  region,  the  conduction-band  edge 
(F  band)  quickly  drops  relative  to  the  Fermi  pinning  energy, 


and  the  DX  center  should  move  back  into  the  gap.  Therefore, 
this  theory  predicts  DX  centers  should  be  their  deepest  at  the 
direct-indirect  transition  (x=0.3).  In^Alj_^As  latticed 
matched  to  InP  (a: =0.52)  would  not  show  any  evidence  of 
DX  centers,  because  the  energy  level  moves  back  into  the 
conduction  band.  Further  work  on  precisely  these  issues,  in¬ 
cluding  the  optical  response  of  the  heterostructures,  is  cur¬ 
rently  in  progress. 

IV.  CONCLUSION 

In  summary,  from  the  cooling  bias  dependence  of  the 
threshold  voltage  and  the  mobility,  we  conclude  a  DX  cen¬ 
terlike  defect  has  been  found  in  heavily  Si-doped 
Ing  29AI0.71AS.  From  a  measure  of  the  ionized  donor  concen¬ 
tration  and  the  known  doping  density,  we  find  up  to  28%  of 
the  electrons  are  trapped  in  this  defect.  From  the  pinning  of 
the  charge  density  in  the  two-dimensional  electron  gas 
(2DEG),  we  derive  a  thermal  ionization  energy  for  the  DX 
center  of  0.32±25%  eV  below  the  conduction-band  edge, 
twice  the  depth  of  the  DX  center  in  AlGaAs.  At  this  time, 
further  studies  of  the  doping  dependence  and  the  photocon- 
ductive  properties  of  new  heterostructures  are  in  progress  to 
determine  other  characteristics  of  this  DX-like  energy  level. 
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A  set  of  <5-GaN^Asi_^/GaAs  strained-layer  superlattices  grown  on  GaAs  (001)  substrates  by 
electron  cyclotron  resonance  (ECR)  microwave  plasma-assisted  molecular  beam  epitaxy  (MBE) 
was  characterized  by  ex  situ  high  resolution  X-ray  diffraction  (HRXRD)  to  determine  nitrogen 
content  y  in  the  nitrided  GaAs  monolayers  as  a  function  of  growth  temperature  T.  A  first  order 
kinetic  model  is  introduced  to  quantitatively  explain  this  y(r)  dependence  in  terms  of  an 
energetically  favorable  N  for  As  anion  exchange  and  thermally  activated  N-surface  desorption  and 
surface  segregation  processes.  The  nitrogen  surface  segregation  process,  with  an  estimated 
activation  energy  £^~0.9  eV  appears  to  be  significant  during  the  GaAs  overgrowth  of 
GaN^Asi  layers,  and  is  shown  to  be  responsible  for  strong  y(T)  dependence.  ©  1996  American 
Vacuum  Society. 


1.  INTRODUCTION 

In  the  past  few  years  we  have  seen  a  flurry  of  activity  in 
the  development  of  visible  light  emitters  based  on  wide  band 
gap  refractory  semiconductor  GaN,  related  alloys,  and 
heterostructures. Most  of  the  research  was  concentrated 
on  epitaxial  growth  techniques  of  nitrides,^  and  growth  of 
device  structures  (LEDs  and  lasers)  on  various  substrates,^ 
while  comparatively  little  work  was  done  on  other  materials 
of  potential  technological  applications,  such  as  mixed  anion 
nitride/arsenide  system.^"^  Although  limited  by  a  small 
solubility,  even  small  amounts  of  N  in  GaAs  can  cause 
unexpected  band  gap  bowing  to  infrared,  while  the  small 
amounts  of  As  in  GaN  can  substantially  increase  valence 
band  spin-orbital  splitting  which  can  significantly  influence 
operation  of  the  active  layer  in  the  laser  structure  due  to  the 
higher  recombination  rate.^^  One  of  the  main  difficulties  in 
growing  such  nitride/arsenide  structure,  for  example,  in  the 
electron  cyclotron  resonance  plasma-assisted  molecular 
beam  epitaxy  (ECR-MBE),  is  controlling  the  structural  and 
chemical  properties  of  the  interface,  which  is  only  possible  if 
important  microscopic  growth  processes  are  fully  under¬ 
stood.  We  have  recently  shown^’^  that  it  is  possible  to  pro¬ 
duce  high  quality,  fully  commensurate  GaN^Asi-^/GaAs 
strained-layer  superlattices  (SLS),  grown  on  the  GaAs  sub¬ 
strate.  Initial  high  resolution  X-ray  diffraction  (HRXRD)  and 
reflection  high  energy  electron  diffraction  (RHEED)  mea¬ 
surements  done  on  such  structures  revealed  existence  of  sev¬ 
eral  distinct  thermally  activated  processes,  but  limited  by  an 
unavailability  of  a  suitable  quantitative  model  against  which 
these  separate  experimental  observations  could  be  tested 


^taectronic  mail:  zzbssdp.caltech.edu 
'’^Electronic  mail:  rjh@hyper.phy.okstate.edu 


consistently,  only  semi-quantitative  considerations  of  kinet¬ 
ics  and  tentative  identification  of  processes  have  been  pos¬ 
sible  to  date.^ 

In  this  article,  we  introduce  a  quantitative  model,  based 
on  the  first  order  kinetic  theory,  to  examine  the  initial  nitri- 
dation  (surface  N  for  As  exchange),  N  surface  desorption, 
and  N  surface  segregation  processes,  observed  in  ECR-MBE 
grown  GaN^AS|  _^,/GaAs  heterostructures,  characterized  by 
ex  situ  HRXRD.  Through  application  of  our  model  to  an 
unusually  strong  growth  temperature  dependence  of  y,  ob¬ 
tained  from  X-ray  experiments,  the  existence  of  a  thermally 
activated  N  surface  segregation  mechanism  is  conclusively 
established  and  quantitatively  confirmed.  Although  previ¬ 
ously  suggested,^  the  thermally  activated  N  surface  segrega¬ 
tion  mechanism  could  not  be  conclusively  established  with¬ 
out  adequate  analytical  model,  which  is  necessary  to  describe 
the  concurrent,  thermally  activated  N  surface  desorption  and 
segregation  processes.  Activation  energies  of  both  surface 
segregation  and  desorption  processes  are  numerically  esti¬ 
mated,  and  the  impact  of  the  compositional-profile  smearing 
of  N,  due  to  the  strong  N  surface  segregation  effect,  to  GaN/ 
GaAs  heteroepitaxy  and  interface  quality  is  analyzed  and  dis¬ 
cussed. 

II.  EXPERIMENTAL  DETAILS 

A  set  of  36-period  “^-GaN^Asj  _^/GaAs”  superlattices, 
with  a  superlattice  period  consisting  of  one  GaN^Asi_^ 
monolayer  (ML)  and  75  GaAs  monolayers,  was  grown 
on  a  GaAs  (100)  substrate  as  a  function  of  temperature 
(540-580  °C)  in  an  ECR-MBE  system.  GaN^^AS]  mono- 
layers  were  produced  through  brief  (4  s)  N2  plasma  exposure 
(nitridation)  of  an  As-stabilized  GaAs  surface.  Nitridation 
was  immediately  followed  by  GaAs  overgrowth  at  a  fixed 
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Fig.  1.  HRXRD  around  (004)  substrate  reflections  for  ^-GaN^Asi-^. 
/GaAs  superlattices.  Results  indicate  strong  dependence  of  the  rocking 
curves  profiles  on  growth  temperature. 

growth  rate  (0.75  ML/s)  for  100  s.  The  surface  of  GaAs  is 
then  As-stabilized  through  exposing  to  As2  flux  for  only  30  s 
(As-soak).  In  addition,  one  superlattice  was  grown  with  6  s 
nitridation  at  550  °C.  Further  details  about  growth  system 
and  procedures  can  be  found  elsewhere.^’^ 

To  determine  strain  and  concentration  profile,  the  entire 
sample  set  is  then  characterized  by  ex  situ  HRXRD,  with  the 
use  of  Philips  Materials  Research  Diffractometer  (MRD). 
The  standard  coHO  scans  are  performed  in  the  4-crystal  mode 
using  Ge  (220)  reflections,  around  the  substrate  (004)  reflec¬ 
tions,  while  the  reciprocal  space  scans  (area  scans)  around 
the  (115)  reflections  are  done  in  the  same  mode,  with  addi¬ 
tional  use  of  a  Bonse-Hart  collimator  in  front  of  the  detector. 
The  area  scans  confirmed  that  all  samples  are  coherently 
strained.  The  resultant  ojHd  rocking  curves  for  4  s  nitrida- 
tions  SLS  are  presented  on  Fig.  1  (550  °C  and  570  °C  scans 
have  been  omitted  for  clarity).  Results  reveal  a  strong  depen¬ 
dence  on  growth  temperature  T  of  the  epitaxial  structure, 
indicating  a  presence  of  thermally  activated  microscopic  pro¬ 
cesses.  Two-dimensional  equivalent  alloy  compositions  (y) 
are  determined  from  peak  positions,  and  are  confirmed 
through  dynamical  simulations  using  Philips  High  Resolu¬ 
tion  Simulation  (HRS)  software  package.  The  agreement 
typically  obtained  between  experimental  rocking  curve  and 
simulation  is  presented  on  Fig.  2  on  the  example  of  the  su¬ 
perlattice  grown  on  550  °C.  Figure  3  shows  an  Arhennius 
plot  of  the  resultant  compositional  dependence  on  growth 
temperature,  y{T),  for  4  s  (circles)  and  6  s  (diamond)  nitri- 
dations,  including  experimental  uncertainties  in  temperature, 
which  represent  measured  variation  over  a  given  wafer  (i.e., 
temperature  nonuniformity)  due  to  unintended  differences  in 
sample  mounting.  Although  not  included  in  the  plot,  the  ab¬ 
solute  errors  associated  with  GaN  composition  can  be  esti¬ 


32.6  32.8  33.0  33.2 

(0/26  (degrees) 


Fig.  2,  Typical  agreement  between  experimental  rocking  curve  and  HRS 
siumulated  rocking  curve. 

mated  to  ±0.5%.  These  errors  are  a  result  of  a  Vegard’s  law 
approximation  and  uncertainties  in  numerical  values  of  elas¬ 
tic  properties  of  the  GaN^Asj.^  layer. 

These  results  suggest  existence  of  two  regimes:  (/)  dose- 
limited  at  low  temperatures;  (//)  kinetically  limited  at  high 


1000/T(K'^) 


Fig.  3.  Growth-kinetics  models  to  explain  the  strong  y(T)  dependence  ob¬ 
served  for  ECR-MBE  grown  <5-GaN^,Asi_^,/GaAs  strained- layer  superlat¬ 
tices.  Experimental  data  points  are  obtained  through  HRXRD  measurements 
for  4  s  (circles)  and  6  s  (diamond)  nitridations.  The  dashed  curve  represents 
the  model  neglecting  N  surface  segregation  [Eq.  (2)],  while  the  solid  curve 
represents  the  model  including  segregation  effects  [Eq.  (8)]. 
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temperatures,  with  unexpectedly  strong  fall-off  in  y  with  in¬ 
creasing  growth  temperature. 

III.  KINETICAL  MODELING 

For  the  purpose  of  understanding  the  physical  origin  of 
this  y{T)  dependence,  we  have  developed  a  first  order  ki¬ 
netic  model  which  explicitly  incorporates  three  phases  of  our 
particular  SLS  growth  sequence:  (i)  nitridation;  (ii)  over¬ 
growth,  and  (Hi)  As-soak,  as  described  in  Sec.  II. 

A.  Nitridation 

The  net  rate  of  surface  nitridation  is  determined  by  two 
physical  processes  which  have  been  previously  identified:  (i) 
N-for-As  surface  anion  exchange  (N  gain),  and  (ii)  surface  N 
desorption  (N  loss).  These  processes  can  be  analyticaly  mod¬ 
eled,  in  the  first  order  in  N  surface  compostion  y  as 

y 

^  =  .T>(l-y)-f.  (1) 

dt  Td 

The  first  term  on  the  right  hand  side  of  Eq.  (1)  corresponds  to 
the  N  gain  on  the  As-stabilized  GaAs  surface  due  to  the 
energetically  favorable  anion  exchange.  This  process  is  mod¬ 
eled  as  being  proportional,  in  the  first  order,  to  the  flux  O  of 
incident  physically  activated  N;  to  the  fraction  of  available 
As  surface  sites  to  be  exchanged  with  N,  and  to  an  overall 
efficiency  factor  5-  (units  of  area),  which  in  general  might 
depend  on  surface  chemistry,  strain,  and  temperature,  but  is 
approximated  as  constant  over  the  temperature  and  compo¬ 
sition  range  of  interest,  for  the  purpose  of  our  simplified 
model.  The  second  term  in  this  equation,  -ylr^,  represents 
the  N  loss  due  to  surface  desorption.  It  can  be  simply  mod¬ 
eled  in  terms  of  thermally  activated  rate  constant 
exp(— £^/A:7),  with  activation  energy  Such  a 
process  has  been  directly  observed  in  our  RHEED  measure¬ 
ments  of  nitrided  GaAs  surfaces,  and  estimates  for  (—2.1 

eV)  have  been  previously  reported.^’^ 

If  we  define  an  “effective  dosing  rate”  the  solu¬ 

tion  to  Eq.  (1)  for  an  initially  N-free  (As-stabilized)  surface 
can  be  written  as 

3'(fexp)=}'ss[l-exp(-rexp/’’)].  (2) 

where  +  and  r^^p  is  the  nitrogen 

plasma  exposure  time.  If  Eq.  (2)  is  considered  in  the  limit  of 
low  temperatures  so  that  desorption  rate  is  negligible, 
then  y-^1  would  be  obtained.  This  reflects  the  assumption 
that  As  atoms  from  the  As-enriched  (100)  GaAs  surface, 
when  exposed  to  N  plasma,  will  be  completely  exchanged 
for  N  atoms  in  the  steady  state,  since  N  terminated  surface 
has  lower  surface  free  energy,  due  to  a  much  stronger  Ga-N 
than  Ga-As  bond  (heats  of  formation  6.81  eV  and  5.55  eV, 
respectively^^).  Although  the  essential  nitridation  kinetics  is 
well  described  by  Eq.  (2)  without  unnecessary  mathematical 
complexity,  it  ignores  the  subsequent  stages  of  growth  of  our 
superlattices,  and  therefore  cannot  be  used  to  model  directly 
our  experimental  results.  This  can  be  seen  in  Fig.  3,  where 
the  dashed  curve  represents  the  plot  of  Eq.  (2)  taking 
^exp~4  s  and  using  our  RHEED  based  experimental  desorp- 
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tion  parameters,  £^^  =  2.1  eV  and  t^=1  s  at  592  In 
addition,  the  dosing  rate  r  is  estimated  to  be  approximately 
0.1  Hz  from  experimental  values  of  y  at  550  °C  for  4  s  and  6 
s  nitridations,  such  that  Eq.  (2)  describes  properly  the  dose- 
limited  (low  temperature)  regime  of  growth. 


B.  GaAs  overgrowth — N  segregation 

Inspection  of  Fig.  3  clearly  reveals  that  the  processes  of 
anion  exchange  and  N  surface  desorption  alone  result  in  the 
weak  fall-off  of  the  model  (dashed)  curve,  and  thus  cannot 
account  for  the  strong  y(r)  dependence  in  the  kinetically 
limited  growth  regime.  Also,  even  N  surface  desorption  dur¬ 
ing  the  short  monolayer  deposition  time  (?ml)  required  to 
bury  (and  freeze-in)  the  nitrided  GaN^Asi_y  layer  cannot 
result  in  significant  N  loss.  To  overcome  this,  we  hypothesize 
that  thermally  activated  N  surface  segregation  process,  de¬ 
scribed  with  the  rate  constant  r~  ^  =  Tq/  Qxp{-E^/kT),  is  oc¬ 
curring  concurrently  with  N  desorption,  during  GaAs 
overgrowth.^  This  assumption  is  supported  qualitatively  by 
our  experimental  observation  that  GaN-related  (3X3)  sur¬ 
face  reconstruction  proceeds  faster  to  GaAs  (2X4)  recon¬ 
struction  at  a  lower  substrate  temperature  during  GaAs  over¬ 
growth,  which  is  consistent  with  freeze-out  of  a  thermally 
activated  N  segregation  process. 

Let  us  assume  that  (1)  segregation  occurs  from  the  first 
subsurface  layer  upward  to  the  current  surface  layer;  (2)  de¬ 
sorption  occurs  from  all  currently  exposed  surfaces,  i.e.,  the 
current  growth  surface  and  the  current  partially  exposed  first 
sub-surface  layers;  and  (3)  the  segregation  and  desorption 
processes  are  statistically  independent,  and  are  occurring 
concurrently.  Also,  let  S„  represent  a  relative  GaN  mole  frac¬ 
tion  in  monolayer  n,  where  by  definition  the  content  of  the 
nitrided  layer  at  the  end  of  nitridation  process  is  normalized 
to  unity,  5o(?  =  0)=l.  Under  these  assumptions,  we  can  re¬ 
late  the  N  content  of  monolayers  n  and  n  +  l  through 

^,^i(0)-[5,(0)--5,(^ml)]  -il  -1.  (3) 

Equation  (3)  describes  the  fraction  of  lost  N 

[5;j(0)  — 5',^(/ ]vil)]’  which  segregates  rather  then  desorbs, 
and  becomes  the  nitrogen  content  of  the  next  surface  layer. 
Since  layer  n  loses  its  N  content  due  to  both  N  desorption 
and  segregation,  during  the  time  interval 
decays  exponentially  as  exp(-“r/ T||),  where  r|~^ 

Using  this  expression  in  Eq.  (3)  leads  us  to 

5„+i(0)  =  5„(0)[l-exp(-/ML/  (4) 

Mot 

where  It  is  obvious  from  Eq.  (4)  that  S„(0) 

represents  geometrical  progression.  Now,  similarly  to  Eq.  (3) 
we  can  define  as  the  amount  of  initial  material  lost 
through  desorption  from  layer  n : 

(5) 

^tot 
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Therefore,  the  total  loss  of  N  from  one  superlattice  period 
during  overgrowth  would  be 

L=S  (6) 

n  =  0 

which  is  simply  a  sum  of  a  geometric  progression.  Finally, 
the  total  amount  of  the  original  GaN  content  retained  within 
one  period  of  a  superlattice  is  1  -  L,  or  straightforwardly 

It  is  important  to  note  that  the  expression  for  R  given  in  Eq. 
(7)  gives  the  correct  limiting  behaviour  in  describing  the  loss 
of  N  during  the  GaAs  overgrowth,  in  the  limits  of  no  segre¬ 
gation,  or  no  desorption.  If  there  is  desorption,  but  no  segre¬ 
gation,  then  the  only  opportunity  for  N  loss  would  be  before 
nitrided  layer  is  completely  buried  in  GaAs;  i.e., 
R  =  cxp{-ty[i^/r^),  which  is  obtained  directly  from  Eq.  (7) 
in  the  limit  Similarly,  if  there  is  segregation,  but  no 

desorption,  then  R  must  be  unity,  since  the  only  possible 
mechanism  for  N  loss  is  desorption  process.  In  that  case, 
where  one  would  obtain  /?^1,  as  it  should  be. 


MBE  techniques  which  have  been  succesfully  used  in  the 
problem  of  dopant-profile  smearing  in  MBE-grown  Si.^^ 
Also,  the  cross-sectional  STM  as  a  direct  imaging  technique 
might  provide  another  way  to  determine  ,  since  our  model 
predicts  distribution  of  nitrogen  in  the  superlattice  period 
(5„(0)),  as  a  function  of  N  surface  activation  energy  and 
growth  temperature  T. 

V.  SUMMARY 

In  conclusion,  a  simple,  but  plausible  kinetic  model  have 
been  developed  to  conclusively  establish  and  quantitatively 
explain  some  of  the  microscopic  processes  observed  in  ECR- 
MBE  grown  GaN^Asi.^^/GaAs  strained  layer  superlat¬ 
tices.  Particularly,  the  strong  y{T)  dependence,  obtained 
through  ex  situ  HRXRD  of  our  superlattices,  is  fully  under¬ 
stood  in  terms  of  energetically  favorable  surface  anion  ex¬ 
change  and  combined,  thermally  activated  N  surface- 
segregation/surface-desorption.  Finally,  our  model  predicts 
for  the  first  time  numerical  estimates  of  the  kinetic  param¬ 
eters  associated  with  N  surface-segregation  process,  which 
appears  to  be  significant  under  typical  GaN/GaAs  ECR-MBE 
growth  conditions. 


IV.  RESULTS  AND  DISCUSSION 


Using  the  “segregation  correction  factor”  from  Eq.  (7),  it 
is  clear  that  in  order  to  consistently  incorporate  N  surface 
segregation  into  our  model,  Eq.  (2)  must  be  replaced  by 


3^(^exp)  !yss[l  exp( 


'^tot 


(8) 


Thus,  Eq.  (8)  represents  the  final  result  of  our  model  and 
quantitatively  describes  the  growth  sequence  (nitridation/ 
overgrowth/soak)  of  our  (5-GaNyAsi_^/GaAs  superlattices, 
taking  into  account  microscopic  processes  of  anion  ex¬ 
change,  desorption,  and  segregation,  and  is  plotted  as  the 
solid  curve  in  Fig.  3.  This  curve  is  calculated  using  identical 
dosing  (r)  and  desorption  {E^  and  Tq^)  parameter  values 
which  were  used  in  Eq.  (2)  to  obtain  the  dashed  curve,  while 
adjusting  at  the  same  time  two  segregation  parameters,  E^ 
and  tqs  .  Nonlinear  least-squares  fit  to  the  experimental  data 
points  for  4  s  nitridations  finally  results  in  the  estimate  for  N 
surface  segregation  energy  equal  to  0.9  eV±30%.  Obviously, 
from  Fig.  3,  it  is  seen  that  with  the  corrected  Eq.  (8)  very 
good  agreement  with  experimental  data  is  obtained,  which 
confirms  our  N  surface  segregation  hypothesis. 

Additionally,  the  compositional-profile  smearing  of  N 
along  the  growth  direction  would  be  expected  as  a  natural 
consequence  of  the  segregation  process,  which  is  confirmed 
in  preliminary  cross  sectional  scanning  tunneling  microscopy 
(STM)  images  of  one  of  our  superlattices. Unfortunately, 
such  profile  smearing  effects  would  tend  to  limit  the  abrupt¬ 
ness  of  the  GaN/GaAs  interface.  However,  it  is  possible  that 
smearing  effects  might  be  controlled  through  some  of  the 
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A  study  of  strain  relaxation  in  GaN^Pi_^  epilayers  grown  by  chemical  beam  epitaxy,  using  a 
rf-plasma  nitrogen  radical  beam  source,  tertiarybutylphosphine,  and  triethylgallium,  is  reported. 
Microcracks  are  observed  in  GaN^Pj.^  epilayers  grown  on  GaP  when  the  nitrogen  composition  is 
greater  than  1.6%.  Transmission  electron  microscopy  results  show  that  the  tensile-strain  relaxation 
in  GaN^Pi_;j.  epilayers  is  initially  relieved  by  microcrack  formation  without  misfit  dislocations. 
These  microcracks  penetrate  through  the  interface,  degrading  the  crystallinity  of  the  GaP  substrate. 
Microcracks  formation  can  not  be  alleviated  by  adjusting  the  growth  rate,  growth  temperature,  V/III 
ratios,  and  forward  plasma  power,  but  they  can  be  eliminated  by  reducing  the  growth  area  of  the  GaP 
substrate.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Recently,  high-brightness  blue-light-emitting  diodes^  and 
high-temperature  electronics,^  have  been  successfully  dem¬ 
onstrated,  using  the  group  III  nitrides.  However,  since  their 
lattice  constants  are  much  smaller  than  that  of  Si,  their  inte¬ 
gration  with  high-speed  Si  electronics  is  difficult.  GaN^P^ 
alloys,  which  can  be  lattice  matched  to  Si  substrates,  are 
promising  in  potential  optoelectronic  applications,^”^  but 
problems  including  antiphase  domains,  pregrowth  prepara¬ 
tion,  and  differences  in  the  linear  thermal  expansion  coeffi¬ 
cient  also  make  direct  growth  on  Si  difficult.  To  overcome 
some  of  these  problems,  GaP/Si  can  be  used  as  a  substrate. 
Therefore,  we  are  investigating  the  growth  of  GaN^Pj  on 
GaP. 

A  variety  of  methods  have  been  implemented  to  grow 
GaN^Pj_^  alloys  on  GaP  using  high-pressure  ammonia  as 
the  N  source,  co-injected  with  phosphine"^  or  tertiarybu¬ 
tylphosphine  (TBP).^’^  In  this  study,  the  GaN;^.Pl_J^.  samples 
were  grown  using  a  rf-plasma  N  radical  beam  source  by 
chemical  beam  epitaxy  (CBE).  This  N  source  is  highly  reac¬ 
tive  and  has  been  successfully  used  to  grow  GaN,^ 
GaNjpAsi_j,.,^  and  to  dope  ZnSe  p-type  for  blue-light- 
emitting  laser  diodes.  Recently,  we  have  also  demonstrated 
its  use  for  the  growth  of  GaN^Pi_^,  where  up  to  16%^^  N 
was  incorporated.  This  is  the  highest  N  content  reported  to 
date  for  this  alloy. 

The  lattice  constant  of  GaN^Pi_^  alloys  is  smaller  than 
that  of  the  GaP  substrate,  therefore,  GaNj^-Pj.^  epilayers  are 
under  tensile  stress.  Whereas  compressively  strained  material 
systems  have  been  extensively  studied,  there  are  few  reports 
on  tensile-strain  relaxation.  In  this  study  we  report  strain 
relaxation  and  interface  microstructure  in  GaN^Pi_^/GaP. 
We  found  that  the  tensile  misfit  strain  between  GaN^Pi_^ 
and  GaP  is  accommodated  by  microcrack  formation  rather 

^^Electronic  mail:  nli@sdcc3.ucsd.edu 

^^Present  address:  Department  of  Material  Science  and  Mineral  Engineering, 
University  of  California,  Berkely,  CA  94720. 


than  dislocations.  Our  investigations  of  the  effects  of  the 
growth  parameters  on  the  surface  morphology  of  GaN^Pi_^ 
epilayers  are  also  reported. 

II.  EXPERIMENT 

The  CBE  system  used  in  this  work  was  modified  from  a 
Perkin-Elmer  425B  MBE  system.  Triethylgallium  (TEGa), 
nitrogen  (N2),  and  TBP  were  used  as  the  gallium,  nitrogen, 
and  phosphorus  source,  respectively.  TEGa  and  TBP  were 
introduced  into  the  chamber  without  any  carrier  gas,  and  the 
flow  rates  were  controlled  by  mass-flow  controllers.  The 
temperature  of  TBP  cracker  was  800  °C.  Ultrahigh  purity 
nitrogen  was  introduced  into  an  Oxford  Applied  Research 
rf-activated  plasma  source  with  the  flow  rate  controlled  by  a 
mass-flow  controller.  The  rf-plasma  N  radical  beam  source 
was  operated  at  13.56  MHz  with  a  forward  power  of  300  W. 
The  N2  flow  rate  was  varied  from  0.375-1.5  seem,  and  the 
chamber  pressure  was  about  4X10~^  Torr  during  growth. 
The  substrates  used  in  this  study  were  nominally  undoped 
(100)  GaP.  Before  being  loaded  into  the  CBE  chamber,  they 
were  chemically  etched  in  HC1:HN03:H20  (4:4:5)  for  5  min 
and  finally  given  deionized  (DI)  water  rinse. 

In  this  study,  GaN^Pi_^  (0.007=^x^0.102)  samples,  0.5- 
0.8  /im  thick,  were  grown  at  different  growth  parameters'^  to 
study  the  strain  relaxation  under  tensile  stress.  Typically,  the 
growth  rate  and  the  growth  temperature  were  varied  from 
0.56-0.85  monolayers  per  second  (ML/s)  and  from  400- 
690  °C,  respectively.  The  gallium  and  phosphorus  incorpora¬ 
tion  rates  were  determined  by  group  III-  and  group 
V-induced  reflection  high-energy  electron  diffraction 
(RHEED)  intensity  oscillations,  respectively.  The  V/III  (P/ 
Ga)  incorporation  ratio  was  about  1.3.  The  (P+N)-induced 
incorporation  rate  cannot  be  collected  from  group  V-induced 
RHEED  intensity  oscillations  because  the  N  radical  beam 
cannot  be  completely  blocked  by  the  shutter.  RHEED  shows 
a  (2X 1)  streaky  pattern  during  growth.  The  surface  morphol¬ 
ogy  and  strain  relaxation  of  GaN^Pi  epilayers  were  char¬ 
acterized  by  atomic  force  microscopy  (AFM),  scanning  elec- 
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Fig.  1.  The  x-ray  FWHM  of  the  GaP  substrates  as  a  function  of  the  N  composition.  The  inset  shows  (400)  x-ray  rocking  curves  of  a  0.8-^m-thick 
GaNo  o25Po.975  and  a  0.5-/um-thick  GaNo.iPo.g  on  GaP  using  a  rf-activated  plasma  N  radical  beam  source. 


tron  microscopy  (SEM),  Nomarski  microscopy,  high- 
resolution  double-crystal  x-ray  diffraction,  and  transmission 
electron  microscopy  (TEM). 

III.  RESULTS  AND  DISCUSSION 

(400)  x-ray  rocking  curves  of  GaN^Pi_^  samples  are 
shown  in  the  inset  of  Fig.  1 .  If  the  film  is  completely  relaxed, 
the  (400)  peak  splitting  (dash  line)  indicates  a  N  composition 
of  4.9%  using  literature  values  for  the  GaP^^  and  cubic 
GaN^"^’^^  elastic  constants  and  Vegard’s  law.  However,  analy¬ 
sis  of  additional  asymmetric  {511}  x-ray  rocking  curves 
shows  that  the  epilayer  is  still  highly  strained,  only  2.5 
±0.2%  relaxed  in  both  [Oil]  and  [011]  in-plane  directions. 
Therefore,  the  N  content  is  determined  to  be  2.5%  (misfit = 
“0.47%).  Similar  analysis  indicates  that  the  film  of  the  solid 
line  in  the  inset  of  Fig.  1  is  relaxed  58.3%  (misfit=“1.8%) 
and  has  a  N  composition  of  10.2%. 


Figure  1  shows  the  full  width  at  half-maximum  (FWHM) 
of  the  GaP  substrate  (400)  x-ray  peak  as  a  function  of  the  N 
composition.  Compared  to  a  normal  GaP  substrate,  FWHM 
of  20  arcsec,  a  much  broader  substrate  peak  with  a  FWHM 
of  102  arcsec  is  observed  for  0.5-/zm-thick  GaNo  1P0.9  grown 
on  Gap.  Figure  2  shows  a  cross-section  TEM  image  of  a  0.8 
/xm-thick  GaNo,o25Po.975  ^  microcrack  is  observed  in 

the  epilayer  that  propagates  into  the  substrate,  ending  in  a 
stacking  fault.  A  similar  TEM  result  was  also  observed  for 
the  sample  with  a  N  composition  of  4.6%  (misfit  =—0.81%). 
No  misfit  dislocations  were  observed,  at  least  up  to  4.6%  N. 
This  is  surprising  since  based  on  the  Matthews  and  Blakeslee 
model, the  critical  layer  thickness  is  about  200  A.  With 
increasing  lattice  mismatch  between  GaN^Pi_;^  and  GaP,  mi¬ 
crocracks  apparently  generate  and  propagate  into  the  sub¬ 
strate,  degrading  the  crystalline  quality  of  both  the  GaP  sub¬ 
strate  as  well  as  the  epilayer.  We  found  that  the  higher  the  N 
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Fig.  2.  Bring  field  cross-section  TEM  micrograph  of  GaNo,o25Po.975  grown  at 
640  °C.  A  microcrack  extending  from  the  surface  through  the  epilayer  into 
the  substrate  is  visible. 


composition  or  the  thicker  the  epilayer,  the  broader  the  GaP 
substrate  x-ray  peak. 

The  surface  morphology  of  the  GaN^Pi  epilayers  is  op¬ 
tically  specular,  but  consistent  with  TEM  and  x-ray  diffrac¬ 
tion  microcracks  are  visible  by  eye  or  with  an  optical  micro¬ 
scope  when  the  N  composition  exceeds  1.6%.  Shown  in  Fig. 
3  is  an  AFM  image  of  the  GaN0.025P0.975  Microcracks 
with  mnow  openings  (50  nm)  are  observed  along  both  [Oil] 
and  [Oil]  directions,  and  one  with  a  wider  opening  of  350 
nm  is  also  observed.  In  between  the  microcracks,  the  surface 
roughness  of  the  GaN0.025P0.975  is  about  15  A,  indicat¬ 
ing  a  smooth  surface  morphology.  Using  SEM,  wide  micro¬ 
cracks  in  the  range  150-550  nm  are  observed  with  a  density 
of  <  1000/cm.  We  also  noticed  large  cracks  forming  during 
growth  through  a  viewport  of  the  CBE  when  the  thickness  of 
the  epilayer  was  about  0.3  fim.  Figure  4  shows  the  percent¬ 
age  of  strain  relaxation  in  0.5-/xm-thick  GaN^Pj..^  epilayers, 
as  a  function  of  the  N  composition.  The  GaN^P^.^  epilayers 
with  a  N  composition  of  2.5%,  4.6%,  5.7%,  7.8%,  and  10.2% 
have  relaxed  2.4%,  3.2%,  11.9%,  33.8%,  and  58.3%,  respec¬ 
tively.  The  percentage  of  strain  relaxation  in  GaN^Pj..^  ep- 


Fig.  3.  AFM  image  of  a  0.8-/>tm-thick  GaNoo25Po.975  grown  at  640  °C.  Microcracks  are  observed  in  both  [011]  and  [011]  directions. 
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Fig.  4.  The  percentage  of  strain  relaxation  of  GaN^.Pl_;^.  (0.007  =^0.102) 

in  both  [Oil]  and  [011]  directions  as  a  function  of  N  composition. 


ilayers  increases  dramatically  when  the  N  composition  is 
higher  than  4.6%.  This  may  be  associated  with  the  onset  of 
misfit  dislocation  formation,  however,  TEM  still  needs  to  be 
carried  out  to  confirm  this. 

Based  on  the  above  results,  we  conclude  that  when  the 
GaN^Pi-;^  epilayer  thickness  exceeds  a  certain  thickness,  the 
strain  is  relieved  initially  by  microcracks  rather  than  disloca¬ 
tions.  By  taking  asymmetric  {511}  x-ray  rocking  curves  in 
the  two  perpendicular  in-plane  (110)  directions,  we  found 
that  the  strain  relaxation  via  microcracks  in  GaN^Pj.;^ 
(x^O.l)  epilayers  is  isotropic^ Microcrack  densities  are  sym¬ 
metric  in  both  [Oil]  and  [011]  directions.  This  result  is  dif¬ 
ferent  from  the  anisotropic  strain  relaxation  via  dislocations 
in  most  III-V  compounds  grown  with  compressive  strain. 
Our  result  is  different  from  other  tensile  strained  material 
systems.  Gao  et  al}^  reported  that  the  distribution  of  micro¬ 
cracks  in  4.5-/xm-thick  GaAs/Si  is  anisotropic.  The  majority 
of  the  microcr^ks  run  in  the  [Oil]  direction  and  only  a  few 
run  in  the  [011]  direction,  which  is  possibly  related  to  the 
substrate  misorientation.  Woodbridge  et  however, 

found  that  GaAs/Si  has  no  clear  preference  for  microcracks 
in  either  (011)  directions,  is  similar  to  our  result.  Olsen 
et  also  reported  asymmetric  microcracks  in  epitaxial  In- 
GaP  on  GaAs  when  the  tensile  misfit  exceeded  —0.15%,  and 
asymmetric  cracking  was  explained  by  invoking  asymmetry 
of  the  distribution  of  dislocations  involved  in  the  microcrack 
formation. 


Several  groups  have  reported  the  growth  of  GaN;,.Pi  -x^y 
gas-source  molecular  beam  epitaxy^’ (GSMBE)  and  orga- 
nometallic  vapor  phase  epitaxy^^  (OMVPE).  Microcracks, 
however,  are  not  observed  for  a  0.5-yam-thick  GSMBE- 
grown  GaNj,Pi„j,  samples  with  N  composition  up  to  16%. 
Microcracks  are  observed  in  our  study  when  the  N  compo¬ 
sition  is  higher  than  1.6%,  which  is  much  smaller  than  16%. 
This  discrepancy  may  be  due  to  the  different  growth  mecha¬ 
nism  among  CBE,  GSMBE,  and  MOVPE.  Furthermore,  the 
C  impurity  concentration  might  play  an  important  role  in  the 
tensile-strain  relaxation  process.  In  our  previous  growth 
study  of  GaN^Pi_^,^  secondary-ion  mass  spectroscopy 
shows  that  the  C  impurity  has  a  concentration  of  7X10^^ 
cm“^  in  the  epilayer,  mainly  from  the  dissociation  of  TBP 
cracked  at  800  °C.  Due  to  the  smaller  covalent  radius  of 
carbon,  a  lattice  contraction  should  be  expected  in  GaP, 
which  can  introduce  an  extra  tensile  stress  in  the  epilayer. 
Also,  the  unintentional  p-tyipQ  C  doping  in  GaN^P^-^^  prob¬ 
ably  increases  the  activation  energy  for  nucleation  and/or 
mobility  of  dislocations,  facilitating  tensile-strain  relaxation 
via  microcracks.  Further  studies  are  necessary  to  confirm  an 
effect  of  C  impurity  on  the  tensile-strain  relaxation  in 
GaN^-Pi_^. 

To  eliminate  microcrack  formation  in  the  GaN^Pi^^^  epi¬ 
layers,  growth  parameters  such  as  growth  rate,  V/III  ratio, 
growth  temperature,  and  forward  plasma  power,  were  varied. 
We  found  that  microcracks  in  GaNj,Pi_j^  epilayers  can  not  be 
alleviated  by  adjusting  these  growth  parameters,  but  the  sur¬ 
face  morphology  can  be  improved  significantly  by  reducing 
the  growth  area.  A  patterned  GaP  substrate  with  Si02  as  a 
mask  was  used  to  reduce  the  growth- substrate  area  for 
GaN_^Pi_;^.  growth.  Our  preliminary  results  show  that  micro¬ 
cracks  are  not  observed  in  the  0.5-yam-thick  GaNj^Pi_^  epi¬ 
layers  with  the  N  composition  up  to  10.2%  grown  in  the 
small  area  (300  yam X  300  yam)  of  a  patterned  GaP  substrate. 
This  result  implies  that  strain  relaxation  via  the  microcrack 
formation  in  GaN^Pi_^  epilayers  can  be  effectively  elimi¬ 
nated  by  reducing  the  growth  area.  This  effect  is  still  under 
investigation,  and  further  results  will  be  reported  in  details 
elsewhere. 


IV.  SUMMARY 

AFM,  Nomarski  microscope,  SEM,  double  crystal  x-ray 
diffraction,  and  TEM  were  used  to  study  the  microstructure 
of  GaN^Pi„j^  epilayers  on  GaP  grown  by  CBE  using  a  rf- 
plasma  N  radical  beam  source.  We  found  that  even  a  small  N 
composition  of  1.6%  in  the  GaN^^Pj-j,.  epilayer  leads  to  the 
formation  of  microcracks  in  this  tensile  strained  material  sys¬ 
tem.  From  TEM  investigations,  for  N  composition  up  to  at 
least  4.6%,  no  threading  or  misfit  dislocations  are  found  in 
GaNj^Pi_^/GaP  epilayer.  The  tensile  misfit  strain  between 
GaP  and  GaN^Pj  grown  by  CBE  is  first  accommodated  by 
microcrack  formation  rather  than  misfit  dislocations.  The 
strain  relaxation  in_GaN^Pi_jp/GaP  up  to  60%  is  isotropic  in 
both  [Oil]  and  [Oil]  directions. 
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We  have  nickel  doped  boron  carbide  grown  by  plasma  enhanced  chemical  vapor  deposition.  The 
source  gas  closo-l,2-dicarbadodecaborane  (ortho-carborane)  was  used  to  grow  the  boron  carbide, 
while  nickelocene  [Ni(C5H5)2]  was  used  to  introduce  nickel  into  the  growing  film.  The  doping  of 
nickel  transformed  a  p-type,  B5C  material,  relative  to  lightly  doped  n-type  silicon,  to  an  n-type 
material.  Both  n-n  heterojunction  diodes  and  n-p  heterojunction  diodes  were  constructed,  using  as 
substrates  n-  and  p-type  Si(lll),  respectively.  With  sufficient  partial  pressures  of  nickelocene  in  the 
plasma  reactor,  diodes  with  characteristic  tunnel  diode  behavior  can  be  successfully  fabricated. 
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I.  INTRODUCTION 

Through  the  decomposition  of  cluster  borane  molecules 
by  plasma-enhanced^"^  and  synchrotron  radiation-induced"^’^ 
chemical  vapor  deposition,  heterojunction  devices  of  boron 
and  boron  carbide  have  been  successfully  fabricated.  These 
techniques  have  not  only  been  used  to  construct  simple  di¬ 
odes,  but  have  also  successfully  yielded  field  effect 
transistors.^  Until  now,  no  attempts  have  been  made  to  inten¬ 
tionally  dope  the  intrinsic  films  of  boron  carbide  using  the 
technique  of  plasma  enhanced  chemical  vapor  deposition 
(PECVD). 

The  introduction  of  metal  dopants  into  boron  carbide  is 
not  a  trivial  process.  Initial  attempts  to  dope  molecular  films 
of  the  precursor  cluster  molecule  closo-1,2- 
dicarbadodecaborane  (C2B10H12),  otherwise  known  as  ortho- 
carborane,  with  the  common  dopant  mercury  were 
unsuccessful.^  Mercury  was  found  to  segregate  to  the  mo¬ 
lecular  film- substrate  interface  and  is  indicative  of  the  weak 
interaction  between  Hg  and  orthocarborane.  While  initially 
disappointing,  subsequent  attempts  to  dope  molecular  films 
of  orthocarborane  did  prove  to  be  successful.^’^  Such  mo¬ 
lecular  films  can  be  doped  with  sodium.^’^  The  possibility  of 
doping  boron  carbide  with  Hg  cannot  be  excluded  based  on 
these  results.  This  work,  nonetheless,  did  suggest  that  doping 
of  this  material  may  be  a  complex  process.  This  is  particu¬ 
larly  true  since  the  suitability  of  orthocarborane  (C2BJ0H12) 
for  the  chemical  vapor  deposition  of  a  B5C  films  has  been 
established^"^  and  because  this  molecule  is  very  similar  to 
the  “building  block”  of  boron  carbide. 

Nickel,  however,  is  a  very  promising  dopant  for  the  boron 
rich  solids,  A  molecular  nickel  carborane  complex  has  been 
synthesized  by  inorganic  chemists^  and  the  inclusion  of 
nickel  in  other  boron  rich  solids  is  well  established.  Nickel  is 
a  common  component  in  the  boron  carbide  superconductors^ 
and  the  reactions  of  nickel  with  boron  phosphide  have  been 
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investigated. Nickelocene,  Ni(C5H5)2,  has  been  shown 
to  be  a  suitable  source  compound  for  the  deposition  of  nickel 
containing  thin  films. Nickelocene  is  volatile  and  far  less 
toxic  than  nickel-tetracarbonyl,  though  a  number  of  other 
nickel  containing  organometallic  compounds  may  be 
suitable.^^  Since  both  orthocarborane  and  nickelocene  are 
easily  sublimed  from  the  solid,  introduction  of  suitable  mix¬ 
tures  into  the  plasma  reactor  can  be  readily  accomplished.  In 
this  paper  we  present  the  device  characteristics  of  diodes 
constructed  with  nickel  doped  boron  carbide  grown  by 
PECVD.  To  the  best  of  our  knowledge,  this  is  the  first  suc¬ 
cessful  demonstration  of  metal  doping  of  boron  carbide  films 
grown  by  PECVD.  Both  high  and  low  levels  of  Ni  doping  of 
boron  carbide  have  been  investigated.  The  diode  devices 
have  markedly  different  behaviors  depending  upon  the  nickel 
doping. 


11.  EXPERIMENT 

The  p-n  and  n-p  heterojunctions  were  formed  by  depos¬ 
iting  boron  carbide  thin  films  on  n-type  and  p-type  Si(lll) 
substrates,  respectively,  following  procedures  described  in 
detail  elsewhere.^’^  Deposition  of  films  was  performed  in  a 
custom  designed  parallel  plate  13.56  MHz  radio-frequency 
PECVD  reactor  used  in  previous  studies. The  silicon  sub¬ 
strates  were  doped  to  7XlO^"^/cm^.  The  Si(lll)  substrates 
surfaces  were  prepared  by  Ar“^  ion  sputtering  in  the  plasma 
reactor.  The  source  molecule  gas  closo-1,2- 
dicarbadodecaborane  (orf/zo-carborane)  was  used  as  the 
source  compound  for  growing  the  boron  carbide,  while  nick¬ 
elocene  [Ni(C5H5)2]  was  used  to  introduce  nickel  into  the 
growing  film.  As  discussed  above,  nickelocene  [Ni{C5H5)2] 
was  simultaneously  introduced  into  the  plasma  reactor  with 
orthocarborane  [closo-l,2-dicarbadodecaborane  (C2B10H12)]. 
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Fig.  1.  The  1-V  characteristics  and  the  corresponding  schematic  diagrams  of  the  diodes  from  (a)  BsC/n-SiClll),  (b)  lightly  Ni  doped  B5C  on  n-Si(lll),  and 
(c)  heavily  Ni  doped  B5C  on  /z-Si(lll).  The  shift  in  polarity  of  the  diode  demonstrates  the  relative  n-type  behavior  following  nickel  doping  and  the 
characteristic  behavior  with  sufficiently  high  levels  of  nickel  doping  (see  the  text). 


III.  RESULTS  AND  DISCUSSION 

Typical  B5C/?2-type  silicon  heterojunctions  have  been  rou¬ 
tinely  formed  by  this  technique.  An  example  of  one  such 
diode  device  is  presented  in  Fig.  1(a).  An  examination  of  the 
1-V  curve  in  Fig.  1(a)  shows  the  excellent  diode  character¬ 
istics  of  devices  built  using  the  technique  of  PECVD.  These 
devices  typically  have  onsets  of  1  eV  with  very  little  leakage 


current  (less  than  5  fxK  at  25  °C).  Figure  1(a)  also  demon¬ 
strates  the  /?-type  character  of  undoped  PECVD  boron  car¬ 
bide  in  this  geometry. 

The  inclusion  of  Ni  into  the  boron  carbide  films  with  the 
introduction  of  nickelocene  into  the  plasma  reactor  has  been 
verified  with  Auger  electron  spectroscopy  (AES).  The  signa¬ 
ture  of  Ni  in  the  Auger  spectra  suggest  that  the  Ni  uptake  is 
large  and  that  these  films  are  highly  doped  (>1X10^^).  The 
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(a) 


Applied  Bias  (  V  ) 


(b) 


Fig.  2.  The  I-  V  characteristics  and  corresponding  schematic  diagrams  of  the  diodes  from  (a)  lightly  Ni  doped  B5C  on  /7-Si(lll)  and  (b)  heavily  Ni  doped  B5C 
on  /?-Si(lll). 


/-  V  curves  of  tvyo  diodes  constructed  with  Ni-doped  boron 
carbide  grown  on  n-type  Si(lll)  are  displayed  in  panels  (b) 
and  (c)  in  Fig.  1.  The  diode  constructed  with  a  “low”  level 
of  Ni  doping  corresponded  to  a  partial  pressure  of  nickel- 
ocene  to  orthocarborane  of  <0.1  during  film  growth.  The 
“high”  doping  corresponded  to  a  relative  partial  pressure 
ratio  ^9,  respectively. 

With  the  inclusion  of  Ni,  the  boron  carbide  films  which 
are  normally  p-type  relative  to  n-type  silicon,  evolve  to  an 
n-type  material.  This  is  evident  from  the  device  characteris¬ 
tics  of  the  diode  shown  in  Fig.  1(b).  This  results  in  the  for¬ 
mation  of  rectifying  diodes  with  reverse  bias.  Thus  the  nickel 


doped  boron  carbide  heterojunction  diodes  appears  n-type 
relative  to  the  lightly  doped  n-type  silicon  substrate.  This  is 
consistent  with  the  fabrication  of  n-p  heterojunction  diodes 
on  p-type  silicon,  again  by  including  nickelocene  with  the 
orthocarborane  as  an  additional  source  gas,  as  seen  in  Fig.  2, 
for  the  nickel  doped  boron  carbide. 

With  the  “higher”  nickel  doping,  a  negative  differential 
resistance,  or  a  valley  in  the  current,  occurs  in  the  effective 
forward  bias  direction  for  diodes  formed  on  both  n-iypt  sili¬ 
con  and  p-type  silicon  substrates,  as  seen  in  Figs.  1(c)  and 
2(b).  This  behavior  is  characteristic  of  a  tunnel  diode^"^  and  is 
consistent  with  degenerative  doping  of  a  pinned  state  relative 
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Fig.  3.  The  1-V  characteristics  and  corresponding  schematic  diagram  of  a 
trilayer  diode  fabricated  from  a  PECVD  Ni  doped  B5C  layer  deposited  on  a 
PECVD  undoped  B5C  layer  deposited  on  a  PECVD  rhombohedral  boron 
layer  placed  on  an  aluminum  substrate. 


to  the  conduction  band  edge.  Certainly  at  the  higher  doping 
levels,  sufficient  nickel  from  nickelocene  is  incorporated  into 
the  boron  carbide  during  film  growth  to  provide  for  degen¬ 
erative  doping  concentrations. 

The  hump  in  the  current  occurs  at  a  larger  bias  voltage  for 
the  tunnel  diodes  fabricated  on  the  pAypc  silicon.  This  is 
consistent  with  an  w-type  layer  [Ni^n(B5C)]  and  the  forma¬ 
tion  of  heterojunction  n-p  diodes.  In  particular,  this  behavior 
suggests  that  the  nickel  states  are  pinned  to  the  conduction 
band  edge.  States  pinned  to  one  band  edge  have  been  pro¬ 
posed  for  boron  carbide^  and  have  been  identified  pinned  to 
conduction  band  edge.^^’^^  It  may  be  that  such  states  are 
occupied  or  filled  through  nickel  doping.  Due  to  the  location 
of  the  gap  states  near  the  conduction  band  edge,  much  larger 
applied  voltages  should  be  needed  to  observe  the  negative 
differential  resistance  for  diodes  constructed  with  p-type 
Si(lll),  as  compared  with  diodes  constructed  with  n-type 
Si(lll).  This  is  exactly  what  is  observed  from  the  I-V 
curves  of  the  n-n  and  n-p  diodes  in  Figs.  1(c)  and  2(b), 
respectively. 

PECVD  fabricated  B5C-boron  heterojunctions  can  also 
form  diodes  on  substrates  other  than  silicon,  such  as 
aluminum.^  Since  the  construction  of  diodes  through  the 
doping  of  a  layer  of  B5C  is  now  potentially  possible,  we  feel 
that  this  advance  in  the  doping  of  PECVD  boron  carbide 
holds  the  prospect  that  rhombohedral  boron  rich  semicon¬ 
ductor  homojunctions  can  be  formed  from  this  material, 
without  the  need  of  a  silicon  substrate.  A  preliminary  diode 
of  this  type  has  been  fabricated  by  doping  a  layer  B5C  with 
nickel  as  seen  in  Fig.  3. 


IV.  SUMMARY 

In  summary,  nickelocene  can  be  used  to  dope  PECVD 
boron  carbide.  The  mildly  p-type  boron  carbide,  B5C,  rela¬ 
tive  to  n-type  silicon,  is  strongly  n-type  following  the  doping 
with  nickel.  With  sufficient  doping  levels,  the  resulting  het¬ 
erojunction  diode  acts  like  a  tunnel  diode.  All  evidence  sug¬ 
gests  that  nickel  populates  states  within  the  gap  of  this  nor¬ 
mally  highly  resistive  material. 
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We  have  measured  charge  states  deep  within  the  ZnSe  band  gap  and  localized  near  the  ZnSe/GaAs 
interface  as  a  function  of  annealing  temperature  by  means  of  photoluminescence  spectroscopy,  as 
well  as  cathodoluminescence  from  the  cross  section  of  a  ZnSSe/GaAs  heterostructure  with  a 
scanning  electron  microscope.  Annealing  produces  new  emissions  at  1.9  eV  and  2.25  eV  which 
increase  with  annealing  temperature  and  which  vary  with  different  growth  conditions. 
Cathodoluminescence  spectra  show  two  similar  features  which  originate  within  ZnSe  and  whose 
relative  emission  intensities  vary  with  depth.  Since  the  2.25  eV  feature  appears  only  under  Zn-rich 
growth  conditions,  these  results  indicate  a  role  of  both  interface  diffusion  and  crystal  growth  in  the 
formation  of  this  defect.  We  observe  a  proportional  increase  of  the  1.9  eV  and  0.9  eV  peak 
intensities  with  annealing,  indicating  their  complementary  nature  with  respect  to  a  common  deep 
level.  Finally,  a  detailed  analysis  of  the  relative  intensities  of  ZnSe  and  GaAs  features  due  to  internal 
photoemission  across  the  heterointerface  suggests  that  the  heterojunction  band  offset  changes  with 
formation  of  these  new  deep  levels.  All  these  results  emphasize  that  multiple  deep  levels  form  near 
the  buried  interface  with  annealing  which  can  dominate  the  heterojunction  electronic  properties. 
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I.  INTRODUCTION 

The  ZnSe/GaAs  heterojunction  is  a  prototypical  system 
for  studying  II-VI/III-V  compound  semiconductor  interfaces 
due  to  its  small  lattice  mismatch  (0.27%),  which  permits 
high  quality  epitaxial  growth.^  An  understanding  of  its  elec¬ 
tronic  and  chemical  properties  is  valuable  for  fundamental 
research  and  technological  applications.  Specifically,  modifi¬ 
cation  of  the  corresponding  band  offsets  could  have  a  sub¬ 
stantial  impact  on  the  performance  of  II- VI  based  blue-green 
laser,  and  several  authors  have  argued  that  the  band  align¬ 
ment  across  the  junction  may  depend  on  the  local  atomic 
structure  of  the  interface,^”^  which,  in  turn,  affects  free  car¬ 
rier  transport  across  heterojunction  interfaces  and  their  de¬ 
vice  applications.  Our  focus  is  on  the  presence  of  charge 
states  localized  near  the  heterojunction  interface,  their 
change  with  annealing,  and  their  effect  on  carrier  transfer 
across  the  junction.^’^’^ 

Nicolini  et  al.^  have  demonstrated  a  connection  between 
atomic  interface  structure  and  band  alignment.  They  showed 
that  variations  in  the  beam  pressure  ratio  (BPR)  of  Zn  to  Se 
during  the  first  a  few  monolayers  growth  of  ZnSe  epilayer  on 
GaAs  in  a  molecular  beam  epitaxy  (MBE)  system  could  be 
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used  to  modify  the  ZnSe  relative  concentration  at  the  inter¬ 
face.  As  a  result,  the  local  atomic  bonding  alters  ZnSe/GaAs 
heterojunction  band  offsets  over  a  wide  energy  range  from 
below  0.6  eV  to  above  1.2  Their  photoemission  mea¬ 

surements  showed  that  those  specimens  with  BPR>1  (Zn- 
rich)  have  large  valence  band  offsets  between  GaAs  and 
ZnSe,  while  specimens  with  BPR<1  (Se-rich)  exhibit  va¬ 
lence  band  offset  values  more  than  0.6 -0.8  eV  lower. 

Deep  levels  also  appear  to  play  a  major  role  in  determin¬ 
ing  the  behavior  of  ZnSe/GaAs  interfaces.  We  have  extended 
the  low  energy  cathodoluminescence  and  photoluminescence 
spectroscopy  techniques  developed  earlier  for  studying 
metal-semiconductor  interfaces to  heterojunction  inter¬ 
faces.  These  techniques  have  revealed  optical  emissions  re¬ 
lated  to  dislocation-induced  defects  deep  within  the  band 
gap,^^  injection  and  recombination  of  free  carriers  across  het¬ 
erointerfaces,  and  deep  level  luminescence  near  the  ZnSe/ 
GaAs  heterointerface.^^"^^  Recently,  we  have  measured  near 
band  edge  (NBE)  emissions  of  GaAs  and  ZnSe  as  a  function 
of  the  Zn  to  Se  BPR  used  to  grow  the  ZnSe  epilayers.^^  We 
found  systematic  variations  in  the  NBE  relative  intensity 
with  BPR  due  to  internal  photoemission,  which  are  consis¬ 
tent  with  the  band  offset  variations  reported  earlier.^ 

We  have  now  performed  photoluminescence  spectroscopy 
studies  of  the  ZnSe/GaAs  heterojunction  interfaces  by  using 
HeCd  laser  excitation  as  a  function  of  annealing  temperature. 
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With  post  growth  annealing,  new  deep  level  emissions  ap¬ 
pear  which  depend  on  initial  growth  conditions.  Our  results 
provide  additional  evidence  of  internal  photoemission  across 
the  interface  from  GaAs  to  ZnSe.  We  identify  complemen¬ 
tary  transitions  due  to  formation  of  new  states  deep  within 
the  ZnSe  band  gap.  We  also  find  changes  in  internal  photo¬ 
emission  and  heterojunction  band  offset  with  annealing  pro¬ 
cesses  which  correlate  with  deep  level  formation. 

II.  EXPERIMENTS 

The  specimens  in  our  study  were  ZnSe/GaAs  heterostruc¬ 
tures  which  were  grown  by  MBE.  The  MBE  system  included 
interconnected  growth  chambers  and  an  analysis  chamber 
with  monochromatic  x-ray  photoemission  spectroscopy 
(XPS)  capabilities.  The  ZnSe/GaAs  growth  has  been  de¬ 
scribed  previously.^  The  Zn  to  Se  BPR  was  monitored  by 
means  of  a  nude  ion  gauge,  and  the  corresponding  near¬ 
interface  composition  was  determined  by  means  of  XPS  us¬ 
ing  the  integrated  emission  intensity  of  Zn  and  Se  core  levels 
as  a  function  of  ZnSe  thickness  for  the  first  few  monolayers 
of  ZnSe  growth.  In  this  article,  the  specific  specimen  growth 
from  substrate  to  the  top  layer  includes  a  GaAs  (100)  wafer 
(Si-doped,  n  =  7-9  X  10^^  cm“^,  a  GaAs  buffer  layer  (0.5 
lim  thick.  Si-doped,  n  =  10^^  cm”^)  grown  at  580  °C,  a 
ZnSe  epilayer  (0.15  [xm  thick,  nominally  undoped)  grown  at 
290  °C,  and  an  amorphous  Se  cap  layer  (0.5-1  /xm  thick) 
grown  at  25  °C.  The  latter  protects  the  epitaxial  ZnSe  film 
from  ambient  contamination  during  transfer  in  air  from 
growth  chamber  to  other  experimental  chambers.  We  have 
used  specimens  with  various  BPR,  i.e.,  BPR=0.1,  0.2,  0.5,  1, 
2,  10.  Overlayers  grown  with  various  BPR  exhibited  differ¬ 
ent  reflection  high  energy  electron  diffraction  (RHEED) 
patterns, which  depended  upon  the  initial  growth  condi¬ 
tions  and  which  were  consistent  with  the  literature.^^ 

We  performed  luminescence  experiments  under  ultrahigh 
vacuum  (UHV)  conditions  in  order  to  minimize  any  effects 
of  surface  contamination  or  surface  recombination  and  to 
permit  low  temperature  (at  90  K)  photoluminescence  mea¬ 
surements.  The  optical  excitation  source  for  the  emission 
spectra  was  a  HeCd  laser  with  photon  energy  2.808  eV 
(X=4416  A).  Our  photodetectors  were  an  S-1  photomulti¬ 
plier  and  a  cooled  North  Coast  Ge  photodiode.  A  CaF2  lens 
collects  the  emitted  light  and  focuses  it  through  a  sapphire 
viewport  into  a  Leiss  prism  monochromator  with  a  spectral 
resolution  better  than  0.05  eV.  Both  CaF2  and  sapphire  are 
transparent  in  our  measuring  energy  range.  We  annealed 
ZnSe/GaAs  heterojunction  specimens  with  different  Zn- 
to-Se  BPR  in  5  min  annealing  cycles  at  increasing  tempera¬ 
tures  from  250  ®C  to  as  high  as  500  °C.  After  each  anneal, 
the  specimens  were  cooled  to  90  K  for  each  annealing  stage 
of  the  photoluminescence  measurement.^^  Using  low  energy 
electron  diffraction  (LEED)  to  detect  the  surface  reconstruc¬ 
tion,  we  found  that  the  temperature  for  completely  desorbing 
the  Se  cap  layer  was  between  200  and  290  °C.  All  specimens 
displayed  a  predominant  (2X1)  mixed  with  a  c(2  X  2)  LEED 
pattern  with  the  BPR=10  specimen  showing  the  sharpest 
pattern. 


We  have  also  used  a  scanning  electron  microscope  (SEM) 
operating  at  3  keV  with  a  nominal  electron  beam  diameter  of 
10-20  nm  to  produce  cross-sectional  luminescence  spectra 
from  a  ZnSe/ZnSeS/GaAs  superlattice  at  5  K.  This  particular 
specimen  was  grown  by  metal-organic  vapor  phase  epitaxy 
(MOVPE)  at  340  °C  and  consists  of  a  GaAs  (001)  wafer  as  a 
substrate,  120  periods  of  ZnSe  layer  (10  nm  thick)  and 
ZnSeS  layer  (10  nm  thick),  and  a  10  nm  thick  ZnSe  cap 
layer.  It  was  subsequently  annealed  for  2.5  h  at  580  °C.  The 
SEM  measurements  were  made  with  the  electron  beam  either 
near  the  GaAs  substrate  or  close  to  the  ZnSe/ZnSeS  free 
surface.  In  both  cases,  the  signals  are  averages  from  multiple 
ZnSe/ZnSeS  layers  since  the  beam  exceeds  the  superlattice 
period,  even  without  blooming. 


III.  RESULTS 

Figures  1  and  2  display  photoluminescence  spectra  of 
ZnSe/GaAs  heterojunctions  with  HeCd  laser  excitation.  Fig¬ 
ure  1  shows  a  set  of  photoluminescence  spectra  measured  at 
90  K  from  three  ZnSe/GaAs  heterojunction  interfaces  with 
various  BPR  as  a  function  of  annealing  temperature  with  an 
S-1  photomultiplier.  The  S-1  photomultiplier  response  cov¬ 
ered  the  energy  range  of  both  ZnSe  NBE  (2.79  eV)  and 
GaAs  NBE  (1.5  eV).  Figure  1(a)  shows  spectra  for  a  speci¬ 
men  with  BPR=1  (i.e.,  the  beam  pressure  of  Zn  and  Se  are 
balanced).  There  are  four  major  features  (1.3,  1.5,  2.79  eV, 
and  the  laser  line  at  2.808  eV)  in  these  spectra  at  tempera¬ 
tures  lower  than  350  °C,  which  is  the  thermal  threshold  for 
this  ZnSe/GaAs  heterostructure  (the  temperature  at  which  the 
new  deep  level  emission  starts  to  increase,  is  called  the  ther¬ 
mal  threshold).  These  low  temperature  spectra  suggest  that 
some  ZnSe  defects  exist  near  the  heterojunction  even  before 
annealing.  The  ZnSe  emission  features  have  relatively  high 
intensities  for  the  HeCd  laser  excitation  employed  here  (see 
Sec.  IV).  We  also  observed  two  new  deep  levels  (1.9  eV  and 
2.25  eV)  form  in  the  ZnSe  band  gap  for  annealing  tempera¬ 
tures  350  °C  and  above.  Their  emission  intensities  increase 
with  increasing  annealing  temperature,  dominating  the  spec¬ 
tra  at  the  highest  temperature  investigated. 

Figure  1(b)  presents  analogous  spectra  for  a  specimen 
with  BPR  =  10.  These  spectra  exhibit  deep  level  emission  and 
NBE  features  similar  to  those  in  Fig.  1(a).  However,  the 
intensity  of  the  new  feature  at  2.25  eV  is  more  pronounced 
relative  to  the  GaAs  NBE.  Figure  1(c)  shows  the  spectra  for 
a  specimen  with  BPR— 0.1  (i.e.,  Se-rich  growth  conditions). 
Note  that  only  the  1.9  eV  feature  is  evident  in  these  spectra 
with  increasing  annealing  temperature.  The  feature  at  2.25 
eV  is  undetectable.  We  also  measured  other  specimens  with 
BPR  ==0.2,  and  0.5  (not  shown).  All  samples  exhibited  simi¬ 
lar  spectral  features,  with  relative  intensities  that  depended 
strongly  on  the  BPR  employed  during  ZnSe  growth.  Specifi¬ 
cally  the  feature  at  2.25  eV  was  detected  and  found  to  in¬ 
crease  with  high  temperature  annealing  only  in  samples 
grown  in  Zn-rich  conditions,  indicating  a  role  of  both  inter¬ 
face  diffusion  and  crystal  growth  in  formation  of  this  defect. 
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Fig.  1.  Deep  level  emission  spectra  for  150  nm  ZnSe/GaAs  heterojunction 
structure  with  a  HeCd  laser  excitation  and  S-l  photomultiplier  detection  as 
a  function  of  annealing  temperature  for  specimens  with  (a)  BPR=1,  (b) 
BPR=10,  and  (c)  BPR=0.1.  Pronounced  new  deep  level  emissions  appear 
at  higher  annealing  temperatures. 

We  also  acquired  lower  energy  photoluminescence  spectra 
with  a  Ge  photodiode  to  complement  the  S-l  spectra.  The 
S-l  photomultiplier  is  sensitive  in  the  range  of  1.1  eV  to  4 
eV,  while  the  Ge  photodiode  responds  within  a  narrower  and 
lower  energy  range  of  spectral  sensitivity,  i.e.,  0.7  eV  to 
^^2.0  eV.  The  response  of  this  Ge  detector  is  heavily 


Fig.  2.  Deep  level  emission  spectra  for  the  same  specimens  as  in  Fig.  1  [(a) 
BPR=1,  (b)  BPR=10]  with  a  HeCd  laser  excitation  and  Ge  photodiode 
detection  as  a  function  of  annealing  temperature.  The  intensities  of  both 
features  at  0.9  eV  and  1 .9  eV  increase  with  annealing  temperature. 

weighted  toward  the  near  infrared,  which  allows  us  to  detect 
emissions  from  states  deep  within  the  semiconductor  band 
gaps.  Figure  2  shows  a  set  of  photoluminescence  spectra 
measured  at  90  K  by  the  Ge  photodiode  from  the  ZnSe/GaAs 
heterojunction  interfaces  with  various  BPR  as  a  function  of 
annealing  temperature.  We  observed  multiple  deep  level 
transitions  with  energies  less  than  the  band  gap  energy  of 
GaAs  (1.5  eV)  as  shown  in  Fig.  2.  The  spectral  features 
resemble  those  of  similar  specimens  reported  previously. 

Evidence  of  new  defect  creation  appears  at  the  annealing 
temperature  of  350  °C  and  above,  but  the  feature  at  2.25  eV 
shown  in  the  S-l  spectra  of  Fig.  1  did  not  appear  in  these 
spectra  because  of  the  much  lower  response  of  the  Ge  pho¬ 
todiode  at  energies  above  2  eV.  Figure  2(a)  shows  spectra  for 
the  same  BPR=1  specimen  measured  in  Fig.  1(a)  but  with 
Ge  photodiode  detection.  Among  low  energy  features,  the 
emission  intensity  of  a  peak  at  0.9  eV  grows  with  annealing 
temperature  and  its  intensity  increases  proportionally  with 
that  of  the  1.9  eV  peak  after  completely  desorbing  the  Se 
cap.  These  intensities  increase  2-7  times  for  0.9  eV  peak, 
and  5-10  times  for  1.9  eV  peak,  from  the  threshold  tempera¬ 
ture  to  425  °C.  Figure  2(b)  shows  spectra  for  the  same 
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Fig.  3.  The  cathodoluminescence  spectra  from  ZnSSe/GaAs  structures  in 
cross  section  using  an  SEM.  the  top  spectrum  corresponds  to  a  spot  near  the 
buried  ZnSeo.9So.i /GaAs  interface  (the  lower  spot  in  the  inset).  The  bottom 
spectrum  corresponds  to  a  spot  near  the  free  ZnSe  surface  (the  upper  spot  in 
the  inset).  The  higher  energy  peak  component  appears  to  increase  near  the 
GaAs  interface. 

BPR=10  specimen  measured  in  Fig.  1(b).  The  spectral  fea¬ 
tures  are  similar  to  those  shown  in  Fig.  2(a),  but  the  intensity 
of  the  1.9  eV  feature  relative  to  the  GaAs  NBE  emission 
increases  less  than  that  of  the  specimen  with  BPR=1.  In 
general,  the  emission  intensities  of  the  lower  energy  peaks 
relative  to  the  1.9  eV  peak  are  larger  for  Zn-rich  growth 
conditions,  consistent  with  previous  results. 

In  order  to  characterize  further  the  nature  of  the  1.9  eV 
and  2.25  eV  features,  we  obtained  luminescence  spectra  from 
similar  heterojunctions  in  cross  section  using  an  SEM  at 
10  K.  Figure  3  shows  cathodoluminescence  from  a 
ZnSe/ZnSeo.9So.i/GaAs  superlattice  at  two  spatial  locations 
relative  to  the  interface  (see  inset).  Two  similar  features 
originate  within  the  ZnSeS,  whose  relative  emission  intensi¬ 
ties  vary  with  depth.  The  bottom  spectrum  is  measured  near 
the  free  surface  (upper  spot  in  the  inset),  while  the  top  spec¬ 
trum  corresponds  to  a  position  near  the  buried 
ZnSeo.9So.i/GaAs  interface  (lower  spot  in  the  inset).  The 
SEM-induced  luminescence  from  the  GaAs  substrate  alone 
exhibits  no  such  emission  in  the  energy  range  from  1.8  to  2.4 
eV.  While  the  spectral  features  are  largely  unresolved,  the 
differences  between  the  two  spectra  indicate  at  least  two 
peaks.  Subtraction  of  one  spectrum  from  the  other  minimizes 
the  background  and  indicates  peak  positions  at  roughly  2.06 
eV  and  2.16  eV.  The  relative  intensity  of  the  2.16  eV  feature 
appears  to  increase  near  the  GaAs  interface. 

IV.  DISCUSSION 

Previous  work  of  deep  level  formation  near  the  ZnSe/ 
GaAs  interface  with  post  growth  annealing  have  shown  a 
dramatic  dependence  on  initial  epitaxial  growth  conditions.^^ 
We  now  find  that  annealing  produces  new  pronounced  emis¬ 
sion  features  at  several  energies.  Most  prominent  are  features 
at  both  1.9  eV  and  2.25  eV,  whose  intensities  increase  rapidly 
with  annealing  temperature  and  which  differ  for  structures 
fabricated  with  Se-  versus  Zn-rich  growth.  The  data  in  Figs. 


1-3  shed  light  on  the  nature  of  the  thermally-induced  deep 
levels.  The  spectra  in  Fig.  1  show  that  the  2.25  eV  peak  is 
most  intense  for  Zn-rich  (BPR=  10)  conditions.  In  general, 
the  2.25  eV  feature  only  appears  under  the  BPR-balanced  or 
Zn-rich  growth  conditions  with  increasing  annealing  tem¬ 
perature.  Overall,  previous  studies  provide  no  well-accepted 
identification  of  the  2.25  eV  luminescence  peak  feature. 
Originally,  this  feature  was  termed  “Cu-green”  lumine¬ 
scence,^^  but  subsequent  studies^^  provide  evidence  for  such 
emission  in  the  absence  of  Cu  impurities.  Similarly,  our  re¬ 
sults  show  the  variation  of  this  emission  depends  on  the  bal¬ 
ance  of  Zn  to  Se  during  epitaxial  overlayer  growth.  Alterna¬ 
tively,  the  emission  may  arise  from  a  Ga-Se  interfacial 
complex.  Zahn  et  have  reported  various  luminescence 
peaks  in  the  2-2.5  eV  range  for  different  Ga-Se  bulk  mate¬ 
rial.  Likewise,  Skromme  et  alP  found  that  a  similar  2.27  eV 
peak  is  stronger  when  the  Ga  is  deposited  in  a  sheet  on  a 
Se-rich  ZnSe  surface  than  when  bulk-  or  sheet-doped  on  a 
Zn-rich  surface.  Such  Ga-Se  compound  emission  is  counter 
to  our  observations,  since  Zn-rich  interfacial  conditions  are 
likely  to  suppress  any  Ga-Se  bonding. 

More  is  known  about  the  1.9  eV  peak  emission,  which  is 
the  most  intense  feature  in  all  of  our  specimens  at  elevated 
annealing  temperature.  This  feature  can  be  associated  with 
self-activated  emission  due  to  Zn  vacancies  (V^n)  or  a  com¬ 
plex  defect  center  involving  a  Zn  vacancy  and  a  substitu¬ 
tional  Ga  atom  on  a  Zn  site^^’^^’^^  (i.e.,  Ga^n-V^n)-  The  na¬ 
ture  of  this  defect  is  consistent  with  the  emission  increase 
with  increasing  annealing  temperature:  as  described 
previously, Ga  outdiffusion  occurs  at  elevated  tempera¬ 
tures  from  GaAs  to  ZnSe,  resulting  in  formation  of  defect 
complexes  such  as  Ga^^,  especially  in  samples  fabricated 
under  Se-rich  growth  conditions. 

The  SEM  cathodoluminescence  spectra  from  the 
ZnSeo.Q^o.i /GaAs  cross  section  in  Fig.  3  are  consistent  with 
our  ZnSe/GaAs  photoluminescence  results.  The  peak  ener¬ 
gies  of  '^2.06  eV  and  —2.16  eV  for  this  ZnSeo.9So.i/GaAs 
heterojunction  appear  related  to  the  1.9  eV  ZnSe  feature  but 
shifted  0.16  and  0.26  eV,  respectively  to  higher  energy.  The 
lower  temperature  can  account  for  —0.1  eV  of  this  shift  of 
both  peaks.^^  The  ternary  alloy’s  larger  band  gap  can  account 
for  the  0.1  eV  difference  in  energy  of  the  two  unresolved 
peaks.  Thus  the  two  peaks  in  ZnSe/ZnSeo.gSo.i/GaAs  and  the 
1.9  eV  peak  in  ZnSe/GaAs  display  a  good  correspondence. 
The  broad  features  indicate  at  least  two  peaks,  whose  inten¬ 
sities  appear  to  vary  with  depth.  The  relative  variation  of  the 
component  intensities  within  the  broad  feature  certainly  in¬ 
dicates  that  more  than  one  deep  level  is  involved.  Whether 
these  variations  are  due  to  defect  generation  proceeding  from 
the  buried  interface  or  from  the  free  ZnSe  surface  remains  to 
be  established.  Recombination  and  defect  states  at  the  cross 
sectional  surface  may  well  affect  both  spectra.  However,  the 
primary  effect  appears  to  be  the  proximity  of  the  GaAs  layer. 
The  absence  of  such  emission  within  the  GaAs  alone  dem¬ 
onstrates  the  II- VI  origin  of  these  emissions.  This  latter  result 
also  demonstrates  that  electrons  in  the  GaAs  do  not  diffuse 
laterally  near  the  cross  sectional  surface  and  recombine  hun- 
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Fig.  4.  Intensity  increase  deep  level  emissions  at  0.9  eV  and  1.9  eV  as  a 
function  of  annealing  temperature  for  150  nm  thick  ZnSe/GaAs  structures 
(BPR=  1).  The  two  peaks  exhibit  a  threshold  for  defect  creation  and  increase 
proportionately. 

dreds  of  nanometers  away  in  the  neighboring  ZnSe.  This 
result  in  fact  emphasizes  the  importance  of  near-interface 
excitation  for  gauging  internal  photoemission. 

The  coupled  growth  of  deep  level  features  with  annealing 
underscores  the  growth  of  deep  rather  than  shallow  level 
densities  with  annealing.  From  Fig.  1  and  Fig.  2,  we  ob¬ 
served  that  the  intensities  of  both  0.9  eV  and  1 .9  eV  features 
increase  proportionally  with  annealing  temperature.  We  ex¬ 
tracted  their  intensities  from  the  spectra  by  using  Gaussian 
distributions  to  deconvolve  the  multiple  peak  structures.  Fig¬ 
ure  4  shows  the  intensities  of  0.9  eV  and  1 .9  eV  peaks  versus 
annealing  temperature  for  the  specimen  with  BPR=1.  Both 
emission  intensities  are  low  and  constant  below  the  anneal¬ 
ing  threshold,  but  increase  rapidly  above  the  threshold  tem¬ 
perature.  Similar  behavior  appears  in  specimens  grown  at  all 
BPR.  This  result  is  consistent  with  our  previous  data,  which 
also  shows  these  two  features  increasing  together  with 
annealing. Furthermore,  the  0.9  eV  and  1.9  eV  peaks  ener¬ 
gies  add  up  to  the  ZnSe  band  gap,  indicating  that  these  two 
transitions  are  complementary,  i.e.,  the  emissions  correspond 
to  transitions  into  and  out  of  the  same  deep  level  in  the  band 
gap.  This  supports  the  contention  that  the  increase  in  the 
intensity  of  the  0.9  and  1.9  eV  features  upon  annealing  re¬ 
flects  a  variation  in  the  spatial  concentration  of  the  same 
deep  level,  as  opposed  to  unrelated  shallow  levels. 

We  have  also  used  the  relative  intensities  of  ZnSe  and 
GaAs  NBE  emissions  in  Fig.  1  to  infer  changes  in  the  ZnSe/ 
GaAs  heterojunction  band  offset  with  annealing.  Previously, 
we  showed  that  the  unusually  large  ZnSe  recombination  with 
HeCd  excitation  at  90  K  was  the  result  of  free  carrier  injec¬ 
tion  across  the  interface  due  to  internal  photoemission.  If 
we  take  into  account  the  relative  absorption  of  HeCd  radia¬ 
tion  of  ZnSe  versus  GaAs  and  the  relative  response  of  the 
S-1  detector  at  their  NBE  emission  energies,^^  the  intensity 
of  the  ZnSe  NBE  (2.79  eV)  emission  should  be  300-600 
times  smaller  (depending  on  the  precise  temperature)  than 
the  GaAs  NBE  (1.5  eV)  emission  in  Fig.  1(a)  for  free  carrier 
recombination  without  diffusion.  Internal  photoemission  can 
explain  the  large  intensity  of  the  ZnSe  NBE.  In  such  a  pro¬ 


Fig.  5.  Intensity  ratio  of  ZnSe  to  GaAs  NBE  emissions  and  intensity  of  new 
deep  level  emission  at  1.9  eV  vs  annealing  temperature  (BPR=1).  The  bot¬ 
tom  curve  represents  the  intensity  of  the  1 .9  eV  feature  which  increases  with 
annealing.  At  the  threshold  temperature,  when  the  new  deep  level  emission 
starts  to  increase,  the  ZnSe/GaAs  NBE  emission  intensity  ratio  also  shows  a 
pronounced  change. 


cess,  free  carriers  are  photoexcited  in  GaAs  near  the  metal¬ 
lurgical  interface  at  energies  below  the  ZnSe  band  gap,  and 
cross  the  junction  to  recombine  in  the  ZnSe.  We  have  already 
shown  that  specimens  known  to  have  different  band  offsets 
displayed  strong  changes  in  the  ratio  of  ZnSe  to  GaAs  NBE 
emission,  indicating  the  sensitivity  of  this  internal  photo¬ 
emission  to  the  interfacial  barriers. 

Annealing  causes  additional  changes  in  internal  photo¬ 
emission.  Figure  5  shows  a  comparison  of  the  intensity  ratio 
of  ZnSe  to  GaAs  NBE  emissions  (top  curve)  and  the  new 
deep  level  emission  at  1.9  eV  (bottom  curve)  as  a  function  of 
annealing  temperature.  At  the  threshold  temperature,  the 
ZnSe/GaAs  NBE  emission  intensity  ratio  also  exhibits  a  pro¬ 
nounced  change.  Specimens  grown  under  different  BPR  con¬ 
ditions  also  show  pronounced  changes  at  the  threshold 
temperature  for  1.9  eV  defect  creation.  Therefore,  this  pro¬ 
nounced  change  in  the  relative  intensities  of  NBE  emissions 
at  the  defect  threshold  temperature  suggests  that  deep  level 
creation  affects  the  heterojunction  band  offset.  When  new 
deep  levels  start  to  form  the  valence  band  offset  appears  to 
increase,  so  that  there  is  a  lower  barrier  to  movement  of  free 
electrons  across  the  interface  from  GaAs  to  ZnSe.  Thus,  the 
intensity  of  internal  photoemission  increases,  as  evidenced 
by  free  carrier  recombination  not  only  across  the  ZnSe  band 
gap,  but  also  into  the  ZnSe  deep  levels.  These  results  are  a 
strong  indication  that  near-interface  defects  and  epitaxial 
growth  conditions  are  critical  factors  in  forming  the  hetero¬ 
junction  band  offsets. 

Besides  internal  photoemission,  other  processes  may  af¬ 
fect  the  ZnSe/GaAs  emission  ratio.  For  several  of  the  speci¬ 
mens  investigated,  including  BPR=1,  the  increase  in  ZnSe/ 
GaAs  NBE  ratio  is  due  primarily  to  a  decrease  in  GaAs  NBE 
intensity  (not  shown).  Such  a  decrease  could  be  due  to:  (a) 
the  transfer  of  photoexcited  carriers  out  of  GaAs  into  ZnSe 
(i.e.,  internal  photoemission),  (b)  an  increase  in  sub-band  gap 
recombination  within  the  GaAs  (i.e.,  “non-radiative”  recom- 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2966 


Yang  et  a!.:  Deep  levels  and  internal  photoemission  at  ZnSe/GaAs  interfaces 


2966 


bination),  or  (c)  a  change  in  GaAs  band  bending  which  in¬ 
creases  the  “dead”  layer  within  the  depletion  width.  (This 
“dead”  layer  separates  free  electrons  and  holes  and  reduces 
NBE  recombination  near  the  GaAs  interface.) 

No  increase  in  GaAs  deep  level  recombination  [case  (b)] 
is  evident  in  the  near-IR  spectra  of  Figs.  2(a)  and  2(b),  at 
least  for  energies  above  0.7  eV.  A  change  of  band  bending 
[case  (c)]  due  to  creation  of  new  near-interface  states  could 
indeed  affect  GaAs  NBE  since  the  HeCd  absorption  depth  of 
50  nm  in  GaAs  is  within  the  buffer  layer’s  depletion  width 
(ca.  140  nm  for  10^^  cm~^  doping  and  0.7  eV  nominal  band 
bending).  “Dead”  layer  effects  may  therefore  account  for 
part  of  the  ZnSe/GaAs  NBE  changes  with  defect  formation. 
However,  previous  photoluminescence  measurements  of 
similar  ZnSe/GaAs  specimens  comparing  HeCd  versus 
HeNe  spectra  show  only  a  factor  of  2  increase  for  excitation 
well  within  versus  well  beyond  (HeNe  absorption  depth  of 
200  nm)  the  depletion  region.  Therefore  these  effects  do 
not  account  for  the  orders-of-magnitude  enhancement  of 
ZnSe  NBE  emission  relative  to  GaAs  before  annealing  based 
on  absorption  coefficients.  Further  investigation  of  possible 
band  bending  effects  on  ZnSe/GaAs  intensity  ratios  with  an¬ 
nealing  is  in  progress.  In  any  case,  the  thermally  induced 
diffusion  and  bonding  which  produces  new  electronic  states 
leads  to  changes  in  the  electric  potential  gradients  at  the 
ZnSe/GaAs  heterojunction  on  a  microscopic  scale. 

V.  CONCLUSIONS 

We  have  performed  photoluminescence  spectroscopy 
studies  of  ZnSe/GaAs  heterojunctions  in  a  wide  energy  range 
by  means  of  HeCd  laser  excitation  and  complementary  pho¬ 
todetection  by  two  detectors  with  different  response  sensi¬ 
tivities.  Using  this  technique,  we  have  observed  new  deep 
level  emissions  which  appear  within  the  ZnSe  energy  band 
gap  (1.9  eV  and  2.25  eV)  with  post  growth  annealing.  The 
thermally  induced  defect  formation  depends  on  specimen 
growth  conditions.  Annealing  permitted  us  to  identify  the 
complementary  nature  of  the  deep  level  emissions  at  0.9  eV 
and  1.9  eV  into  and  out  of  the  same  mid-gap  state.  The 
dramatic  variations  in  internal  photoemission  with  thermally- 
induced  defect  formation  emphasize  that  annealing  processes 
lead  to  the  creation  of  multiple  deep  levels  near  the  buried 
interface,  and  that  such  deep  level  formation  can  affect  het¬ 
erojunction  band  offsets. 
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ZnSe/GaAs(001)  interfaces  were  fabricated  through  naolecular  beam  epitaxy  by  predosing  the 
GaAs(00 1)2X4  surface  with  ordered  monolayers  of  Se  or  Zn,  prior  to  ZnSe  growth  in 
stoichiometric  growth  conditions.  We  performed  the  predosing  at  room  temperature  (RT)  and 
achieved  ZnSe  epitaxial  overgrowth  at  RT  to  minimize  desorption  of  the  adsorbed  species  and 
minimize  interdiffusion.  In  situ  synchrotron  radiation  photoemission  studies  indicate  that 
Se-predosing  results  in  valence  band  offsets  as  low  as  0.40  ±0.05  eV,  and  that  Zn-predosing  yields 
valence  band  offsets  as  high  as  1.01  ±0.05  eV.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  possibility  of  controlling  the  band  discontinuities  at 
semiconductor  heterojunctions  by  modifying  the  local  inter¬ 
face  environment  has  attracted  much  attention  in  the  past  ten 
years. Methods  to  change  the  band  offsets  would  implic¬ 
itly  clarify  the  microscopic  mechanism  that  determine  the 
band  alignment,  and  would  have  a  substantial  technological 
impact.  Semiconductor  heterojunctions  are  extensively  em¬ 
ployed  in  modem  optoelectronic  and  high  speed  technology, 
and  tuning  the  band  offset  to  a  desired  application  could 
optimize  carrier  confinement  or  injection  in  devices  ranging 
from  lasers  to  heterojunction  bipolar  transistors. 

Recent  theoretical  developments  have  identified  heterova- 
lent  heterojunctions  with  polar  orientation  as  the  most  prom¬ 
ising  candidates  for  offset  tuning  through  modification  of  the 
local  interface  environment.'^”^  In  particular,  linear  response 
theory  (LRT)  results  have  shown  that  while  in  isovalent  het¬ 
erojunction  the  band  offsets  for  abrupt  interfaces  should  be  a 
property  of  the  bulk  semiconductor  constituents,  in  heterova- 
lent  heterojunctions  with  polar  orientation,  the  band  align¬ 
ment  should  depend  on  the  detail  of  the  atomic  structure  of 
the  interface.^  Furthermore,  ideally  abrupt  heterovalent  het¬ 
erojunctions  with  polar  orientation  would  be  charged,  so  that 
for  these  interfaces  there  is  a  substantial  thermodynamic 
driving  force  toward  the  formation  of  mixed  local  atomic 
configurations  to  yield  neutral  interfaces. 

Attempts  to  exploit  molecular  beam  epitaxy  (MBE)  to 
achieve  different  local  atomic  configurations  and  different 
band  offsets  have  met  with  success  in  the  case  of  the  ZnSe/ 
GaAs(OOl)  heteroj unction. This  prototypical,  lattice- 
matched  (0.27%  in-plane  mismatch)  heterovalent  heterojunc¬ 
tion  is  a  crucial  component  of  most  blue-green  solid  state 
lasers  demonstrated  to  date,^^"^^  and  changes  in  the  corre¬ 
sponding  band  offsets  could  have  an  important  impact  on 
device  performance. 
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The  growth  parameter  successfully  employed  to  modify 
the  ZnSe/GaAs  band  offset  has  been  the  Zn/Se  beam  pres¬ 
sure  ratio  (BPR)  employed  during  the  early  stages  of  ZnSe 
MBE  on  GaAs  surfaces.^’^  Heterojunctions  grown  with  low 
BPR  (BPR^O.l,  Se-rich  growth  conditions)  have  shown 
relatively  low  valence  band  offsets  (A£'j^'~-0.6  eV)  as  com¬ 
pared  to  junctions  grown  with  high  BPR  (BPR 10, 
A£'y~1.2  eV),  in  samples  probed  in  situ  by  photoemission 
spectroscopy,^’^  and  ex  situ  by  optical  techniques, and  in 
fully  functional  heterojunction  diodes  examined  by  low- 
temperature  tunneling  measurements.^^ 

The  mechanism  through  which  the  BPR  change  affects 
the  band  alignment  is  more  difficult  to  probe  experimentally. 
Monochromatic  x-ray  photoemission  (XPS)  studies  of  the 
Zn  3d/Se  3d  intensity  ratio  from  ZnSe/GaAs  interfaces  as  a 
function  of  ZnSe  thickness  indicate  that  high  (low)  BPRs 
yield  a  Zn-rich  (Se-rich)  composition  near  the  interface.^  The 
local  nature  of  the  mechanism  is  further  supported  by  evi¬ 
dence  that  the  offset  change  is  also  observed  when  the  non- 
stoichiometric  growth  conditions  (i.e.,  high  or  low  BPR)  are 
confined  to  a  thin  (^^2  nm  thick)  ZnSe  layer  near  the 
interface,^  while  the  rest  of  the  II-VI  epilayer  is  grown  with 
BPR=1.  Calculations  examining  some  of  the  simplest  pos¬ 
sible  mixed  atomic  configurations  that  would  lead  to  neutral 
interfaces  predict  A£y  =  0.62  eV  for  interfaces  comprised  of 
a  single  Se-As  mixed  plane,  and  AE^=1.59  eV  for  inter¬ 
faces  comprised  of  a  single  Zn-Ga  mixed  plane,^’^^  but  no 
studies  relating  the  BPR  employed  to  the  formation  of  dif¬ 
ferent  chemical  bonds  at  the  interface  have  been  reported  to 
date. 

Here  we  take  a  different  approach  to  band  offset  modifi¬ 
cation  in  ZnSe/GaAs(001)  interfaces  with  the  goal  of  ascer¬ 
taining  if  the  presence  of  Zn  or  Se  just  at  the  interface  is 
sufficient  to  induce  the  band  offset  changes,  and  probing 
more  closely  the  interfacial  bonds  that  are  established  in  the 
two  cases.  In  the  MBE  growth  procedure  employed,  the  ini¬ 
tial  GaAs(00 1)2X4  surface  was  exposed  to  an  elemental  flux 
of  Zn  or  Se  prior  to  ZnSe  growth.  The  surface  predosing  was 
used  to  achieve  an  initial  ordered  Zn  or  Se  coverage  of  the 
surface.  Since  the  formation  of  a  Zn  ordered  monolayer  on 
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GaAs  in  our  experimental  conditions  required  lower  sample 
temperatures  than  normally  employed  for  ZnSe  MBE,  we 
elected  to  perform  both  types  of  predosing  at  room  tempera¬ 
ture  (RT).  To  avoid  Zn  desorption,  and  minimize  atomic  in¬ 
terdiffusion,  we  successfully  implemented  epitaxial  regrowth 
of  ZnSe  at  RT  on  the  predosed  surface.  We  examined  the 
resulting  heterojunction  band  offsets  and  interfacial  chemical 
bond  formation  in  situ  by  means  of  synchrotron  radiation, 
soft-x-ray  photoemission  spectroscopy  (SXPS),  which  yields 
substantially  higher  energy  resolution  and  surface  sensitivity 
relative  to  conventional  XPS. 

II.  EXPERIMENTAL  DETAILS 

Si-doped  {Nj^—  1 X 10^^  cm"^),  500-nm  thick 

GaAs(001)2X4  buffer  layers  were  grown  at  580  °C  on 
n'^-GaAs(OOl)  wafers  in  a  solid  source  MBE  facility  follow¬ 
ing  the  procedures  described  elsewhere.  A  micron- thick  As 
cap  layer  was  deposited  in  situ  onto  the  samples  to  protect 
them  during  transfer  in  air  to  the  Synchrotron  Radiation  Cen¬ 
ter  of  the  University  of  Wisconsin-Madison.  The  samples 
were  introduced  into  a  miniature  II- VI  MBE  chamber  con¬ 
nected  to  the  photoelectron  spectrometer,  and  the  As-cap 
layer  was  removed  by  heating  the  samples  to  450  in  about 
30  min,  resulting  in  a  sharp  2X4  reflection  high  energy  elec¬ 
tron  diffraction  (RHEED)  pattern,  and  a  surface  rms  rough¬ 
ness  comparable  to  that  of  as-grown  surfaces. 

Predosing  of  the  GaAs(00 1)2X4  surface  with  Se  or  Zn 
was  performed  using  elemental  solid  sources,  and  atomic 
fluxes  of  the  order  of  3X10^^  atoms/cm^  s.  In  these  condi¬ 
tions  it  was  found  that  monolayer  or  multilayer  adsorption  of 
Se  could  be  readily  achieved  at  the  typical  ZnSe  growth 
temperature  of  290  °C,  but  that  only  traces  of  Zn  could  be 
detected  on  the  surface.  Although  a  systematic  studies  of  Zn 
condensation  on  GaAs  as  a  function  of  temperature  and  ex¬ 
posure  time  is  beyond  the  scope  of  this  paper,  and  will  be 
presented  elsewhere, we  mention  that  submonolayer  to 
monolayer  Zn  saturation  coverages  were  observed  at  290  °C 
on  GaAs  only  following  hour-long  exposures  at  Zn  fluxes  25 
times  as  high  as  those  available  here. 

To  increase  the  Zn  sticking  coefficient,  we  performed  the 
Zn  predosing — and  for  consistency  the  Se  predosing — at  RT. 
In  these  conditions  monolayer  coverages  of  Zn  could  be 
achieved  in  a  few  minutes,  and  gave  rise  to  the  removal  of 
the  initial  2X4  surface  reconstruction,  and  the  emergence  of 
a  sharp  1X1  RHEED  pattern.  The  coverage  was  estimated 
using  the  reading  of  a  quartz  thickness  monitor  located  near 
the  sample.  Estimates  performed  from  the  SXPS-derived  at¬ 
tenuation  of  the  Ga3c/  and  As  3d  core  levels  were  within 
20%-30%  of  the  thickness  monitor  reading.  Se  exposures 
also  removed  the  initial  2X4  reconstruction  giving  rise  to  a 
1X1  pattern.  However,  for  coverages  above  1  A  the  pattern 
became  increasingly  diffused,  and  disappeared  at  about  2  A. 
We  therefore  restricted  our  studies  to  a  coverage  of  1  A  for 
both  Zn  and  Se  at  RT. 

To  minimize  desorption  and  redistribution  of  the  pread¬ 
sorbed  species,  as  well  as  reduce  interdiffusion  across  the 
interface,  ZnSe  growth  after  predosing  was  also  imple¬ 


mented  at  RT,  using  a  compound  ZnSe  cell.  For  relatively 
low  growth  rates  of  74  A/h,  epitaxy  could  also  be  achieved  at 
such  a  relatively  low  temperature,  admittedly  with  some  de¬ 
gree  of  three-dimensional  growth  (3D),  especially  in  the  case 
of  Zn-predosed  surfaces,  as  indicated  by  the  RHEED  pat¬ 
terns  (see  next  section). 

In  what  follows  the  Zn,  Se,  or  ZnSe  coverage  is  given  in 
angstroms  or  monolayers.  The  former  values  reflect  the 
thickness  monitor  readings,  and  assume  an  overlayer  density 
equal  to  that  of  bulk  Zn,  Se,  or  ZnSe. 

At  selected  stages,  the  growth  was  interrupted,  and  the 
samples  were  cooled  to  RT,  and  transferred  to  the  photoelec¬ 
tron  spectrometer.  We  used  synchrotron  radiation  from  the 
800  MeV  electron  storage  ring  Aladdin  dispersed  by  a  6-m 
toroidal  grating  monochromator,  and  a  commercial  hemi¬ 
spherical  analyzer  to  record  angle-integrated  photoelectron 
energy  distribution  curves  EDCs  from  the  Ga3(7,  As  3d, 
Zn  3d,  and  Se  3d  core  levels  with  an  overall  energy  resolu¬ 
tion  (electrons + photons)  of  0.2  eV  for  a  photon  energy  of 
120  eV. 

Most  measurements  reported  here  were  obtained  by  col¬ 
lecting  the  photoelectrons  at  an  average  emission  angle  of 
—20°  from  the  sample  normal.  On  occasions,  to  further  en¬ 
hance  the  surface  sensitivity,  photoelectrons  were  collected 
at  about  —80°  from  the  sample  normal.^^ 

When  necessary,  a  line  shape  analysis  of  the  different  core 
levels  was  performed  in  terms  of  the  superposition  of  up  to 

three  Voigt  doublets,  using  a  standard  nonlinear  least- squares 

....  .  20-22 
minimization  routine. 

III.  RESULTS  AND  DISCUSSION 
A.  Epitaxy 

In  Fig.  l_(a)  we  show  the  RHEED  pattern  (10  keV  primary 
energy,  [110]  sample  azimuth)  from  a  ZnSe/Se/GaAs(001) 
heterostructure  fabricated  by  MBE  at  RT.  The  initial  Se  pre¬ 
dosing  of  the  surface  (1  A)  was  followed  by  the  deposition  of 
70  A  of  ZnSe  at  RT.  The  streaky  pattern  suggests  the  pres¬ 
ence  of  a  highly  ordered  overlayer.  The  2X1  reconstruction 
is  that  observed  during  ZnSe  growth  at  290  °C  with  a  single 
compound  effusion  cell,  corresponding  to  a  BPR— 1  (Ref. 
20).  As  discussed  in  the  previous  section,  the  initial  2X4 
GaAs  reconstruction  (not  shown)  is  rapidly  replaced  by  a 
sharp  1X1  pattern  (not  shown)  upon  Se  predosing,  and  then 
by  the  streaky  2X1  at  a  ZnSe  coverage  of  about  5  A. 

In  Fig.  1(b)  we  show  the  RHEED  pattern  (10  keV  primary 
energy,  [1  fO]  sample  azimuth)  from  a  ZnSe/Zn/GaAs(001) 
heterostructure  deposited  by  MBE  at  RT.  The  initial  Zn  pre¬ 
dosing  of  the  surface  (1  A)  was  followed  by  the  deposition  of 
18  A  of  ZnSe  at  RT.  A  well-defined  diffraction  pattern  is  still 
visible  in  Fig.  1(b),  indicating  crystallinity  of  the  deposits, 
although  the  oval-shaped  features  in  the  pattern  suggest  a 
gradual  transition  to  the  spotty  pattern  indicative  of  3D 
growth.  The  presence  of  the  oval  features  complicates  the 
identification  of  the  surface  symmetry  which,  however, 
shows  several  analogies  with  initial  1X1  pattern  observed 
after  Zn  predosing. 
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Fig.  1.  Top:  RHEED  pattern  (10  keV  electrons)  from  a  70-A-thick  ZnSe 
layer  deposited  at  room  temperature  (RT)  on  a  GaAs(00 1)2X4  surface  pre¬ 
dosed  with  1  A  of  Se  at  RT  to  yield  a  1 X 1  pattern.  Sample  azimuth:  [110]. 
Bottom:  RHEED  pattern  from  an  1 8-A-thick  ZnSe  layer  deposited  at  room 
temperature  on  GaAs (00 1)2X4  predosed  with  1  A  of  Zn  at  RT  to  yield  a 
IXI  pattern.  Sample  azimuth:  [110]. 


B.  Se  predosing 

Representative  SXPS  results  showing  the  effects  of  pre¬ 
dosing  and  ZnSe  overgrowth  are  summarized  in  Figs.  2  and 
3  for  Se  predosing,  and  in  Figs.  4  and  5  for  Zn  predosing.  In 
Fig.  2  we  show  the  evolution  of  the  St  3d  (left)  and  As  3d 
(right)  emission  following  predosing  of  the  GaAs  surface 
with  1  A  of  Se  (bottom),  and  upon  deposition  of  5  A  (cen¬ 
ter),  and  18  A  (top)  of  ZnSe,  all  at  RT.  For  selected  spectra 
we  also  show  superimposed  to  the  data  (solid  circles)  the 
result  of  a  best-fit  (solid  line)  of  the  overall  line  shape  in 
terms  of  the  superposition  of  two  or  three  Voigt  doublets 
(also  shown,  solid  line).  The  corresponding  results  for  the 
Ga  3  J  and  Zn3d  core  levels  are  shown  in  Fig.  3. 

After  Se  predosing,  the  Se  36?  emission  in  Fig.  2  (bottom¬ 
most  spectrum)  is  comprised  of  two  inequivalent  doublets, 
shifted  by  0.69  eV  relative  to  each  other.  This  type  of  line 
shape  is  consistent  with  that  reported  following  Se  passiva¬ 
tion  of  GaAs  surfaces  at  RT.^^  In  this  passivation  study  the 
high-kinetic-energy  component  has  been  associated  with  the 
formation  of  As-Se  bonds,  and  the  low-kinetic-energy  com¬ 
ponent  has  been  associated  with  Se  that  overlays  Se  bonded 
to  As.  Note  that  Se  predosing  seems  to  induce  little  change  in 


Fig,  2.  As  3d  and  SQ3d  core  level  spectra  from  ZnSe  epilayers  of  increas¬ 
ing  thickness  (center  and  top)  deposited  at  RT  onto  a  GaAs(OOl)  surface 
predosed  with  1  A  of  Se  (bottom).  The  photon  energy  employed  was  120  eV. 
The  vertical  lines  mark  the  positions  of  the  centroids  of  the  bulk-related 
As  3d  and  Se3d  doublets.  The  resulting  interface  core  level  separation  is 
consistent  with  a  valence  band  offset  A£’y  =  0.48±0.06  eV. 

the  Ga  36?  emission  in  Fig.  3  (bottom),  while  the  As  3d  line 
shape  in  Fig.  2  (bottom-right)  clearly  shows  the  emergence 
of  a  broad  reacted  component,  shifted  by  0.80  eV  to  lower 
kinetic  energy.  A  smaller  high-kinetic  energy  contribution  to 
the  As  36?  line  might  be — at  least  initially — related  to  some 
residual  surface  emission  from  regions  not  yet  saturated  with 
Se  (Se  coverage  is  only  1  A  in  Figs.  2  and  3). 

The  lack  of  any  Se-induced  change  in  the  Ga36?  line 
shape,  and  the  formation  of  a  chemically  shifted,  high  bind¬ 
ing  energy  contribution  to  the  As  3  J  emission  are  also  con¬ 
sistent  with  the  results  of  Ref.  23,  and  has  been  tentatively 
explained  with  a  preferential  Se-As  bond  formation.  The 
direction  and  order  of  magnitude  of  the  As  3^^  chemical  shift 
is  consistent  with  previous  reports  on  the  formation  of  As-Se 
bonds.  Simeca  et  alP  observed  a  component  at  1.23  eV 
higher  binding  energy  relative  to  the  bulk  As  3d  doublet 
upon  Se  exposure  of  GaAs(OOl)  at  RT.  Since  the  intensity  of 
the  reacted  component  decreased  relative  to  the  bulk  compo¬ 
nent  upon  annealing  at  250  °C,  the  authors  attributed  the 
shifted  doublet  to  As  segregated  on  the  Se-terminated 
GaAs(OOl)  surface. 

We  use  the  same  interpretation  for  the  low-kinetic  energy, 
reacted  As  3d  component  in  Fig.  2  based  on  three  consider¬ 
ations.  First,  upon  ZnSe  growth  on  top  of  the  Se-predosed 
surface,  the  reacted  component  rapidly  becomes  the  domi¬ 
nant  As  3d  feature  when  the  ZnSe  overlayer  thickness  be¬ 
come  comparable  with  the  photoemission  sampling  depth, 
and  shifts  to  1.18  eV  below  the  substrate  line.  Second,  mea- 
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Fig.  3.  Ga  3^^  and  Zn  3  J  core  level  spectra  from  ZnSe  epilayers  of  increas¬ 
ing  thickness  (center  and  top)  deposited  at  RT  onto  a  GaAs(OOl)  surface 
predosed  with  1  A  of  Se  (bottom).  The  photon  energy  employed  was  120  eV. 
The  vertical  lines  mark  the  positions  of  the  centroids  of  the  bulk-related 
Ga  3^/  and  Zn  3  J  doublets.  The  resulting  interface  core  level  separation  is 
consistent  with  a  valence  band  offset  A£y  =  0.40±0.06  eV. 

surements  at  grazing  emission  angles  (not  shown)  yield  a 
dramatic  increase  of  the  intensity  of  this  component  relative 
to  all  other  As  3d  features.  Third,  upon  annealing  the  overall 
ZnSe/Se/GaAs  heterostructures  at  400  °C  (not  shown),  the 
component  is  selectively  removed  leaving  the  other  As  3d 
features  unchanged.  We  therefore  suggest  that  Se  predosing 
displaces  a  number  of  As  atoms  from  GaAs  which  segregate 
on  top  of  the  Se-terminated  overlayer  surface,  and  that  such 
atoms  account  for  the  high-binding  energy  reacted  As  feature 
in  Fig.  2.  Having  extracted  the  segregated  As-Se  contribu¬ 
tion  from  the  As  3d  line  shape  in  Fig.  2,  we  associate  the 
remaining  main  contribution  at  74  eV  of  kinetic  energy  with 
emission  from  the  GaAs  substrate. 

With  increasing  ZnSe  deposition,  the  initially  complex 
Se3J  line  shape  in  Fig.  2  converges  rapidly  to  the  single 
doublet  expected  for  ZnSe  (see  also  Fig.  4).  Therefore  it 
seems  feasible  to  use  the  main  Sc  3d  doublet  at  a  ZnSe  cov¬ 
erage  of  5-18  A,  and  the  substrate  related  As  3d  feature  in 
Fig.  2  to  estimate  the  valence  band  offset.  From  the  mea¬ 
sured  separation  of  the  centroids  in  Fig.  2  (13.00±0.05  eV, 
vertical  solid  lines),  and  the  known  position^"^  of  each  dou¬ 
blet  relative  to  valence  band  maximum  in  bulk  ZnSe  and 
GaAs,  we  calculated  =  0.48 ±0.05  eV. 

In  Fig.  3  we  show  the  evolution  of  the  Ga  3<i  (left)  and 
Zn3d  (right)  emission  following  predosing  of  the  GaAs  sur¬ 
face  with  1  A  of  Se  (bottom),  and  upon  deposition  of  5  A 
(center)  and  1 8  A  (top)  of  ZnSe,  all  at  RT.  The  G3.3d  shows 
no  line  shape  change  following  Se  predosing,  but  upon  depo¬ 


Fig.  4.  As  3d  and  Se3d  core  level  spectra  from  ZnSe  epilayers  of  increas¬ 
ing  thickness  (center  and  top)  deposited  at  RT  onto  a  GaAs(OOl)  surface 
predosed  with  1  A  of  Zn  (bottom).  The  photon  energy  employed  was  120 
eV.  The  vertical  lines  mark  the  positions  of  the  centroids  of  the  bulk-related 
As  3d  and  St  3d  doublets.  The  resulting  interface  core  level  separation  is 
consistent  with  a  valence  band  offset  A£y  =  0.97±0.06  eV. 

sition  of  ZnSe  onto  the  predosed  surface,  a  low  kinetic  en¬ 
ergy  doublet  becomes  increasingly  visible  0.75  eV  below  the 
main  line.  The  direction  and  order  of  magnitude  of  the  shift 
would  be  consistent  with  the  formation  of  Ga-Se  bonds, 
however,  its  origin  is  more  ambiguous.  Since  the  intensity  of 
this  feature  become  gradually  more  important  relative  to  the 
main  Gei3d  line  with  increasing  ZnSe  coverage,  it  is  likely 
to  reflect  Ga  atoms  located  at  the  overlayer  surface  or  within 
the  overlayer,  rather  than  at  the  interface.  Measurements  at 
grazing  emission  (not  shown)  also  tend  to  emphasize  this 
feature.  Upon  annealing  at  400  °C,  however,  this  feature  is 
not  selectively  removed  as  its  As-related  counterpart  in  Fig. 
2,  but  rather  it  is  rapidly  enhanced  until  it  becomes  the  domi¬ 
nant  feature. 

We  therefore  propose  that  the  reacted  G3.3d  component 
derives  from  interdiffusion  across  the  ZnSe/GaAs  interface, 
i.e.,  from  Ga  atoms  outdiffusing  from  the  substrate  into  the 
overlayer.  Such  outdiffusion  is  enhanced  by  annealing,  and 
possibly  by  the  presence  of  Zn  vacancies,  which  is  likely  to 
accompany  ZnSe  regrowth  on  a  S e-rich  substrate  surface. 
Enhanced  Ga  outdiffusion  in  Se-rich  ZnSe/GaAs(001)  inter¬ 
faces  has  indeed  been  observed  upon  annealing,  with  the 
formation  of  characteristic  deep  level  emission  at  2.0  eV 
which  reflects  G^zn~~^Zn  complexes. We  emphasize  that 
this  is  quite  unlike  the  mechanism  proposed  to  explain  the 
reacted  As  component  in  Fig.  2,  which  derives  from  a  fixed 
number  of  As  atoms  displaced  during  the  initial  Se  predos¬ 
ing,  and  that  segregate  at  the  overlayer  surface.  Upon  anneal¬ 
ing,  the  segregated  As  atoms  simply  desorb  from  the  surface, 
while  Ga  atoms  continue  to  outdiffuse  in  the  overlayer. 

The  Zn3d  line  shape  in  Fig.  3  remains  unchanged 
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Fig.  5.  Ga  3  J  and  Zn3d  core  level  spectra  from  ZnSe  epilayers  of  increas¬ 
ing  thickness  (center  and  top)  deposited  at  RT  onto  a  GaAs(OOl)  surface 
predosed  with  1  A  of  Zn  (bottom).  The  photon  energy  employed  was  120 
eV.  The  vertical  lines  mark  the  positions  of  the  centroids  of  the  bulk-related 
Ga  3g?  and  Zn3d  doublets.  The  resulting  interface  core  level  separation  is 
consistent  with  a  valence  band  offset  0.97  ±0.06  eV. 


throughout  the  coverage  range  explored,  so  that  one  can  use 
the  Zn3d  line,  and  the  substrate  related  Ga3J  feature  in 
Fig.  3  to  estimate  the  valence  band  offset.  From  the  mea¬ 
sured  separation  of  the  centroids  in  Fig.  3  (9.30±0.04  eV), 
and  the  known  position^"^  of  each  doublet  relative  to  the  va¬ 
lence  band  maximum  in  bulk  ZnSe  and  GaAs,  we  found 
=  0.40  ±  0.05  eV,  in  good  agreement  with  the  value  de¬ 
rived  from  the  anion  core  levels  in  Fig.  2.  This  low  value  for 
the  valence  band  offset  is  quantitatively  consistent  with  that 
determined  in  ZnSe/GaAs(001)  heterojunctions  grown  by 
MBE  at  290  °C  This  compelling  similar¬ 

ity  has  two  important  implications.  First,  it  supports  the  con¬ 
tention  that  the  effect  of  using  a  low  BPR  in  the  early  growth 
stages  of  ZnSe  is  to  drive  the  system  toward  the  formation  of 
a  Se-terminated  interface.  Second,  since  interdiffusion  at  the 
interface  is  likely  to  be  minimized  at  RT  relative  to  290  °C — 
as  indicated  by  the  increase  in  Ga  outdiffusion  which  we 
observe  in  our  sample  upon  annealing — and  we  still  measure 
a  comparable  offset  in  the  two  cases,  it  argues  against  a 
substantial  role  of  interdiffusion-related  doping  dipoles  in 
determining  the  experimental  band  offsets. 

C.  Zn  predosing 

In  Fig.  4  we  show  the  evolution  of  the  St  3d  (left)  and 
As  3d  (right)  emission  following  predosing  of  the  GaAs  sur¬ 
face  with  1  A  of  Zn  (bottom),  and  upon  deposition  of  5  A 
(center)  and  1 8  A  (top)  of  ZnSe,  all  at  RT.  The  corresponding 
results  for  the  Get  3d  and  Zn3d  core  levels  are  shown  in  Fig. 
5,  where  we  also  show,  superimposed  to  the  data  (solid 
circles)  for  selected  Ga  3^  spectra  the  result  of  a  best-fit 


(solid  line)  of  the  overall  line  shape  in  terms  of  the  superpo¬ 
sition  of  two  or  three  Voigt  doublets  (also  shown,  solid  line). 

The  line  shape  of  the  St  3d  and  As  3d  doublet  remains 
remarkably  constant  in  Fig.  4.  In  particular,  we  observe  no 
qualitative  changes  in  the  As  3d  line  shape  after  Zn  predos¬ 
ing  and  ZnSe  regrowth,  in  sharp  contrast  with  Se  predosing. 
We  can  therefore  directly  use  the  position  of  the  two  cen¬ 
troids  to  gauge  the  valence  band  offset.  From  the  measured 
separation  of  the  centroids  in  Fig.  4  (13.53±0.02  eV),  and 
the  known  position^"^  of  each  doublet  relative  to  valence  band 
maximum  in  bulk  ZnSe  and  GaAs,  we  obtained 
A£y=  1.01  ±0.05  eV,  i.e.,  a  value  which  is  0.6  eV  higher 
than  that  observed  for  interfaces  fabricated  with  Se  predos¬ 
ing. 

The  Gdi3d  and  Zn3d  core  levels  in  Fig.  5  exhibit  sub¬ 
stantial  changes  following  predosing  and  ZnSe  regrowth. 
Upon  Zn  predosing  (bottom),  the  Ga  3^/  line  shape  shows  a 
definite  high  kinetic  energy  contribution,  together  with  a 
smaller  low  kinetic  contribution  that  could  derive  from  re¬ 
sidual  surface-related  emission.  Although  for  the  moment 
our  attempt  at  obtaining  a  consistent  line  shape  analysis  for 
the  Zn3d  doublet  at  different  emission  angles  and  photon 
energy  has  been  less  than  satisfactory,  qualitatively  it  seems 
clear  that  a  Zn  3  d/  line  shape  following  predosing  also  shows 
an  unresolved  high  kinetic  energy  contribution  relative  to  the 
main  line. 

The  formation  of  Ga-Zn  bonds  at  the  surface  might  ex¬ 
plain  the  presence  of  high  kinetic  energy  contributions  to 
both  the  Ga  3  d/  and  Zn  3  d/  line  shape  following  predosing,  in 
view  of  the  electronegativity  difference  between  Ga,  Zn,  and 
As.  However,  upon  deposition  of  5  A  of  ZnSe  the  high  ki¬ 
netic  energy  component  to  the  Ga3d/  lines  is  not  detected 
anymore,  so  that  it  is  not  clear  if  Ga-Zn  bonds  at  the  inter¬ 
face  play  a  role  in  the  final  heterojunction.  The  Zn  3d/  line 
shape  at  5  A  ZnSe  coverage  (center.  Fig.  5)  still  shows  an 
unresolved  high  kinetic  energy  contribution,  suggesting  that 
this  might  be  the  case,  but  the  same  contribution  might  be 
due  to  segregated  Zn  atoms  displaced  from  the  interface. 
Since  no  reacted  As  3d  contribution  is  visible  in  Fig.  4,  as 
opposed  to  Fig.  2,  i.e.,  no  As  segregation  is  observed,  we  are 
forced  to  conclude  that  any  Zn-Ga  bond  formation  occurs 
either  without  displacing  the  As  from  the  interface  region, 
leading  to  the  formation  of  a  complex  ternary  phase,  or  that 
the  displaced  As  atoms  are  desorbed  from  the  surface.  The 
latter  would  be  somewhat  surprising  in  view  of  the  high 
stability  of  Zn-As  bonds,  and  of  the  low  temperature  at 
which  the  experiment  was  performed. 

The  emergence  of  an  important  low  kinetic  energy  contri¬ 
bution  0.81  eV  below  the  main  Ga3d/  line  in  Fig.  5  upon 
deposition  of  5  and  18  A  of  ZnSe  also  raises  some  puzzling 
issues.  This  feature  is  substantially  broader  and  has  a  some¬ 
what  larger  chemical  shift  as  compared  to  the  corresponding 
feature  in  Fig.  3.  In  the  absence  of  data  concerning  the  de¬ 
pendence  of  the  intensity  of  this  feature  on  the  escape  depth 
and  annealing  time  and  temperature,  at  this  stage  we  cannot 
discriminate  between  a  reacted  interface  contribution,  or  a 
surface  segregated  contribution.  In  both  cases,  the  direction 
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and  order  of  magnitude  of  the  corresponding  chemical  shift 
points  to  Ga-Se  bond  formation. 

From  the  measured  separation  of  the  centroids  of  the 
Zn3d  line  at  high  ZnSe  coverage  in  Fig.  5  (vertical  solid 
lines),  and  the  substrate-related  Gsi3d  doublet  (8.73±0.05 
eV),  and  the  known  position^"^  of  each  doublet  relative  to 
valence  band  maximum  in  bulk  ZnSe  and  GaAs,  we  found 
A£'y  =  0.97 ±0.06  eV,  in  very  good  agreement  with  the  value 
derived  from  the  anion  core  levels  in  Fig.  4. 

IV.  CONCLUSIONS 

Ordered  monolayer  coverages  of  Zn  and  Se  were  obtained 
on  the  GaAs(001)2X4  surface  at  room  temperature  prior  to 
ZnSe  MBE.  Adsorption  removed  the  initial  2X4  surface  re¬ 
construction  and  yielded  a  1 X 1  surface  periodicity  in  both 
cases.  On  top  of  these  adsorbed  layers,  we  achieved  ZnSe 
epitaxial  overgrowth  at  room  temperature.  Low  temperature 
was  required  to  minimize  Zn  desorption,  and  suppress  the  Zn 
indiffusion  and  the  Ga  outdiffusion  which  are  known  to  take 
place  across  the  ZnSe/GaAs(001)  interface  at  higher  growth 
temperatures.  Arsenic  segregation  on  top  of  the  ZnSe/Se/ 
GaAs(OOl)  structure  was  nevertheless  observed,  as  a  result 
of  the  displacement  of  As  atoms  from  the  interface  during  Se 
predosing.  This  type  of  interface  reaction  was  not  observed 
following  Zn  predosing. 

Although  ZnSe  epitaxial  growth  was  performed  with 
identical  Zn/Se  beam  pressure  ratios  for  both  types  of  inter¬ 
faces,  widely  different  band  offsets  were  observed  for  the 
two  interfaces.  Se  predosing  resulted  in  valence  band  offsets 
as  low  as  0.40±0.05  eV,  while  Zn  predosing  yielded  valence 
band  offsets  as  high  as  1.01  ±0.05  eV.  Such  values  are  com- 
pellingly  similar  to  those  observed  for  ZnSe/GaAs(001)  het¬ 
erojunctions  grown  with  low  and  high  Zn/Se  BPR,  respec¬ 
tively. 

Our  results  support  the  contention  that  the  effect  of  using 
a  low  (high)  BPR  in  the  early  growth  stages  of  ZnSe  is  to 
drive  the  system  toward  the  formation  of  a  Se-terminated 
(Zn-terminated)  interface.  Furthermore,  since  interdiffusion 
across  the  interface  is  minimized  at  RT  relative  to  typical 
growth  temperatures,  and  comparable  band  offsets  are  ob¬ 
served  in  the  two  cases,  interdiffusion-related  doping  dipoles 
are  unlikely  to  have  a  substantial  role  in  determining  the 
experimental  band  offset. 
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The  in-plane  anisotropy  {e^^Ey)  and  the  off-plane  anisotropy  of  ZnSe/GaAs  interfaces 

formed  under  conditions  that  promote  the  formation  of  either  Zn-As  or  Ga-Se  bonds  are  studied  by 
reflectance  difference  spectroscopy.  Two  resonance  lines,  one  at  2.70  eV  and  the  other  around  3.0 
eV,  have  been  observed.  The  in-plane  anisotropy  is  along  the  [110]  and  the  [110]  principal  axes.  The 
dependence  of  the  resonances  on  interface  formation  conditions,  the  results  of  the  photoreflectance 
spectroscopy,  and  the  annealing  experiments  all  suggest  that  the  anisotropy  is  not  due  to  the 
electro-optic  effect  resulting  from  an  interface  electric  field.  The  reflectance  difference  spectroscopy 
results  are  consistent  with  the  assumption  that  the  anisotropy  is  the  intrinsic  properties  of  ordered 
ZnSe/GaAs  heterovalent  interface,  that  the  resonance  at  2.70  eV  is  associated  with  the  interface  state 
of  Zn-As  bonds  and  the  resonance  near  3.0  eV  is  associated  with  the  interface  state  of  Ga-Se 
bonds.  The  presence  of  a  thin  layer  of  S  at  the  ZnSe/GaAs  interfaces  results  in  a  third  resonance  at 
3.2  eV,  probably  due  to  the  Ga-S  bonds.  The  resonance  line  shape  changes  as  the  top  layer  thickness 
varies.  Such  change  can  be  well  explained  by  a  three-layer  system  with  the  conventional  Fresnel 
optics  and  is  understood  as  due  to  the  ZnSe  top  layer.  It  is  our  hope  that  our  results  will  stimulate 
more  interest  in  the  theoretical  study  of  this  interface.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

The  properties  of  semiconductor  interfaces  are  an  impor¬ 
tant  topic  of  research  for  both  their  fundamental  and  practical 
signiflcance.  As  the  technology  of  epitaxial  thin  film  growth 
advances,  it  becomes  clear  that  the  quality  of  a  thin  film 
grown  on  a  substrate  depends  critically  on  the  formation  of 
film/substrate  interface.  Experimentally,  however,  a  buried 
solid  interface  is  difficult  to  probe.  Current  surface  diagnos¬ 
tic  techniques  such  as  electron  beam  diffraction  have  limited 
penetration  depth  and  can  only  probe  interfaces  covered  by  a 
very  thin  layer  of  several  nanometers  thick.  However,  the 
interfaces  may  change  as  the  epilayers  grow  thicker  such  as 
in  the  case  of  strain  relaxation.  Interfaces  buried  hundreds  of 
nanometers  beneath  the  surface  are  out  of  reach  by  the  cur¬ 
rent  surface  diagnostic  techniques,  and  the  development  of 
new  techniques  becomes  necessary.  Optic  spectroscopy  has 
much  larger  penetration  depth,  and  so  far  the  most  successful 
one  applied  to  surface/interface  has  been  the  reflectance 
difference/anisotropy  spectroscopy  (RDS/RAS).*  RDS  mea¬ 
sures  the  difference  between  the  reflectances  of  light  polar¬ 
ized  along  two  perpendicular  directions  as  a  function  of  pho¬ 
ton  energy.  When  light  is  normally  incident  on  a  surface/ 
interface  sample  and  if  the  bulk  optical  response  of  the 
sample  is  isotropic,  then  the  only  contribution  to  the  reflec¬ 
tance  difference  (RD)  comes  from  the  surface/interface.  RDS 
is  therefore  a  surface/interface  sensitive  technique,  and  has 
been  widely  employed  in  the  study  of  surface  electronic 
properties  of  III-V  semiconductors.^'"^  RDS  has  also  been 
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applied  to  study  the  Si/Si02  interfaces^  buried  under  a  layer 
of  transparent  Si02,  and  sulfur-passivated  GaAs  surface.^ 

ZnSe  is  an  important  material  for  light  emitting  devices  in 
the  blue  color  region.  Continuous  wave  lasing  at  room  tem¬ 
perature  has  been  demonstrated  and  the  life  span  of  the  de¬ 
vices  has  been  extended  to  over  100  h.  At  present,  most  of 
the  ZnSe  based  devices  are  grown  on  GaAs  substrates  be¬ 
cause  of  their  high  quality  and  relatively  low  cost.  The  qual¬ 
ity  of  the  ZnSe/GaAs  interface  is  one  of  the  key  factors,  if 
not  the  most  important  factor,  that  limits  the  device  life  span. 
In  the  meantime,  a  perfect  ZnSe/GaAs  interface  is  expected 
to  contain  a  single  layer  of  Zn-As  or  Ga-Se  bonds,  depend¬ 
ing  on  how  the  substrate  surface  is  treated  prior  to  growth, 
and  these  bonds  are  pointing  at  either  the  [110]  or  [110] 
direction  for  a  (001)  oriented  substrate.  As  a  result,  the  ZnSe/ 
GaAs  interface  is  expected  to  exhibit  optical  anisotropy  of 
the  same  magnitude  as  a  clean  GaAs  surface. 

One  problem  to  be  solved  in  studying  the  ZnSe/GaAs 
interface  using  optical  techniques  such  as  RDS  is  to  separate 
the  interface  signal  from  that  of  the  ZnSe  surface.  Although 
this  has  not  been  clearly  revealed,  a  well  ordered  ZnSe  sur¬ 
face  is  nevertheless  expected  to  exhibit  sizable  dielectric  an¬ 
isotropy  and  contribute  significantly  to  RDS  signal.  To  study 
the  ZnSe/GaAs  interface,  it  is  therefore  necessary  to  passi¬ 
vate  the  ZnSe  surface  to  make  it  optically  isotropic. 

In  our  previous  work,  we  have  reported  the  RDS  study  of 
ZnSe/GaAs  interfaces  buried  hundreds  of  nanometers  below 
the  ZnSe  surface.^  Sharp  resonances  in  RDS  spectra  at  2.70 
and  3.0  eV  had  been  observed.  Efforts  were  made  to  identify 
the  origin  of  the  resonances,  and  it  was  concluded  that  RD 
due  to  the  electro-optic  effect  was  not  the  likely  cause  be- 
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cause  no  interface  electric  field  had  been  observed.  It  was 
suggested  that  the  Ga-Se  and/or  the  Zn-As  bonds  might  be 
related  to  the  resonances.  In  this  article,  we  report  systematic 
study  of  the  optical  anisotropy  of  ZnSe/GaAs  (001)  inter¬ 
faces  prepared  under  different  conditions.  The  surface  of  the 
GaAs  substrates  were  pretreated  to  promote  the  formation  of 
either  Zn-As  bonds  or  Ga-Se  bonds.  More  evidence  is  ob¬ 
tained  that  suggests  the  absence  of  the  electro-optic  effect  in 
our  samples.  Within  the  interface  plane  the  anisotropy  occurs 
along  the  [110]  and  [110]  principal  axes  In  addi¬ 

tion,  with  the  employment  of  a  special  RDS  configuration, 
the  optical  anisotropy  off  the  interface  plane  (6^^  is  also 
studied.^  Both  the  in-plane  and  the  off-plane  anisotropy  vary 
for  interfaces  formed  under  different  conditions.  Efforts  to 
identify  the  origin  of  the  interface  optical  anisotropy  are 
made  and  results  will  be  reported. 


II.  EXPERIMENT 

The  RDS  optical  setup  is  essentially  the  same  as  de¬ 
scribed  in  Ref.  1  except  that  the  polarization  of  the  incident 
light  now  is  at  45°  to  the  incidence  plane.  In  such  configu¬ 
ration,  both  the  anisotropic  and  isotropic  interface  states  can 
be  observed.^  The  samples  can  be  rotated  within  the  surface 
plane.  The  incidence  angle  is  7°  off  the  normal  direction  of  a 
sample  surface.  The  difference  between  the  complex  reflec¬ 
tances  of  light  polarized  in  the  two  directions 
where  5  and  p  denote  the  conventional  polarization  direc¬ 
tions  in  the  oblique  incidence  case,  is  measured.  The  samples 
are  undoped  ZnSe  single  crystal  films  grown  at  250°  to 
300  °C  by  molecular  beam  epitaxy  (MBE)  on  (001)  GaAs 
semi-insulating  substrates.  Planer  and  cross  section  transmis¬ 
sion  electron  micrographs  show  that  the  interfaces  are  sharp 
and  flat,  with  no  sign  of  corrugated  quantum- wire  type  struc¬ 
tures.  Stacking  faults  dislocation  density  is  in  the  range  of 
10^  to  low  10"^  per  cm^.  The  surfaces  of  the  ZnSe  layers  are 
covered  with  a  layer  of  Se  at  the  end  of  growth  to  prevent 
oxidation  when  exposed  to  air.  Typical  full  width  at  half¬ 
maximum  (FWHM)  of  x-ray  diffraction  peak  of  the  films  is  a 
few  hundred  arc  second.  Epilayers  with  thickness  ranging 
from  18  A  to  more  than  1  pm  have  been  studied.  All  RDS 
measurement  are  done  in  air  at  room  temperature. 

The  measured  RD  spectrum,  Ap=2*(r^”-r^)/(r^+r^), 
contains  three  parts,  namely  the  anisotropic  part  which 
is  due  to  the  interface  dielectric  anisotropy,  the  isotropic  part 
Ap^-  which  is  due  to  the  off-plane  anisotropic  of  the  interface, 
and  the  featureless  isotropic  part  Ap^^  of  the  conventional 
Fresnell  RD  due  to  the  nonzero  incidence  angle.  Ap^  has  a 
sin  29  dependence  on  the  angle  0  between  the  incidence  po¬ 
larization  and  the  [110]  principal  axis,  while  Ap^  and  Ap^ 
has  no  such  angular  dependence.  By  measuring  Ap(^)  as  a 
function  of  6,  one  can  then  separate  Ap^  from  the  isotropic 
parts  Apj  +  Apy^,  i.e.,  Ap^  =  [Ap(0°)-Ap(90°)]/2,  and 
Ap,  + Ap^=[Ap(0°)+Ap(90°)]/2.  The  isotropic  parts  can  fur¬ 
ther  be  confirmed  by  the  Ap(45°)  spectra. 
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Fig.  1 .  (a)  The  complex  in-plane  RD  spectra  of  sample  B 1 .  The  solid  curve 
represents  the  real  part  of  RD,  while  the  dashed  curve  represents  the  imagi¬ 
nary  part.  The  thickness  of  the  ZnSe  epilayer  on  GaAs(lOO)  substrate  is 
about  1000  A.  (b)  The  complex  off-plane  RD  spectra  of  sample  Bl.  The 
solid  curve  represents  the  real  part  of  RD,  while  the  dashed  curve  represents 
the  imaginary  part. 


III.  RESULTS  AND  DISCUSSIONS 

An  undoped  ZnSe  single  crystal  (100)  substrate  of  about  1 
mm  thick  was  measured  first  to  test  the  RDS  apparatus.  The 
obtained  Ap  is  featureless  and  is  of  the  order  of  10~^,  largely 
due  to  the  Fresnel  RD  of  p  and  ^  polarization.  Bulk  ZnSe 
contribution  to  RD  can  therefore  be  neglected  for  our  pur¬ 
poses. 

A.  Effects  of  surface  treatments 

The  five  samples,  all  with  the  same  nominal  ZnSe  layer 
thickness  of  1000  A  but  with  different  GaAs  substrate  sur¬ 
face  treatment,  are  presented  first.  The  samples  were  all 
grown  by  molecular  beam  epitaxy  (MBE)  at  250  °C  on  (001) 
GaAs  substrates.  The  substrate  for  sample  Bl  was  first 
treated  by  5H2SO4+H2O2+H2O  and  heated  in  vacuum  at 
600  °C  for  10  min.  Growth  of  ZnSe  was  then  initiated  by 
simultaneous  exposure  of  Zn  and  Se  flux.  Figs.  1(a)  and  1(b) 
show  the  in-plane  and  the  off-plane  RD  spectra  of  sample 
Bl.  A  strong  resonance  line  around  3.1  eV  is  obviously  seen 
in  the  in-plane  RD  spectrum.  A  much  sharper  but  much 
weaker  resonance  line  is  also  observed  at  2.70  eV  [see  the 
insert  of  Fig.  1(a)],  with  a  FWHM  of  only  20  mV.  The  off- 
plane  RD  spectrum  shows  a  pronounced  resonance  near  3.0 
eV  in  the  imaginary  part  of  Ap  (=Ar/r)  and  a  derivative-like 
feature  near  the  same  energy  in  the  real  part  of  Ap. 

The  substrates  for  samples  B2  and  B4  were  first  treated 
with  (NH3)S^  and  heated  to  380  °C  for  4  min  in  vacuum.  The 
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Fig.  2.  The  complex  in-plane  RD  spectra  of  samples  B2  through  B5.  The 
solid  curves  represent  the  real  parts  of  RD,  while  the  dashed  curves  repre¬ 
sent  the  imaginary  parts  of  RD. 


substrates  for  samples  B3  and  B5  were  first  treated  with 
S2CI2  and  heated  to  380  °C  for  4  min.  The  chemical  pretreat¬ 
ment  left  a  layer  of  S  on  the  GaAs  surface.^  The  substrates  of 
B2  and  B3  were  exposed  to  Se  flux  (partial  pressure 
3.2X10“'^  Pa)  for  1  min  prior  to  growth,  while  those  of  B4 
and  B5  were  exposed  to  Zn  flux  (partial  pressure  1.5X10“'^ 
Pa)  for  1  min  before  growth.  Their  in-plane  and  off-plane  RD 
spectra  are  shown  in  Figs.  2  and  3,  respectively.  Several 


Photon  Energy  (eV) 


Fig.  3.  The  complex  off-plane  RD  spectra  of  samples  B2  through  B5.  The 
solid  curves  represent  the  real  parts  of  RD,  while  the  dashed  curves  repre¬ 
sent  the  imaginary  parts  of  RD. 


important  points  can  be  made  by  examining  these  spectra. 

The  in-plane  RD  spectra  of  all  five  samples  are  qualita¬ 
tively  the  same.  Two  resonance  lines,  one  at  2.70  eV  (R-1), 
and  the  other  near  3.0  eV  (R-2),  are  present  in  all  the  spectra. 
The  energy  of  2.70  eV  is  slightly  above  the  Eq  band  gap 
energy  of  2.67  eV  at  room  temperature,  while  the  energy  of 
3.0  eV  is  near  the  jEo+A  gap  of  ZnSe  and  the  Ei  gap  of 
GaAs.  The  fact  that  the  resonance  energies  are  close  to  the 
critical  energy  points  of  bulk  ZnSe  and  GaAs  suggests  that 
the  resonances  are  caused  by  perturbation  of  bulk  electronic 
states.  The  relative  intensities  of  the  two  resonances,  how¬ 
ever,  depend  on  the  substrate  surface  exposure  to  molecular 
beams  prior  to  growth.  For  sample  B1  which  was  grown 
without  any  pregrowth  beam  exposure,  the  intensity  of  reso¬ 
nance  line  R-2  at  3.0  eV  is  about  60  times  the  intensity  of 
R-1  at  2.7  eV.  For  B2  and  B3  which  were  grown  on  Se-rich 
substrate  surfaces  the  ratio  of  R-2  and  R-1  intensities  is 
about  15,  and  for  B4  and  B5  grown  on  Zn-rich  substrate 
surfaces  the  ratio  is  only  about  two.  The  off-plane  RD  spec¬ 
tra  of  samples  grown  on  Zn-rich  surfaces  (B4  and  B5)  are 
also  distinctively  different  from  those  grown  on  Se-rich  sur¬ 
faces  (B2  and  B3)  or  bare  GaAs  surface  (Bl).  A  resonance  in 
the  off-plane  RD  spectra  at  nearly  the  same  energy  of  R-2  of 
in-plane  RD  spectra  is  observed  for  samples  Bl,  B2,  and  B3, 
while  there  are  no  resonances  for  B4  and  B5.  Resonances  in 
off-plane  RD  spectra  indicate  the  presence  of  interface  states 
because  only  states  localized  at  the  interface,  i.e.,  of  two- 
dimensional  nature  can  cause  off-plane  anisotropy 

A  third  resonance  R-3  are  observed  for  samples  B2,  B4, 
and  B5  (although  barely  seen  in  Fig.  2,  the  resonance  is  well 
defined  when  proper  vertical  scale  is  used).  The  resonance, 
however,  is  not  observed  in  samples  without  S  coverage. 

Different  chemical  treatments  of  substrate  surface  seem  to 
have  different  effects  on  the  overall  magnitude  of  the  RD 
spectra.  The  RD  magnitude  of  B5  is  about  five  times  of  that 
of  B4.  Both  samples  were  grown  on  Zn-exposed  surfaces. 
The  substrate  surface  of  B5  was  pretreated  with  S2CI2  while 
that  of  B4  was  pretreated  with  (NH3)S;^ .  Both  chemical  treat¬ 
ments  leave  an  S  covered  GaAs  surface  with  much  reduced 
surface  states  with  S2CI2  treatment  resulting  in  stronger 
Ga-S  bonds. ^  For  samples  B2  and  B3  grown  on  Se-exposed 
surfaces  the  effect  of  chemical  pretreatment  is  opposite.  The 
RD  magnitude  of  B2  which  was  (NH3)S;^  treated  is  about  two 
times  that  of  B3. 

Other  samples  such  as  ZnSe-7,  ZnSe-8,  ZnSe-41,  and 
ZnSe-66  were  grown  on  5H2SO4+H2O2+H2O  treated  GaAs 
surface.  ZnSe-7,  ZnSe-8,  and  ZnSe-66  were  grown  on  a  Zn- 
rich  GaAs  surface,  and  ZnSe-41  was  grown  on  a  Se  covered 
2X3  reconstructed  GaAs  surface  as  a  result  of  prolonged  Se 
beam  exposure.  The  interface  defects  density  of  ZnSe-41  is 
much  lower  than  the  others  grown  on  2X1  reconstructed 
surface.  For  example  the  stacking  faults  density  of  ZnSe-41 
is  in  the  low  10"^  cm^,  close  to  the  high  quality  samples 
grown  in  two-chamber  MBE  systems.  The  preparation  pro¬ 
cedures  of  all  the  samples  along  with  their  RDS  results  are 
summarized  in  Table  I. 
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Table  L  Substrate  surface  pretreatments,  RDS  resonance  intensity  ratio,  RDS  resonance  peak-to-peak  values,  and  the  thickness  of  ZnSe  overlayer  of  the 
ZnSe/GaAs  samples. 


Chemical 

dip 

Preheat 

Flux 

exposure 

RHEED 

pattern 

R2/R1 

ratio 

RDS  value 
peak-to-peak 

ZnSe  layer 
thickness  (A) 

B1 

5H2SO4 

+H2O2+H2O 

10  min 
at  600  "C 

No 

None 

60 

0.02 

1000 

B2 

(NH3)S, 

4  min 
at  380  °C 

Se 

None 

15 

0.03 

1000 

B3 

S2C12 

4  min 
at  380  °C 

Se 

None 

15 

0.015 

1000 

B4 

(NH3)S, 

4  min 
at  380  °C 

Zn 

None 

2 

0.004 

1000 

B5 

S2CI2 

4  min 

at  380  °C 

Zn 

None 

2 

0.02 

1000 

ZnSe-7 

5H2SO4 
+  H2O2+H2O 

3  min 
at  380  °C 

Zn 

2X1 

6 

0.005 

1200 

ZnSe-8 

5H2SO4 
+  H2O2  +  H2O 

3  min 
at  380  °C 

Zn 

2X1 

6 

0.005 

1000 

ZnSe-41 

5H2SO4 
+  H2O2  +  H2O 

3  min 
at  380  °C 

Se 

prolonged 

2X3 

60 

0.04 

325 

ZnSe-66 

5H2SO4 
+  H2O2  +  H2O 

3  min 
at  380  °C 

Zn 

2X1 

6 

0.04 

18 

B.  Line  shape 

It  is  noted  in  the  spectra  shown  in  Figs,  1-3  that  the  real 
part  of  Ap  is  closely  related  to  the  imaginary  part  of  Ap, 
namely  that  if  the  real  part  of  the  resonance  is  in  line  shape 
then  the  corresponding  imaginary  part  is  derivativelike,  or 
vise  versa,  such  as  in  the  case  of  R-1  resonance  of  B4  and 
B5,  and  R-2  resonance  of  B2  and  B4.  In  other  cases,  the 
resonances  are  neither  line  nor  derivative  like,  but  a  mixture 
of  the  two,  such  as  in  the  case  of  R-1  and  R-2  resonances  of 
B3.  As  will  be  seen  in  the  following  text,  the  line  shape  of 
the  resonances  depends  on  the  ZnSe  epilayer  thickness,  and  a 
single  interface  anisotropic  dielectric  function  can  produce 
RD  resonance  which  is  in  line  shape,  derivativelike,  or  a 
mixture  of  the  two  depending  on  the  ZnSe  overlayer  thick¬ 
ness,  The  difference  in  line  shape  of  the  resonances  of  these 
five  samples  indicates  that  there  is  a  small  variation  of  ZnSe 
thickness  from  sample  to  sample,  even  though  nominally 
they  are  all  the  same.  The  effect  of  finite  overlayer  thickness 
on  interface  RD  spectra  has  been  effectively  treated  in  Ref, 
5.  Here  we  take  a  slightly  different  approach  to  demonstrate 
how  drastic  the  resonance  line  shape  is  affected  by  the  top 
ZnSe  layer,  in  the  energy  range  where  ZnSe  is  highly  absorb¬ 
ing. 

Rather  than  showing  Ap  {d=0)  for  zero  overlayer  thick¬ 
ness  from  the  measured  Ap(<i)  using  the  approach  in  Ref,  5, 
we  simulate  the  interface  with  an  anisotropic  thin  layer  with 
a  Lorentz-type  anisotropic  dielectric  function  resonant  at 
2.72  eV,  and  show  the  calculated  RD  spectra  for  various 
ZnSe  overlayer  thickness.  For  clarity,  only  the  R-1  portion  of 
the  spectra  are  shown.  The  three-layer  optic  system  consist¬ 
ing  of  a  ZnSe  top  layer,  an  interface  layer  of  50  A,  followed 
by  an  infinitely  thick  GaAs  substrate.  The  reflectance  of  the 
system  is  calculated  by  applying  the  electromagnetic  fields 
boundary  conditions  at  each  interface.  The  dielectric  func¬ 
tions  of  bulk  ZnSe  and  GaAs  are  taken  from  Refs.  9  and  10, 


respectively.  Only  the  results  at  normal  incidence  are  pre¬ 
sented  here,  as  a  small  tilt  angle  of  T  in  this  model  merely 
results  in  a  featureless  RD  background  of  the  order  of  10”^. 
The  upper  curves  of  Figs.  4(a)  and  4(b)  show  the  calculated 
complex  Ap  of  two  ZnSe  thicknesses,  respectively.  For  a 
950-A-thick  ZnSe  top  layer  there  is  a  sharp  line  in  Re(Ap) 
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'  '  ' . '  1 . '  '  '  '  r  1— r-r  -  : 
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Fig,  4.  Simulation  spectra  of  RD  with  (a)  950  A  ZnSe  layer,  and  (b)  1150  A 
ZnSe  layer.  The  experimental  RD  spectra  of  sample  ZnSe-8  with  an  1000  A 
ZnSe  layer,  and  sample  ZnSe-7  with  a  1200  A  ZnSe  layer  are  shown  for 
comparison. 
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Photon  Energy  (eV) 


Fig.  5.  In-plane  RD  spectra  of  sample  ZnSe-66  with  an  18-A-thick  ZnSe 
overlayer.  The  solid  curve  represents  the  real  parts  of  RD,  while  the  dashed 
curve  represents  the  imaginary  parts  of  RD. 

and  a  derivative-like  feature  in  Im(Ap)  spectrum.  The  experi¬ 
mental  RD  spectra  of  ZnSe-8  with  an  1000  A  top  layer  is 
shown  in  the  lower  part  of  the  figure  for  comparison.  The 
thickness  of  ZnSe-8  was  determined  from  the  interference 
fringe  of  the  x-ray  diffraction  curve.  For  an  11 50- A- thick 
ZnSe  layer  the  calculated  line  shape  of  Re(Ap)  and  Im(Ap) 
are  reversed,  i.e.,  Im(Ap)  is  in  line  shape  and  Re(Ap)  is 
derivative-like,  and  they  resemble  those  of  sample  ZnSe-7 
with  1200  A  ZnSe  layer.  Simulation  RD  spectra  at  layer 
thicknesses  between  1000  and  1200  A  are  a  mixture  of  the 
two  shown  in  Fig.  4,  which  explains  the  shape  of  the  RD 
spectra  of  sample  B3.  In  a  wider  thickness  range  the  simu¬ 
lation  spectra  evolve  periodically  from  line  shape  to  deriva¬ 
tivelike,  and  back  to  line  shape,  in  consistency  with  the  RD 
spectra  of  other  ZnSe/GaAs  samples  with  corresponding 
overlayer  thicknesses.  It  is  important  to  count  for  the  epil- 
ayer  phase  factor  in  the  RD  spectra.  Otherwise,  one  would 
come  to  the  wrong  conclusion  that  the  ZnSe/GaAs  interface 
anisotropy  changes  with  the  thickness  of  the  ZnSe  layer. 

C.  Origin  of  anisotropy 

We  now  turn  to  the  discussions  of  possible  causes  of  the 
observed  resonances.  First,  the  observed  features  are  not 
from  the  ZnSe  surface  layer,  because  the  RD  spectra  of  ZnSe 
bulk  samples  show  no  resonance  features,  and  the  RD  spec¬ 
tra  of  thick  (>5000  A)  samples  are  featureless  in  the  energy 
range  above  the  band  gap  of  ZnSe.  This  is  consistent  with  the 
fact  that  the  surface  oxide  layer  is  amorphous  and  isotropic. 
Second,  the  observed  RD  is  not  a  bulk  effect.  Apart  from  the 
evidence  from  the  results  of  thick  samples  it  is  also  because 
no  obvious  thickness  dependence  of  RD  resonance  intensity 
has  been  found  from  the  dozens  of  samples  investigated  so 
far,  if  the  thickness  is  below  2000  A.  RDS  resonance  inten¬ 
sity  varies  from  sample  to  sample  with  nearly  the  same  ZnSe 
thickness,  the  cause  of  which  will  be  discussed  later  in  this 
article.  Strong  resonances  comparable  to  that  of  B5  were 
observed  for  samples  with  ZnSe  layers  as  thin  as  1 8  A,  as  is 
shown  in  Fig.  5.  Nor  is  it  due  to  the  GaAs  bulk  effect  be¬ 
cause  the  observed  resonance  energies  shifted  as  alloy  com¬ 
position  in  the  same  way  as  the  energy  gap  Eq  for  ZnCdSe 


and  ZnSSe  alloy  films.  The  only  remaining  source  of  anisot¬ 
ropy  is  therefore  the  epilayer/substrate  interface. 

One  possible  origin  of  the  anisotropy  is  the  electro-optic 
effect  due  to  the  presence  of  an  interface  electric  field. It 
is  an  attractive  assumption  because  the  observed  resonance 
energies  are  close  to  the  critical  energies  of  bulk  ZnSe.  The 
field  could  be  due  to  extended  interdiffusion  of  Ga  and  Zn 
atoms  which  serve  as  dopants  in  the  adjacent  layers over 
dozens  of  atomic  layers  at  the  interface,  due  to  Fermi  level 
pinning  at  an  interface  state,  or  due  to  an  abrupt  ZnSe/GaAs 
interface.  The  electric  field  due  to  the  first  two  cases  ex¬ 
tends  over  dozens  to  hundreds  of  atomic  layers  (extended 
field),  while  in  the  last  case,  the  field  exists  only  within  sev¬ 
eral  atomic  layers  from  the  interface  (confined  field).  How¬ 
ever,  according  to  the  following  observation,  the  extended 
electric  field  is  not  strong  enough  to  produce  anisotropy  of 
such  magnitude.  Photoreflection  PR  experiments  were  per¬ 
formed  but  the  Franz-Keldysh  oscillation,^"^  which  signifies 
the  presence  of  an  extended  interface  electric  field,  is  absent 
in  the  PR  spectra  of  all  the  samples.  To  produce  RD  reso¬ 
nance  of  the  order  of  10“^  as  observed  in  our  samples,  the 
electric  field  must  be  comparable  to  the  one  produced  by  a 
doping  level  of  the  order  of  10^^/cm^,^^  and  should  easily  be 
observed  as  the  Franz-Keldysh  oscillation  in  the  PR  spectra. 
Furthermore,  the  electro-optic  effect  transfers  oscillation 
strength  between  Ei  and  Ej+A  transitions,^^  and  is  likely  to 
do  the  same  to  the  Eq  and  Eq+A  transitions,  so  that  the 
resonances  R-1  and  R-2  should  have  opposite  signs  but  com¬ 
parable  amplitudes.  It  is  therefore  impossible  for  a  single 
electric  field,  extended  or  confined,  to  cause  such  a  drastic 
change  of  the  relative  intensities  of  R-1  and  R-2  from  2  to 
60.  Furthermore,  in  most  of  the  samples  studied  so  far  in¬ 
cluding  the  ones  with  ZnSe  layers  so  thin  that  the  ZnSe  layer 
plays  no  role  in  the  resultant  RD  spectra,  the  two  resonances 
all  have  the  same  sign.  Since  the  electro-optic  effect  is  more 
or  less  a  bulk  effect,  the  RD  intensity  should  depend  on  the 
layer  thickness  for  overlayers  thinner  than  1000  A,  which  is 
contradictory  to  what  has  been  observed.  Extended  interdif¬ 
fusion  can  be  ruled  out  as  RD  spectra  of  samples  after  10  h 
of  annealing  at  300  °C  are  virtually  the  same  as  the  ones 
before  annealing.  We  therefore  conclude  that  neither  the  ex¬ 
tended  nor  the  confined  interface  electric  field  can  be 
counted  for  the  observed  two  resonances  in  the  RD  spectra. 

As  the  possibility  of  interface  electric  field  is  ruled  out 
there  left  two  possible  causes  for  the  observed  resonances, 
namely  that  (i)  they  are  a  direct  consequence  of  interface 
defects  such  as  stacking  faults,  or  (ii)  they  reflect  the  intrinsic 
properties  of  well  ordered  II-VI/III-V  hetero valent  semi¬ 
conductor  interfaces,  because  bonds  at  the  interface  are  be¬ 
tween  Group  II  and  Group  V  or  Group  III  and  Group  VI 
elements  which  are  different  from  the  II -VI  bonds  in  bulk 
ZnSe  or  the  III-V  bonds  in  bulk  GaAs.  From  the  few 
samples  examined  by  planar- view  TEM,  we  have  found  no 
obvious  trend  between  stacking  faults  or  other  defects  den¬ 
sities  and  RDS  resonance  intensity.  The  mechanism  of  aniso¬ 
tropic  stacking  faults  or  other  interface  defects  is  also  an 
open  question.  However,  based  on  the  available  data,  we 
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cannot  rule  out  this  possibility  either.  Further  work  is  on  the 
way  to  examine  planar  TEM  graphs  of  all  the  samples,  and 
the  results  will  be  duly  reported.  Here  we  concentrate  our 
discussion  on  the  second  possibility  that  the  two  resonances 
reflect  the  intrinsic  properties  of  ZnSe/GaAs  interface. 

We  first  consider  an  ideal  abrupt  As  (Ga)  terminated  in¬ 
terface  that  only  Zn-As  (Ga-Se)  bonds  are  present.  Due  to 
the  presence  of  these  bonds  the  interface  loses  its  D2d  sym¬ 
metry  and  becomes  anisotropic  along  the  two  principal  axes 
[110]  and  [110].^^  In  reality  the  surface  is  partially  Ga  and 
partially  As  terminated  after  the  oxide  has  been  removed. 
With  certain  treatment  prior  to  ZnSe  growth,  such  as  pre¬ 
exposure  of  Zn  (Se)  flux,  one  can  create,  to  a  large  extent,  a 
Zn(Se)-rich  surface,  resulting  in  a  Zn-As  or  Ga-Se  bonds 
dominated  interface.  Intermixing  within  several  monolayers, 
which  may  occur  because  it  is  energetically  more  favored, 
will  change  the  numbers  of  Zn-As  (Ga-Se)  bonds.  This  is 
shown: 

Ga- As-Ga- As-  Ga-  As-  Zn-  Se-Zn-Se-Zn-Se. 

After  the  location  of  the  boldfaced  Zn  atom  at  the  As-Zn 
interface  and  the  boldfaced  Ga  atom  to  the  left  of  the  As 
atom  are  exchanged,  the  resultant  Zn-As  bond  and  the 
As-Zn  bonds  cancel  each  other  in  RDS  because  the  projec¬ 
tion  of  the  two  bonds  onto  the  interface  plane  is  perpendicu¬ 
lar  to  each  other,  while  a  new  Ga-Se  bond  is  created.  The 
same  result  holds  if  the  intermixing  extends  to  several  mono- 
layers.  The  occurrence  of  the  few-monolayer  intermixing  is 
mainly  due  to  its  reduction  of  the  interface  formation 
energy, so  that  once  the  interface  is  formed  there  is  no 
further  mixing.  This  is  probably  the  reason  why  postgrowth 
annealing  at  the  growth  temperature  did  not  change  the  RDS 
spectra.  Stacking  faults  of  odd  number  of  monolayers  will 
reduce  the  total  number  of  effective  II-V  and  III-VI  bonds 
because  the  bonds  in  the  two  adjacent  atomic  planes  sepa¬ 
rated  by  a  stacking  fault  are  perpendicular  to  each  other  so 
that  the  contribution  to  RDS  of  the  bonds  in  the  atomic  plane 
of  smaller  area  cancels  part  of  the  bonds  in  the  larger  plane, 
resulting  in  a  reduction  of  RDS  intensity.  Interfaces  with 
high  stacking  faults  density  will  then  posses  diminishing  op¬ 
tical  anisotropy,  while  well  ordered  interfaces  will  have  large 
optical  anisotropy. 

Based  on  the  above  argument  we  then  assign  the  R-1 
resonance  to  a  Zn-As  bond  related  interface  state,  and  the 
R-2  resonance  to  a  Ga-Se  bond  related  interface  state.  The 
intensity  ratio  of  these  two  resonances  reflects  the  relative 
number  of  Ga-Se  bonds  and  Zn-As  bonds  near  the  inter¬ 
face,  and  their  overall  intensity  reflects  the  degree  of  order, 
or  quality,  of  the  interface.  With  such  assignment  we  can 
explain  the  resonance  intensities  change  with  the  pretreat¬ 
ment  of  GaAs  substrates. 

(1)  Samples  B1  was  kept  at  600  °C  for  10  min,  resulting 
in  an  As  deficient  surface  and  a  Ga-Se  bonds  rich 
interface  after  the  over  layer  was  grown  while  there 
were  few  Zn-As  bonds.  This  is  consistent  with  the 
fact  that  the  R-2  to  R-1  intensities  ratio  is  60  for  Bl. 
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(2)  Samples  B2,  B3,  and  ZnSe-41  were  exposed  to  Se 
flux  prior  to  growth,  resulting  in  a  Ga-Se  bond-rich 
interface.  The  expose  time  for  ZnSe-41  was  much 
longer  than  that  of  B2  and  B3,  resulting  in  an  almost 
complete  Se  covered  surface,  and  an  R-2/R-1  inten¬ 
sity  ration  of  60.  The  ratio  of  R-2  to  R-1  intensities 
for  B2  and  B 3  is  15.  That  the  R-1  resonance  can  still 
be  observed  for  Se-rich  interfaces  indicates  that  there 
were  still  some  Zn-As  bonds  near  the  interface,  prob¬ 
ably  due  to  intermixing,  and  incomplete  coverage  of 
Se  in  the  case  of  B2  and  B3. 

(3)  The  substrates  of  samples  ZnSe-7,  ZnSe-8,  ZnSe-66, 
B4  and  B5  were  exposed  to  Zn  flux  prior  to  growth, 
resulting  in  a  somewhat  Zn-rich  interface.  The  pre¬ 
growth  exposure,  however,  was  not  enough  to  form  a 
large  Zn  coverage  at  300  ""C,  since  the  growth  tem¬ 
perature  for  a  complete  layer  Zn  on  GaAs  is  much 
lower.  The  sizable  R-2  resonance  of  these  samples 
indicates  that  only  a  fraction  of  the  substrate  surface 
was  covered  with  Zn,  and  it  is  much  easier  for  Se  to 
stick  to  a  GaAs  surface. 

(4)  The  R-2  to  R-1  intensity  ratio  is  about  six  for  samples 
ZnSe-7  and  ZnSe-8  which  had  no  S  coverage  prior  to 
ZnSe  growth,  as  compared  to  a  ratio  of  two  for 
samples  B2  and  B3,  indicating  that  the  Zn  coverage 
was  less  in  the  former  group  of  samples.  This  sug¬ 
gests  that  an  S  terminated  GaAs  surface  is  more  re¬ 
ceptive  to  Zn  than  to  Se. 

(5)  Interface  of  B2  which  was  (NH3)S^  treated  is  better 
ordered  than  that  of  B3  which  was  S2CI2  treated.  In¬ 
terface  of  B5  with  S2CI2  pretreatment  is  better  or¬ 
dered  than  B4  which  was  (NH3)S^  pretreated.  This 
suggests  that  the  stronger  Ga-S  bonding  will  produce 
a  better  ordered  Zn-rich  interface,  and  a  less  ordered 
Se-rich  interface.  This  is  understandable  because  for 
Ga-Se  to  form  it  must  first  break  a  Ga-S  bond.  It  is 
made  harder  if  the  Ga-S  bonds  are  strong,  such  as  in 
the  case  of  S2CI2  pretreated  surface,  and  not  so  hard 
for  a  weakly  bonded  S-terminated  surface  created  by 
(NH3)S^  pretreatment.  The  best  ordered  interfaces, 
however,  are  those  samples  with  no  S  coverage. 

(6)  A  third  resonance,  R-3,  is  observed  only  for  samples 
B2  through  B5,  i.e.,  samples  with  S  at  the  ZnSe/GaAs 
interface.  It  is  probably  due  to  the  Ga-S  bonds. 

IV,  CONCLUSIONS 

We  have  studied  in  detail  the  optical  properties  of  ZnSe/ 
GaAs  interface  by  using  reflectance  difference  spectroscopy. 
Both  the  in-plane  anisotropy  (€^^€y)  and  the  off-plane  an¬ 
isotropy  e^)  are  studied.  Three  resonance  lines  at  2.70, 
3.0,  and  3.2  eV  have  been  observed  in  the  in-plane  RD  spec¬ 
tra.  It  is  confirmed  that  the  observed  RD  spectra  are  not  due 
to  the  ZnSe  surface  in  contact  with  air,  or  the  bulk  effect  of 
either  ZnSe  epilayer  or  GaAs  substrate,  but  is  rather  due  to 
the  ZnSe/GaAs  interface.  The  in-plane  anisotropy  is  along 
the  [110]  and  the  [1 10]  principal  axes.  The  dependence  of  the 
resonances  on  interface  formation  conditions,  the  results  of 
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the  photoreflectance  spectroscopy,  and  the  annealing  experi¬ 
ments  all  suggest  that  the  anisotropy  is  not  due  to  the  electro¬ 
optic  effect  resulting  from  an  interface  electric  field.  The 
results  are  consistent  with  the  assumption  that  the  observed 
optical  anisotropy  is  the  intrinsic  properties  of  ZnSe/GaAs 
interface,  caused  by  the  III- VI  and  the  II-V  bonds  at  the 
interface  which  are  different  from  the  III-V  bonds  of  the 
substrate  or  the  II- VI  bonds  of  the  epilayer.  The  resonance 
R-1  at  2.7  eV  is  associated  with  the  interface  state  of  Zn-As 
bonds,  the  resonance  R-2  at  3.0  eV  is  associated  with  the 
interface  state  of  Ga-Se  bonds,  and  the  resonance  R-3  at  3.2 
eV  is  associated  with  the  interface  state  of  Ga-S  bonds,  as 
the  intensity  of  R-1  increases  with  the  Zn  coverage,  the  in¬ 
tensity  of  R-2  increases  with  the  Se  coverage  at  the  ZnSe/ 
GaAs  interface,  and  R-3  exists  only  in  samples  with  S  at  the 
ZnSe/GaAs  interface.  Theoretical  work  along  this  direction 
is  very  much  desired. 

The  resonance  line  shape  changes  drastically  as  the  top 
layer  thickness  varies.  This  is  well  explained  by  a  three-layer 
system  with  conventional  Fresnel  optics  and  is  understood  as 
due  to  the  extra  phase  factor  introduced  by  the  ZnSe  top 
layer.  Such  phase  factor  changes  the  line  shapes  of  the  mea¬ 
sured  RD  resonances.  Both  the  real  and  the  imaginary  parts 
of  the  RD  spectra  must  be  measured  to  make  the  phase  cor¬ 
rection. 

The  application  of  RDS  to  semiconductor  interfaces  bur¬ 
ied  deeply  underneath  a  surface  has  revealed  interface  prop¬ 
erties  that  have  never  been  discovered  before.  When  corre¬ 
lated  with  well  documented  initial  growth  conditions,  RDS 
could  very  well  be  a  powerful  tool  in  studying  interface  elec¬ 
tronic  properties,  even  in  the  energy  region  where  the  top 
layer  is  absorbing.  It  is  hoped  that  the  development  of  this 
new  experimental  technique  will  stimulate  more  interest  in 
the  theoretical  study  as  well. 
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We  report  here  a  systematic  study  of  the  adsorption  of  Zn  on  GaAs(  100)2X4  prior  to  ZnSe  growth. 
At  the  typical  ZnSe  growth  temperature  of  290  °C,  the  Zn  coverage  was  found  to  increase  only 
logarithmically  with  increasing  exposure  time,  so  that  only  submonolayer  coverages  of  Zn  can  be 
absorbed  and  remain  stable  on  the  GaAs  substrates.  At  room  temperature,  and  in  general  at 
temperatures  below  150-200  ®C,  a  drastic  increase  in  the  Zn  sticking  coefficient  makes  monolayer 
and  multilayer  adsorption  possible.  However,  subsequent  annealing  at  the  ZnSe  growth  temperature 
removes  most  of  the  Zn  atoms  from  the  surface,  so  that  only  submonolayer  coverages  are  found  on 
the  initial  growth  surface.  We  discuss  the  possible  role  of  Zn  as  an  interface  control  layer  between 
ZnSe  and  GaAs  in  the  light  of  our  results.  ©  1996  American  Vacuum  Society, 


1.  INTRODUCTION 

Blue-green  semiconductor  laser  diodes  capable  of  con¬ 
tinuous  wave  operation  at  room  temperature  (RT)  potentially 
are  key  devices  for  use  in  high  density  optical  recording,  full 
color  displays,  and  many  other  applications.  Most  blue-green 
solid  state  lasers  demonstrated  to  date  are  comprised  of 
wide-gap  II-VI  epilayers  grown  by  molecular  beam  epitaxy 
(MBE)  on  III-V  substrates. 

The  lifetime  of  such  devices  is  currently  limited  primarily 
by  microstructural  mechanisms  that  generate  dark-line  de¬ 
fects  in  the  active  layer.  These  defects  cause  nonradiative 
recombination  and  rapidly  reduce  the  gain  of  the  active  re¬ 
gion  to  the  point  where  stimulated  emission  ceases.  Evidence 
exists  that  the  degradation  originates  from  the  presence  of 
crystalline  defects,  primarily  stacking  faults,  that  nucleate  at 
the  II-VI/III-V  heterovalent  interface,  and  give  rise  to  dis¬ 
location  sources  at  the  intercepts  with  the  active  layer.^’^ 

Some  evidence  points  to  the  initial  growth  mode  of  the 
II-VI  epilayer  on  the  III-V  substrates  as  an  important  factor 
in  determining  the  stacking  fault  concentration.  Two- 
dimensional  (2D)  growth  of  low-roughness  wide-gap  II-VI 
epilayers  has  been  associated  with  the  effect  of  exposing  the 
GaAs (001)  growth  surface  to  a  Zn  flux  prior  to  II-VI  MBE. 
Such  Zn  predosing  reportedly  gives  rise  to  reduced  stacking 
fault  densities. However,  the  optimum  parameters  to  be 
used  for  this  pretreatment,  the  amount  of  Zn  actually  present 
at  the  interface,  and  the  microscopic  mechanism  responsible 
for  defect  suppression  are  still  unclear,  and  no  detailed  in  situ 
study  has  been  conducted  for  the  relevant  interfaces,  which 
include  ZnSe/GaAs(001),  ZnSe/InGaAs(001),  and 
ZnMgSeS/GaAs(001). 

Recent  studies  of  ZnSe/GaAs  interfaces  fabricated  by 
MBE  in  the  presence  of  a  Zn  overpressure  have  also  shown 
that  a  resulting  Zn-rich  composition  near  the  interface  yields 
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changes  in  the  band  offsets,^’ suppresses  atomic  interdiffu¬ 
sion  across  the  interface,  and  the  related  deep-level 
formation, and  modifies  the  optical  and  structural  quality 
of  the  II-VI  epilayer. These  effects  have  been  tentatively 
associated  with  the  presence  of  monolayer  amounts  of  Zn  at 
the  interface,  yielding  a  Zn- terminated  interface,  but  the 
mechanism  by  which  the  Zn  overpressure  determines  the  Zn- 
rich  interface  composition,  and  the  very  nature  of  the  local 
deviation  from  stoichiometry  remain  to  be  determined. 

For  the  above  reasons,  a  systematic  investigation  of  Zn 
adsorption  on  GaAs(  100)2X4  surfaces  as  a  function  of  sub¬ 
strate  temperature  and  exposure  time  seems  warranted  at  this 
stage.  We  used  MBE  protocols  successfully  employed  for  the 
fabrication  of  blue-green  lasers,  and  examined  the  results  of 
Zn-predosing  by  means  of  monochromatic  x-ray  photoemis¬ 
sion  spectroscopy  (XPS)  and  reflection  high  energy  electron 
diffraction  (RHEED)  in  situ. 


II.  EXPERIMENTAL  DETAILS 

All  of  the  results  presented  here  were  obtained  in  a  facil¬ 
ity  that  includes  MBE  chambers  for  the  growth  of  III-V  and 
II-VI  semiconductors,  and  an  analysis  chamber  with  mono¬ 
chromatic  XPS  capabilities.  All  chambers  are  interconnected 
via  ultrahigh- vacuum  (UHV)  transfer  modules.  The  sample 
surface  periodicity  was  monitored  in  situ  by  RHEED,  with  a 
primary  electron  energy  of  10  keV. 

Si-doped  (V^,—  !  X  10^^  cm“^),  500-nm-thick  GaAs(OOl) 
buffer  layers  were  grown  on  /i-GaAs(lOO)  wafers  at  580  °C 
in  the  III-V  MBE  growth  chamber  following  the  methods 
described  in  Refs.  15  and  16.  During  the  growth,  the  surfaces 
exhibited  an  As-stabilized  2X4  surface  reconstruction  as  de¬ 
termined  by  RHEED.  Both  Zn  predosing  and  ZnSe  growth 
were  performed  in  the  II-VI  MBE  chamber.  Elemental  Zn 
and  Se  effusion  cells  with  a  Zn/Se  beam  pressure  ratio  (BPR) 
of  1  were  employed  for  ZnSe  deposition,  with  a  typical 
growth  rate  of  0.36  pixxdh  at  290  °C.  For  Zn  predosing,  the 
GaAs(001)2X4  surface  was  exposed  to  the  same  Zn  flux 
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employed  for  ZnSe  growth,  i.e.,  an  equivalent  beam  pressure 
of  1.6  X 10“^  Torr  as  measured  by  means  of  a  nude  ion  gauge 
at  the  sample  position.  The  Zn  flux  onto  the  surface  was 
switched  on  and  off  in  a  few  tenths  of  a  second  using  the  cell 
shutter  of  the  Zn  source  in  the  MBE  system. 

Zn  predosing,  and  on  occasion  ZnSe  regrowth,  were  per¬ 
formed  at  a  variety  of  substrate  temperature  in  the  RT  to 
400  °C  range.  Substrate  temperatures  were  measured  using 
an  infrared  optical  pyrometer  in  the  temperature  range  above 
250  °C  and  by  means  of  a  thermocouple  calibrated  using 
known  melting  points  in  the  temperature  range  below 
250  °C.  The  estimated  error  in  the  sample  temperature  was 
of  the  order  of  ±  10  K. 

At  selected  stages,  the  growth  was  interrupted,  and  the 
samples  were  quenched  to  RT  and  transferred  to  the  analysis 
chamber.  Desorption  of  Zn  was  negligible  at  temperature  be¬ 
low  100-200  °C,  so  that  this  treatment  minimized  uninten¬ 
tional  desorption  prior  to  XPS  measurements  of  the  Zn  cov¬ 
erage.  Emission  from  the  Zn  3d,  Zn  2/?3/2,  Ga  3d,  and  As  3d 
core  levels  was  monitored  by  means  of  a  monochromatic 
XPS  spectrometer  that  uses  A\Ka  radiation  and  a  crystal 
monochromator  to  achieve  a  typical  energy  resolution  of  0.7 
eV.  The  photoelectron  escape  depth  employed  was  \==15  A, 
having  taken  into  consideration  the  average  photoelectron 
collection  angle. 

To  monitor  the  Zn  coverage,  we  used  the  variation  with 
exposure  of  the  integrated  intensities  of  the  different  core 
levels.  For  exposures  performed  at  RT,  we  observed  an  iden¬ 
tical  exponential  attenuation  of  the  Ga3d  and  As  3d  emis¬ 
sion  with  increasing  exposure,  and  no  detectable  change  in 
the  Ga  3d/ As  3d  emission  ratio.  This  would  support  a  layer- 
by-layer  growth  mode  with  constant  sticking  coefficient.  As¬ 
suming  a  layer-by-layer  growth  mode,  the  ratio  of  the  inte¬ 
grated  photoemission  intensity  1(9)  of  the  Zn  and  Ga  core 
levels  can  be  used  to  determine  the  amount  of  Zn  adsorbed 
on  the  surface.  In  the  multilayer  coverage  range,  the  cover¬ 
age  9  can  be  approximately  derived  from: 


ln| 


/Ga(Q)  hniO) 

^Zn(^)  ^Ga(^) 


(1) 


where  the  /Ga(0)  is  the  integrated  intensity  of  the  Ga  core 
level  emission  prior  to  any  deposition,  and  is  in¬ 

tegrated  intensity  of  the  Zn  core  level  emission  from  a  thick 
(quasi-infinite)  Zn  film. 

For  substrate  temperatures  higher  than  RT,  and  in  particu¬ 
lar  above  200  °C,  the  attenuation  of  the  substrate  core  level 
emission  and  the  increase  in  the  Zn  emission  were  found  to 
be  much  slower  than  those  encountered  at  RT,  although  no 
change  in  the  GaSJ/As  3d  intensity  ratio  was  observed  at 
any  of  the  temperatures  examined.  This  is  likely  to  reflect  a 
decrease  in  the  sticking  coefficient  that  gives  rise  to  a  finite 
(submonolayer)  saturation  coverage.  In  these  conditions,  Eq. 
(1)  is  not  strictly  appropriate,  since  the  individual  core  level 
intensities  would  vary  linearly  with  coverage,  but  we  still 
elected  to  use  Eq.  (1)  to  obtain  a  rough  estimate  of  the  cov¬ 
erage.  The  resulting  equivalent  thickness  of  the  Zn  films  is 
given  in  Angstroms  throughout  the  article.  Assuming  the 
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Fig.  1.  Angle-integrated  photoemission  spectra  showing  the  Zn  (left) 
and  Gei3d  (right)  emission  from  GaAs(001)2X4  surfaces  exposed  to  a  Zn 
flux.  The  zero  of  the  energy  scale  corresponds  to  the  position  of  the  Fermi 
level  of  the  spectrometer.  Top:  spectra  recorded  following  exposure  to  the 
Zn  flux  at  290  °C  (typical  ZnSe  growth  temperature)  for  120  s,  in  relative 
units.  Center:  exposure  to  the  Zn  flux  at  room  temperature  (RT)  for  90  s. 
Bottom:  exposure  to  the  Zn  flux  at  room  temperature  (RT)  for  90  s,  followed 
by  5  min  annealing  at  290  °C. 


atomic  density  in  the  adsorbed  film  to  be  equal  to  that  of 
bulk  Zn,  one  could  also  express  the  film  thickness  in  mono- 
layers  (ML;  1  ML=2.48  A).  We  caution  the  reader  that  in 
terms  of  the  As  (or  Ga)  atomic  density  on  the  GaAs(lOO) 
surface,  1  monolayer  of  Zn  atoms  would  rather  correspond  to 
about  1  A  coverage. 

III.  RESULTS  AND  DISCUSSION 

Figure  1  illustrates  the  drastic  changes  in  Zn  coverage  that 
correspond  to  different  predosing  procedures.  In  the  topmost 
panel  of  Fig.  1  we  show  in  relative  units  angle-integrated 
photoemission  spectra  from  the  Zn  2py2  level  (left)  and 
Ga3c/  doublet  (right)  observed  after  exposure  of  a 
GaAs(001)2X4  surface  at  290  °C  to  the  Zn  flux  for  120  s. 
The  relatively  small  Zn-related  emission  is  in  sharp  contrast 
with  the  dominant  Znlp^a  feature  observed  following  Zn 
predosing  at  RT  for  90  s  (center  panel).  Multilayer  adsorp¬ 
tion  in  this  case  largely  attenuates  the  substrate  emission 
(right).  However,  when  heating  the  sample  for  5  min  at  the 
typical  ZnSe  growth  temperature  of  290  °C  (bottom-most 
panel),  most  of  the  Zn  atoms  desorb,  and  the  residual  emis¬ 
sion  intensity  in  Fig.  1  is  similar  to  that  observed  following 
predosing  at  290  °C  (topmost  panel). 

A  more  systematic  comparison  of  the  three  types  of  pre¬ 
dosing  procedures  just  described  is  illustrated  in  Figs.  2-4, 
using  the  normalized  integrated  intensities  of  the  Zn  2^3/2 
and  Ga  3d^i2  core  levels,  and  Eq.  (1).  In  Fig.  2,  we  show  the 
Zn  2/73/2  core  level  photoemission  intensity  relative  to  the 
Ga  3d ^12  core  level  photoemission  intensity as  a  function  of 
the  Zn  exposure  time  (in  a  logarithmic  scale)  at  a  sample 
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Zn  exposure  time  (min) 

Fig,  2.  XPS-derived  Zn  Ip^ijlGdi  core  level  intensity  ratio  as  a  func¬ 
tion  of  the  logarithmic  Zn  exposure  time  from  GaAs (00 1)2X4  surfaces  at 
290  °C.  The  right-hand  scale  shows  the  corresponding  estimated  Zn  cover¬ 
age. 


temperature  of  290  The  right-hand  scale  shows  the  cor¬ 
responding  Zn  coverage,  estimated  from  Eq.  (1).  The  loga¬ 
rithmic  dependence  of  the  Zn  film  thickness  from  the  expo¬ 
sure  time  points  in  the  direction  of  a  competition  between 
adsorption  and  re-evaporation  of  Zn  from  the  surface  during 
exposure.  At  the  longer  exposure  times  examined  (—100 
min)  6  approaches  asymptotically  a  saturation  coverage  of 
the  order  of  0.7-0.9  A,  i.e.,  0.3-0.4  ML  in  terms  of  the  Zn 
atomic  surface  density,  or  1  monolayer  in  terms  of  the  As  (or 
Ga)  surface  density. 

In  the  exposure  range  examined  in  Fig.  2  the  RHEED 
pattern  changed  from  the  original  2X4  to  a  1X1.  First,  after 
10  min,  the  2X  streaks  faded  out  and  disappeared.  Next, 
after  45  min,  the  4X  streaks  became  weaker  and  disap¬ 
peared.  Finally,  after  60  min,  a  sharp  1X1  pattern  was  ob¬ 
served.  This  pattern  did  not  change  at  least  for  Zn-exposure 
times  up  to  90  min. 


0  20  40  60  80  100 

Zn  exposure  time  (sec) 

Fig.  3.  XPS-derived  Zn  2/>3/2/Ga  8^5/2  core  level  intensity  ratio  as  a  func¬ 
tion  of  the  Zn  exposure  time.  The  GaAs(001)2X4  substrate  was  kept  at  RT. 
The  right-hand  scale  shows  the  corresponding  estimated  Zn  coverage. 
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Fig.  4.  XPS-derived  Zn  2 p 2/2^02. 3d ^,2  intensity  ratio  from  GaAs(001)2X4 
surfaces  following  exposure  to  a  Zn  flux  at  RT  followed  by  5  min  annealing 
cycles  in  ultrahigh-vacuum  at  increasing  temperatures.  The  right-hand  scale 
shows  the  corresponding  estimated  Zn  coverage.  Solid  circles:  5  s  exposure 
at  RT,  yielding  an  initial  Zn  coverage  of  —0.6  ML  (1  ML =2.48  A).  Open 
symbols:  90  s  exposure  at  RT,  yielding  an  initial  Zn  coverage  of  5  ML.  Open 
circles  and  squares  correspond  to  two  different  experimental  runs  (sample 
series  A  and  B). 


We  emphasize  that  the  relatively  large  scatter  in  the  data 
of  Fig.  2  is  well  outside  the  error  margins  of  the  XPS  analy¬ 
sis,  and  reflects  true  coverage  differences  from  sample  to 
sample.  We  propose  that  small,  unintended  variations  of  the 
sample  temperature  may  introduce  relatively  large  variations 
in  coverage.  In  what  follows  we  will  show  that  for  sample 
temperatures  near  290  °C,  the  Zn  coverage  indeed  depends 
critically  on  the  sample  temperature. 

The  scatter  in  the  data  is  dramatically  reduced  in  Fig.  3, 
that  depicts  the  corresponding  exposure-time  dependence  of 
the  Zn  2/?3/2/Ga  3d^/2  intensity  ratio,  following  a  Zn- 
predosing  at  RT.  As  before,  the  right-hand  scale  shows  the 
corresponding  Zn  coverage,  estimated  from  Eq.  (1).  At  RT, 
the  thickness  of  the  adsorbed  Zn  layer  its  found  to  depend 
linearly  on  the  exposure  time.  The  implication  is  that  the 
sticking  coefficient  of  Zn  is  large  and  adsorption  dominates. 
Already  after  a  90  s  exposure,  a  Zn  film  thickness  of  12  A  is 
obtained,  corresponding  to  a  multilayer  (~5  ML)  coverage. 

As  explained  in  the  previous  section,  upon  RT  exposure 
the  intensity  of  both  the  Ga.3d  and  As  3d  doublets  decreases 
exponentially  with  identical  attenuation  length  with  increas¬ 
ing  coverage,  suggesting  a  layer-by-layer  growth  mode.  The 
RHEED  patterns,  however,  show  only  mixed  indications  of 
the  presence  of  long  range  order.  After  a  5  s  exposure  (^—0.6 
ML),  the  substrate-related  2X4  reconstruction  is  still  ob¬ 
served,  even  though  it  becomes  increasingly  diffuse.  The 
2X4  pattern  disappears  completely  following  a  10  s  expo¬ 
sure  (^~1  ML).  For  longer  exposure  times  a  RHEED  pattern 
comprised  of  a  high  background  and  a  superposition  of 
circles  and  streaks  is  observed  suggesting  the  presence  of 
epitaxial  domains  in  a  poly  crystalline  matrix. 

The  monolayer  or  multilayer  Zn  adsorbates  fabricated  at 
RT  on  GaAs  do  not  remain  stable  at  typical  ZnSe  growth 
temperatures.  In  Fig.  4  we  show  that  when  surfaces  on  which 
a  0.6  ML  (5  s  exposure,  solid  circles)  or  5  ML  (90  s  expo¬ 
sure,  open  circles  and  squares)  Zn  overlayer  was  condensed 
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Fig.  5.  XPS-derived  Zn  intensity  ratio  from  GaAs(001)2X4 

surfaces  exposed  to  a  Zn  flux  for  the  same  exposure  time  of  1 0  min,  but  at 
different  temperatures  in  the  150-400  *^0  range.  The  right-hand  scale  shows 
the  corresponding  estimated  Zn  coverage. 


at  RT  are  annealed  for  5  min  cycles  in  UHV  at  increasing 
temperatures,  substantial  desorption  takes  place.  The  evolu¬ 
tion  of  the  Zn  2p3/2/Ga  3^/572  intensity  ratio  for  multilayer 
samples  (open  symbols)  clearly  indicate  that  in  the  150- 
200  °C  temperature  range  the  samples  revert  to  the  submono¬ 
layer  coverage  stage.  The  XPS  data  for  the  multilayer  and 
submonolayer  samples  coincide  for  temperatures  above 
200  although  the  corresponding  RHEED  patterns  differ. 
The  initial  partially  polycrystalline  pattern  of  the  multilayer 
sample  is  replaced  by  a  1 X 1  pattern  at  about  300  °C,  while 
the  diffuse  2X4  initial  pattern  of  the  submonolayer  sample 
becomes  sharper  above  250  °C.  Further  gradual  desorption  at 
higher  temperatures  makes  the  Zn  2;?  signal  all  but  disappear 
at  450-500  °C. 

The  data  in  Figs.  2-4  suggests  that  the  fabrication  of  full 
monolayers  of  Zn  on  GaAs  prior  to  ZnSe  regrowth,  to  ex¬ 
plore  the  effect  of  Zn-terminated  interfaces  on  the  defect 
density, band  offsets,^’^^  interdiffusion  and  deep  levels,^ 
and  optical  quality  of  the  II- VI  overlayers^^’^"^  will  require 
growth  temperatures  well  below  those  normally  employed 
for  ZnSe.  We  explored  more  quantitatively  the  acceptable 
growth  temperature  range  by  predosing  GaAs  for  a  constant 
exposure  time  of  10  min  at  a  number  of  different  tempera¬ 
tures  in  the  150-400  range.  The  results  are  summarized 
in  Fig.  5.  The  Zn  2p3/2/Ga  2^(1  ^12  intensity  ratio  in  Fig.  5  de¬ 
creases  monotonically  with  increasing  substrate  temperature, 
so  that  a  10  min  exposure  time  is  seen  to  give  rise  to  a  Zn 
coverage  of  0.7  ML  at  150  °C,  but  only  ~0.05  ML  at 
350  We  note  that  the  typical  ZnSe  growth  temperature 
of  290  °C  is  in  a  range  that  corresponds  to  the  strongest 
dependence  of  the  sticking  coefficient  on  temperature.  This 
may  explain  the  scatter  of  the  data  in  Fig.  2  as  compared  to 
Figs.  3  and  4.  Small  unintentional  temperature  variations 
from  sample  to  sample  may  be  the  culprit. 

ZnSe  regrowth  on  top  of  the  adsorbed  Zn  layer  was  also 
examined  for  a  variety  of  temperatures.  Elsewhere,  we  report 
on  studies  of  ZnSe  regrowth  at  RT,  and  the  effect  of  the 
adsorbed  Zn  layer  on  the  ZnSe-GaAs  band  offsets.^^  Results 


Fig.  6.  Zn3dlSe3d  integrated  photoemission  intensity  ratios  from  ZnSe/ 
Zn/GaAs  structures  fabricated  by  Zn  adsorption  and  ZnSe  regrowth  on 
GaAs  at  the  same  temperature.  The  ratio  was  determined  by  XPS  in  the 
early  ZnSe  growth  stage  (3-5  A)  and  is  plotted  in  arbitrary  units  versus  the 
adsorption/regrowth  temperature.  The  apparent  intensity  ratio  reflects  the 
emission  from  the  stoichiometric,  thin  ZnSe  overlayer  plus  that  of  the  pread¬ 
sorbed  Zn  below  the  overlayer. 


for  Zn  adsorption  and  ZnSe  regrowth  in  the  150-320  "’C 
temperature  range  are  summarized  in  Fig.  6.  At  each  tem¬ 
perature,  the  sample  was  exposed  to  the  Zn  flux  for  10  min, 
and  then  the  Se  shutter  was  opened  to  start  ZnSe  growth  with 
BPR=1  at  the  same  temperature.  In  Fig.  6  we  plot  the  ap¬ 
parent  Zn3dlSe3d  integrated  intensity  ratio  (in  arbitrary 
units)  in  the  early  stages  of  growth  (ZnSe  coverage  3-5  A). 
The  intensity  ratio  reflects  the  emission  from  the  stoichio¬ 
metric,  thin  ZnSe  overlayer  plus  that  of  the  preadsorbed  Zn 
below  the  overlayer. 

The  monotonic  decrease  in  the  Zn  3d/Se  3d  intensity  ra¬ 
tio  in  Fig.  6  with  increasing  growth  temperature  primarily 
reflects  the  different  initial  Zn  coverage  of  the  predosed  sur¬ 
face  (see  Fig.  5).  This  figure  proves  that  the  adsorbed  Zn  is 
not  desorbed  during  the  deposition  of  ZnSe.  This  is  the  main 
reason  why  increasingly  Zn-rich  interfaces — in  the  sense  em¬ 
ployed  in  Refs.  9  and  10 — are  observed  with  decreasing  tem¬ 
peratures  in  Fig.  6.  This  illustrates  how  the  results  of  Figs. 
1-5  might  be  employed  to  determine  the  predosing  param¬ 
eters  that  yield  a  desired  interface  composition,  and  how  the 
same  results  offer  a  new  approach  to  engineer  ZnSe/GaAs 
interfaces. 


IV.  CONCLUSIONS 

Adsorption  of  Zn  on  GaAs(00 1)2X4  surfaces  prior  to 
ZnSe  growth  has  been  examined  using  different  types  of  pre¬ 
dosing  procedures.  Exposure  to  Zn  was  performed  at  the 
typical  ZnSe  growth  temperature  of  290  °C,  at  room  tem¬ 
perature,  at  several  intermediate  temperatures,  and  through 
room- temperature  deposition  followed  by  annealing.  We 
found  that  only  submonolayer  saturation  coverages  can  be 
obtained  at  290  and  that  in  general,  full  monolayer  or 
multilayer  adsorption  and  stability  requires  adsorption  and 
processing  temperatures  below  150-200  °C. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


2984 


Heun  etal.:  Zn  preadsorption  on  GaAs(100)2x4 

In  particular,  the  predosing  procedure  (2  min  at  280  °C) 
employed  by  Kuo  et  to  enhance  2D  growth  and  reduce 
stacking  fault  formation  at  II-VI/III-V  interfaces  produces 
Zn  coverages  of  the  order  of  0.1  ML  rather  than  a  full  Zn 
termination  of  the  interface.  Any  model  targeted  to  explain¬ 
ing  the  enhanced  2D  growth  will  have  to  take  this  into  ac¬ 
count.  In  this  coverage  regime  the  Zn  preadsorbed  layer  is 
unlikely  to  affect  atomic  interdiffusion/ or  the  band 
offsets.^’^^ 

The  fabrication  of  Zn-terminated  interfaces  using  Zn  pre¬ 
dosing  of  the  III-V  substrates  requires  growth  temperatures 
well  below  those  normally  employed  to  grow  high  quality 
II- VI  epilayer.  Evidence  that  Zn-rich  interface  compositions 
can  be  achieved  instead  at  290  °C  using  high  BPR-values  in 
the  early  growth  stages  suggests  that  codeposition  of  ZnSe 
and  excess  Zn  is  the  essential  ingredient  that  makes  this  al¬ 
ternate  procedure  work.  We  speculate  that  the  immediate 
growth  of  ZnSe  on  top  of  the  adsorbed  Zn  layer  is  essential 
to  stabilize  the  Zn  atoms  at  the  surface  and  avoid  re¬ 
evaporation. 
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The  electronic  properties  of  metal-semiconductor  or  Schottky  contacts  are  characterized  by  their 
barrier  heights.  At  ideal  contacts,  they  are  determined  by  the  continuum  of  metal-induced  gap  states. 
Extrinsic  interface  defects  and  dipoles  as  well  as  interlayers  may  be  present  in  real  contacts  and  will 
modify  the  barrier  heights.  The  zero-charge-transfer  barrier  height  and  a  slope  parameter  describe 
the  chemical  trends  of  the  barrier  heights  of  Schottky  contacts  on  a  specific  semiconductor.  The 
zero-charge-transfer  barrier  heights  may  be  calculated  by  using  the  empirical  tight-binding  method 
with  universal  parameters  and  the  width  of  the  dielectric  band  gaps  and  the  slope  parameters  are 
given  by  the  optical  dielectric  constants  of  the  semiconductors.  This  is  verified  by  comparison  with 
numerical  data  from  well-established  theoretical  approaches.  Hydrogen  doping  of  metal-diamond 
and  metal-silicon  interfaces  changes  their  barrier  heights  with  opposite  sign.  This  is  explained  by 
a  different  orientation  of  H-C  and  H-Si  interface  dipoles.  Ag  and  Pb/Si(lll)  contacts  may  be 
prepared  with  a  “7X7”  and  a  1X1  interface  structure.  The  difference  of  their  barrier  heights  is 
explained  by  the  electric  dipole  correlated  with  the  stacking  fault  which  is  a  characteristic  of  7X7 
reconstructions.  Interlayers  will  also  alter  the  barrier  heights  of  Schottky  contacts.  Typical  examples 
are  Si02  and  Si3N4,  i.e.,  MOS  and  MNS  structures.  Previous  investigations  found  their  barrier 
heights  to  vary  as  a  function  of  the  metals  used.  The  chemical  trends  of  these  two  data  sets  are 
described  by  the  slope  parameters  predicted  from  the  optical  dielectric  constants  of  Si02  and  Si3N4. 
©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

The  chemical  compositions  and  the  atomic  arrangements 
at  real  semiconductor  interfaces  may  be  quite  complicated. 
Therefore,  physical  explanations  of,  for  example,  the  barrier 
heights  at  metal- semiconductor  contacts,  which  are  the  main 
subject  of  this  contribution,  have  to  consider  more  than  just 
one  mechanism.^  Ideal  interfaces,  to  begin  with,  are  laterally 
homogeneous,  intimate,  abrupt,  and  free  of  any  kind  of 
chemical  or  structural  defects.  The  band  alignment  at  such 
ideal  interfaces  is  determined  by  the  continuum  of  interface- 
induced  gap  states. Meanwhile,  this  theoretical  conclusion 
has  become  widely  accepted.  These  intrinsic  interface  states 
at  metal-semiconductor  interfaces  derive  from  the  virtual  gap 
states  (ViGS)  of  the  complex  semiconductor  band  structure."^ 
At  metal-semiconductor  interfaces,  the  metal-induced  gap 
states  or,  for  short,  MIGS,  are  formed  by  the  wave  function 
tails  of  the  metal  electrons  which  decay  exponentially  into 
the  semiconductor. This  behavior  is  similar  to  the  expo¬ 
nential  tails  of  the  electron  wave  functions  at  clean  surfaces 
of  solids,  i.e.,  at  solid-vacuum  interfaces.  Below  the 
valence-band  maximum  of  the  semiconductor,  the  wave 
functions  are  bulklike  on  either  side  of  metal -semiconductor 
contacts.^ 

The  continua  of  interface-induced  gap  states  have  branch 
points  where  the  character  of  these  states  changes  from 
mostly  valence-band-  or  donorlike  to  predominantly 
conduction-band-  or  acceptorlike.  If  the  Fermi  level  is  above, 
coincides  with,  and  drops  to  below  the  branch  point,  the  net 


^^Eiectronic  mail:  w.moench@uni-duisburg.de 


charge  density  in  interface-induced  gap  states  is  negative, 
vanishes,  and  becomes  positive,  respectively.  Therefore, 
these  branch  points  are  intuitively  called  the  charge  neutral¬ 
ity  levels  (CNL)  of  the  MIGS.  The  energy  position  of  the 
Fermi  level  at  semiconductor  interfaces  and,  what  is  equiva¬ 
lent,  the  barrier  heights  of  metal-semiconductor  contacts  or 
the  band-edge  discontinuities  in  semiconductor  heterostruc¬ 
tures  depend  on  the  occupation  of  the  interface-induced  gap 
states.  Space  charges  are  much  smaller  and  may  be  ne¬ 
glected. 

Interfaces  are  electrically  neutral.  Therefore,  the  charge 
densities  on  both  sides  of  an  interface  are  of  equal  size  but 
have  opposite  sign.  A  generalization  of  Pauling’s  model  for 
small  molecules^  may  be  used  to  describe  the  charge  transfer 
at  interfaces  by  the  electronegativity  difference  of  the  solids 
in  contact. 

The  current  transport  across  metal- semiconductor  inter¬ 
faces  is  governed  by  their  barrier  heights.^  They  are  the  en¬ 
ergy  distance  between  the  Fermi  level  Wp  and  the  respective 
majority-carrier  band  edge  right  at  the  interface,  which  are 
the  valence-band  maximum  W^i  for  /?-type  and  the 
conduction-band  minimum  for  n-type  semiconductors. 
The  MIGS-and-electronegativity  model  predicts  barrier 
heights  to  vary  as^ 

<l>Bn=  (l>cnl'^^xiS.,n-Xs)  (1) 

and  as 

<l>Bp=cf>*n,-S,{Xm-X,)  (2) 
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on  semiconductors  doped  n-  and  p-type,  respectively.  The 
zero-charge-transfer  barrier  heights  (l>cni~^ci~^cni 

~  ^cni  ~  ^vi  ^sult  when  the  difference  X^-X^  be¬ 
tween  the  metal  and  the  semiconductor  electronegativities  is 
zero.  As  a  consequence  of  this,  the  Fermi  level  coincides 
with  the  charge  neutrality  level  of  the  MIGS.  For  one  and  the 
same  metal- semiconductor  pair,  the  barrier  heights  and 
<f)^p  add  up  to  the  fundamental  band  gap,  i.e., 

4^Bn^^Bp~^g-  (3) 

The  MIGS-and-electronegativity  model  predicts  the  barrier 
heights  of  ideal  metal- semiconductor  contacts  to  be  deter¬ 
mined  by  the  energy  position  of  the  charge  neutrality  level, 
Wcni »  of  the  MIGS  and  the  slope  parameter  S^=  d(/)p/dX^  . 
Both  quantities  will  be  discussed  in  Sec.  IL 

At  real  metal-semiconductor  contacts,  interface  defects, 
interface  dipoles  due  to  foreign  atoms  or  correlated  with  spe¬ 
cific  interface  structures  of  substrate  atoms,  as  well  as  inter¬ 
layers,  have  to  be  considered  in  addition  to  the  intrinsic  MIG 
states.  Examples  of  such  extrinsic  mechanisms  will  be  pre¬ 
sented  and  discussed  in  Secs.  III-VI. 
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Fig.  1.  Width  of  the  band  gap  at  the  mean-value  point  of  diamond.  Si,  Ge, 
SiC,  GaAs,  and  CdS  as  calculated  in  the  GW  approximation  by  Rohlfing 
et  al  (Refs.  16  and  17)  plotted  vs  the  width  of  the  dielectric  band  gap.  The 
dashed  line  is  a  least-squares  fit  and  has  a  slope  parameter  of  0.96 ±0.05. 


II.  IDEAL  INTERFACES: 
MIGS-AND-ELECTRONEGATIVITY  MODEL 

A.  Zero-charge-transfer  barrier  height 

At  their  branch  point,  the  MIG  states  pass  over  from  pri¬ 
marily  valence-  to  predominantly  conduction-band  character. 
Tersoff^’^^  calculated  the  branch  points  for  as  many  as  15 
semiconductors.  He  defined  a  cell-averaged  real-space 
Green’s  function  and  evaluated  it  by  summing  over  152 
points  in  the  irreducible  wedge  of  the  Brillouin  zone.  The 
Green’s  function  changes  sign  at  the  branch  point  of  the  vir¬ 
tual  gap  states  of  the  complex  band  structure.  Tersoff  calcu¬ 
lated  the  energy  bands  in  the  local  density  approximation 
(LDA).  He  adjusted  the  absolute  gaps  to  the  experimental 
values  by  rigid  shifts  of  the  conduction  bands  to  higher  en¬ 
ergies.  Tersoff ’s  branch  point  energies  with  regard 

to  the  valence-band  maxima  are  a  most  reliable  set  of  data. 
However,  they  require  a  lot  of  computational  effort. 

At  their  branch  point  the  decay  length  of  the  virtual  gap 
states  is  shortest.^’^^  In  one  dimension,  the  imaginary  part  of 
the  ViGS’s  wave  vector  passes  through  its  maximum  at 
V\lAWx  below  midgap  where  2|  Fj  |  and  Wi  are  the  widths  of 
the  band  gap  and  of  the  valence  band,  respectively.  With  an 
average  or  dielectric  band  gap  eV  and  an  average 

width  Wi^ll  eV  of  the  valence  bands,  one  estimates 
(W^^/2)^/4Wi^0.04W^g/2  for  three  dimensions.  The  branch 
point  of  the  ViGS  is  thus  slightly  below  the  midpoint  of  the 
dielectric  band  gap. 

Cardona  and  Christensen^^  took  advantage  of  Balderes- 
chi’s  concept  of  mean-value  points  in  the  Brillouin  zone.^^ 
For  the  face-centered  cubic  lattice,  the  coordinates  of  the 
mean-value  point  are  =  (2 tt/^o) (0.6223, 0.2953,0),^^’^^ 

where  is  the  lattice  parameter.  The  gap  at  the  mean-value 
point  should  have  the  width  of  the  dielectric  band  gap  in 
Penn’s  model.^"^  The  branch  point  of  the  ViGS  is  then  ex¬ 
pected  to  be  close  to  the  midpoint  of  the  band  gap  at  the 


mean-value  point  of  the  Brillouin  zone.  Cardona  and  Chris¬ 
tensen  again  calculated  the  energy  bands  in  the  local  density 
approximation.  Instead  of  applying  the  scissors  operation,  as 
done  by  Tersoff,  they  enlarged  the  band  gap  by  taking  the 
averages  of  the  top  two  valence  and  the  bottom  two  conduc¬ 
tion  bands  at  the  mean-value  point  in  the  Brillouin  zone. 
Their  dielectric  midgap  energies  are  surprisingly  close  to 
Tersoff ’s  branch  point  energies. 

Meanwhile,  the  LDA  band-gap  problem  has  been  resolved 
by  using  the  GW  approximation.^^  The  width  of  the  band 
gaps  at  the  mean-value  point  of  GWA  band  structures  and  of 
the  dielectric  band  gap  should  be  equal.  The  results  of  re¬ 
spective  GWA  calculations  by  Rohlfing  et  which  are 

plotted  in  Fig.  1  as  a  function  of  the  widths  of  the  respective 
dielectric  band  gaps,  confirm  this  conclusion. 

GWA  band  structures  need  elaborate  computations.  How¬ 
ever,  the  valence  bands  are  quite  well  approximated  by  the 
empirical  tight-binding  method  (ETB).  Figure  2  demon¬ 
strates  that  at  the  mean-value  point,  which  is  of  interest  here, 
the  GWA^^  and  the  ETB  valence-band  energies  excellently 
agree.  The  valence-band  maximum  at  F  in  the  middle  of 
the  Brillouin  zone  was  taken  as  reference.  In  the  ETB  calcu¬ 
lations,  the  atomic  term  values  calculated  by  Fischer^®  and 
the  universal  coefficients  rjur^  of  the  four  matrix  elements 
^ssa^  ^spcr^  ^ppa^  ^ppir  evaluated  by  Harrison^^  were 
used.  With  the  group-II  chalcogenides,  VppTr  were 

scaled  by  a  factor  of  1.5.  These  matrix  elements  vary  as 

1 

(4) 

with  the  nearest-neighbor  distance  d^^  —  aQV3/4. 

GWA  data  at  the  mean-value  point  are  available  for  the 
six  semiconductors  considered  in  Figs.  1  and  2  only.  How¬ 
ever,  the  ETB  and  the  GWA  valence-band  data  are  in  good 
agreement.  Therefore,  the  energy  position  of 
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GWA  valence-band  at  mean-value  point  [eV] 


Fig.  2.  Position  of  the  top  valence  band  at  the  mean-value  point  relative  to 
the  valence-band  maximum:  Comparison  of  ETB  data  with  GWA  data  from 
Rohlfing  et  al.  (Refs.  16-18).  The  dashed  line  is  a  least-squares  fit  and  has 
a  slope  parameter  of  1.05  ±0.05. 


the  branch  point  above  the  valence  band  at  the  mean- value 
point  may  be  calculated  from  Tersoff’s  Wi^p-W^iT) 
data^’^®  and  the  ETB  values  lT„(r)  at  the  mean- 

value  point.  Figure  3  displays  the  resulting  energy  position 
Wbp-Wjj{kmJ  as  a  function  of  the  width  of  the  dielectric 
band  gap  for  all  the  semiconductors  considered  by  Tersoff. 
The  dashed  line  is  a  least-squares  fit  to  the  data  of  the  com¬ 
pound  semiconductors  and  has  a  slope  of  0.449 ±0.008.  The 
branch  point  in  the  continuum  of  the  virtual  gap  states  of  the 
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Fig.  3.  Position  of  the  ViGS’s  branch  point  above  the  valence  band  at  the 
mean-value  point  vs  the  dielectric  band  gap.  The  dashed  line  is  a  least- 
squares  fit  and  has  a  slope  parameter  of  0.449  ±0.008. 


complex  band  structure  is  thus  slightly  below  midgap  posi¬ 
tion.  This  finding  is  explained  by  the  above  estimate  from  the 
one-dimensional  model. 


B.  Slope  parameter 

Due  to  the  charge  transfer  across  interfaces,  an  electric 
double  layer  of  atomic  dimensions  exists  at  metal- 
semiconductor  contacts.  The  charge  density  on  their  semi¬ 
conductor  side  is  made  up  of  the  net  charge  density  in  the 
MIGS  continuum 

Q;:  =  eoD^J(CNL)(W,„i-W^) 

=  eoD^j(CNL)(<^B„-</,,„i),  (5) 

since  the  space  charge  may  be  neglected.  The  density  of 
states  D^”^^(CNL)  of  the  MIG  states  is  assumed  to  be  constant 
around  their  CNL.  The  thickness  of  this  intrinsic  dipole  layer 
may  be  estimated  as  the  decay  length  of  the 

MIG  states  at  their  CNL.  The  factor  2  in  the  denominator 
occurs  since  the  charge  distribution  rather  than  the  wave 
functions  of  the  MIGS  are  considered.  The  slope  parameter 
Sx=d(/)B/dXm  then  results  from^^’^^ 

A,/S,-  1  =  (e2/e^^5o)D™;(CNL)/2^™(CNL),  (6) 

where  and  €q  are  the  interface  dielectric  constant  and  the 
permeability  of  vacuum,  respectively.  The  parameter 
A^=0.86  accounts  for  the  use  of  Miedema’s  electronegativ¬ 
ity  scale.^"^ 

In  one  dimension,  both  the  density  of  states  and  the  decay 
length  of  the  ViGS  at  their  branch  point  vary  inversely  pro¬ 
portional  to  the  width  of  the  band  gap.^^’^^  Penn’s  nearly  free 
electron  model  of  three-dimensional  semiconductors,^"^  on 
the  other  hand,  defines  the  width  of  the  dielectric  band  gap 
W^gby 

(7) 

where  hco^^  is  the  plasmon  energy  of  the  bulk  valence  elec¬ 
trons  and  600  is  the  optical  dielectric  constant  of  the  semicon¬ 
ductor.  Therefore,  the  slope  parameters  were  supposed  to 
correlate  with  the  dielectric  susceptibility  600”  1  of  the 
semiconductors.^^  For  several  metal-semiconductor  contacts 
the  densities  of  states  as  well  as  the  charge  decay  lengths  of 
the  MIG  states  at  their  charge  neutrality  level  were  calcu¬ 
lated  by  using  different  theoretical  approaches. Ac¬ 
cording  to  Eq.  (7),  Fig.  4  displays  a  plot  of  “theoretical  slope 
parameters”  (^o/^o)^g^(CNL)/2<7^J(CNL)  as  a  function  of 
the  experimental  susceptibilities  600“  1-  A  least-squares  fit  to 
these  data  gives 

(e^/eo)£>^;(CNL)/2g”;(CNL)  =  0.29(e„-l)‘'^‘-'’-^^. 

(8) 

The  interface  dielectric  constant  was  estimated  as  ap¬ 
proximately  2  and  4  for  diamond^^  and  III-V  Schottky 
contacts.^’  Thus,  the  theoretical  result  (8)  reproduces  the  ear¬ 
lier  semiempirical  rule^^ 

A^/5^-1  =0.1(600- 1)^  (9) 
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Fig.  4.  “Theoretical  slope  parameters”  (eo/6o)/)'I"(CNL)/2<7"'/(CNL)  as  a 
function  of  dielectric  susceptibilities  O:  data  from  Refs.  2,  9,  and  29; 
□;  data  from  Ref.  28.  From  Ref.  23. 
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Fig.  5.  Barrier  heights  of  p-diamond  Schottky  contacts  as  a  function  of  the 
As  will  be  demonstrated  in  Sec.  IV,  relation  (9)  also  predicts  difference  of  metal  and  carbon  electronegativities.  From  Ref.  33. 

the  slope  parameters  for  metal  contacts  on  Si02  and  Si3N4. 


III.  EXTRINSIC  INTERFACE  DIPOLES 
A.  Atomic  interlayers 

Modifications  of  barrier  heights  by  means  of  interface 
doping  of  Schottky  contacts  have  been  frequently  tried.  As 
an  example,  hydrogen  interlayers  at  diamond  and  silicon 
Schottky  contacts  shall  be  discussed  here.  Aoki  and 
Kawarada^^  deposited  metals  on  hydrogen-terminated  {100}- 
oriented  p -diamond  surfaces  at  room  temperature.  The  bar¬ 
rier  heights  reported  are  plotted  in  Fig.  5  as  a  function  of  the 
difference  between  the  Miedema  electronegativities 

of  the  metals  used  and  carbon.  The  dashed  line  is  a  least- 
squares  fit, 

0p^(H-C)  =  O.O17-O.37(A,,-Xc)  [eV], 

to  the  experimental  data.  The  dash-dotted  line,  on  the  other 
hand,  represents  the  prediction 

0p^(C)  =  1.4-O.3(X^-Ac)  [eV] 

of  the  MIGS-and-electronegativity  model  for  ideal  diamond 
Schottky  contacts.  The  CNL  position  was  adopted  from  Car¬ 
dona  and  Christensen^^  and  the  slope  parameter  was  evalu¬ 
ated  by  using  Eq.  (9).  This  MIGS  line  reproduces  the  experi¬ 
mental  data  for  Al  and  Au  contacts  with  “clean”  interfaces 
while  the  Ba  data  point  deviates  by  approximately  0.5  eV. 

The  MIGS  model  predicts  barrier  heights  that  are  larger 
by  approximately  1  eV  than  what  was  observed  with 
H-modified  diamond  Schottky  contacts.  Monch^^  explained 
these  findings  by  H-induced  interface  dipoles.  Recent  experi¬ 


mental  data  obtained  with  Pb  contacts  on  H-terminated 
Si(lll)  surfaces  support  this  interpretation.  These  latter  ex¬ 
perimental  results  will  be  discussed  next. 

Figure  6  shows  current-voltage  (//V)  characteristics  of 
one  diode  each  prepared  by  evaporation  of  lead  onto  a  clean 
«-Si(lll)-7X7  and  a  H-terminated  /?-Si(lll):H-lXl  surface 
at  room  temperature.^"^  The  H-terminated  silicon  surfaces 
were  prepared  by  wet  chemical  etching  in  buffered  hydro¬ 
fluoric  acid  with  ;?H=9.  Clean  7X7-reconstructed  surfaces 
were  obtained  by  desorption  of  the  hydrogen  adatoms  from 


Fig.  6.  Current-voltage  characteristics  of  Pb/«-Si(Ill)  and  Pb/H//?-Si(lil) 
contacts  at  room  temperature.  The  diode  areas  measure  9.3X10“^  cm^. 
From  Ref.  34. 
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such  surfaces  at  approximately  850  ®C.  The  analysis  of  the 
//y  characteristics  displayed  in  Fig.  6  gives  barrier  heights 
of  0.64  and  0.7  eV,  respectively.  By  using  relation  (3)  one 
then  obtains  barrier  heights  of  0.48  and  0.42  eV  for  Pb  con¬ 
tacts  on  p-Si(lll)-7X7  and  w-Si(lll):H-lXl  substrates,  re¬ 
spectively.  Experimentally,  the  latter  contacts  had  ohmic  HV 
curves.  Such  behavior  is  expected  from  simulations  for  bar¬ 
rier  heights  lower  than  0.5  eV  and  the  increased  series  resis¬ 
tance  which  results  from  the  low  doping  level  of  the  sub¬ 
strates  used. 

Changes  of  the  barrier  heights  induced  by  interface 
doping  or,  more  generally  speaking,  interface  modification, 
will  be  of  the  same  size  but  of  different  sign  on  n-  and 
/?-type  substrates.  The  barrier  heights  of  interface-modified 
metal-semiconductor  contacts  may  then  be  written  as 

<l>Z=^Bn-HB  (10) 

and 

4>%=cl>Bp+S<f>B,  (11) 

where  and  (p^p  denote  the  barrier  heights  of  the  respec¬ 
tive  “clean”  interfaces.  The  experimental  results  presented  in 
Figs.  5  and  6  then  give 

ScPs(n-C)==-l  eV  but  ^05(H-Si)=+O.25  eV. 

The  hydrogen-induced  changes  of  the  barrier  heights  are  thus 
of  opposite  sign  at  metal-diamond  and  metal- silicon  con¬ 
tacts. 

These  experimental  findings  may  be  easily  explained  if 
nearest-neighbor  interactions  between  interlayer  (ia)  and  sub¬ 
strate  atoms  (s)  are  considered  only.  Chemical  bonds  be¬ 
tween  monovalent  interlayer  atoms  such  as  hydrogen  and 
substrate  atoms  may  be  described  as  diatomic  “interface 
molecules.”  Pauling  found  a  correlation  between  the  partial 
ionic  character  of  covalent  bonds  and  the  difference  of 
the  atomic  electronegativities.  The  respective  values  relevant 
in  the  present  discussion  are  Xj^=2.2,  Xc=2.55,  and 
Xsi=  1 .9.  The  H-C  and  H-Si  interface  dipoles  are  thus  op¬ 
positely  oriented.  This  finding  immediately  explains  the  ex¬ 
perimentally  observed  reduction  of  the  barrier  heights  at 
H-modified  n -silicon  but  p -diamond  contacts. 

In  a  simple  point-charge  model,  the  dipole  moment  of 
such  “interface  molecules”  may  be  approximated  by 

/7,.  =  €o[0.16|Xi,-Xj  +  0.035(X,.,-X,)2](r,.„  +  rJ,  (12) 

where  and  are  the  covalent  radii  of  the  interlayer  and 
the  substrate  atoms,  respectively.  The  expression  in  square 
brackets  gives  the  ionic  character  of  single  bonds.  It  is  a 
revised  version^^  of  a  relation  originally  proposed  by 
Pauling.^  The  change  of  barrier  heights  by  such  extrinsic 
dipoles  may  then  be  described  as  the  voltage  drop  across  an 
electric  double  layer,  i.e.,  by 

S<l}B=±ieo/€^seo)piNa,  (13) 

where  is  the  area  density  of  the  extrinsic  interface  di¬ 
poles.  If  interfaces  are  doped  with  atoms  having  large  cova¬ 


lent  radii  and  then  also  large  polarizabilities,  the  mutual  in¬ 
teraction  of  the  interface  dipoles  will  lower  their  dipole 
moment  as  a  function  of  their  area  density.^^ 

For  H-modified  diamond  and  silicon  Schottky  contacts, 
relations  (12)  and  (13)  give 

eV 

and 

^<^5(H-Si)  =  + 1.00/6^,  eV. 

As  was  already  mentioned  above,  the  interface  dielectric 
constants  were  estimated  as  e^^^2  for  diamond^^  and 
4  for  III-V  Schottky  contacts.^^  Since  the  dielectric 
constant  of  silicon  differs  not  that  much  from  the  values 
reported  for  the  classical  III-V  compounds,  the  interface  di¬ 
electric  constant  is  also  approximated  by  6^^^4  for  silicon 
Schottky  contacts.  The  H-induced  variations  of  the  barrier 
heights  then  result  as 

S(Ps{B-C)—l  eV  and  (5c/>5(H-Si)«+0.25  eV. 

These  estimates  are  in  very  good  agreement  with  the  experi¬ 
mental  data. 

Quite  recently,  Saiz-Pardo  et  computed  the  variations 
of  barrier  heights  induced  by  a  hydrogen  interlayer  in  K/Si 
contacts.  They  used  a  self-consistent  LCAO  method  supple¬ 
mented  with  a  local-density  prescription  to  account  for 
many-body  effects.  The  hydrogen  atoms  were  found  to  form 
a  dipole  layer  at  K/H/Si  interfaces  and  to  reduce  the  barrier 
height  by  0.23  eV  with  regard  to  “clean”  K/Si  contacts.  This 
theoretical  result  reproduces  the  experimental  observations 
with  Pb/H/Sl(lll)  contacts  and  confirms  the  conclusions 
from  the  simple  point  charge  model  outlined  above. 


B.  Influence  of  interface  structure  on  barrier  heights 

The  barrier  heights  of  “clean”  metal-semiconductor  con¬ 
tacts  vary  if  the  interface  structures  are  different.  Tung^^  was 
the  first  to  report  on  such  a  correlation.  He  discovered  that 
the  barrier  heights  of  epitaxial  NiSi2/Si(lll)  diodes  differ  by 
140  meV,  depending  on  whether  the  interfaces  are  of  type  A 
or  type  B.  The  lattices  are  identically  aligned  on  both  sides  of 
type- A  contacts  but  they  are  rotated  by  180°  around  the  sur¬ 
face  normal  for  the  case  of  type-B  interfaces.^^  Further  ex¬ 
amples  are  Ag  and  Pb  contacts  on  {lll}-oriented  Si  sub¬ 
strates,  which  will  be  discussed  here. 

The  atomic  structures  of  buried  Ag  and  Pb/Si  interfaces 
were  studied  with  surface  x-ray  diffraction. First,  films 
evaporated  onto  clean  Si(lll)-7X7  surfaces  at  room  tem¬ 
perature  will  be  considered.  Each  7X7  unit  mesh  contains  12 
adatoms  and  a  stacking  fault  in  one  of  its  triangular  subunits 
which  is  surrounded  by  nine  dimers  and  a  comer  hole.^^ 
After  evaporation  of  thick  Ag  and  Pb  films,  7th  order  spots 
were  found  to  persist.  From  the  diffraction  intensities  mea¬ 
sured  with  such  Ag/Si(lll)-“7X7”  interfaces,  Hong  et 
concluded  that  the  stacking  faults  and  their  associated  dimers 
and  corner  holes  are  still  present,  while  the  Si-adatoms  are 
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Fig.  7.  Current- voltage  characteristics  of  Pb-  and  Ag//i-Si(lll)  contacts 
with  “7X7”  and  IX 1  interface  structures.  The  diode  areas  measure 
6.14X10“^  cm^  and  6.6X10“^  cm^  respectively.  From  Refs.  46  and  45, 
respectively. 


missing.  Similar  results  were  reported  by  Feidenhans’l 
et  after  the  deposition  of  up  to  one  monolayer  of  Pb  on 
Si(lll)-7X7  surfaces  at  room  temperature. 

The  Ag/Si(lll)-“7X7”  interface  structure  is  unstable  to 
annealing  at  temperatures  above  250  With  surface 
x-ray  diffraction,  no  superstructure  spots  were  detected.  The 
resulting  contacts  are  labeled  Ag/Si(lll)-1  XL  Surface  x-ray 
diffraction  studies  demonstrated  that  similar  Pb/Si(lll)-lXl 
interfaces  exist  after  the  deposition  of  100  nm,  i.e.,  thick  Pb 
films  onto  Si(lll):Pb-(V3X^3)R30‘^  surfaces  at  room 
temperature."^^  Heslinga  et  and  Schmitsdorf  et 
measured  current-(//y)  and  capacitance-voltage  {CIV) 
characteristics  of  Pb  and  Ag/Si(lll)  diodes,  respectively, 
with  “7X7”  and  IXI  interface  structures.  Irrespective  of  the 
metal,  the  latter  interfaces  were  found  to  have  the  larger 
barrier  heights. 

Figure  7  displays  HV  characteristics  recorded  with  one 
Ag  and  Pb/Si(lll)  contact  each  that  exhibit  a  “7X7”  and  a 
1X1  interface  structure."^^’"^^  The  clean  7X7-reconstructed 
surfaces  to  start  with  were  obtained  by  hydrogen  desorption 
at  approximately  850  °C  from  Si(lll):H-lXl  surfaces  pre¬ 
pared  by  wet  chemical  etching.  Only  with  such  surfaces, 
thermionic  HV  characteristics  as  those  shown  in  Fig.  7  were 
observed.  When  clean  Si(lll)-7X7  surfaces  were  prepared 
by  decomposition  of  Si02  at  high  temperatures,  nonideal  HV 
characteristics  were  observed."^^  Irrespective  of  the  metal 
used,  the  barrier  heights  are  larger  with  diodes  exhibiting 
1X1  rather  than  “7X7”  interface  structures.  This  observa¬ 
tion  is  attributed  to  the  stacking  fault  which  is  a  characteris¬ 
tic  of  the  Si(lll)-7X7  structure."^^ 

Chou  et  investigated  the  electronic  properties  of  in¬ 
trinsic  and  extrinsic  stacking  faults  in  bulk  silicon.  The  stack¬ 
ing  sequences  are  ...AA'BB'CC'BB'CC...  and 
...AA'BB'AA'CC'AA^BB'CC^..,  respectively.  At  such  de¬ 
fects  the  charge  densities  differ  from  the  charge  distribution 
in  the  undisturbed  bulk.  Figure  8  displays  the  integrated  dif¬ 
ference  of  the  charge  density  over  the  (111)  plane  between  a 


Charge  Density  Difference  ESF 

{10"^eo/au.) 


Fig.  8.  Integrated  difference  of  the  charge  density  over  (111)  planes  between 
a  Si  crystal  with  a  stacking  fault  and  a  perfect  Si  crystal  and  atomic  posi¬ 
tions  in  the  (110)  plane  (1  a.u.=fl|ohr^^-48XlO“'^  nm"*).  After  Chou  et  al 
(Ref.  47). 


crystal  with  an  extrinsic  stacking  fault  and  a  perfect  crystal 
and  the  atomic  positions  on  the  (110)  plane.  As  a  whole, 
stacking  faults  are  electrically  neutral  but  they  exhibit  elec¬ 
tric  dipoles. 

The  stacking  sequence  ...AA'BB'AA'  is  characteristic  of 
the  stacking  fault  in  one  of  the  triangular  subunits  of  the  7X7 
clean-surface  and  by  this  also  of  the  “7x7”  interface  struc¬ 
ture.  The  computed  distribution  of  the  charge  density  indi¬ 
cates  that  the  electric  dipole  layer  associated  with  such  stack¬ 
ing  faults  has  its  negative  sign  towards  the  vacuum  at  clean 
surfaces  and  the  metal  at  “7X7”  interfaces.  These  “7X7”- 
induced  interface  dipoles  are  thus  oriented  the  same  as  the 
H-induced  chemical  dipoles  considered  in  the  preceding  sec¬ 
tion.  This  finding  immediately  explains  the  experimentally 
observed  increase  in  barrier  height  on  «-type  semiconductors 
when  the  “7X7”-induced  interface  dipoles  are  removed. 

The  respective  change  in  barrier  heights  may  be  estimated 
by  inserting  the  dipole  moment  resulting  from  the  theoretical 
charge  density  difference  plotted  in  Fig.  8  into  relation  (13). 
Numerical  integration  of  these  data  gives  the  dipole  moment 
of  the  ...AA'BB'AA'  stacking  fault  per  unit  area  as 

4.93X10^^0 

With  this  value,  relation  (13)  then  gives 

-(eo/e„,eo)^^/2=0.446/e,„,  eV. 

The  factor  \  accounts  for  the  fact  that  only  one  half  of  the 
“7X7”  unit  mesh  contains  a  stacking  fault.  With  an  interface 
dielectric  constant  €^^^3,  one  obtains 

meV. 

This  estimate  is  close  to  the  difference  of  170  meV  observed 
with  Pb/Si(lll)  contacts,  but  it  is  larger  by  approximately 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2991 


Winfried  Monch:  Electronic  properties  of  metal-semiconductor  interfaces 


2991 


100  meV  than  what  was  found  with  Ag/Si(lll)  diodes.  The 
latter  difference  may  be  attributed  to  interface  defects.  This 
will  be  discussed  next. 

The  Si(lll):Pb-(V3X73)R30°  surface  structure  is  due  to 
the  arrangement  of  adsorbed  lead  atoms  on  otherwise  (al¬ 
most)  ideally  terminated  Si(lll)  surfaces."^^’^^’"^^  The  removal 
of  this  reconstruction  and,  by  this,  the  formation  of  a  1X1 
interface  structure  under  thick  Pb  layers  thus  results  from  a 
reordering  of  the  lead  atoms  at  the  interface.  The  coordinates 
of  the  Si  interface  atoms,  on  the  other  hand,  will  experience 
small  shifts  parallel  and/or  normal  to  the  interface  only. 
However,  the  elimination  of  the  stacking  fault  and  the  for¬ 
mation  of  an  ideally  terminated  1X1  interface  structure  dur¬ 
ing  annealing  treatments  of  Ag/Si(lll)-“7X7”  interfaces 
needs  a  supply  of  eight  additional  silicon  atoms  per  7X7  unit 
mesh.  This  is  equivalent  to  16  percent  of  the  atoms  in  a  bulk 
{111}  layer.  Residual  interface  vacancies,  for  example,  might 
act  as  interface  defects  of  donor  type.  They  would  decrease 
the  barrier  height  of  metal  contacts  on  n-type  semiconduc- 
tors 

(14) 

where  is  the  area  density  of  occupied  interface  donors. 
Already  approximately  4X10^^  charged  donors  per  cm^  suf¬ 
fice  to  lower  the  barrier  heights  of  Schottky  contacts  on 
n-type  silicon  by  100  meV.  This  density  corresponds  to  less 
than  one  third  of  the  additional  Si  atoms  needed  in  the  ther¬ 
mally  activated  conversion  of  the  “7X7”  to  a  IX 1  interface 
structure  at  Ag/Si(lll)  contacts. 

As  was  already  mentioned,  Heslinga  et  al.^^  discovered 
that  the  barrier  heights  of  Pb/Si(lll)  contacts  correlate  with 
the  initial  reconstruction  of  the  silicon  substrate.  They  depos¬ 
ited  thick  Pb  layers  on  clean  7X7-  and  on  Pb-(V3X73)R30°- 
reconstructed  n-Si(lll)  substrates.  From  the  HV  character¬ 
istics  of  their  diodes,  they  evaluated  barrier  heights  of  0.7 
and  0.93  eV,  respectively.  Their  and  our  data  agree  for  Pb/ 
Si(lll)-“7X7”  interfaces  while  our  values  are  lower  by  ap¬ 
proximately  60  meV  for  the  Pb/Si(lll)  interfaces  fabricated 
by  evaporation  of  Pb  onto  Pb-(^3X73)R30°-reconstructed 
Si(lll)  surfaces.  This  difference  is  attributed  to  the  nonideal 
HV  characteristics  reported  by  Heslinga  et  al  Our  HV 
curves  were  ideal  when  we  prepared  the  clean  silicon  sur¬ 
faces  to  start  with  by  hydrogen  desorption  from  Si(lll):H- 
1X1  surfaces  processed  by  HF  dips;  however,  we  obtained 
nonideal  characteristics  with  Si  surfaces  cleaned  by  thermal 
decomposition  of  Si02  at  elevated  temperatures."^^  Heslinga 
et  al  used  the  latter  preparational  method  in  their  studies. 

IV.  METAL-INSULATOR  CONTACTS 

Metal-insulator-semiconductor  (MIS)  structures  with 
thin  insulating  interlayers  behave  electrically  like  Schottky 
contacts.  The  barrier  heights  of  such  devices  depend  on  the 
nature  of  the  insulator,  its  thickness,  and  the  specific  metal 
used.  Deal  et  determined  metal-Si02  barrier  heights  in 
metal-Si02-silicon  (MOS)  structures  by  using  internal  pho¬ 
toemission  and  capacitance-voltage  (C/V)  measurements. 
Sobolewski  and  Helms^^  evaluated  Si3N4-Si  barrier  heights 
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Fig.  9.  Metal-Si02  barrier  heights  as  a  function  of  the  metal  electronega¬ 
tivity.  The  dashed  line  is  a  least-squares  fit  to  the  data  and  has  a  slope 
parameter  of  0.85  eV/Miedema  unit.  Data  from  Ref.  51. 


from  C/V  characteristics  of  metal-Si3N4-silicon  (MNS) 
contacts.  The  nitride  thickness  varied  between  1.6  and  3  nm. 
Figures  9  and  10  display  the  MOS  and  MNS  data,  respec¬ 
tively.  They  will  be  analyzed  in  the  following. 

Insulators  may  be  described  as  semiconductors  with  large 
band  gaps.  Therefore,  relation  (1)  should  also  apply  to  metal- 
insulator  contacts.  Figure  9  displays  the  barrier  heights  at 
metal-Si02  interfaces,  which  were  reported  by  Deal  et  al,^^ 
as  a  function  of  the  metal  electronegativities.  The  dashed  line 
is  a  least-squares  fit  to  the  data.  Its  slope  amounts  to  0.85 
±0.2  eV  per  Miedema  unit.  In  the  infrared,  amorphous  Si02 
has  an  index  of  refraction  n  =  lA5  and  one  obtains 
600  =  ^^  =  2.1.  Relation  (9)  then  predicts  a  slope  parameter 
5j^(SiO2)=0.77  eV/Miedema  unit.  This  value  agrees  remark¬ 
ably  well  with  what  resulted  from  the  fit  to  the  experimental 
data  plotted  in  Fig.  9  although  they  exhibit  a  large  scatter. 

Figure  10(a)  displays  the  barrier  heights  at  the  Si3N4-Si 
interfaces  of  MNS  structures  as  communicated  by  Sobo¬ 
lewski  and  Helms.^^  Irrespective  of  the  contact  metal,  the 
barrier  heights  vary  linearly  as  a  function  of  the  insulator 
thickness.  The  sign  of  the  slope  parameters  is  negative  for  n 
and  positive  for  /?-type  silicon  but  the  least- squares  fits  to  the 
data  give  identical  absolute  values  of  the  slope  parameters 
irrespective  of  the  type  of  doping  and  the  metal  used. 

MIS  structures  may  be  described  by  a  metal-insulator  and 
an  insulator- semiconductor  contact  in  series.  Thus,  the  con¬ 
cept  of  interface-induced  gap  states  outlined  above  may  be 
also  applied  to  MIS  structures.  Figure  11  shows  schemati¬ 
cally  a  band  diagram  of  MIS  structures  in  thermal  equilib¬ 
rium.  The  different  energy  contributions  add  up  as 

=  +  (15) 
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Fig.  10.  Barrier  heights  of  inetal/Si3N4/Si  contacts  as  a  function  of  the 
nitride  thickness  for  a  variety  of  metals  (a)  and  as  a  function  of  metal 
electronegativity  for  the  same  nitride  films  (b).  Data  from  Ref.  52. 


As  in  semiconductor  heterostructures, the  charge- 
neutrality  levels  of  an  insulator  and  a  semiconductor  in  con¬ 
tact  will  align.  The  conduction-band  offsets  at  such  interfaces 
are  then  given  by 

(16) 

where  the  last  term  on  the  right  side  accounts  for  dipole 
effects  due  to  a  nonvanishing  difference  X^-X^  of  the  insu¬ 
lator  and  semiconductor  electronegativities.  The  potential 
drop  A I  across  the  insulator  accounts  for  a  fixed  charge: 

G{=(e,eo/eo)  A,. /</,.,  (17) 

where  and  di  are  the  dielectric  constant  and  the  thickness 
of  the  insulator.  By  combining  Eq,  (15)  to  Eq.  (17)  and  con¬ 
sidering  Eq.  (1),  one  obtains 

^Bn  =  ^cnl)  +  (t>lnr  ^ ^i^o)Q{di  + 

=  S,i{Xi-X,)-{eo/€ieo)Q{di+<f>l„i+D,.  (18) 


W 


metal  insulator  semiconductor 


Fig.  11.  Band  diagram  for  metal/insulator/semiconductor  contacts  (sche¬ 
matically). 

For  p-type  semiconductors,  the  first  two  terms  of  (18)  have 
opposite  sign. 

The  barrier  height  at  the  insulator- semiconductor  in¬ 
terface  will  thus  decrease  proportionally  to  the  thickness 
provided  the  charge  density  2{  is  fixed.  Relation  (18)  thus 
explains  the  linear  variation  of  the  barrier  height  as  a  func¬ 
tion  of  nitride  thickness  as  plotted  in  Fig.  10(a).  This  result  is 
well-known  and  by  no  means  surprising. 

For  constant  thickness  of  the  insulator,  Eq.  (18)  predicts 
the  barrier  heights  (f)%^  at  the  Si3N4-Si  interfaces  to  increase 
proportionally  to  the  electronegativity  of  the  metal.  There¬ 
fore,  the  barrier  heights  displayed  in  Fig.  10(a)  are  replotted 
in  Fig.  10(b)  versus  the  electronegativities  of  the  metals  with 
the  nitride  thickness  as  a  parameter.  As  to  be  expected,  the 
barrier  heights  increase  for  n-  but  decrease  for  p-type  doping 
of  the  silicon  substrates.  The  straight  lines  shown  are  least- 
squares  fits  to  the  data.  The  average  of  their  slope  parameters 
amounts  to  — 0.45  ±0.05  eV/Miedema  unit.  The  slope 
parameter  is  determined  by  the  insulator.  In  the  infrared, 
Si3N4  has  an  index  of  refraction  n  =  2.05  so  that  the  optical 
dielectric  constant  results  as  6oo  =  n^  =  4.2.  Relation  (9)  then 
predicts  a  slope  parameter  5';^(Si3N4)=0.42  eV/Miedema 
unit.  This  result  excellently  confirms  the  value  obtained 
above  from  the  experimental  data. 

V.  SUMMARY 

The  interface-induced  gap  states  determine  the  band 
alignment  at  ideal  semiconductor  interfaces.  These  intrinsic 
interface  states  derive  from  the  virtual  gap  states  of  the  com¬ 
plex  band  structure  of  the  semiconductor.  The  chemical 
bonds  at  the  interfaces  establish  the  charge  transfer  across 
the  interface  and,  by  this,  the  net  charge  in  the  interface- 
induced  gap  states.  A  generalization  of  Pauling’s  concept  for 
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small  molecules  models  the  charge  transfer  by  the  electrone¬ 
gativity  difference  of  the  solids  in  contact.  Two  quantities 
then  determine  the  chemical  trends  of  the  barrier  heights  at 
metal-semiconductor  contacts  by  the  use  of  the  zero-charge- 
transfer  barrier  height  and  a  slope  parameter.  The  zero- 
charge-transfer  barrier  heights  may  be  estimated  from  the 
position  of  the  top  valence  band  at  the  mean-value  point  in 
the  Brillouin  zone  relative  to  the  valence-band  maximum  and 
the  dielectric  band  gap.  The  valence  band  may  be  calculated 
in  the  empirical  tight-binding  approximation  with  universal 
coefficients  for  the  nearest-neighbor  interaction  matrix  ele¬ 
ments.  The  slope  parameters  may  be  directly  calculated  from 
the  optical  dielectric  constant  of  the  semiconductor. 

In  the  present  article,  two  types  of  interface  modifications 
were  considered,  which  are  chemical  interface  doping  and 
differing  interface  structures.  The  experimentally  observed 
variations  of  the  barrier  heights  were  explained  by  extrinsic 
interface  dipoles.  They  are  present  in  addition  to  the  intrinsic 
interface  dipole  layer  which  results  from  the  metal-induced 
gap  states.  Furthermore,  interface  defects  may  exist  that  then 
absorb  charge  transferred  at  the  interface.  By  this,  they  also 
effect  changes  of  barrier  heights. 

Insulators  may  be  regarded  as  semiconductors  with  large 
band  gaps.  Therefore,  the  concept  of  interface-induced  gap 
states  not  only  applies  to  metal-semiconductor  contacts  and 
semiconductor  heterostructures  but  also  to  metal-insulator 
and  insulator- semiconductor  interfaces.  Furthermore,  lay¬ 
ered  structures  may  be  described  as  a  sequence  of  interfaces 
in  series.  The  thickness  of  an  individual  semiconductor  or 
insulator  layer  has,  of  course,  to  exceed  the  decay  lengths  of 
the  respective  interface-induced  gap  states. 
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Silicon  layers  grown  by  molecular  beam  epitaxy  in  the  interface  region  of  AIM -Ga As (001)  Schottky 
diodes  have  been  shown  to  tune  the  Schottky  barrier  height  in  the  0.3- LI  eV  range,  provided  that 
high  enough  excess  fluxes  of  As  or  A1  are  employed  during  Si  deposition.  We  studied  the 
incorporation  of  As  and  A1  in  the  interface  layer  for  Si  growth  temperatures  of  300  and  400  ®C,  and 
for  interlayer  thicknesses  in  the  10-150  A  range.  Following  deposition  under  an  excess  As  flux  we 
found  an  As  atomic  concentration  Cas=0.4-0.5  at  300  °C  and  Cas=0,2  at  400  °C.  The  excess  A1 
flux  yielded  an  A1  atomic  concentration  Cai=0.7-0.8  at  300  °C,  and  Cai=0.6-0.7  at  400  °C. 
Correspondingly,  the  lattice  structure  of  the  interlayers  appears  dramatically  different  from  that  of 
Si.  We  conclude  that  the  10-100  A  interface  layers  used  by  some  authors  to  change  the  barrier 
height  bear  little  resemblance  to  degenerate  bulk  Si,  so  that  several  of  the  mechanisms  proposed  to 
explain  Schottky  barrier  tuning  should  be  reevaluated.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Ohmic  as  well  as  rectifying  contacts  have  a  capital  role  in 
solid  state  electronics.  In  modern  technology,  modulation  of 
the  properties  of  metal/semiconductor  (MS)  junctions  is  typi¬ 
cally  achieved  by  varying  the  doping  of  the  region  of  the 
semiconductor  closest  to  the  metallurgical  junction  to  allow  a 
variable  degree  of  tunneling.^’^  Many  authors,  however,  are 
continuing  the  search  for  reliable  methods  to  engineer  the 
actual  Schottky  barrier  height,  with  a  threefold  motivation.^ 
First,  any  procedure  that  reproducibly  changes  the  barrier 
height  will  shed  light  on  the  true  microscopic  mechanism 
responsible  for  Fermi  level  pinning.  Second,  doping  technol¬ 
ogy  is  sufficiently  advanced  only  for  a  very  small  number  of 
semiconductors.  Many  new  materials  (e.g.,  the  wide  gap  II- 
VI,  III-V,  and  IV  semiconductors  employed  in  short  wave¬ 
length  optoelectronic  applications)  exhibit  doping  limitations 
that  severely  hinder  contact  performance.  Third,  the  perfor¬ 
mance  of  specific  classes  of  devices  may  be  improved 
through  the  incorporation  of  engineered  Schottky  barriers. 
Potential  examples  include  lower  leakage  currents  in  metal- 
semiconductor  field-effect  transistors,  lower  dark  currents  in 
metal-semiconductor-metal  photon  detectors,  and  tunable 
bandpass  for  Schottky  barrier  photon  detectors.^ 

In  the  area  of  Schottky  barrier  tuning,  several  recent  re¬ 
ports  have  emphasized  the  role  of  heterovalent  semiconduc¬ 
tor  interface  layers,  such  as  Si  and  Ge  in  metal/GaAs  junc¬ 
tions,  fabricated  by  molecular  beam  epitaxy  (MBE)  in  the 
interface  region  of  conventional  Schottky  diodes.  Grant  and 
co-workers  initially  suggested  that  the  growth  of  15-30  A 
thick  Si  interface  layers  under  Ga  flux  unpinned  the  Fermi 
level  at  AuM-GaAs(OOl),  CrM-GaAs(OOl),  and  TiM- 
GaAs(OOl)  contacts,  with  final  Schottky  barrier  values  in  the 
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1.01-1.14  eV  range,  i.e.,  considerably  higher  than  those  ob¬ 
tained  for  the  same  metal/GaAs  contacts  in  the  absence  of 
the  Si  interface  layer  (0.76-0.89  eV)."^  Studies  of  the  surface 
position  of  the  Fermi  level  upon  deposition  of  Si  and  Ge 
under  As  or  Ga  flux  on  GaAs(OOl)  surfaces  were  used  to 
derive  Fermi  level  positions  near  the  bottom  of  the  overlayer 
conduction  bands  for  overlayers  grown  under  As  flux,^  and 
predict  low  Schottky  barrier  values  (—0.3  eV)  for  hypotheti¬ 
cal  metal/Si:As/n-GaAs  MS  structures. 

Costa  et  al,^  fabricated  Al/Si/«-GaAs(001)  MS  structures 
with  6,  30,  and  60  A  thick  Si  layers  grown  by  MBE,  and 
concluded  that  the  barrier  decreased  from  about  1.0  eV  to 
0.3-0.4  eV  with  increasing  Si  thickness.  They  associated  this 
effect  with  incorporation  of  residual  As  in  the  Si  interface 
layer.^  Koyanagi  et  aL  also  examined  Al/Si/?2-GaAs(001) 
MS  structures  grown  by  MBE,  and  observed  a  barrier  only 
some  0.2  eV  lower  than  for  the  Al/n-GaAs(001)  case  when 
Si  layers  10-40  A  thick  were  fabricated  at  the  interface  un¬ 
der  As  flux.  Conversely,  no  substantial  change  of  the 
Schottky  barrier  relative  to  the  Al/n-GaAs(001)  case  was  re¬ 
portedly  observed  in  Al/Si/w-GaAs(001)  structures  in  which 
Si  layers  10-40  A  thick  had  been  grown  under  Ga  flux.^ 

A  possible  explanation  of  the  above  discrepancy  was  pro¬ 
vided  by  our  recent  photoemission  and  transport  results  from 
Al/Si/GaAs(001)  diodes, which  showed  that  the  As  and  A1 
fluxes  required  during  Si  deposition  to  produce  an  effect  are 
much  larger  than  those  commonly  employed  for  Si  doping 
purposes,  i.e.,  comparable  with,  or  higher  than  the  Si  flux, 
and  presumably  higher  than  those  employed  by  Koyanagi 
et  aC 

Most  empirical^’^^  and  theoretical  models^’^^’^^  proposed 
to  explain  subsets  of  the  above  experimental  results  consider 
the  electronic  structure  of  the  Si  interfacial  layer  as  identical 
to  that  of  bulk  Si,  require  the  presence  of  an  As-doped  (Ga- 
doped)  n (p^)  degenerate  Si  layer  of  sufficient  thickness  at 
the  interface,  e.g.,  to  justify  tunneling,^  depletion  charge 
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compensation,^’ or  metallic  screening  of  the  interface 
states,  and  assume  that  the  band  alignment  across  the  Si- 
GaAs  heterojunction  in  the  Al/Si/GaAs  structure  is  indepen¬ 
dent  of  the  Si  layer  thickness.  However,  the  high  dopant 
fluxes  required  during  Si  deposition,  and  evidence  that  all  of 
the  barrier  modification  is  already  observed  at  Si  coverages 
of  the  order  of  1-2  monolayers, call  into  question  the  role 
and  the  very  nature  of  the  interface  layer.  Structural  and 
compositional  information  about  the  Si  interlayer  is  woefully 
scarce.  To  our  knowledge,  only  in  one  previous  case  has  the 
structure  of  Si  epilayers  grown  on  GaAs  under  As  flux  been 
examined  by  x-ray  photoelectron  diffraction  and  shown  to  be 
consistent  with  that  of  crystalline  Si.^^ 

Here  we  focus  on  the  composition  and  structure  of  the 
Si-related  species  synthesized  by  MBE  on  GaAs  in  condi¬ 
tions  homologous  to  those  employed  in  Refs.  3-11.  In  par¬ 
ticular,  we  fabricated  Si  layers  at  300  and  400  °C  in  ultrahigh 
vacuum  (UHV)  conditions,  as  well  as  under  As  and  A1  fluxes 
comparable  with  the  Si  flux.  We  emphasize  that  these  were 
the  same  Si  deposition  conditions  which  gave  rise  to 
Schottky  barrier  tunability  in  Al/Si/GaAs(001)  diodes  in 
Refs.  8  and  9.  The  composition  and  structure  of  the  layers 
was  examined  in  situ  by  means  of  monochromatic  x-ray  pho¬ 
toemission  spectroscopy  (XPS)  and  reflection  high  energy 
electron  diffraction  (RHEED).  Preliminary  secondary  ion 
mass  spectrometry  (SIMS)  depth  profiling  studies  and  trans¬ 
mission  electron  microscopy  (TEM)  results  lent  further  sup¬ 
port  to  the  surprising  conclusions  of  our  in  situ  studies. 

II.  EXPERIMENTAL  DETAILS 

All  samples  were  grown  by  MBE  using  a  facility  which 
includes  interconnected  growth  chambers  and  an  analysis 
chamber  with  monochromatic  XPS  capabilities.  GaAs 
buffer  layers  5000  A  thick  {n-type,  ^£>=10^^  cm”^)  were 
first  grown  at  580  °C  on  GaAs(lOO)  wafers  using  the  meth¬ 
odology  described  in  Ref.  13.  Si  was  deposited  on  the  buffer 
layers  at  300  or  400  in  UHV  conditions  (As  partial  pres¬ 
sures  below  5X10“^®  Torr),  under  As  flux  (As  equivalent 
beam  pressures  examined  were  in  the  1-5X10“^  Ton- 
range),  or  under  A1  flux  (A1  equivalent  beam  pressures  ex¬ 
amined  were  in  the  8-lOX  10“^  Torr  range).  The  fluxes  were 
calibrated  by  exposing  a  nude  ion  gauge  to  the  molecular 
flux  at  the  sample  position  and  no  correction  was  included  to 
take  into  account  the  ionization  cross  sections  for  the  differ¬ 
ent  species.  The  Si  growth  rate  and  atomic  flux  was  deter¬ 
mined  using  the  procedure  described  in  Ref.  13.  Typical  Si 
growth  rate  employed  was  30  A/h.  The  A1  growth  rate  was 
measured  by  depositing  thick  A1  films  on  substrates  kept  at 
room  temperature,  and  determining  the  film  thickness  with  a 
profilometer.  The  A1  X-cell  temperature  used  in  the  test  was 
varied  between  1000  and  1150  ""C.  For  the  growth  of  Si  with 
A1  flux  the  A1  K  cell  was  operated  at  950  °C,  which  corre¬ 
sponds  (by  linear  extrapolation  from  the  measured  A1  flux 
values)  to  an  A1  flux  of  ^-2X10^^  atoms/cm^  s.  For  the 
growth  of  Si  with  As,  the  As  flux  employed  was  —IXIO^^ 
atoms/cm^  s  which  was  determined  by  exposing  to  the  flux  a 
GaAs(001)c(4X4)  surface  cooled  down  to  -30  °C. 


In  principle,  RHEED  intensity  oscillations  in  MBE  can  be 
used  to  independently  determine  the  A1  and  AS4  fluxes.  The 
method  would  involve  AlAs  growth  in  cation-  or  anion- 
limited  conditions,  and  measurements  of  the  resulting  overall 
compound  growth  rate  from  RHEED  oscillations.  This 
method  yields  an  estimate  of  —1X10^^  atoms/cm^  s  for  the 
A1  flux  at  a  X-cell  temperature  of  1010  °C,  and  an  estimated 
flux  (by  linear  extrapolation  from  the  measured  A1  flux  val¬ 
ues)  of  3X10^^  atoms/cm^  s  at  a  j^-cell  temperature  of 
950  °C.  Unfortunately,  the  same  independent  flux  estimate 
from  RHEED  oscillations  could  not  be  performed  satisfacto¬ 
rily  for  As. 

We  observed  a  c(4X4)  GaAs  RHEED  pattern  at  300  or 
400  °C  prior  to  Si  deposition.  During  Si  deposition  in  UHV 
conditions  at  300  and  400  °C,  a  diffuse  2X1  RHEED  pattern 
became  increasingly  spotty  at  Si  coverages  above  the  critical 
thickness  (8-10  ML),^^  with  an  increased  background  at 
—  110  ML  (150  A).  The  nominal  Si  coverage  was  determined 
in  all  cases  from  the  Si  coverage  calibration  described  in  Ref. 
13.  The  2X1  reconstruction  is  normally  observed  during  Si 
epitaxy,  in  the  absence  of  any  residual  As  atmosphere. 

During  Si  deposition  at  400  °C  under  As  flux  a  somewhat 
diffused  3X1  RHEED  pattern  became  visible  at  a  coverage 
of  about  0.5  ML  (1  ML=6.25X  10^"^  atoms/cm^).  The  3X1 
RHEED  pattern  became  increasingly  diffused  at  coverages 
higher  than  6  ML,  and  circles  started  to  appear  at  —50  ML. 
At  —110  ML  the  dominant  presence  of  circles  in  the  diffrac¬ 
tion  pattern  suggested  a  polycrystalline  structure  of  the  epil- 
ayer.  The  RHEED  patterns  observed  during  Si  deposition  at 
300  °C  under  As  flux  were  similar  to  those  observed  during 
growth  at  400  °C,  but  a  poly  crystalline-type  diffraction  pat¬ 
tern  was  observed  at  substantially  lower  coverages  (—10 
ML). 

During  Si  deposition  under  A1  flux  a  complex  and  streaky 
RHEED  pattern  was  observed  for  Si  growth  at  300  and 
400  and  this  pattern  became  relatively  spotty  with  an  in¬ 
creased  background  at  110  ML. 

Systematic  analyses  of  the  integrated  emission  intensities 
from  the  Si  2/?,  Ga  3d,  Ga  3p,  As  3d,  and  A1  2p  core  levels 
and  of  the  valence  band  line-shape  was  performed  in  situ  by 
XPS  to  gauge  the  nature  and  composition  of  the  Si-related 
species.  The  XPS  spectrometer  uses  A1  Ka  radiation  (1486.6 
eV)  monochromatized  and  focused  by  a  bent  crystal  mono¬ 
chromator,  and  was  operated  with  an  overall  resolution  (elec¬ 
trons  and  photons)  of  0.73  eV  and  a  spot  size  of  300  yum.  The 
XPS  and  RHEED  measurements  were  complemented  by  pre¬ 
liminary  cross-sectional  TEM  measurements,  and  by  SIMS 
depth  profiling  studies  of  thick  (110  ML)  layers  grown  under 
the  same  conditions. 

To  compare  the  nature  of  the  valence  states  for  the  Si- 
related  species  with  that  of  well-characterized  bulk  stan¬ 
dards,  we  also  fabricated  a  series  of  standard  samples.  A  Si 
reference  sample  was  obtained  by  depositing  a  Si  epilayer  on 
a  Si(OOl)  wafer  which  was  oriented  4°  off  the  [001]  along 
the  [110]  direction.  After  chemical  removal  of  the  native  ox¬ 
ide  the  substrate  was  annealed  at  900  °C,  and  a  —10  ML 
thick  Si  layer  was  grown  at  500  °C.  A  double-domain  2X1 
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Fig.  1.  Si  2jr7  (left-hand  side),  As  ?>d  (center)  core  level,  and  valence  band 
(right-hand  side)  emission  from  bulk  Si  (top-most  panel),  GaAs  (second 
top-most  panel),  elemental  As  (second  bottom-most  panel)  standards,  and 
from  a  110  ML  thick  Si  layer  (bottom-most  panel)  deposited  under  high  As 
flux  at  300  °C  onto  GaAs(001)c(4X4).  The  zero  of  the  energy  scale  corre¬ 
sponds  to  the  Fermi  level  of  the  spectrometer  Ep  (right-most  vertical  line). 
The  spectra  shown  are  angle  integrated  photoelectron  energy  distribution 
curves  (EDCs)  obtained  with  monochromatic  x-ray  photoemission  spectros¬ 
copy  (XPS)  using  PAKa  radiation.  Left-most  and  central  vertical  lines  mark 
the  positions  of  the  centroids  of  the  Si  2/?  and  As  3^  doublet  in  the  bottom¬ 
most  spectra. 


RHEED  pattern  and  no  detectable  contaminants  by  XPS 
were  observed  after  growth. 

GaAs  and  AlAs  standards  were  fabricated  as  epilayers  on 
GaAs  buffers.  Both  the  GaAs(001)-2X4  epilayer  and  the  300 
A  thick  AlAs(001)3Xl  epilayer  were  grown  at  580  °C  on  a 
GaAs  buffer  using  the  methodology  described  in  Ref.  13. 

An  elemental  As  standard  was  fabricated  by  condensing  a 
1  /xm  thick  As  layer  on  a  GaAs(001)-c(4  X  4)  surface  cooled 
down  to  -30  °C.  Diffraction  circles  in  the  resulting  RHEED 
pattern  suggested  a  polycrystalline — as  opposed  to 
amorphous — structure  of  the  surface,  and  no  contaminants 
were  detected  by  XPS.  The  thickness  of  the  resulting  ad¬ 
sorbed  layer  was  determined  by  cross-sectional  lateral  force 
microscopy.  An  elemental  A1  standard  was  obtained  by  de¬ 
positing  1300  A  of  A1  on  a  molybdenum  plate  at  10  °C.  Only 
a  diffuse  diffraction  intensity  was  observed  by  RHEED,  sug¬ 
gesting  the  formation  of  an  amorphous  film. 

III.  RESULTS  AND  DISCUSSION 

In  general,  the  photoelectron  valence  band  and  core  level 
emission  from  the  Si-related  species  deposited  under  the 
high  As  and  A1  fluxes  employed  here,  and  in  Refs.  3-11 
exhibit  large  differences  as  compared  to  that  of  bulk  Si  stan¬ 
dards,  or  of  Si  epilayers  deposited  in  UHV  conditions.  In 
Fig.  1,  we  compare  the  valence  band  and  core  level  emission 
observed  from  a  110  ML  thick  layer  deposited  under  As  flux 
(bottom)  at  300  °C  with  that  observed  from  a  variety  of  stan¬ 
dards.  The  spectra  shown  correspond  to  angle-integrated 
photoelectron  energy  distribution  curves  (EDCs)  from  the 
Slip  and  As  3(i  core  level  doublets,  and  the  valence  band 
emission.  The  zero  of  the  energy  scale  corresponds  to  the 
spectrometer  Fermi  level  Ep .  The  top-most  panel  in  Fig.  1 
shows  the  emission  from  a  bulk  Si  standard.  The  three  main 


features  in  the  valence  band  correspond  to  the  well-known  p , 
sp,  and  5  Si  density  of  states  (DOS)  features.  The  position 
of  the  centroid  of  the  Si  2p  core  doublet  is  found  98.94 
±0.08  eV  below  the  linearly  extrapolated  valence  band 
maximum  (E^),  in  agreement  with  the  literature. The  emis¬ 
sion  from  Si  epilayers  deposited  on  GaAs  in  UHV  conditions 
(not  shown)  is  compellingly  similar  to  that  of  the  bulk  stan¬ 
dard. 

The  second  topmost  panel  in  Fig.  1  shows  the  emission 
from  the  GaAs  standard.  The  three  main  features  in  the  va¬ 
lence  band,  qualitatively  similar  to  those  observed  for  Si, 
correspond  to  the  well-known  As-p,  hybrid  As-/?/Ga-5,  and 
As-s  DOS  features.  The  position  of  the  centroid  of  the 
As  3d  doublet  is  found  40.85 ±0.08  eV  below  ,  in  agree¬ 
ment  with  the  literature. 

The  second  bottom-most  panel  in  Fig.  1  shows  the  emis¬ 
sion  from  an  elemental  As  standard.  The  valence-band  DOS 
is  substantially  different  from  that  of  Si  or  GaAs,  especially 
in  the  region  of  the  p-s  hybrid  orbitals,  and  the  centroid  of 
the  As3d  doublet  is  found  41.45±0.08  eV  below  the  ex¬ 
trapolated  leading  edge  of  the  valence  band.  The  0.6  eV  shift 
to  higher  binding  energy  relative  to  the  GaAs  case  is  consis¬ 
tent  with  previous  experiments  examining  monolayer  adsor¬ 
bates  of  As  on  GaAs,^®  and  reflects  the  lower  ionicity  of  the 
chemical  bond  in  going  from  an  As-Ga  to  an  As-As  type  of 
coordination.  A  somewhat  larger  asymmetry  in  the  core  level 
line  shape  may  reflect  an  increased  metallic  character  of  the 
epilayer,  although  we  hasten  to  add  that  there  is  no  evidence 
of  emission  at  the  Ep  in  the  valence  band  spectrum. 

The  emission  from  the  Si-related  species  formed  at 
300  °C  on  GaAs  under  As  flux  (bottom-most  panel)  is  dra¬ 
matically  different  from  that  of  the  bulk  Si  standard,  and 
qualitatively  more  similar  to  what  would  be  expected  from  a 
mixed  Si- As  phase.  The  centroids  of  the  Si  2/?  and  the 
As  3d  doublets  appear  99.47 ±0.08  and  4 1.04 ±0.08  eV,  re¬ 
spectively,  below  the  extrapolated  leading  edge  of  the  va¬ 
lence  band.  This  would  correspond  to  a  0.53  eV  increase  in 
the  Slip  binding  energy  and  a  0.41  eV  decrease  of  the 
As  3d  binding  energy  relative  to  the  bulk  Si  and  As  standard. 
The  direction  and  order  of  magnitude  of  the  shifts  supports 
the  formation  of  a  Si-As  mixed  phase,  in  view  of  the  relative 
electronegativities  of  Si,  As,  and  Ga. 

We  emphasize  that  the  presence  of  As  impurities  incorpo¬ 
rated  at  or  below  the  percent  level  in  a  bulk  Si  crystal  would 
be  unlikely  to  give  rise  to  the  large  changes  in  the  valence 
band  and  core  level  emission  observed  when  comparing  the 
bottom-most  and  the  top-most  spectra  in  Fig.  1.  The  com¬ 
parison  argues  instead  for  the  formation  of  a  concentrated 
alloy.  The  limited  width  of  the  core  levels  also  supports  the 
formation  of  a  true  alloy,  rather  than  the  segregation  of  sepa¬ 
rate  elemental  Si  and  elemental  As  phases. 

In  Fig.  2  we  compare  the  valence  band  and  core  level 
emission  observed  from  a  110  ML  thick  layer  deposited  un¬ 
der  A1  flux  (bottom)  at  300  °C  with  that  observed  from  a 
variety  of  standards.  The  spectra  shown  correspond  to  angle- 
integrated  EDCs  from  the  Si2p  and  A\2p  core  doublets, 
and  the  valence  band  emission.  The  zero  of  the  energy  scale 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


2997 


Sorba,  Paggel,  and  Franciosi:  Tunable  Schottky  barriers  and  the  nature 


2997 


"n  rri  f 

:Si2y 

rTT^l  jTTT 

;  AI2p 

1 

1  ^  H  1  1//  M  M  1  M  1  1  1  1  1  1  1  1  1 

:  VB  Si 

-r 

: 

i 

AlAs 

: 

J 

rt  Al 

y 

1  J 

h  SI+AI 

u-lL//  u-L 

.  ,1,1  1,1  W  j  1  1  1  1  ■  1  1  1  I  1. 1-I  -i  .i.  1  1 

Binding  Energy  (eV) 


Fig.  2.  EDCs  for  the  Si  Ip  (left-hand  side),  A1 2p  (center)  core  level,  and 
valence  band  (right-hand  side)  emission  from  bulk  Si  (top-most  panel),  AlAs 
(second  top-most  panel),  elemental  A1  (second  bottom-most  panel)  stan¬ 
dards,  and  from  a  110  ML  thick  Si  layer  (bottom-most  panel)  deposited 
under  high  A1  flux  at  300  °C  onto  GaAs(001)c(4X4).  The  zero  of  the 
energy  scale  corresponds  to  the  Fermi  level  of  the  spectrometer  Ep  (right¬ 
most  vertical  line).  Left-most  and  central  vertical  lines  mark  the  positions  of 
the  centroids  of  the  Si  2p  and  A1  2p  doublets  in  the  bottom-most  spectra. 


corresponds  to  E^.  The  top-most  panel  in  Fig.  2  shows  the 
emission  from  a  bulk  Si  standard,  as  in  Fig.  1.  The  second 
top-most  panel  in  Fig.  2  shows  the  emission  from  an  AlAs 
standard.  The  three  main  features  in  the  valence  band,  quali¬ 
tatively  similar  to  those  observed  for  Si,  correspond  to  the 
well-known  As-p,  hybrid  As-p/A\-s,  and  As-^  DOS 
features.  The  position  of  the  centroid  of  the  A\2p  doublet 
is  found  72.89 ±0.08  eV  below  ,  in  agreement  with  the 
literature. 

The  second  bottom-most  panel  in  Fig.  2  shows  the  emis¬ 
sion  from  an  elemental  A1  standard.  The  well-defined  cutoff 
Bit  Ep  and  the  marked  asymmetry  of  the  A12jp  core  line 
shape  reflect  the  metallic  character  of  the  standard.  The  cen¬ 
troid  of  the  A1  2p  doublet  is  found  72.97±0.08  eV  below  the 
Fermi  level  Ep,  consistent  with  the  literature. 

The  emission  from  the  Si-related  species  formed  at 
300  on  GaAs  under  the  A1  flux  (bottom-most  panel)  is 
substantially  different  from  that  of  bulk  Si,  as  shown  by  the 
emission  at  Ep  and  the  enhanced  asymmetry  of  the  Si  2p}^ 
The  centroid  of  the  two  doublets  appear  99.27 ±0.08  eV  and 
73. 09 ±0.08  eV,  respectively,  below  the  Fermi  level.  Evi¬ 
dence  of  hybrid  Si-Al  states  5-6  eV  below  Ep  in  the  va¬ 
lence  band,  and  the  small  increase  in  the  binding  energy  of 
the  A1  2p  core  levels  relative  to  bulk  Al,  support  the  forma¬ 
tion  of  a  Si-Al  alloys.  As  was  the  case  for  the  Si-As  mixed 
phases  in  Fig.  1,  we  emphasize  that  the  presence  of  Al  im¬ 
purities  incorporated  at  or  below  the  percent  level  in  a  de¬ 
generate  Si  crystal  would  not  give  rise  to  the  large  changes  in 
the  valence  band  and  core  level  emission  observed  when 
comparing  the  bottom-most  and  the  topmost  spectra  in  Fig. 
2.  The  comparison  argues  for  the  formation  of  a  concentrated 
alloy. 

To  follow  the  formation  of  the  Si-related  phases,  and 
gauge  their  composition,  we  monitored  the  attenuation  of  the 
As  3d,  Ga  3d,  and  Ga3p  core  level  emissions  of  the  GaAs 
substrate  upon  Si  deposition  under  the  As  and  Al  flux,  and 

JVST  B  -  Microelectronics  and  Nanometer  Structures 


0. 

^  0. 

< 

2  0. 

0. 


Fig.  3.  As/(As+Si)  ratio  observed  during  deposition  of  Si  under  high  As 
flux  at  300  °C  onto  GaAs(001)c(4x4)  surfaces.  The  ratio  was  derived  from 
the  integrated  As  3d  emission  intensity  (and  Si  2p  intensity),  subtracting  the 
contribution  from  the  GaAs  substrate,  so  that  it  is  related  only  to  the  As 
atoms  incorporated  in  the  overlayer.  While  the  ratio  takes  into  account  the 
relative  As  3  J/Si  2p  relative  XPS  cross  sections,  it  would  correspond  to  the 
actual  As  atomic  concentration  only  in  the  absence  of  any  concentration 
gradient  in  the  overlayer. 


the  complementary  increase  of  the  Si  2/?,  A\2p,  or  As  3  <7 
core  level  emission  as  a  function  of  coverage.  We  used  the 
integrated  emission  intensities  after  subtraction  of  a  second¬ 
ary  background  multiplied  by  the  tabulated  Scofield 
factors.^^ 

Figures  3  and  4  show  results  for  Si  deposition  under  As 
flux  at  300  and  400  °C,  respectively.  From  the  experimental 
overall  integrated  As>3d  emission  intensity,  the  contribution 
associated  with  the  GaAs  substrate  was  subtracted  using  the 
experimental  Ga  3^/  residual  intensity  and  the  known  As  3dl 
Ga  3  J  intensity  ratio  from  the  GaAs  substrate.  The  remain¬ 
ing  A%3d  intensity  is  assumed  to  be  related  to  As  atoms 
incorporated  in  the  overlayer.  The  experimental  Si  2/?  inten¬ 
sity  and  the  measured  Si  2/?/As  3d  cross  sections  were  used 
to  derive  an  As/(As+Si)  ratio  without  any  attempt  to  correct 


03 

+ 

Vi 

< 

< 


10  15  20 

Si  coverage  (ML) 


Fig.  4.  As/(As-l-Si)  ratio  observed  during  deposition  of  Si  under  high  As 
flux  at  400  °C  onto  GaAs(001)c(4  X4)  surfaces.  The  ratio  was  derived  from 
the  integrated  As  3  c?  emission  intensity  (and  Si  2p  intensity),  subtracting  the 
contribution  from  the  GaAs  substrate,  so  that  it  is  related  only  to  the  As 
atoms  incorporated  in  the  overlayer.  While  the  ratio  takes  into  account  the 
relative  As  3dlS\  2p  relative  XPS  cross  sections,  it  would  correspond  to  the 
actual  As  atomic  concentration  c^s  only  in  the  absence  of  any  concentration 
gradient  in  the  overlayer. 
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Fig.  5.  Al/(A1+Si)  ratio  observed  during  deposition  of  Si  under  high  A1  flux 
at  300  °C  onto  GaAs(001)c(4X4)  surfaces.  The  ratio  was  derived  from  the 
experimental  overall  integrated  A\2p  and  Si  2p  emission  intensity,  taking 
into  account  the  relative  A1  2p/Si  2p  relative  XPS  cross  sections.  However, 
it  would  correspond  to  the  actual  A1  atomic  concentration  c^i  only  in  the 
absence  of  any  concentration  gradient  in  the  overlayer. 


for  the  actual  spatial  distribution  of  Si  and  As  atoms  within 
the  overlayer.  Therefore  this  experimental  ratio  would  corre¬ 
spond  to  the  actual  As  atomic  concentration  c^s  only  in  the 
absence  of  any  concentration  gradient  in  the  overlayer. 

The  As/(As+Si)  ratio  in  Fig.  3  is  seen  to  remain  remark¬ 
ably  constant  with  increasing  coverage,  at  a  value  that  would 
correspond  to  Cas=0.4-0.5  in  a  homogeneous  alloy.  In  Fig. 
4  the  ratio  also  remains  relatively  constant  with  increasing 
deposition,  but  at  a  lower  value,  consistent  with  Cp^^-0.2. 
The  fact  that  the  As/(As+Si)  ratio  in  Figs.  3  and  4  remains 
relatively  constant  throughout  the  2-30  monolayer  range  ex¬ 
amined  would  be  consistent  with  the  formation  of  a  rela¬ 
tively  homogeneous  alloy,  and  therefore  supports  the  identi¬ 
fication  between  the  As/(As+Si)  ratio  and  The 

formation  of  an  alloy  with  Cas=0.4-0.5  would  also  be  quan¬ 
titatively  consistent  with  the  results  of  Fig.  1.  Preliminary 
SIMS  depth  profiling  results  in  110  ML  thick  samples  grown 
at  300  also  show  a  fairly  homogeneous  As  content  in  the 
overlayer,  with  an  estimated  In  view  of  the  uncer¬ 

tainties  involved  in  the  determination  of  the  XPS  and  SIMS 
elemental  cross  sections,  the  similarity  between  the  SIMS 
and  the  XPS-determined  As  concentration  should  be  consid¬ 
ered  very  comforting. 

We  caution  the  reader  that  the  As  equivalent  beam  pres¬ 
sures  employed  during  Si  deposition  in  the  present  study  and 
in  those  of  Refs.  8  and  9  (1-5X10“^  Torr)  may  be  substan¬ 
tially  higher  than  those  employed  in  Ref.  10.  Chambers  and 
Loebs  reportedly  employed  an  AS4  overpressure  of 
1-5X10“^  Torr  during  Si  deposition,  and  showed  by  x-ray 
photoelectron  diffraction  that  the  structure  of  the  epilayer 
was  homologous  to  that  of  bulk  Si.  The  resulting  degenerate 
Si  overlayers  were  found  to  change  the  band  bending  of 
GaAs,  although  no  results  for  Al/Si/GaAs  diodes  were  re¬ 
ported. 

Figures  5  and  6  show  results  for  Si  deposition  under  A1 
flux  at  300  and  400  °C,  respectively.  From  the  experimental 
A12;7  and  Slip  integrated  intensity,  and  the  measured 
A12/?/Si2;?  cross  sections,  we  derived  an  Al/(Al+Si)  ratio 


Fig.  6.  Al/(A1  +  Si)  ratio  observed  during  deposition  of  Si  under  high  A1  flux 
at  400  °C  onto  GaAs(001)c(4x4)  surfaces.  The  ratio  was  derived  from  the 
experimental  overall  integrated  A\2p  and  Si  2p  emission  intensity,  taking 
into  account  the  relative  A1  2p/Si  2p  relative  XPS  cross  sections.  However, 
it  would  correspond  to  the  actual  A1  atomic  concentration  c^i  only  in  the 
absence  of  any  concentration  gradient  in  the  overlayer. 


without  any  correction  for  the  actual  spatial  distribution  of  Si 
and  A1  atoms  within  the  overlayer.  As  in  the  previous  case, 
this  ratio  would  correspond  to  the  actual  A1  atomic  concen¬ 
tration  only  in  the  absence  of  any  concentration  gradient. 

The  Al/(A1-I-Si)  ratio  in  Fig.  5  remains  constant  with  in¬ 
creasing  coverage  above  a  few  monolayers,  at  a  value  that 
would  correspond  to  ^^1=0.7  in  a  homogeneous  alloy.  In 
Fig.  6  the  Al/(AH-Si)  shows  a  somewhat  stronger  variation 
with  coverage.  A  saturation  value  appears  to  be  reached  at 
coverages  above  5-6  ML,  when  the  observed  value  of  the 
ratio  would  be  consistent  with  in  a  homogeneous 

alloy.  The  formation  of  Si-Al  alloys  with  Cai=0.6-0.7 
would  also  be  quantitatively  consistent  with  the  results  of 
Fig.  2. 

Unfortunately,  the  interpretation  of  SIMS  depth  profiling 
results  for  Si-Al  is  hindered  by  substantial  knock-in  implan¬ 
tation  effects,  which  preclude  a  quantitative  analysis  of  the 
A1  content  in  the  alloy.  We  mention,  however,  that  prelimi¬ 
nary  selected  area  diffraction  results  by  means  of  cross- 
sectional  TEM,  support  a  fee — as  opposed  to  diamond — 
local  crystalline  structure  for  the  overlayer,  as  expected  for 
an  Al-rich  metallic  phase. 

Binary  equilibrium  phase  diagrams  are  only  of  limited 
help  in  the  interpretation  of  the  above  results.  The  Si-As 
bulk  equilibrium  phase  diagram  contains  at  least  two  sto¬ 
ichiometric  compounds  (SiAs  and  SiAs2)  in  addition  to  the 
Si:As  solid  solution.^^  The  equilibrium  solubility  of  As  in 
bulk  Si  at  300  and  400  °C  is  much  smaller  than  the  values  of 
Cas  found  in  Figs.  3  and  4,  but  can  be  greatly  enhanced 
through  nonequilibrium  growth  techniques.  For  example, 
Cas~0.2  has  been  observed  in  Si  as  a  result  of  ion  implanta¬ 
tion  and  subsequent  laser  annealing.^"^  Nonequilibrium  solu¬ 
bility  might  in  principle,  be  consistent  with  the  result  of  Fig. 
4,  while  the  higher  value  of  Cas  in  Fig.  3  could  be  explained 
by  the  presence  of  a  mixed-phase  interlayer,  possibly  involv¬ 
ing  SiAs  and  SiAs2  compounds. 

The  Si-Al  bulk  equilibrium  phase  diagram,  however, 
does  not  contain  compounds  or  solid  solutions  with  Cai  large 
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enough  to  explain  the  results  of  Figs.  5  and  6.  The  maximum 
equilibrium  solubility  of  A1  in  Si  is  only  0.016  at.  %  at  the 
eutectic  temperature  (577  °C)P  The  maximum  solubility  of 
Si  in  A1  is  1.5  at.  %  at  the  eutectic  temperature,  and  it  de¬ 
creases  to  0.05  at.  %  at  300  °C.  Even  through  nonequilibrium 
techniques  such  as  splat  quenching  from  the  liquid,  only 
values  as  high  as  0.11  were  observed.^^  Prior  to  this  study, 
Al-Si  alloys  have  not  been  reported  to  form  metastable  in- 
termetallics,  unlike  the  case  of  Al-Ge  alloys. 

IV.  CONCLUSIONS 

Growth  of  Si  layers  on  GaAs  at  300  and  400  °C  under  an 
As  equivalent  beam  pressure  of  1-5X10“^  Torr  or  an  A1 
equivalent  beam  pressure  of  8-10X10“^  Torr  gives  rise  to 
the  formation  of  mixed  Si-V  and  Si-III  alloys  as  opposed  to 
degenerate  bulk  Si.  While  RHEED  indicates  crystallinity  of 
the  deposits  up  to  coverages  of  10-50  ML,  depending  on  the 
growth  temperature,  the  atomic  structure  appears  different 
from  that  of  Si,  and  group  V  and  III  incorporation  in  the 
resulting  mixed  phases  may  exceed  50%.  The  trend  is  to¬ 
wards  reduced  incorporation  with  increasing  growth  tem¬ 
peratures,  possibly  reflecting  a  decrease  in  the  sticking  coef¬ 
ficient  of  the  groups  V  and  III  species  relative  to  Si  with 
increasing  temperature,  or  a  decrease  in  the  bulk  solubility 
associated  with  the  formation  of  phases  which  are  closer  to 
thermodynamic  equilibrium.  In  any  case,  for  both  300°  and 
400  °C  deposition,  the  resulting  intermixed  phases  bear  little 
resemblance  to  degenerate  bulk  Si. 

Since  fabrication  of  Si  interface  layers  under  an  excess 
group  V  or  group  III  elemental  flux  has  been  used  to  tune  the 
Schottky  barrier  height  in  Al/Si/GaAs(001)  diodes,  any 
model  which  explains  the  Schottky  barrier  tunability  with  the 
presence  of  a  degenerate  Si  layer  with  bulk  Si  electronic 
structure,  band  gap,  doping  characteristics,  as  well  as  equi¬ 
librium  values  of  the  bulk,  unstrained  Si-GaAs  heterojunc¬ 
tion  band  offsets  should  be  reevaluated  taking  into  account 
the  detail  of  the  growth  conditions  employed  for  Si  deposi¬ 
tion. 
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Using  an  ab  initio  pseudopotential  approach,  we  have  investigated  the  electronic  structure  of  ideal 
Al/GaAs(100)  and  Al/Gai_^Al;,As(100)  junctions,  and  the  change  of  the  corresponding  Schottky 
barrier  height  versus  the  alloy  composition  x  and  in  the  presence  of  ultrathin  group-IV  atom 
interlayers.  We  find  large  changes  in  the  Schottky  barrier  height  which  agree  well  with  the 
experimental  data.  In  order  to  interpret  the  observed  trends  we  have  analyzed  the  charge  density 
response  to  chemical  substitutions  near  the  junction.  This  allowed  us  to  extend  to 
metal/semiconductor  interfaces  a  microscopic  linear-response  theory  approach  previously  employed 
to  interpret  band-offset  trends  at  semiconductor  heterojunctions.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

Understanding  the  microscopic  mechanisms  which  deter¬ 
mine  the  Schottky  barrier  height  in  metal/semiconductor 
junctions  is  a  key  issue  in  solid  state  physics  and 
technology.^  It  is  by  now  generally  accepted  that  “interfa- 
cial”  states  such  as  defects^’^  or  metal-induced  gap  states"^"^ 
(MIGS)  play  an  important  role  in  the  barrier  formation.  The 
relative  importance,  however,  of  MIGS  and  defect-related 
states  in  determining  Schottky  barrier  trends  and  the  barrier 
dependence  on  the  interface  geometry  and  composition  are 
issues  which  are  still  widely  debated,  especially  in  the  case 
of  metal  contacts  to  III-V  semiconductors  such  as  GaAs.^"^^ 

In  recent  years,  there  has  been  a  considerable  experimen¬ 
tal  effort  targeted  to  engineering  Schottky  barriers  with  the 
aim  of  improving  the  performance  or  modulating  the  charac¬ 
teristics  of  metal/semiconductor  devices.  One  way  of  engi¬ 
neering  Schottky  barriers  is  by  altering  the  bulk  semiconduc¬ 
tor  material,  e.g.,  by  using  alloys.  Schottky  barrier  heights 
show,  in  general,  a  stronger  dependence  on  the  semiconduc¬ 
tor  material  than  on  the  type  of  metal  used  for  the  contact. 
This  type  of  engineering  with  bulk  perturbations  modifies,  in 
principle,  also  other  electronic  parameters  of  the  devices 
such  as,  e.g.,  the  semiconductor  electronic  mobility  or  the 
band  gap.  A  different  approach,  which  recently  yielded  en¬ 
couraging  results,  is  to  control  the  barrier  height  via  artificial 
modifications  of  the  interface.  A  number  of  experimental 
studies  showed,  in  particular,  that  the  Schottky  barrier  height 
at  metal/GaAs(100)  contacts  could  be  increased/decreased  by 
as  much  as  0.4  eV  by  growing  thin  Si  interlayers  under  an 
excess  group-III/group-V  atomic  flux.^^’^^  The  mechanism 
responsible  for  these  important  variations  has  not  yet  been 
established.  Various  conflicting  empirical  models  based  on 
the  macroscopic  properties  of  degenerate  bulk  Si  have  been 
proposed  to  explain  different  sets  of  experimental  data.^^’^^ 
Very  recently,  however,  Cantile  et  al^^  have  shown  that  the 
modified  Schottky  barrier  in  Al/GaAs(100)  junctions  is  al¬ 
ready  established  for  Si  coverages  less  than  2  monolayers 
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(ML),  thus  indicating  that  the  tuning  has  a  more  microscopic 
origin. 

For  semiconductor  interfaces,  a  recent  microscopic 
model based  on  linear-response  theory,  which  treats  the 
interface  as  a  perturbation  relative  to  a  reference  virtual  crys¬ 
tal,  successfully  explained  band-offset  chemical  trends  at 
semiconductor  heterojunctions.  In  addition,  a  microscopic 
interface-dipole  model explained  band-offset  variations  in¬ 
duced  by  thin  Si  or  Ge  interlayers  at  III-V/III-V 
heterojunctions. For  metal/semiconductor  junctions,  the 
complexity  of  the  structure  and  the  lack  of  precise  informa¬ 
tion  on  the  inhomogeneous  screening  near  the  interface  hin¬ 
dered  the  application  of  similar  microscopic  models  to  the 
analysis  of  Schottky  barrier  tuning.  The  strength  of  the 
screening  by  the  metal  wave  functions  of  perturbations  intro¬ 
duced  at  or  near  the  interface  is  a  key  issue  for  metal/ 
semiconductor  contacts,^  and  a  deeper  understanding  of  this 
screening  on  the  atomic  scale  is  essential  to  develop  models 
with  truly  predictive  capabilities  for  Schottky  barrier  engi¬ 
neering. 

In  this  article  we  report  a  first-principles  study  of  the 
modification  of  Al/GaAs  Schottky  barriers  with  bulk  semi¬ 
conductor  Gaj.^Al^As  alloys  and  with  ultrathin  (0-2  ML 
thick)  Si  interlayers.  Our  results  for  ideal  epitaxial  junctions 
show  that  the  experimental  alloy  composition  and  Si-doping 
trends  can  be  fully  explained  without  postulating  defect- 
induced  Fermi-level  pinning.  We  explain  the  observed  trends 
based  on  an  atomic  scale  analysis  of  the  charge  density  re¬ 
sponse  to  chemical  substitutions  near  the  interface.  Our  study 
builds  upon  the  linear-response  theory  work  performed  for 
semiconductor  heterojunctions, and  extends  to  metal/ 
semiconductor  interfaces  microscopic  models  used  to  inter¬ 
pret  band-offset  trends  with  bulk  and  interfacial 
perturbations.^^’^^ 

II.  COMPUTATIONAL  METHOD 

Our  calculations  were  performed  within  the  local-density 
approximation  (LDA)  to  density  functional  theory  (DFT), 
using  the  pseudopotential  plane-wave  method.^^  We  used 
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Troullier-Martins  pseudopotentials^^  in  the  Kleinman- 
Bylander  non-local  form.^^  The  exchange-correlation  func¬ 
tional  by  Ceperley  and  Alder  was  employed. To  simulate 
the  isolated  Al/GaAs(100)  interfaces,  we  used  supercells 
consisting  of  21  layers  of  GaAs  and  7  layers  of  AL  The  polar 
Al/GaAs(100)  interface  offers  two  inequivalent  configura¬ 
tions  with  As-terminated  or  Ga-terminated  GaAs(lOO), 
which  we  both  consider  in  this  study.  The  A1  [100]  direction 
was  made  parallel  to  the  GaAs  [100]  axis,  and  the  A1  fee 
lattice  was  rotated  45°  about  the  [100]  axis  relative  to  GaAs 
cubic  lattice  in  order  to  satisfy  epitaxial  relations.  The 
small  compressive  strain  (1%)  in  the  A1  in-plane  lattice  con¬ 
stant  was  accommodated  by  a  tetragonal  elongation  (2%)  of 
the  A1  overlayer,  following  macroscopic  elasticity  theory. 
The  interplanar  distance  at  the  interface  was  taken  as  the 
average  between  the  interplanar  distances  in  the  semiconduc¬ 
tor  and  in  the  metal,  consistently  with  the  result  of  total 
energy  calculations. 

We  simulated  Al/Gai_;tAl;cAs(100)  interfaces  within  the 
virtual  crystal  approximation,  replacing  Ga  pseudo-ions  with 
{  Gai_;,.Aljj.)  pseudo-ions.  Si  interlayers  were  introduced  at 
the  Al/GaAs(100)  junctions  by  replacing  with  Si  an  equal 
number  of  Ga  and  As  atoms  in  the  two  planes  closest  to  the 
metal.  We  thus  assumed  fully  self-compensated  Si  dopants 
distributed  over  two  layers,  and  simulated  the  doping  for 
coverage  0<x<2  ML  with  the  virtual  crystal  approximation 
using  {Gai_^/2Si^/2)  and  (Asi_;,/2Si^/2)  pseudo-ions. 

The  supercell  calculations  were  performed  with  a  plane- 
wave  kinetic  energy  cut-off  of  16  Ry.  For  the  Brillouin  zone 
integrations  we  used  a  (6,  6,  2)  Monkhorst-Pack  grid,^^  cor¬ 
responding  to  6  special  k  points,  and  employed  the  Gaussian 
broadening  scheme^^  with  a  full  width  at  half  maximum  of 

—  0.1  eV.  Convergence  tests  showed  that  with  the  above  pa¬ 
rameters  the  numerical  accuracy  on  the  barrier  height  is 

—  20  meV.  Atomic  relaxation  at  the  Al/GaAs(100)  interfaces 
was  found  to  have  a  relatively  small  effect  (  —  20  meV)  on 
the  Schottky  barrier  height,  and  was  neglected  in  the  present 
study.^^ 

To  calculate  the  Schottky  barrier  height  ,  we  separated 
the  band  discontinuity  into  two  contributions: 

(1) 

The  band-structure  term  AEp  is  the  difference  between  the 
Fermi  energy  of  the  metal,  Ep ,  and  the  valence-band  edge  of 
the  semiconductor,  E^,  each  measured  with  respect  to  the 
average  electrostatic  potential  of  the  corresponding  crystal. 
This  term  does  not  depend  on  the  interface,  and  was  deter¬ 
mined  from  standard  bulk  band-structure  calculations  for 
each  of  the  two  materials  forming  the  j unction. The  second 
term  AV  is  the  electrostatic  potential  lineup  across  the  inter¬ 
face.  This  term  contains  all  interface-specific  features.  For 
Schottky  barrier  tuning  via  interfacial  perturbations,  the 
change  in  the  potential  lineup  is  the  only  contribution  to  the 
barrier  modification.  With  bulk-semiconductor  perturbations 
such  as  alloys,  instead,  both  AEp  and  AV  contribute  to  the 
barrier  change.  The  potential  lineup  AV  was  derived  via 


Fig.  1.  Macroscopic  average  of  the  electrostatic  potential  in  the  AV 
GaAs(lOO)  As-terminated  junction.  The  potential  lineup  AV  and  the  bulk 
band  energies  determining  the  Schottky  barrier  are  also  indicated. 


Poisson’s  equation  from  the  self-consistent  supercell  charge 
density  using  the  macroscopic  average  technique  of  Ref.  16. 
In  Fig.  1  we  show  the  macroscopic  average  of  the  electro¬ 
static  potential  V  and  the  potential  lineup  AV  across  the 
Al/GaAs(100)  As-terminated  junction.  The  energy  diagram 
for  the  calculation  of  (fyp  based  on  Eq.  (1)  is  also  shown.^^ 

III.  RESULTS 

A.  Unperturbed  AI/GaAs(100)  interfaces 

The  calculated  LDA  /7-type  Schottky  barriers  including 
the  GaAs  spin-orbit  splitting^®  are  0.56  eV  for  the  As- 
terminated  (I)  and  0.45  eV  for  the  Ga-terminated  (II)  AM 
GaAs(lOO)  interfaces.^^  These  values  are  about  0.1  eV 
smaller  than  the  values  obtained  by  Dandrea  and  Duke  in 
their  ab  initio  study  of  the  Al/GaAs(100)  junctions.^^  We 
attribute  such  differences  to  the  use  of  different  pseudopo¬ 
tentials  together  with  somewhat  different  atomic  geometries. 

The  potential-lineup  contribution  to  in  Eq.  (1)  is  a 
ground-state  property  of  the  junction  depending  only  on  its 
charge  density.  This  term  is  therefore  accurately  determined, 
in  principle,  within  DFT.  The  LDA  band-structure  term 
AEp ,  instead,  should  be  corrected  using  quasi-particle  ener¬ 
gies.  The  quasi-particle  or  many-body  corrections  to  the 
bulk-band  structures  of  A1  and  GaAs  have  been  evaluated  in 
Ref.  32.  Corrections  of  —0.14  eV  and  —0.36  eV  were  re¬ 
ported  for  the  A1  Fermi  level  Ep  and  for  the  GaAs  valence- 
band  edge  Ey,  respectively.  Using  these  values  to  correct  the 
LDA  band-structure  term  AE^,  we  obtain  0^^^  =  O.78  eV 
and  0™=O.67  eV  (i.e.,  <^^’>=0.64  eV  and  ^™=0.75  eV, 
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Fig.  2.  Planar  average  of  the  charge  density  in  the  Al/GaAs(100)  As-terminated  junction  (solid  line).  Comparison  with  the  planar-averaged  charge  densities 
of  the  two  bulk  materials  (dotted  line)  forming  the  junction  (a).  Contribution  of  the  MIGS  to  the  total  charge  density  (b);  the  dotted  line  indicates  the 
contribution  of  the  states  with  energies  below  the  GaAs  valence  band  edge.  The  lower  panels  show  the  macroscopic  average  of  the  difference  between  the 
interface  and  bulk  charges  (a),  which  somewhat  depends  on  the  arbitrary  choice  of  the  interface,  and  the  charge  density  contribution  of  the  MIGS  (b). 


using  the  GaAs  room- temperature  band-gap  value  of  1.42 
eV).  The  estimated^^  uncertainty  on  the  absolute  values  of 
the  barriers  including  many -body  corrections  is  about  0.1  eV. 

The  higher  for  the  As-terminated  interface  may  be 
understood  in  terms  of  the  higher  work  function  of  the  As- 
relative  to  the  Ga- terminated  GaAs(lOO)  surface.  The  results 
for  the  two  interfaces  compare  well  with  the  measured  values 
of  the  n-type  barriers  in  epitaxial  Al/GaAs(100)  junctions 
with  As-  and  Ga-rich  surfaces:  0.72  eV  and  0.83  eV  in  Ref. 
33  or  0.66  eV  and  0.72  eV  in  Ref.  34,  given  the  theoretical 
and  experimental  uncertainties. 

In  this  work,  we  will  focus  on  two  basic  quantities  to 
interpret  the  Schottky  barrier  trends  with  bulk  alloy  and  in¬ 
terlayer  perturbations,  namely  the  charge  density  and  the 
MIGS.  Some  of  their  characteristics  are  illustrated  in  Fig.  2 
for  the  As-terminated  Al/GaAs(100)  interface.  In  Fig.  2(a), 
we  compare  the  planar  average  of  the  charge  density  in  the 
junction  to  the  planar-averaged  charge  densities  of  the  bulk 
GaAs  and  Al  crystals.  Inspection  of  the  figure  indicates  that 
the  charge  density  in  the  junction  reproduces  the  bulk  semi¬ 
conductor  charge  density  almost  up  to  the  last  semiconductor 
plane.  This  is  an  important  feature  which  will  allow  us  to 
explain  (section  III  C)  the  “bulk”  character  of  the  barrier 
trend  with  the  semiconductor  alloy  composition,  when  the 
alloying  does  not  affect  the  chemical  composition  of  the  last 
semiconductor  plane.  The  microscopic  bulk  behavior  of  the 
charge  density  is  related  to  the  local-charge  neutrality  em¬ 
phasized  by  Tersoff^  in  his  model  of  the  charge-neutrality 
level.  In  Fig.  2(b),  we  show  the  contribution  of  the  MIGS  to 
the  charge  density.  These  states  decay  within  the  semicon¬ 
ductor  with  a  decay  length  of  about  3  A.  The  contribution  of 


these  states  is  important  to  re-establish  the  bulk  semiconduc¬ 
tor  charge  density  [see  Fig.  2(b)].  The  MIGS-related  screen¬ 
ing  will  be  an  important  issue  in  the  study  of  the  barrier 
modification  with  heterovalent  interlayers  (section  III  E). 

B.  Semiconductor-composition  dependence 

In  Fig.  3,  we  present  the  composition  dependence  of  the 
Al/Gai__yAlj^As(100)  Schottky  barrier  for  the  two  interfaces 


GaAs  X  ALAs 


Fig.  3.  Composition  dependence  of  the  Al/Gai„;,AljfAs(100)  Schottky  bar¬ 
rier  height  for  the  anion-  and  cation -terminated  Gai_^Alj.As(100)  surfaces. 
The  symbols  show  the  experimental  data  from  Ref.  9  [  0  /(V)]  and  Ref.  10 
[A  /(V),  O  C{V),  □  IPE]. 
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with  cation  and  anion  Gai_^Al;^As(100)  termination.  The 
many-body  correction  on  ^^/Al/AlAs)  was  included  using 
the  quasi-particle  correction  on  (;6p(Al/GaAs)  of  Ref.  32  and 
the  difference  in  quasi-particle  band-edge  energies  for  GaAs 
and  AlAs  given  in  Ref.  35,  i.e.,  ^0.1  eV.  A  linear  interpola¬ 
tion  was  used  for  0<.x:<  1.  The  theoretical  barriers  in  Fig.  3 
are  compared  to  the  experimental  current/voltage  /(F), 
capacitance/voltage  C(V),  and  internal  photoemission  (IPE) 
data  from  Refs.  9  and  10.  Taking  into  account  the  estimated 
theoretical  uncertainly  of  —0.1  eV  on  and  the  un¬ 

known  structural  details  of  the  experimental  interfaces  (with 
possibly  cation-  and  anion-  terminated  domains),  good  gen¬ 
eral  agreement  is  found  between  theoretical  and  experimen¬ 
tal  trends. 

In  our  calculations,  the  transitivity  rule^^  between  the  Al/ 
GaAs,  Al/AlAs,  and  GaAs/AlAs  band  discontinuities  is  sat¬ 
isfied  within  0.03  eV  with  the  type  I  metal/semiconductor 
interfaces  (As  termination),  whereas  a  deviation  of  —0.21 
eV  is  observed  for  the  type  II  interfaces  (Ga  or  Al  plane 
closest  to  the  metal).  Experimentally  the  transitivity  is  ap¬ 
proximately  verified:  the  difference  ^^(AlAs)- (;6p(GaAs) 
ranges  from  0.45  to  0.6  eV,  and  the  valence-band  offset 
VBO(GaAs/AlAs)  from  0.45  to  0.55  eV.^^ 


Gai/2Ali/2As  Gai/2AIi/2As 


Gay2Xli/2^  Al 


Gai/2Alj/2As  Gaj/2Ali/2As 


Gaj/2Al]/2As  Al 


I 


O 

Dh 


Fig.  4.  Charge  (solid  line)  and  potential  (dashed  line)  profiles  induced  by  a 
single  Al  or  Ga  plane  substitution  in  the  Ali/2Ga{/2As(100)  semiconductor 
homojunction  (top  panels)  and  within  the  semiconductor  near  the  interface 
in  the  AI/AIi/2Gaj/2As(100)  heterojunction  (bottom  panels). 


C.  Interpretation  based  on  linear-response  theory 

In  the  case  of  semiconductor  heterojunctions,  the  linear- 
response  theory  model  showed  that  the  band  offset  in  is- 
ovalent  junctions  such  as  GaAs/ AlAs  is  determined  only  by 
the  bulk  properties  of  the  two  semiconductor  constituents. 
We  will  show  here  that  the  composition  dependence  of  the 
Al/GaAlAs  Schottky  barriers,  the  transitivity  of  the  Al/GaAs, 
Al/AlAs,  and  GaAs/AlAs  band  discontinuities  for  the  type  I 
interfaces,  and  the  —0.2  eV  deviation  from  the  transitivity 
rule  for  the  type  II  interfaces  can  be  explained  using  a  similar 
approach  focusing  on  charge  density  response. 

Since  band  terms  are  bulk  properties,  our  main  focus  is 
the  potential-lineup  term.  Within  the  linear-response 
approach, the  GaAs/AlAs  interface  is  considered  as  a  per¬ 
turbation  with  respect  to  the  virtual  Gai/2Ali/2As  crystal,  and 
AF  derives  from  the  charge  density  Ap+(z)  induced  by  a 
single  cation-plane  substitution  Vc=~  changing 

the  virtual  ions  into  real  Al  ( + )  or  Ga  ( - )  ions,  through 

AF--47r^^(A  +  -A_)/L,  (2) 

where  A  +  is  the  second  moment  of  the  charge. 


A±=-J  z'^^p±{z)dz.  (3) 

and  L  the  distance  between  consecutive  cation  planes.  The 
charge  profiles  Ap  +  (z)  and  the  corresponding  potential  pro¬ 
files,  which  determine  the  GaAs/AlAs  potential  lineup  AF, 
are  illustrated  in  Fig.  4  (upper  panels).  The  Ap+(z)  reflect 
the  bond  polarization  associated  with  the  higher  electronega¬ 
tivity  of  Ga  relative  to  Al. 

Similarly,  starting  from  a  reference  Al/Gai/2Ali/2As  sys¬ 
tem,  one  may  build  the  Al/GaAs  or  Al/AlAs  interfaces  by 
introducing  cation-plane  substitutions  u  c  in  the  semi-infinite 
semiconductor  region.  An  ideal  linear  superposition  of  semi¬ 
conductor  charge  responses  Ap  (Fig.  4,  upper  panel)  in  the 
semi-infinite  semiconductor  region  would  induce  a  potential 
lineup  AF=  —  Air AIL  across  the  interface,  which  is  a  bulk 
quantity  and  satisfies  the  transitivity  rule.  In  our  calculations 
these  bulk  terms  yield  (the  quadratic  responses  cancel  out): 
AFbuik(Al/AlAs)  -  AFb^ik(Al/GaAs) =0.41  GeV= AF(AlAs/ 
GaAs). 

The  response  of  the  reference  Al/Gai/2Alj/2As  system  to  a 
substitution  of  the  cation  plane  closest  to  the  metal,  i;c(  1 )» is 
presented  for  the  type  I  junction  in  Fig.  4  (lower  panel).  The 


Table  I.  Dipoles  in  eV  induced  by  Al  (  +  )  and  Ga  (-)  single-plane  substitutions  in  Al/Gai/2Al,/2As  junctions 
for  the  first  three  semiconductor  cation  planes  from  the  metal  (the  dipoles  are  negligible  beyond  the  third  plane). 
The  index  I  (II)  refers  to  the  anion-  (cation-)  terminated  interface.  A  Vcorr  is  the  deviation  from  the  transitivity 
rule  for  the  band  discontinuities  <^^( Al/GaAs),  </>^,(Al/AlAs),  and  VBO(GaAs/AlAs)  as  obtained  from  the  ah 
initio  calculations. 


Interface 

dt 

4 

dt 

di 

di 

AVcorr 

I 

+  0.01 

-0.03 

+  0.00 

-0.03 

+  0.02 

+  0.01 

-0.02 

“0.03 

II 

-0.06 

-0.03 

-0.02 

+  0.04 

+  0.01 

+  0.01 

-0.17 

-0.21 
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Fig.  6.  Schottky  barrier  height  of  Al/Si/GaAs(100)  diodes  as  a  function  of  Si 
coverage.  The  solid  circles  are  the  results  of  the  self-consistent  calculations, 
the  open  triangles  correspond  to  the  experimental  data  of  Ref.  15  for  Si 
grown  under  As  (A)  and  Al  (V)  fluxes.  The  solid  lines  give  the  predictions 
of  our  model  [see  text,  Eq.  (5)];  the  dashed  lines  indicate  the  model  predic¬ 
tions  at  low  coverage  in  the  linear  regime. 


Fig.  5.  Planar  averages  of  the  ionic  (dotted  lines)  and  electronic  (dashed 
lines)  charge  densities  and  electrostatic  potentials  induced  by  a  Si  bilayer  at 
1  ML  coverage.  The  solid  line  represents  the  total  potential  scaled  by  a 
factor  of  10.  The  bare  ionic,  and  screened,  AV,  discontinuities  across 
the  Si  interlayer  are  also  indicated. 


presence  of  the  metal  can  be  seen  to  induce  an  asymmetry  in 
the  response  Api(z)  responsible  for  a  dipole  di .  The  sign  of 
the  dipole  in  Fig.  4  may  be  understood  in  terms  of  the 
higher  electronegativity  of  Ga  relative  to  AL  When  the  vir*> 
tual  crystal  plane  is  replaced  by  a  Ga  (Al)  plane  the  increase 
(decrease)  in  electronegativity  produces  an  electronic  charge 
transfer  from  (to)  the  metal  beyond  the  second  neighboring 
plane  through  the  MIGS,  which  accounts  for  the  sign  of  the 
observed  dipole  di.  To  the  linear  order  in  the  planar  pertur¬ 
bations  Vc(i),  the  deviation  from  the  transitivity  rule  is 
given  by  'Ziid^  —  d'[),  where  df  are  the  dipoles  induced  by 
the  asymmetric  Apf(z)  for  substitutions  Vcii)  in 
planes  close  to  the  interface.  These  dipoles  df  are  given  in 
Table  1.^^  The  sum  —  df)  yields  —0.02  eV  and 

—  0.17  eV  for  the  type  I  and  II  interfaces,  respectively,  and 
accounts  for  most  of  the  total  deviation  A  Vcou  the  tran¬ 
sitivity  rule  (-0.03  eV  and  -0.21  eV). 

D.  Interfacial  perturbation:  Heterovalent  interlayers 

In  Fig.  5  we  show  the  ionic  and  electronic  charge  densi¬ 
ties  and  electrostatic  potentials  induced  by  a  Si  interlayer 
(1  ML  coverage)  on  the  As-terminated  Al/GaAs  junction.  Si 
doping  of  adjacent  cation  and  anion  planes  in  GaAs  may  be 
viewed^^  as  a  proton  transfer  between  substituted  As  and  Ga 
atoms  (indicated  by  the  dashed  arrows  in  Fig.  5).  For  an 
arbitrary  coverage  in  the  range  0^a:^2  the  bare  dipole  layer 
associated  with  these  point  charges  produces  a  potential  dis¬ 
continuity  ±(7Te^/a)x  —  where  a  is  the  GaAs  lattice 


constant^^ — which  increases  (decreases)  at  the  As-  (Ga-) 
terminated  interface.  At  1  ML  coverage  we  have 
|Ayb|  =  8.15  eV.  The  electronic  contribution,  obtained  from 
the  self-consistent  calculations  (see  Fig.  5),  reduces  the  dis¬ 
continuity  to  Ay=0.22  eV  for  the  As-terminated  case  and 
AV=— 0.20  eV  for  the  Ga-terminated  case.  This  corre¬ 
sponds  to  an  effective  screening  at  the  junctions 
^eff^AVb/AV-  40  for  1  Si  ML. 

In  Fig.  6  we  present  our  results  for  the  Schottky  barrier 
height  of  the  two  As-  and  Ga-terminated  interfaces  as  a  func¬ 
tion  of  Si  coverage  together  with  the  measured Al/Si/GaAs 
Schottky  barriers  for  Si  grown  under  excess  As  or  Al  flux. 
The  difference  between  the  barrier  heights  of  the  As-  and 
Ga-terminated  interfaces  increases  with  Si  coverage,  and  be¬ 
comes  —  0.8  eV  for  jc  =  2  ML,  in  excellent  agreement  with 
the  experimental  data.  The  agreement  between  theory  and 
experiment  in  Fig.  6  supports  the  contention  by  Cantile 
et  that  an  excess  anion  (cation)  flux  may  tend  to  favor  a 
cation-anion  (anion-cation)  site  sequence  for  the  Si  layers. 
This  interpretation  is  also  consistent  with  expected  trends 
from  free  energy  considerations  for  the  As-  and  Ga- 
terminated  GaAs(lOO)  surfaces  at  high  As  (Ga)  chemical  po¬ 
tential,  i.e.,  in  the  presence  of  excess  As  (Ga). 

The  effective  screening  changes  from  about  30  at  low 
Si  coverages  to  about  50  at  2  ML  coverage.  This  large  varia¬ 
tion  in  accounts  for  the  non-linear  coverage  dependence 
of  (f)p  in  Fig.  6.  The  screening  of  the  dipole  created  by  Si 
interlayers  analogously  grown  at  GaAs/GaAs  homojunctions 
has  been  explained  in  terms  of  the  GaAs  dielectric  constant 
(at  low  coverages)  and  of  the  Si  dielectric  constant  (at 
~2  ML  coverage). Here  6eff  is  3  to  5  times  larger  than  the 
GaAs  and  Si  dielectric  constants. There  is  therefore  a  dras¬ 
tic  enhancement  of  the  screening  related  to  the  presence  of 
the  metal. 
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Fig.  7.  Surface  density  of  states  at  the  Fermi  energy  without  Si  and  with 
2  ML  of  Si  (see  text).  The  solid  line  (dashed  line)  corresponds  the  As(Ga)- 
terminated  interface.  Inset:  local  density  of  states  within  the  semiconductor 
band  gap.  The  curves  correspond  to  supercell  regions  indicated  by  the  labels 
in  the  upper  part  of  the  figure. 


E.  Model  for  the  dipole  screening 

At  low  Si  coverage,  the  additional  polarization  related  to 
the  presence  of  the  metal  can  be  described  within  a  linear- 
response  formulation  by 

^Vb=6effAy=(l+477-;^se  +  47T^ni)AV,  (4) 

where  —  l)/4i7  is  the  GaAs  electronic  dielectric 

susceptibility,  and  Xm  accounts  for  the  additional  polarization 
due  to  the  metal  on  the  other  side  of  the  junction. 

The  additional  susceptibility  Xm  be  explained  in  terms 
of  the  metal-induced  gap  states  (MIGS)  at  the  Fermi  energy. 
The  central  quantities  are  the  local  density  of  states^  (LDOS) 
N{z)  and  the  related  surface  density  of  states  at  the  Fermi 
energy  Ds(z)  =  fiooN(z')dz' ,  where  indicates  a  posi¬ 
tion  well  inside  the  semiconductor  (where  the  MIGS  vanish) 
and  z  is  the  dipole  position.  Based  on  simple  electrostatic 
arguments'^^  an  estimate  for  the  MIGS  contribution 
to  the  potential  lineup  [Eq.  (4)]  induced  by  a 
small  change  Av  in  the  Fermi-level  position  is 
—  4'rTe^D^(z)AvS,  where  <5  is  a  distance  of  the  order  of  the 
exponential  decay  length  X  of  the  MIGS  at  the  Fermi 
energy."^’^  The  effective  susceptibility  describing  the  MIGS 
response  to  a  change  in  potential  lineup  is  thus 
Xm=e^D,(z)S. 

In  Fig.  7  we  present  the  surface  density  of  states  D^iz)  for 
the  As-  and  Ga-terminated  interfaces  without  Si  and  with  2 
Si  ML.  In  the  inset  we  display  the  LDOS  in  the  gap  region 
for  the  As-terminated  interface.  The  figure  shows  that  the 
D^iz)  for  the  two  terminations  are  very  similar,  and  both 
decay  exponentially  in  the  semiconductor  with  a  decay 
length  A.  At  the  position  of  the  Si  interlayer  (z  =  0)  we 
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Fig.  8.  Schottky  barrier  (filled  squares,  left  scale)  and  effective  screening 
(filled  diamonds,  right  scale)  as  a  function  of  the  Si  bilayer  position  within 
the  semiconductor.  The  symbols  give  the  results  of  self-consistent  calcula¬ 
tions  for  a  Si  coverage  :r  =  0.5.  Their  size  indicates  the  estimated  accuracy  of 
the  numerical  results.  The  solid  lines  correspond  to  the  predictions  of  Eq.  (5) 
in  the  linear-coverage  regime. 


find  D^(0) ^0.027  A“^  eV~^  To  model  the  screening  ef¬ 
fect  we  will  use  ^=3.2  A.  These  values  are  consistent  with 
the  results  of  the  earlier  calculations  by  Louie,  Chelikowsky, 
and  Cohen^  and  Tersoff.^  At  low  Si  coverage  (in  the  linear 
regime)  the  effective  screening  is  6^^^= 

+  47re^D^(0)^^28,  corresponding  to  the  dashed  lines  in 
Fig.  6,  in  perfect  agreement  with  experiment  and  with  the 
first-principles  results. 

At  high  Si  coverage  x,  Eq.  (4)  can  still  be  used  to  describe 
the  macroscopic  screening  of  the  Si  bilayer  in  the  GaAs/ 
GaAs  homojunction  (A'm^O)  provided  the  gradual  change  in 
the  host  susceptibility  Xsc  induced  by  the  Si  is  taken  into 
account  (microscopic  capacitor  model^^’^^): 

=  +  Along  the  same  line, 

we  should  also  include  the  change  in  Xm  induced  by  the 
modification  of  the  MIGS  with  Si  doping: 
Xm(x)  =  e^Dsiz)S+e'^x/2[D'^{z)-D,(z)]S,  where d:(z)  is 
the  surface  density  of  states  at  the  Fermi  energy  in  presence 
of  2  ML  of  Si  (see  Fig.  7).  The  effective  screening  for  a 
dipole  layer  at  position  z  and  for  a  coverage  0^x^2  is  thus 

e,dx,z)  =  el^^^^^  +  47re^D,iz)S+ 

+  47Te\D’iz)-D,iz)]S}.  (5) 

The  perturbed  surface  density  of  states  at  the  position  of  the 
Si  interlayer  (Fig.  7)  is  D^(0)^0.050  A~^  eV“^  Using  this 
value  in  Eq.  (5)  we  obtain  the  coverage  dependence  indi¬ 
cated  by  the  solid  line  in  Fig.  6,  in  excellent  agreement  with 
the  results  of  the  self-consistent  calculations.  We  would  like 
to  point  out  that  most  of  the  non-linearity  in  (f>p(x)  is  due  to 
the  large  difference  between  D^(0)  and  0^(0)  in  Eq.  (5);  the 
difference  between  and  plays  only  a  minor  role. 

Finally,  based  on  Eq.  (5),  the  symmetry  in  the  barrier  varia- 
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tions  of  the  two  interfaces  in  Fig.  6  can  be  understood  from 
the  similarity  of  their  D^(0)  and  values  (see  Fig.  7). 

To  probe  the  correlation  between  the  local-dipole  screen¬ 
ing  and  the  spatial  decay  of  MIGS,  which  is  explicit  in 
Eq.  (5),  we  moved  the  Si  dipole  layer  away  from  the  inter¬ 
face  to  a  position  z<0  into  the  semiconductor.  Given  the 
exponential  decay  of  D^(z)  in  the  semiconductor,  we  expect 
6eff(x  =  0,  z)  to  exponentially  converge  to  with  a 

similar  decay  length.  We  investigated  the  change  in  the 
screening  as  a  function  of  the  position  of  the  Si  dipole  layer 
for  a  coverage  jc  =  0.5.  We  selected  this  coverage  to  mini¬ 
mize  the  numerical  uncertainty  on  €  within  the  linear  re¬ 
gime  in  Eq.  (5)].  The  results  for  obtained  with 

first-principles  calculations  are  displayed  in  Fig.  8  together 
with  the  corresponding  Schottky  barrier  values.  The  expo¬ 
nential  decay  length  of  e^f^z)  towards  in  Fig.  8  is 

similar  to  the  decay  length  of  D^iz)  in  Fig.  7.  The  solid  line 
indicates  the  position  dependence  predicted  by  Eq.  (5)  in  the 
linear  coverage  regime.  The  very  good  agreement,  in  Fig.  8, 
between  the  predictions  of  the  model  and  the  results  of  the 
ab  initio  calculations  further  confirms  the  soundness  of  the 
picture  here  proposed  to  explain  the  screening  of  local  inter¬ 
face  dipoles  at  metal/semiconductor  junctions. 

IV.  CONCLUSION 

We  have  studied  the  change  of  Al/GaAs  and  Al/GaAlAs 
Schottky  barriers  versus  alloy  composition  and  in  the  pres¬ 
ence  of  ultrathin  (0-2  ML)  Si  interlayers  using  first- 
principles  calculations.  We  presented  results  for  ideal,  defect- 
free,  Al/GaAlAs  interfaces  showing  that  the  experimental 
composition  dependence  of  the  barriers  can  be  explained 
without  postulating  defect-induced  Fermi-level  pinning.  We 
also  showed  that  microscopic-interface  dipoles  induced  by 
the  heterovalent  nature  of  Si  interlayers  quantitatively  ex¬ 
plain  the  experimental  barrier  tuning  at  Al/Si/GaAs(100) 
junctions. 

These  results  were  interpreted  within  a  linear-response 
theory  approach  focusing  on  the  charge  density  response  to 
chemical  substitutions  near  the  junction.  This  model  ap¬ 
proach  quantitatively  explained  the  trends  with  alloy  compo¬ 
sition  obtained  from  the  ab  initio  calculations.  We  also  in¬ 
vestigated  the  screening  of  the  local  dipoles  induced  by 
interlayers  on  the  atomic  scale,  and  presented  a  model  ex¬ 
plaining  the  effective  screening  in  terms  of  simple  physical 
parameters  such  as  the  semiconductor  dielectric  constant  and 
the  local  interface  density  of  states  at  the  Fermi  energy. 
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Recent  advances  in  wet-chemical  surface  treatment  of  silicon  has  resulted  in  unreconstructed, 
hydrogen  terminated  Si(lll)  surfaces  with  extremely  low  surface  recombination  velocity  of  only 
0.25  cm/s  [E.  Yablonovitch  et  al,  Phys.  Rev.  Lett.  57,  249  (1986)].  This  points  towards  a  surface 
free  of  surface  states  that  normally  pin  the  Fermi  level  Ep  on  Si(lll)  somewhere  near  midgap 
depending  on  the  surface  reconstruction.  This  expectation  is  not  borne  out,  however,  by  experiment. 
Using  Slip  core  level  spectra,  we  find  consistently  a  surface  Fermi  level  position  near  midgap  for 
16  heavily  doped  n-  and  p-type  samples.  Checks  for  surface  photo  voltage  and  ion  concentrations 
sufficient  to  induce  the  observed  band  bending  were  negative.  All  samples  attain  their  bulk  Fermi 
level  position  at  the  surface  after  annealing  to  about  400  °C,  a  temperature  at  which  the  surface 
hydrogen  coverage  is  unchanged  as  monitored  by  ultraviolet  excited  photoemission.  Above  400  °C, 
hydrogen  starts  to  desorb  from  the  surface  and  only  with  the  onset  of  the  7X7  reconstruction  does 
Ep  move  back  towards  midgap.  We  suggest  that  the  initial  position  of  Ep  near  midgap  is  due  to  a 
hydrogen  induced  passivation  of  dopants  that  leaves  an  intrinsic  surface  layer.  Annealing  at 
—400  °C  reactivates  the  dopants  by  breaking  the  B-H  and  P-H  complexes  as  is  known  from 
plasma  passivation.  The  necessary  passivation  depth  has  been  calculated  as  a  function  of  bulk 
doping  yielding  values  of  the  order  of  one  micrometer.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Advances  in  wet-chemical  etching  processes  of  silicon 
have  resulted  in  atomically  flat,  unreconstructed  Si(lll)  sur¬ 
faces  of  unprecedented  quality.  The  IR  spectra  of  these  sur¬ 
faces  taken  in  the  attenuated  total  reflection  (ATR)  mode 
show  a  single  sharp  Si-H  stretching  mode  at  2084  cm“^ 
with  negligible  contributions  from  Si-H2  and  Si-H3 
configurations.^  Recent  high  resolution  photoemission  mea¬ 
surements  show  Si  2/7  core  line  spectra  that  exhibit  lin- 
ewidths  below  130  meV,  which  was  previously  unheard  of, 
and  a  sharp  Si-H  bonding  state  about  5.1  eV  below  the 
valence  band  maximum.^  All  these  observations  point  to¬ 
wards  an  ideally  Si-H  terminated  unreconstructed  Si(lll) 
surface  with  a  minimum  of  steps  and  imperfections.  As  a 
consequence,  one  would  expect  a  surface  essentially  free  of 
surface  states  in  the  fundamental  gap  leading  thus  in  contrast 
to  the  hydrogen-free,  reconstructed  Si(lll)  surfaces  (7X7  or 
2X1)  to  an  unpinned  Fermi  level  position  and  a  vanishingly 
small  surface  recombination  velocity.  The  latter  is  indeed 
borne  out  by  the  measurements  of  Yablonovitch  et  al,  who 
determine  a  surface  recombination  velocity  of  only  0.25 
cm/s,  which  is  even  lower  than  that  of  the  best  Si-Si02 
interfaces  reported  so  far  and  corresponds  to  a  density  of 
surface  states  of  only  2.5X10“^  cm“^.^  Here  we  present  data 
on  p-  and  w -doped  hydrogen  passivated  Si(lll)  samples  that 
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exhibit  contrary  to  expectation  a  surface  Fermi  level  position 
close  to  midgap,  corresponding  thus  to  near  surface  depletion 
layers  in  all  cases. 

II.  EXPERIMENT 

The  surface  Fermi  level  position  was  determined  through 
the  binding  energies  of  the  Si  2/7  core  level  as  measured  in 
x-ray  excited  photoelectron  spectroscopy  (XPS).  Strictly 
speaking,  only  differences  in  Fermi  level  position  are  being 
determined  by  this  method  which  can,  however,  be  converted 
into  absolute  Fermi  level  positions,  as  will  be  shown  below. 

We  investigated  three  kinds  of  Czochralski-grown  Si 
samples.  P  and  Sb  doped  samples  with  a  resistivity  be¬ 
tween  0.02  and  0.05  O  cm  and  a  donor  concentration  of  1.5 
to  2.5X10^^  cm~^,  highly  B  doped  p'^  samples  with  resistiv¬ 
ities  of  0.01  to  0.02  flcm  and  acceptor  concentrations  be¬ 
tween  4.5  and  11X10^^  cm“^,  and  lightly  B-doped  /7-type 
samples  with  acceptor  concentrations  of  1.3  to  2.3X10^^ 
cm“^.  The  samples  were  cleaned  according  to  the  recipe 
given  by  Higashi  et  al  using  buffered  HF  (40%  NH4F  in 
H2O)  as  the  last  etching  step,  which  was  followed  by  a  3  s 
dip  in  de-ionized  water  to  remove  traces  of  fluorine  at  the 
surface.^  Infrared  absorption  spectra  of  the  samples  so  pre¬ 
pared  showed  as  the  dominant  absorption  band  the  Si-H 
stretching  mode  at  2084  cm~\  with  less  than  10%  contribu¬ 
tions  from  either  Si-H2  or  Si-H3  modes.  Only  in  a  few  cases 
could  contaminants  in  the  form  of  physisorbed  C  and  O  be 
traced  just  at  the  detection  limit  of  XPS  (—1%).  A  remark¬ 
ably  sharp  low  energy  electron  diffraction  (LEED)  pattern 
with  very  low  background  confirmed  the  unreconstructed 
1X1  surface  unit  mesh.  XPS  was  performed  with  a  resolu- 
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Fig.  1.  Si  2/73/2  binding  energies  of  hydrogen  passivated  Si(lll)  surfaces  as  a  function  of  annealing  temperature.  The  lines  are  guides  to  the  eye  and  the 
average  binding  energy  as  measured  before  annealing  is  given  next  to  each  group  belonging  to  one  doping  class.  On  the  right  ordinate  binding  energies 
measured  relative  to  the  Fermi  level  Ep  have  been  converted  into  Fermi  level  position  with  respect  to  the  valence  band  edge  Ey  of  Si  by  subtracting  99.59 
eV  from  the  former.  The  hatched  regions  indicate  the  bulk  Fermi  level  positions  of  the  samples  according  to  their  doping  and  the  open  rectangles  mark  the 
range  of  pinning  positions  of  £/r  as  observed  for  unhydrogenated  and  reconstructed  Si(lll)  surfaces  according  to  Ref.  4. 


tion  of  0.5  eV  using  monochromatized  A\Ka  radiation 
(1486  eV),  and  ultraviolet  excited  photoemission  (UPS)  with 
a  resolution  of  0.11  eV  utilizing  Hell  radiation  (40.8  eV)  was 
used  to  measure  the  valence  band  spectra.  In  order  to  exclude 
band  flattening  under  x-ray  or  UV  illumination,  we  measured 
in  situ  the  surface  photovoltage  using  a  semitransparent 
Kelvin  probe  (copper  mesh).  None  of  the  samples  developed 
photovoltages  exceeding  the  detection  limit  of  1  meV  under 
the  illumination  conditions  encountered  in  the  photoemission 
measurements. 

III.  RESULTS 

In  Fig.  I  we  show  the  binding  energies  of  the  Si  2py2 
core  level  with  respect  to  the  Fermi  level  for  different 
samples  as  obtained  from  a  fit  to  the  XPS  spectra  (see  insert) 
in  the  as  deposited  state  and  after  various  in  situ  annealing 
steps  (5  min  held  at  the  annealing  temperature,  no  rapid  cool¬ 
ing).  Temperatures  were  measured  with  an  optical  pyrometer. 
From  the  individual  binding  energies  of  the  as -prepared 
samples  we  derive  average  binding  energies  for  each  group 
of  differently  doped  samples:  99.39±0.08  eV  for  the  n^ 
samples,  99.14±0.06  eV  for  the  p-type  samples,  and  99.01 
±0.11  eV  for  the  samples.  The  scatter  of  the  data  is  due 
to  the  preparation  process  as  well  as  to  the  experimental 
uncertainty  in  Si2p  line  position  (±50  meV,  indicated  ex¬ 
emplary  on  some  data  points). 

On  the  right  side  of  Fig.  1,  the  gap  of  Si  and  the  bulk 
Fermi  level  positions  of  the  three  kinds  of  samples  are  indi¬ 
cated  on  an  energy  scale  that  is  shifted  by  98.56  eV  with 
respect  to  the  binding  energy  scale.  The  bulk  Fermi  levels  are 


appropriate  for  300  K  and  were  calculated  from  the  dopant 
concentrations  given  above.  It  is  obvious  from  this  compari¬ 
son  that  the  variation  in  surface  Fermi  level  position  that 
amounts  to  0.38  eV  falls  considerably  short  of  the  difference 
in  bulk  Fermi  level  of  —1.00  eV  between  n^  and 
samples.  Anticipating  the  discussion  that  leads  to  the  align¬ 
ment  of  the  binding  energy  and  Fermi  level  scales  as  given  in 
Fig.  1,  we  can  infer  that  the  Fermi  levels  tend  to  lie  closer  to 
midgap  at  the  surface  than  in  the  bulk.  The  midgap  position 
corresponds  to  a  binding  energy  of  99.15  eV  and  the  band 
bending  amounts  on  average  to  0.18  eV  for  the  samples, 
0.37  eV  for  the  p  type,  and  0.42  eV  for  the  p^  samples.  Note 
that  the  surface  is  never  pulled  into  inversion,  i.e.,  the  Fermi 
level  position  of  the  as-prepared  samples  is  always  found 
between  its  bulk  value  and  midgap. 

For  annealing  temperatures  up  to  400  °C,  the  binding  en¬ 
ergies  and  thus  the  surface  Fermi  levels  move  in  opposite 
directions  depending  on  the  type  of  doping,  i.e.,  towards 
their  respective  bulk  position  (hatched  regions  on  the  right 
hand  scale).  The  extreme  binding  energy  difference  of  0.93 
eV  that  is  reached  by  the  p  and  n  ^  samples  after  annealing 
at  400  ""C  agrees  with  the  bulk  Fermi  level  difference  of  1.00 
eV  as  well  as  can  be  expected.  We  conclude  therefore  that 
the  heavily  doped  samples  attain  flat  band  condition  after  a 
400  °C  anneal  and  we  have  used  this  fact  to  align  the  two 
energy  scales  of  Fig.  1  as  indicated  by  the  dashed  horizontal 
lines.  As  a  by-product,  we  obtain  the  Si  2py2  binding  energy 
with  respect  to  the  valence  band  edge  Ey  as  98. 56 ±0.05  eV. 

For  annealing  temperatures  above  about  450  °C,  the  bind¬ 
ing  energies  of  the  p  samples  increase  again,  corresponding 
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Fig.  2.  Hen  valence  band  spectra  of  a  hydrogen  passivated  Si(lll) 
surface  as  a  function  of  annealing  temperature  . 

to  a  tendency  of  to  move  back  towards  midgap.  Three  of 
the  six  /7-type  samples  were  annealed  at  various  temperatures 
up  to  900  °C.  None  of  them  attained  the  bulk  Fermi  level 
position  or  showed  the  reversal  in  band  bending  observed  for 
the  samples.  Instead,  we  measure  a  monotonic  movement 
of  Ef  towards  its  bulk  value  albeit  by  no  more  than  0.1  eV. 
Annealing  the  samples  above  400  °C  led  to  their  oxida¬ 
tion,  a  fact  that  is  currently  being  investigated.  We  shall 
therefore  concentrate  our  discussion  on  the  p'^  samples  in 
what  follows. 

Connected  with  annealing  above  400  °C  is  a  loss  of  hy¬ 
drogen  at  the  surface  as  witnessed  by  the  UPS  valence  band 
spectra  of  a  representative  p'^  sample  in  Fig.  2,  where  the 
sharp  peak  at  —5.3  eV  represents  Si-H  bonding  states  at  the 
surface.  It  is  evident  from  the  first  two  spectra  that  the  maxi¬ 
mum  Fermi  level  shift  of  "0.5  eV  obtained  after  annealing 
at  400  °C  occurs  without  the  loss  of  hydrogen.  Only  beyond 
that  temperature  does  the  intensity  of  the  Si-H  peak  drop, 
signalling  a  corresponding  loss  in  surface  hydrogen  that  is 
now  indeed  accompanied  by  a  movement  of  Ef  back  to¬ 
wards  midgap.  In  this  state,  samples  are  considerably  more 
susceptible  to  oxygen  contamination  as  witnessed  by  the  ap¬ 
pearance  of  the  0  2/7  signal  at  —7  eV.  This  contamination 
could  be  held  below  4%,  however,  in  all  cases  as  measured 
by  XPS,  The  oxygen  is  reduced  again  after  the  900  °C  an¬ 
nealing  step  and  the  characteristic  surface  states  and  ^2  of 
the  7X7  reconstruction"^  can  be  clearly  observed,  together 
with  the  corresponding  LEED  pattern. 

For  one  p^  sample,  we  have  followed  the  desorption  ki¬ 
netics  of  hydrogen  in  somewhat  more  detail,  as  illustrated  in 
Fig.  3.  Here  we  plot  the  area  under  the  Si-H  peak  at  5.3  eV 
binding  energy  relative  to  the  amplitude  of  bulk  derived 
states  at  a  binding  energy  of  about  4.5  eV  as  a  measurement 
of  the  hydrogen  coverage  as  a  function  of  annealing  tempera¬ 
ture.  Also  shown  as  solid  lines  are  coverages  calculated  ac¬ 
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Fig.  3.  Hydrogen  coverage  as  a  function  of  annealing  temperature.  The  solid 
lines  have  been  calculated  for  first  and  second  order  desorption  kinetics, 
respectively,  using  a  common  activation  energy  of  2.38  eV. 

cording  to  first  and  second  order  desorption  kinetics,  respec¬ 
tively,  calculated  according  to  Ref.  5  using  an  attempt 
frequency  of  10^^  s~^  and  an  annealing  time  at  each  step  of 
300  s  as  stated  above.  It  is  obvious  that  the  data  are  not 
accurate  enough  to  distinguish  unambiguously  between  first 
and  second  order  desorption  kinetics  despite  the  fact  that 
second  order  appears  to  be  slightly  favored  at  lower  cover¬ 
ages.  From  a  least  squares  fit  to  the  data,  we  obtain 
^d2“2,.4±0.1  eV  for  the  activation  energy  of  the  second 
order  process.  The  first  order  line  was  calculated  using  the 
same  activation  energy.  Schulz  and  Henzler^  deduce  first  or¬ 
der  kinetics  for  the  desorption  of  H  from  Si(lll)  at  high 
coverages  and  second  order  at  low  coverages  with  activation 
energies  of  2. 1  and  2.5  eV,  respectively,  values  that  are  in  as 
good  agreement  with  our  result  as  can  be  expected. 

IV.  DISCUSSION 

A  common  explanation  for  surface  band  bending  is  to 
assume  surface  charges  in  conjunction  with  a  space  charge 
region  inside  the  doped  semiconductor.  For  the  p^  samples, 
a  surface  charge  density  (in  units  of  the  elementary  charge) 
of  7X10^^  cm“^  is  necessary  to  account  for  the  observed 
band  bending.  These  surface  charges  can  be  present  either  in 
the  form  of  charged  surface  states  or  as  surface  ions  with 
energy  levels  possibly  outside  the  gap  region.  Surface  con¬ 
centrations  of  Na"^  and  F"  as  the  most  likely  ionic  contami¬ 
nants  were  proved  by  XPS  to  be  at  least  an  order  of  magni¬ 
tude  lower  than  required  to  explain  the p^  results.  Moreover, 
sign  and  concentration  of  surface  ions  would  have  to  be  just 
appropriate  to  pull  the  Fermi  level  close  to,  but  never  beyond 
midgap  for  all  three  types  of  samples,  which  we  consider  a 
rather  unlikely  coincidence. 

Surface  Fermi  level  pinning  by  surface  states  would  thus 
require  a  density  of  surface  states  on  the  order  of  10^^  cm~^ 
or  more.  These  conditions  are  obviously  not  met  at  the  hy¬ 
drogen  terminated  (111)  surface  as  demonstrated  by 
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Fig.  4.  Yield  spectra  of  two  representative  Si  samples:  (a)  as-prepared;  (b)  p-type  sample  after  annealing. 


Yablonovitch  et  al?  (see  above).  The  lack  of  deep  gap  states 
that  could  be  responsible  for  the  pinning  of  Ep  is  further 
confirmed  by  electron  yield  spectroscopy  [compare  Fig. 
4(a)],  which  provides  no  evidence  whatsoever  for  emission 
below  the  onset  of  transitions  from  the  valence  band  edge  at 
threshold  energies  of  5.1  (n^ ,p  samples)  and  4.8  eV  (p^ 
samples),  respectively.  The  sensitivity  of  this  technique  is 
sufficient  to  detect  surface  densities  of  states  as  low  as  10^^ 
cm“^.^  As  the  samples  are  heated  above  450  ""C  emission 
from  surface  states  can  be  detected  below  the  bulk  threshold 
as  demonstrated  in  Fig.  4(b)  for  temperatures  of  600  and 
860  °C,  respectively. 

We  therefore  suggest  that  the  surface  Fermi  level  position 
in  our  Si(lll):H  samples  is  the  result  of  a  hydrogen  passiva¬ 
tion  of  dopant  atoms  in  a  surface  layer;  that  is  to  say,  we  are 
really  looking  at  undoped  surfaces  here.  The  hydrogen  pas¬ 
sivation  of  dopant  atoms  is  a  well-known  phenomenon  that  is 
intentionally  performed  and  studied  extensively  by  exposing 
silicon  at  temperatures  above  ~100°C  to  atomic  hydrogen 
created  in  an  H2  plasma.^  By  comparison,  there  are  only  few 
reports  on  hydrogen  injection  and  near-surface  passivation  at 
room  temperature  (RT)  following  wet-etching  procedures'”*^ 
and  ail  apply  to  boron  compensation  with  the  exception  of 
Ref.  11,  which  suggests  that  donor  passivation  might  have 
been  observed  in  one  sample  as  well. 

With  the  assumption  that  hydrogen  passivation  is  indeed 
responsible  for  the  depletion  layers  in  our  samples  the  an¬ 
nealing  behavior  can  be  understood  as  follows.  For  tempera¬ 
tures  between  200  and  350  °C  the  donor-  and  acceptor- 
hydrogen  complexes  dissociate,*^  leading  thus  to  a  gradual 
reactivation  of  the  dopant  atoms  and  a  concomitant  move¬ 
ment  of  Ep  towards  its  respective  bulk  value.  A  complete 


reactivation  is  apparently  reached  at  7^=400  to  450  °C  for 
both  and  the  p'^  samples. 

Above  450  °C,  the  desorption  of  hydrogen  from  the  sur¬ 
face  commences  in  agreement  with  the  temperature  con¬ 
trolled  desorption  study  presented  above.  With  the  loss  of 
surface  hydrogen  the  now  unsatisfied  bonds  start  to  rearrange 
towards  the  thermodynamically  stable  7X7  reconstruction*^ 
with  a  corresponding  pinning  of  the  Fermi  level  by  surface 
states.  As  mentioned  earlier,  the  hydrogen  exposition  of  the 
n sample  above  400  °C  led  to  a  surface  contamination  and 
the  resulting  Fermi  level  positions  have  therefore  not  been 
included  in  Fig.  2. 

The  near-surface  depletion  of  doped  samples  as  a  result  of 
hydrogen  passivation  could  explain  some  as  yet 
unaccounted-for  observations.  Landemark  et  al  commented 
on  the  unexpected  low  Fermi  level  position  0.68  eV  above 
Ey  for  an  n'*'-type  Si(lll):H  sample,*"*  whereas  the  annealed 
(at  400  °C)  Si(lll):H  surface  exhibits  the  unpinned  bulk 
Fermi  level. *^  Ley  et  al}^  measured  a  lower  than  expected 
Schottky  barrier  height  in  the  PtSi/Si  system  prepared  on 
hydrogen  terminated  n-type  Si(lll)  samples.  All  these  ob¬ 
servations  are  compatible  with  a  (partial)  passivation  of  do¬ 
nors  at  the  surface. 

This  leaves  the  question  to  what  degree  and  what  depth 
the  dopant  atoms  in  the  as-prepared  samples  have  to  be  pas¬ 
sivated  to  account  for  the  observed  Fermi  level  positions  at 
the  surface.  A  lower  limit  for  the  compensation  depth  can  be 
obtained  under  the  assumption  that  all  dopants  are  passivated 
up  to  a  certain  depth.  The  potential  profile  calculated  for  the 
ensuing  i-n  and  i-p  transitions  under  the  proper  boundary 
conditions  yields  the  Fermi  level  at  the  surface  as  a  function 
of  compensation  depth  as  shown  for  three  dopant  concentra- 
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dependent  of  doping.  The  annealing  behavior  of  Ep  points 
towards  a  near  surface  depletion  layer  caused  by  hydrogen 
passivation  during  the  wet-chemical  surface  treatment  as  the 
likely  cause.  The  bulk  Fermi  level  positions  for  p'^  and 
samples  are  restored  at  the  surface  after  a  400  °C  anneal, 
while  hydrogen  desorption  above  450  '^C  causes  a  pinning  of 
Ep  by  surface  gap  states.  These  results  explain  some 
unaccounted-for  observations  in  the  literature  concerning  the 
Fermi  level  position  on  Si(lll):H.  A  calculation  yields  pas¬ 
sivation  depths  of  the  order  of  2  pm  to  account  for  the  ob¬ 
served  Fermi  level  positions. 
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Fig.  5.  Surface  Fermi  level  position  on  Si  as  a  function  of  dopant  compen¬ 
sation  depth.  The  solid  lines  are  calculated  for  a  surface  free  of  surface 
charge,  whereas  the  dotted  lines  represent  the  calculations  for  fixed  densities 
of  charged  surface  states  «ss  •  Parameter  is  in  either  case  the  dopant  con¬ 
centration.  The  dashed  horizontal  lines  indicate  the  average  surface  Fermi 
level  positions  for  the  three  types  of  samples  investigated.  Details  of  the 
calculation  may  be  found  in  Ref.  17. 

tions  in  Fig.  5.^^  For  simplicity,  we  did  not  distinguish  be¬ 
tween  n-  and  /?-type  doping  in  this  calculation,  which  means 
that  we  have  used  the  effective  valence  band  density  of  states 
for  the  conduction  band  as  well.  Comparing  the  experimen¬ 
tally  determined  surface  Fermi  level  positions  with  the  cal¬ 
culation,  we  obtain  compensation  depths  of  2  and  10  pm  for 
the  n  and  p  samples,  respectively.  These  values  are  com¬ 
patible  with  what  has  been  reported  by  Tavendale  et  al}^  A 
depletion  depth  of  about  100  pm  as  required  to  account  for 
the  surface  Fermi  position  in  the  p-type  samples  appears 
excessive  by  comparison.  However,  as  is  also  indicated  in 
Fig.  3,  a  density  of  surface  charge  of  10^^  cm“^  which  is 
below  the  detection  limit  of  our  photoyield  experiment  sub¬ 
stantially  reduces  the  depletion  width  necessary  for  all  dop¬ 
ing  levels  and  leads  to  values  below  2  pm  for  all  samples. 
We  are  quick  to  admit  that  these  surface  densities  are  con¬ 
siderably  higher  than  the  results  of  Yablonovitch  et  al?  How¬ 
ever,  at  least  for  plasma  hydrogenated  silicon,  there  is  ample 
evidence  for  hydrogen  induced  near- surface  defects.^’ 

V.  CONCLUSIONS 

In  summary,  we  have  observed  a  Fermi  level  position  near 
midgap  at  the  surface  of  hydrogen  passivated  Si(lll):H  in- 
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Correlation  between  surface  structure  and  ordering  in  GalnP 
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Ga  and  In  atoms  in  Gao.52Ino.48P  layers  spontaneously  segregate  to  form  alternating  In-  and  Ga-rich 
{111}  monolayers  during  organometallic  vapor  phase  epitaxial  (OMVPE)  growth  on  (001)  oriented 
GaAs  substrates,  thus  forming  the  CuPt  ordered  structure.  This  ordering  phenomenon  is  believed  to 
be  driven  by  surface  processes,  although  little  direct  experimental  information  is  available.  This 
work  presents  evidence,  based  on  surface  photoabsorption  data,  that  [110]  oriented  P  dimers  are 
present  on  the  surface  during  OMVPE  growth  using  trimethylgallium  and  ethyldimethylindium 
combined  with  tertiarybutylphosphine,  suggesting  a  (2X4)-like  surface  reconstruction.  Furthermore, 
when  the  growth  temperature  is  increased  above  620  °C,  with  other  parameters  constant,  both  the 
concentration  of  these  P  dimers  and  the  degree  of  order  are  observed  to  decrease.  A  similar 
correlation  of  decreased  P-dimer  concentration  with  decreased  degree  of  order  is  observed  for 
decreases  in  V/III  ratio.  Thus,  the  changes  in  order  parameter  for  variations  in  temperature  and  TBP 
flow  rate  are  found  to  be  closely  correlated  with  the  changes  in  the  order  parameter.  A  third 
parameter  studied  was  the  misorientation  of  the  substrate  from  (001)  toward  either  the  {11 1}^  or 
{111}^  planes.  The  concentration  of  P  dimers  decreased  as  the  misorientation  increased  in  either 
direction.  The  degree  of  order  was  also  observed  to  generally  decrease,  supporting  the  connection 
between  surface  reconstruction  and  ordering.  However,  the  difference  in  order  parameter  observed 
for  the  two  misorientation  directions  suggests  the  importance  of  a  second  parameter,  the  step 
structure,  itself.  For  exactly  (001)  oriented  substrates  the  surface  was  observed,  using  high 
resolution  atomic  force  microscopy,  to  consist  of  islands,  elongated  in  the  [110]  direction,  with 
heights  of  30-60  A.  Monolayer  steps  are  observed  for  some  growth  conditions,  but  for  most 
conditions  the  boundaries  are  formed  exclusively  of  bilayer  (5.7  A)  steps.  Predominantly  monolayer 
steps  are  formed  for  low  V/III  ratios  and  bilayer  steps  for  high  V/III  ratios.  ©  1996  American 
Vacuum  Society. 


I.  INTRODUCTION 

The  spontaneous  formation  of  monolayer  superlattices 
during  vapor  phase  epitaxial  growth  is  a  phenomenon  of  both 
fundamental  and  technological  importance.  The  most  widely 
studied  example  of  this  phenomenon  is  the  formation  of  the 
CuPt  ordered  structure,  with  ordering  on  {111}  planes,  during 
organometallic  vapor  phase  epitaxial  (OMVPE)  growth  of 
Gao  52lno.48PA^  The  thermodynamic  driving  force  for  CuPt 
ordering  is  believed  to  be  related  to  formation  of  the  (2X4) 
surface  reconstruction,  since  this  ordered  structure  is  not 
stable  in  the  bulk.^  The  ordered  structure  is  formed  at  the 
surface  during  epitaxial  growth.  Evidence  of  this  is  the 
change  in  ordered  structure  correlated  with  the  change  in 
surface  reconstruction  for  GalnP  grown  by  molecular  beam 
epitaxy  (MBE).^’"^  The  initial  studies  of  Murata  et  al^  have 
demonstrated  that  for  GalnP  grown  by  OMVPE  a  (2X4)-like 
surface  reconstruction  can  be  detected  in  situ  using  the  opti¬ 
cal  technique  of  surface  photo  absorption  (SPA).  This  allows 
the  opportunity  to  probe  the  correlation  between  the  degree 
of  CuPt  order  or  order  parameter,  S,  and  the  extent  of  for¬ 
mation  of  the  (2X4)-like  reconstruction  as  a  function  of 
growth  parameters  such  as  temperature,  V/III  ratio,  and  sub- 
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strate  misorientation  from  (001),  all  of  which  are  known  to 
have  marked  effects  on  the  degree  of  order  observed  in  the 
GalnP  epitaxial  layers  grown  by  OMVPE. 

This  article  will  present  the  results  of  a  study  of  the  ef¬ 
fects  of  the  three  growth  parameters  mentioned  above  on  the 
SPA  spectrum  and  the  degree  of  order  determined  from  the 
photoluminescence  peak  energy  as  well  as  the  step  structure 
of  the  surface  determined  using  atomic  force  microscopy 
(AFM).  The  results  suggest  a  direct  correlation  between  the 
(2X4)-like  reconstruction  determined  by  SPA  and  the  degree 
of  CuPt  order  in  the  GalnP  epitaxial  layers,  although  changes 
in  the  step  structure  are  also  found  to  be  important. 

IK  EXPERIMENT 

The  epitaxial  layers  studied  in  this  work  were  grown  by 
OMVPE  in  a  horizontal,  atmospheric  pressure  apparatus^  us¬ 
ing  the  newly  developed,  less  hazardous  phosphorus  source 
tertiarybutylphosphine  (TBP).  The  flow  rates  of  the  group  III 
precursors  trimethylgallium  (TMGa)  and  ethyldimethylin¬ 
dium  (EDMIn)  were  adjusted  to  give  the  solid  composition 
lattice  matched  to  the  GaAs  substrates,  i.e.,  Gao.52Ino.48P’  ^nd 
a  growth  rate  of  0.3  /^m/h.  The  TBP  flow  rate  was  varied  to 
give  the  various  V/III  ratios  used.  The  semiinsulating  GaAs 
substrates  were  nominally  (001),  misoriented  by  angles  of 
0°,  3°,  6°,  11°,  and  15.8°  toward  either  the  [1 10]  or  the  [110] 
direction  in  the  lattice. 
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For  the  SPA  measurements,  P-polarized  light  from  a  150 
W  Xe  lamp  irradiates  the  GalnP  layer  at  an  incidence  angle 
of  70°  through  a  polarizer  and  a  chopper.^  The  direction  of 
the  incident  light  is  parallel  to  the  direction  of  gas  flow  in  the 
reactor.  The  reflected  light  is  monochromatized  and  detected 
by  a  Si  PNN"^  photodiode  using  standard  lock-in  amplifica¬ 
tion  techniques.  Each  run  consists  of  the  growth  of  two  epi¬ 
taxial  layers,  one  where  the  [110]  direction  of  layer  is  paral¬ 
lel  to  the  gas  flow  and  one  where  the  substrate  is  rotated  by 
90°.  The  P-stabilized  and  group  III  element-stabilized  sur¬ 
faces  were  formed  by  switching  the  TBP  flow  between  reac¬ 
tor  and  vent,  respectively.^’^ 

The  characterization  of  the  surface  structure  was  carried 
out  using  a  Nanoscope  III  atomic  force  microscope  (AFM)  in 
the  tapping  mode.  Etched  single-crystalline  Si  tips  were  used 
with  an  end  radius  of  about  5  nm,  with  a  sidewall  angle  of 
about  35°.  Scan  rates  of  1  to  2  lines  per  second  were  used 
and  data  were  taken  at  512  points/line  and  512  lines  per  scan 
area.  This  technique  has  been  shown  capable  of  revealing 
atomic-scale  features  on  the  surface.^  Monolayer  steps  have 
been  observed  ex  situ  for  GalnP  layers  grown  by  OMYPE.^ 

The  solid  composition  of  the  GalnP  layers  was  deter¬ 
mined,  using  Vegard’s  law,  from  x-ray  diffraction  measure¬ 
ments  using  CuKa  radiation.  The  20  K  PL  was  excited  with 
the  488  nm  line  of  an  Ar*^  laser.  The  emission  was  dispersed 
using  a  Spex  Model  1870  monochromator  and  detected  using 
a  Hamamatsu  R1104  head-on  photomultiplier  tube. 

III.  RESULTS  AND  DISCUSSION 

The  major  feature  of  the  SPA  spectrum,  i.e.,  the  (^fference 
in  SPA  signal  with  incident  light  along  [110]  and  [110],  is  a 
strong  absorption  peak  observed  at  approximately  400  nm.^ 
This  has  been  attributed  to  the  [110]  oriented  P  dimers  on  the 
surface^  based  on  the  similarity  to  the  SPA  result^  for  InP 
reported  by  Kobayashi  and  Kobayashi.^^  These  [110]  ori¬ 
ented  group  V  dimers  are  the  major  characteristic  of  the 
(2X4)  reconstructed  surface.  Thus,  in  what  follows,  the 
intensity  of  this  peak  is  taken  as  a  measure  of  the  degree  of 
(2X4)  order  on  the  surface  during  growth,  although  SPA  is  a 
probe  of  the  local  bonding,  so  it  cannot  be  used  to  determine 
the  long  range  order  on  the  surface.  For  this  reason,  the  re¬ 
construction  will  be  referred  to  as  “(2X4)-like.” 

The  effect  of  growth  temperature  on  the  SPA  difference 
signal  at  400  nm  is  shown  in  Fig.  1.  The  SPA  signal  differ¬ 
ence,  and  hence  the  concentration  of  [1 10]  oriented  P  dimers, 
[P^iio]],  decreases  raonotonically  as  the  temperature  in¬ 
creases  from  620  to  720  °C.  This  is  simply  due  to  a  shift  in 
the  P  adsorption/desorption  balance  at  the  surface.  The  SPA 
difference  signal  approaches  zero  at  720  °C.  The  SPA  signal 
at  400  nm  continues  to  climb  as  the  temperature  is  further 
reduced  to  520  °C.^ 

The  degree  of  order  is  included  in  Fig.  1.  It  was  deter¬ 
mined  from  the  dependence  of  the  band  gap  energy  on  the 
degree  of  order, ^  with  a  band  gap  energy  difference  between 
ordered  and  disordered  Gao5lno5P  of  471  meV.^^  The  PL 
peak  energy  is  used  to  determine  the  band  gap  energy.  A 
potential  problem  with  this  approach  is  the  possibility  of 
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Fig.  1 .  The  effect  of  growth  temperature  on  the  degree  of  order,  5,  and  the 
SPA  signal  difference  at  400  nm:  (•)  degree  of  order  for  a  V/III  ratio  of  40 
(TBP  partial  pressure  of  50  Pa),  (O)  SPA  signal  difference  for  a  TBP  partial 
pressure  of  50  Pa. 

nonband  edge  emission  dominating  the  PL  spectra.  Impuri¬ 
ties  are  one  possible  source  of  such  low-energy  emission 
peaks,  although  impurity  emission  is  not  normally  dominant 
for  the  high  purity  GalnP  layers  considered  here.  Impurity 
effects  can  normally  be  minimized  by  using  high  excitation 
energies.  A  more  serious  problem  for  these  samples  is  the 
possibility  of  band  tailing  effects  associated  with  the  inho¬ 
mogeneity  of  the  order  parameter  or  the  presence  of  an¬ 
tiphase  boundaries.  Such  effects  have  been  studied  by  sev¬ 
eral  workers. Again  in  this  case,  the  best  method  for 
minimizing  these  problems  is  the  use  of  high  excitation  in¬ 
tensities.  Thus,  all  order  parameters  have  been  deduced  from 
PL  spectra  measured  at  a  laser  power  of  10  mW.  For  the  size 
of  the  laser  spot  used  here,  this  corresponds  to  an  excitation 
power  density  of  30  W/cm^. 

The  data  plotted  in  Fig.  1  indicate  that  for  temperatures  of 
620  °C  and  above,  both  the  degree  of  order  and  [P^no]] 
decrease  with  increasing  growth  temperature.  This  suggests 
that  loss  of  the  (2X4)-like  surface  reconstruction  results  in  a 
reduction  of  the  degree  of  CuPt  ordering.  As  discussed 
above,  this  is  consistent  with  theoretical  predictions  summa¬ 
rized  in  Ref.  2.  The  order  parameter  is  a  maximum  at  ap¬ 
proximately  620  °C.  A  clear  decrease  is  observed  with  de¬ 
creasing  temperatures  below  620  °C.^’^  Thus,  the  degree  of 
order  decreases  with  decreasing  temperature  even  though 
[^[1  iQj]  increases  in  this  range.  This  suggests  that  the  degree 
of  order  decreases  at  low  temperatures  for  kinetic  reasons. 
Lower  temperatures  lead  to  lower  rates  of  atomic  migration 
on  the  surface,  which  may  prevent  formation  of  CuPt  struc¬ 
ture  during  the  time  available  before  the  surface  layer  is  cov¬ 
ered  by  the  next  layer  deposited.^  This  mechanism  is  basi¬ 
cally  the  same  as  that  postulated  to  account  for  the  decrease 
in  degree  of  order  with  increasing  growth  rate  at  a  constant 
temperature.^"^ 

The  effect  of  V/III  ratio  on  both  ordering  and  the  surface 
reconstruction  deduced  from  the  SPA  measurements  is  also 
significant.  As  seen  in  Fig.  2,  for  layers  grown  using  TBP  on 
exactly  (001)  substrates,  a  decrease  in  the  TBP  partial  pres- 
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Fig.  2.  Influence  of  input  TBP  partial  pressure  on  the  SPA  difference  signal 
at  400  nm  and  the  degree  of  order,  deduced  from  the  15  K  PL  peak  energy, 
for  GalnP  samples  grown  lattice  matched  to  exactly  (001)  GaAs  substrates. 
O,  □:  670  °C  and  •,  ■:  620  °C. 
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sure  from  200  to  10  Torr  results  in  a  significant  decrease  in 
the  extent  of  (2X4)-like  reconstruction  at  both  620  and 
670  °C.  Consistent  with  the  temperature  effect  discussed 
above,  the  SPA  signal  is  always  larger  at  the  lower  growth 
temperature.  The  degree  of  order  deduced  from  the  PL  peak 
energy  is  also  plotted  versus  input  TBP  partial  pressure  in 
Fig.  2  for  growth  at  both  620  and  670  °C.  The  monotonic 
trend  of  decreasing  degree  of  order  for  decreasing  (2X4)-like 
reconstruction  observed  at  670  °C  is  qualitatively  similar  to 
the  correlation  observed  when  the  growth  temperature  is 
changed.  At  620  ®C,  the  behavior  is  more  complex.  At  high 
V/III  ratios  at  this  lower  growth  temperature,  the  degree  of 
order  is  observed  to  decrease  slightly.  This  is  consistent  with 
previous  observations  for  growth  using  either  phosphine  or 
TBP.^^  This  behavior  at  high  P  surface  coverages  is  not  com¬ 
pletely  understood.  However,  evidence  of  formation  of  an 
isotropic  P  dimer  on  the  surface  at  high  group  V  flow  rates 
and  low  temperatures  observed  in  InP^^  is  also  seen  in  the 
GalnP  layers  studied  here.  This  change  in  surface  reconstruc¬ 
tion  may  be  responsible  for  the  observed  decrease  in  the 
degree  of  order.  This  also  provides  an  alternate  possibility  for 
the  reduced  ordering  observed  at  lower  growth  temperatures. 

The  step  structure,  another  factor  that  may  affect  ordering, 
is  also  found  to  be  dependent  on  the  V/III  ratio.  Figure  3 
shows  AFM  scans  for  layers  grown  at  620  °C  with  V/III 
ratios  of  8,  40,  and  160.  The  surfaces  for  V/III  ratios  of  40 
and  160  are  smooth,  giving  easily  interpreted  AFM  profiles. 
Growth  at  the  low  V/III  ratio  of  8  is  difficult,  resulting  in 
rougher  surfaces.  The  scan  shown  in  Fig.  3  is  representative 
of  the  smoothest  areas  observed.  The  [110]  steps  are  seen  to 
be  mainly  monolayers  (2.8  A)  at  low  V/III  ratios  and  mainly 
bilayers  at  high  V/III  ratios.  The  average  step  height  is  plot¬ 
ted  versus  input  TBP  flow  rate  in  Fig.  4  for  growth  tempera¬ 
tures  of  both  620  and  670  °C.  The  formation  of  bilayer  (5.7 
A)  steps  at  high  TBP  partial  pressures  has  been  attributed  to 
the  stabilization  of  the  (2X2)  structure  on  the  (lll)B  step 
edge  at  high  phosphorus  partial  pressures.^  Biegelsen  et 


(C)  MM 

Fig.  3.  Cross  sectional  AFM  scans  of  the  surfaces  of  GalnP  layers  grown  on 
exactly  (001)  GaAs  substrates  at  a  temperature  of  620  °C.  (a)  V/III=8;  (b) 
V/III=40;  and  (c)  V/III=160. 

observed  the  As-rich  (2X2)  structure  on  (lll)B  GaAs  sur¬ 
faces  at  high  As  pressures.  The  (2X2)  unit  cell  will  just  fit  on 
the  (lll)B  face  of  a  bilayer  step.^  Recent  calculations  indi¬ 
cate  that  bilayer  type  A  ([110])  steps  become  more  stable  for 
high  anion/cation  ratios, although  the  effect  for  B  type 
([110])  steps  has  not  been  considered. 

The  bilayer  step  structure  observed  for  growth  at  high 
V/III  ratios  apparently  explains  one  type  of  defect  observed 
in  GalnP  layers,^ J^or  growtl^on  exactly  (001)  oriented  sub¬ 
strates,  both  the  (111)  and  (1 1 1)  variants  of  the  CuPt  ordered 
structure  are  observed.  The  domains  of  the  two  variants  are 
separated  by  order  twin  boundaries  having  the  (001)  orienta¬ 
tion.  The  thicknesses  of  the  (001)  laminae  of  the  two  variants 
are  found  experimentally  to  be  always  equal  to  an  even  num¬ 
ber  of  atomic  layers.^^  The  results  presented  here  suggest 
that  this  is  due  to  the  formation  of  the  CuPt  structure  via 
propagation  of  bilayer  steps.  The  order  twin  boundaries  are 
apparently  due  to  a  change  in  the  direction  of  step  propaga¬ 
tion  due  to  the  undulating  nature  of  the  surface  as  growth 
proceeds. 

The  SPA  spectra  for  GalnP  layers  grown  at  620  with  a 
V/III  ratio  of  40  on  substrates  withjnisorientation  angles  of 
0°,  6°,  11°,  and  15.8°  toward  the  [110]  direction  are  super- 
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Fig.  4.  Average  step  height  (neglecting  bunched  steps)  vs  V/III  ratio  of 
GalnP  layers  grown  on  exactly  (001)  oriented  GaAs  substrates  at  tempera¬ 
tures  of  620  (■)  and  670  °C  (•). 

imposed  in  Fig.  5.  The  substrate  with  a  misorientation  of 
15.8°  is,  of  course,  the  (115)B  surface.  Clearly,  the  feature  at 
approximately  400  nm  is  strongly  dependent  on  the  substrate 
misorientation.  As  mentioned_  above,  this  strongly  aniso¬ 
tropic  feature  is  attributed  to  [110]  oriented  P  dimers  on  the 
surface.  The  SPA  difference  intensity  is  plotted  in  Fig.  6 
versus  misorientation  angle,  for  misorientation  in  the  [110] 
direction,  to  produce  A-type  steps,  and  in  the  [110]  direction, 
to  produce  B-type  steps.  The  data  clearly  show  the  decrease 
in  the  (2X4)-like  reconstruction  with  increasing  misorienta¬ 
tion  angle.  The  effect  is  stronger  for  misorientation  in  the  B 
direction.  Also  included  in  Fig.  6  are  the  order  parameters 
calculated  as  described  above  from  the  PL  peak  energies. 
The  degree  of  order  decreases  monotonically  with  increasing 
0^ ,  as  reported  previously  for  GalnP  growth  using 
phosphine.  For  misorientation  in  the  B  direction,  the  degree 
of  order  increases  with  increasing  misorientation  angle  for 
angles  of  less  than  approximately  3°.  However,  for  larger 
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Fig.  5.  Superimposed  SPA  spectra  for  GalnP  samples  grown  at  620  °C  on 
subsMes  misoriented  from  (001)  by  angles  of  0°,  6°,  11°,  and  15.8°  toward 
the  [110]  direction  in  the  lattice. 
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Fig.  6.  SPA  difference  intensity  at  400  nm  vs  misorientadon  angle,  for 
misorientations  in  both  the  A  and  B  directions,  to  produce  [110]  and  [110] 
steps,  respectively.  Also  plotted  are  the  order  parameters  determined  for 
these  samples. 


misorientation  angles,  a  decrease  in  order  parameter  with 
increasing  misorientation  angle  is  observed,  similar  to  the 
data  for  misorientation  in  the  A  direction.  These  data  are 
similar  to  those  obtained  earlier  using  PH3.^’^^ 

By  combining  these  observations,  it  is  clear  that  the  de¬ 
crease  in  the  (2X4)-like  reconstruction  induced  by  increasing 
the  misorientation  angle  corresponds  to  a  decrease  in  the 
degree  of  order,  as  for  the  data  obtained  for  variations  in  both 
temperature  and  V/III  ratio.  However,  a  second  factor  also 
appears  to  be  important.  The  [110]  ^eps  themselves  clearly 
assist  the  ordering  process  and  the  [110]  steps  have  a  disor¬ 
dering  effect.  _ 

The  effect  of  the  surface  [110]  P-dimer  concentration  on 
the  degree  of  order  measured  in  the  GalnP  layers  can  be  seen 
most  clearly  by  directly  examining  the  relation  between  the 
degree  of  order,  deduced  from  the  PL  peak  energy,  and  the 


SPA  Signal  Difference  (%) 

Fig.  7.  Degree  of  order  vs  SPA  difference  signal:  (•)  Ref.  7  for  samples 
grown  at  several  temperatures  for  exactly  (001)  substrates;  (■)  Ref.  7  for 
samples  grown  at  several  values  of  V/III  ratio  for  exactly  (001)  substrates; 
(□)  substrates  misoriented  toward  [110];  (O)  substrates  misoriented  toward 
[110]. 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


3017 


Murata  et  al.:  Surface  structure  and  ordering  in  GalnP 

SPA  difference  signal  measured  at  400  nm.  The  data  are 
plotted  in  Fig.  7.  For  samples  with  the  exact  (001)  orienta¬ 
tion,  the  correlation  between  the  degree  of  order  and  the  SPA 
signal  intensity  induced  by  changing  either  the  growth  tem¬ 
perature  or  the  input  TBP  flow  rate  is  nearly  perfect.  The 
general  trends  for  changes  in  the  misorientation  angle  are 
similar,  although  it  is  apparent  tha^[110]  steps  have  an  ad¬ 
ditional  beneficial  effect  and  the  [110]  steps  have  an  addi¬ 
tional  deleterious  effect. 

The  disappearance  of  the  (2X4)-like  reconstruction  with 
increasing  misorientation  angle  is  thought  to  relate  to  the 
inability  to  form  dimers  involving  all  atoms  on  the  (001) 
facets  between  steps.  For  an  exactly  (001)  oriented  surface 
having  the  (2X4)  reconstruction,  all  P  surface  atoms  will  be 
able  to  reduce  the  energy  of  their  dangling  bonds  via  forma¬ 
tion  of  [110]  dimer  rows.  The  introduction  of  [110]  steps 
produces  (001)  facets  of  finite  widths.  This  interferes  with 
the  ability  of  all  surface  atoms  to  form  dimers.  For  example, 
if  the  (001)  facet  in  a  particular  location  consists  of  an  odd 
number  of  [1 10]  rows  of  P  atoms,  one  row  will  not  be  able  to 
participate  in  the  reconstruction.  The  fraction  of  surface  P 
atoms  not  able  to  participate  generally  increases  with  in¬ 
creasing  misorientation  angle.  In  terms  of  the  simplest  sur¬ 
face  structure  involving  P  dimers,  the  (2X1)  reconstruction, 
one  can  see  qualitatively  that  a  larger  fraction  of  the  surface 
P  atoms  will  be  able  to  dimerize  on  the  surface  in  the  pres¬ 
ence  of  [110]  steps,  in  agreement  with  the  results  presented 
in  Fig.  6. 

For  both  the  A  and  B  misorientations,  the  steps  on  the 
surface  are  also  expected  to  affect  the  reconstruction.  Simple 
electron  counting  (local  charge  density)  arguments  are  often 
used  to  understand  the  stabilities  of  particular  surface 
reconstructions.^^  The  presence  of  steps  will  certainly  affect 
the  electron  counting  results  for  large  misorientation  angles, 
leading  to  the  expectation  that  other  reconstructions  may 
form. 

The  general  effects  of  surface  steps,  in  addition  to  the 
effect  on  the  surface  reconstruction,  have  been  considered 
qualitatively.  For  example,  simple  models  to  explain  the  for¬ 
mation  of  the  B  variants  of  the  CuPt  structure  involve  [110] 
steps.  Additional  factors  related  to  the  attachment  of  ada¬ 
toms  at  steps  were  discussed  qualitatively  by  Asahi.^^  For 
example,  a  group  III  adatom  approaching  a  [110]  step  makes 
no  bonds  in  addition  to  the  two  bonds  to  the  underlying  (001) 
surface.  This  is  expected  to  lead  to  a  small  sticking  coeffi¬ 
cient  which  leads,  in  turn,  to  a  higher  probability  for  order¬ 
ing,  since  the  lowest  energy  structure  is  more  likely  to  form 
when  many  adsorption/desorption  steps  occur  at  the  step 
edge  before  the  final  configuration  is  attained.  On  the  other 
hand,  for  high  V/III  ratios,  the  group  V  adatom  approaching 
a  [110]  step  will  be  able  to  form  third  bond.  This  is  expected 
to  result  in  a  high  sticking  coefficient,  which  would  not  be 
conducive  to  formation  of  an  ordered  structure.  For  example, 
if  the  sticking  coefficient  were  unity,  the  step  propagation 
rate  would  be  a  maximum,  the  [110]  step  would  be  ex¬ 
tremely  rough,  and  the  arrangement  of  the  Ga  and  In  atoms 
would  be  random.  This  simple  model  is  consistent  with  the 
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observation  that  islands  on  exactly  (001)  surfaces  of  GalnP 
grown  by  OMVPE  are  elongated  in  the  [110]  direction.^ 
Thus,  [110]  steps  are  expected  to  enhance  the  formation  of 
the  B  variants  of  the  CuPt  structure  and  [110]  steps  are  ex¬ 
pected  to  have  a  deleterious  effect. 

Examination  of  the  data  collected  here  for  variations  in 
growth  temperature,  V/III  ratio,  and  substrate  misorientation 
suggests  that  formation  of  the  CuPt  ordered  structure  is  inti¬ 
mately  related  to  the  formation  of  the  (2X4)-like  ordered 
structure  on  the  surface  during  growth.  However,  it  is  too 
early  to  claim  a  direct  mechanistic  relationship.  Recent  ef¬ 
forts  to  produce  order/disorder  heterostructures  by  changing 
the  flow  rate  of  the  P  precursor  during  growth  have  resulted 
in  a  clear  memory  effect.  The  degree  of  order  in  the  bottom 
layer  apparently  “propagates”  into  the  top  layer  for  a  signifi¬ 
cant  distance,  i.e.,  the  change  in  order  parameter  is  not 
abrupt.^^  The  change  in  the  SPA  signal  is  virtually  instanta¬ 
neous.  This  suggests  that  the  step  structure  may  play  an  im¬ 
portant  role  in  ordering.  Investigations  of  the  dynamics  of 
the  change  is  step  structure  are  currently  underway. 

IV.  CONCLUSIONS 

The  effects  of  three  growth  parameters,  temperature,  V/III 
ratio,  and  substrate  misorientation,  on  both  the  occurrence  of 
CuPt  ordering  and  the  surface  structure  have  been  studied. 
Considering  first  the  data  for  singular  (001)  substrates,  in¬ 
creasing  growth  temperature  above  620  ®C  is  found  to  de¬ 
crease  the  degree  of  order  and  to  result  in  a  dramatic_de- 
crease  in  the  SPA  signal  at  400  nm  attributed  to  the  [110] 
dimers  characteristic  of  the  (2X4)-like  surface  reconstruc¬ 
tion.  Decreasing  the  input  TBP  flow  rate,  giving  V/III  ratios 
of  <160  at  620  and  670  ""C  is  similarly  found  to  decrease 
both  the  400  nm  SPA  signal  and  the  degree  of  order  mea¬ 
sured  in  the  GalnP  epitaxial  layers.  Together,  these  data 
stongly  suggest  a  correlation  between  the  concentration  of 
[110]  P  dimers  on  the  surface,  characteristic  of  the  (2X4) 
reconstruction,  and  the  degree  of  order  produced  during  ep¬ 
itaxial  growth.  In  addition,  the  structure  of  [110]  steps  is 
found  to  change  from  predominately  bilayer  at  high  V/III 
ratios  to  monolayer  at  low  V/III  ratios.  Increasing  the  con¬ 
centration  of  [110]  steps  by  misorientation  of  the  (00j_)  sub¬ 
strate  is  found  to  decrease  the  concentration  of  [110]  P 
dimers  on  the  surface.  It  is  found  to  increase  the  degree  of 
order  for  angles  of  <3°  and  to  decrease  the  degree  of  order 
for  larger  angles.  Increasing  the  concentration  of  [110]  steps 
is  found  to  decrease  both  the  concentration  of  P  dimers  and 
the  degree  of  order.  This  supports  the  relationship  between 
the  (2X4)-like  reconstruction  and  ordering,  but  indicates  that 
a  second  factor  is  the  concentration  of  steps.  [110]  steps  as¬ 
sist  the  ordering  process  and  [110]  steps  lead  to  disordering. 
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In  this  article,  we  analyze  the  kinetics  of  heteroepitaxial  growth  of  GaP  on  Si(lOO)  by  pulsed 
chemical  beam  epitaxy  on  the  basis  of  results  obtained  by  real-time  optical  process  monitoring.  In 
view  of  the  large  barrier  to  epitaxial  growth  on  oxygen  or  carbon  contaminated  silicon  surface 
elements  and  the  low  stacking  fault  energy  for  GaP,  residual  contamination  of  the  silicon  surface 
contributes  to  defect  formation  in  the  initial  phase  of  GaP  heteroepitaxy  on  Si,  and  requires  special 
measures,  such  as  surface  structuring,  to  limit  the  propagation  of  defects  into  the  epitaxial  film.  The 
control  of  the  supersaturation  during  the  first  10-20  s  of  film  formation  is  essential  for  the  quality 
of  subsequent  epitaxial  growth  and  is  limited  to  a  narrow  process  window  between 
three-dimensional  nucleation  and  overgrowth  at  low  Ga  supersaturation  and  gallium-cluster 
formation  at  high  Ga  supersaturation.  Steady  state  heteroepitaxial  growth  is  described  by  a 
four-layer  stack  substrate/epilayer/surface  reaction  layer  (SRL)/ambient  and,  depending  of  the 
source  vapor  flux,  allows  for  more  than  monolayer  coverage.  Under  this  special  condition  of 
low-temperature  CBE,  the  kinetics  of  chemical  reactions  in  the  SRL  is  composed  of  homogeneous 
reactions  creating/consuming  intermediates  that  participate  in  surface  reactions  including  the 
incorporation  of  Ga  and  P  atoms  into  the  epitaxial  film.  For  the  homogeneous  part  of  the  surface 
kinetics  the  dipole  approximation  provides  an  adequate  correlation  of  the  changes  in  the  dielectric 
function  of  the  SRL  to  the  activities  of  randomly  distributed  reactants  and  products.  No  adequate 
correlation  of  the  dielectric  function  exists  to  the  concentrations  of  strongly  bonded  surface  atoms 
and  surface  molecules.  Therefore,  quantitative  assessments  of  the  heterogeneous  kinetics  on  the 
crystal  surface  cannot  use  real-time  optical  monitoring  as  a  reliable  basis.  ©  1996  American 
Vacuum  Society. 


I.  INTRODUCTION 

Research  concerning  polar-on-nonpolar  hetero structures 
has  been  motivated  for  more  than  a  decade  by  potential  tech¬ 
nological  applications  and  scientific  questions  of  general  im¬ 
portance.  However,  progress  with  regard  to  both  technologi¬ 
cal  and  scientific  goals  has  been  slow.  The  primary  reasons 
for  this  are  as  follows. 

(1)  The  formation  of  defects  during  early  stages  of  nucle¬ 
ation  and  heteroepitaxial  overgrowth  and  their  propagation 
into  the  epitaxial  film. 

(2)  The  roughening  of  the  surface  of  the  epilayer  in  the 
course  of  epitaxial  growth,  complicating  the  modeling  and 
control  of  the  kinetics  of  heteroepitaxial  growth. 

(3)  Interface  broadening  related  to  surface  reactions,  re¬ 
construction  and  interdiffusion. 

As  discussed  in  Sec.  Ill  in  more  detail,  these  three  prob¬ 
lem  areas  are  linked,  so  that  their  separation  and  control 
provides  for  a  considerable  challenge.  Here  we  focus  onto 
GaP  heteroepitaxy  on  Si(lOO)  by  pulsed  chemical  beam  epi¬ 
taxy. 

II.  EXPERIMENT 

Figures  1(a)  and  1(b)  show  schematic  representations  of 
the  experimental  arrangement  for  pulsed  CBE.^  Pulsed  bal- 
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listic  beams  of  the  source  vapors,  e.g.,  tertiary- 
butylphosphine  (TBP)  and  triethylgallium  (TEG)  for  the 
growth  of  GaP,  are  directed  onto  a  substrate  wafer,  e.g.,  a 
silicon  wafer  that  is  heated  from  the  back.  For  real-time 
monitoring  of  the  kinetics  of  heteroepitaxy,  we  combine  sev¬ 
eral  supplementing  nonintrusive  optical  techniques,  such  as 
-polarized  reflectance  spectroscopy  (PRS),^  reflectance  dif¬ 
ference  spectroscopy  (RDS),^  and  laser  light  scattering 
(LLS).  In  addition,  reflected  high  energy  electron  diffraction 
(RHEED)  and  mass  spectrometric  sampling  are  available  on 
the  system.  Here  we  focus  onto  results  of  single-wavelength 
PRS  and  LLS.  A  comparison  of  these  techniques  to  RDS  is 
provided  in  our  companion  article."^  PRS  is  based  on  the 
changes  in  the  reflectivity  during  the  formation  of  a  het¬ 
eroepitaxial  stack  with  regard  to  a  chopped  beam  of 
-polarized  laser  light  of  below  band  gap  energy  that  im¬ 
pinges  onto  the  surface  at  the  Brewster  angle  of  the  substrate. 
For  monitoring  of  GaP  heteroepitaxy  on  silicon,  we  use 
HeNe  laser  radiation  at  632.8  nm  wavelength,  polarized 
by  a  Glan-Thompson  polarizer  to  a  ratio  of  the  s- 
and  -polarized  components  of  the  incident  intensity 
IJlp^  10“^.  This  corresponds  to  a  residual  reflectivity  of  the 
Si  surface  of  the  order  of  lO”"^.  The  reflected  laser  beam  is 
detected  by  a  photodiode,  the  output  of  which  is  processed 
through  a  phase  sensitive  amplifier  and  read  into  a  computer. 
Also,  the  radiation  that  is  scattered  by  the  front  and  back 
interfaces  of  the  growing  film  into  a  solid  angle  of  5^  is 
focused  by  a  lens  at  a  location  well  removed  from  the  plane 
of  incidence  of  the  impinging  light  beam  onto  the  entrance 
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substrate 


Fig.  1.  Schematic  representation  of  (a)  the  vacuum  chamber  with  real-time  optical  process  monitoring  ports  for  single-wavelength  PR,  LLS,  and  RDS.  (b) 
Schematic  representation  of  source  vapor  switching  and  substrate  stage. 


slit  of  a  photomultiplier.  Because  of  its  technological  signifi¬ 
cance,  most  of  the  experimental  data  discussed  here  refer  to 
the  Si(lOO)  substrate  surface,  including  vicinal  surface  orien¬ 
tations,  misoriented  toward  [Oil].  Upon  appropriate  surface 
conditioning,  the  misorientation  for  such  vicinal  surfaces  is 
accommodated  by  double  steps  parallel  to  the  [Oil]  direc¬ 
tion,  which  are  energetically  favored  with  regard  to  single 
steps  and  result  in  large  (2X1)  domains  with  parallel  dimer 
rows  on  the  top  and  bottom  surfaces  of  the  step.^ 

The  single-wavelength  PR  intensity  shows  periodic  varia¬ 
tions  in  time  due  to  interference  between  the  partial  waves 
reflected  at  the  top  and  bottom  interfaces  of  the  heteroepi- 
taxial  GaP  as  illustrated  in  Fig.  2(a).  The  periodicity  of  these 
oscillations  is  related  by  analytic  expressions  to  the  thickness 
of  the  growing  epilayer  and  can  be  utilized  to  calculate  the 
growth  rate  in  real  time,  which  is  reviewed  in  detail  in  Ref. 


3.  Superimposed  to  the  interference  oscillations  is  a  fine 
structure  that  varies  periodically  with  the  periodicity  of  the 
source  vapor  pulse  sequence.  Figure  2(a)  shows  the  PR  in¬ 
tensity  as  a  function  of  time  over  a  period  of  growth  corre¬ 
sponding  to  several  interference  fringes,  and  in  the  inset  the 
magnified  fine  structure  in  the  region  of  the  second  maxi¬ 
mum.  In  the  particular  experiment  of  Fig.  2  the  LLS  intensity 
exhibits  only  weak  variations,  that  is,  the  surface  roughness 
does  not  undergo  a  large  change  from  start  to  finish  of  the 
experiment.  The  top  part  of  Fig.  2(b)  shows  a  schematic 
representation  of  the  pulse  sequence,  consisting  of  a 
r-butPH2  pulse  that  is  followed  by  pause  No.  1,  a  TEG  pulse 
followed  pause  No.  2  and  so  on.  The  bottom  part  of  Fig.  2(b) 
shows  the  associated  response  in  the  PR  intensity.  From  the 
periods  in  the  interference  fringes  and  the  precursor  cycle  the 
average  incremental  increase  of  the  GaP  epilayer  thickness 
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Fig.  2.  (a)  PR  intensity  as  a  function  of  time  for  pulsed  CBE  growth  of  GaP 
on  Si(lOO),  (b)  pulse  sequence  and  the  associated  PR  fine  structure. 

per  cycle  can  be  determined  for  steady  state  growth  with  an 
accuracy  of  approximately  5%.  The  measured  time- 
dependent  LLS  intensity  provides  information  on  the  rough¬ 
ness  of  the  top  and  bottom  interfaces  of  the  film. 

III.  CONTROL  OF  DEFECT 
FORMATION/PROPAGATION 

At  room  temperature,  GaP  has  a  lattice  constant  of  5.4512 
A  that  nearly  matches  the  lattice  constant  of  Si  of  5.4301  A. 
Exact  lattice  matching  of  the  III-V  epilayer  to  the  silicon 
substrate  is  possible  at  a  chosen  temperature  through  alloy¬ 
ing,  e.g.,  formation  of  a  GaN^Pi  solid  solution  of  the  ap¬ 
propriate  composition.^  However,  the  difference  in  the  ther¬ 
mal  expansion  coefficients  for  the  epilayer  and  the  substrate 
invariably  results  in  the  generation  of  strain  upon  thermal 
cycles  in  the  processing.  Also,  antiphase  domains  (APDs)  are 
formed  irrespective  of  misfit  strain  upon  III-V  heteroepitaxy 
on  group  IV  substrate  surfaces  that  exhibit  single  steps  or 
odd  multiples  thereof.^  Fortunately,  one  APD  variant  often 
outgrows  the  other,  so  that  the  APDs  remain  restricted  to 
regions  close  to  the  heterointerface.^ 

At  interfaces  between  singular  Si  and  III-V  faces  that  are 
polar,  such  as  {100}  or  {111},  either  faceting  into  nonpolar 


Fig.  3.  Cross  sectional  HREM  image  of  a  GaP/Si(100)  interface  prepared  by 
CBE  using  TEG  and  TBP  as  source  materials. 


surface  elements,  such  as  {110}  or  {112},  or  intermixing  have 
been  predicted  based  on  assessments  of  interface  charge.^ 
Figure  3  shows  a  high  resolution  transmission  electron  mi¬ 
croscopy  (HREM)  image  of  a  cross  section  of  a  GaP/Si(100) 
heterostructure  created  by  pulsed  CBE.  No  evidence  for 
faceting  is  observed.  However,  charge  build-up  at  the  polar 
interface  can  contribute  to  interface  broadening.  In  addition, 
there  are  other  mechanisms  of  interface  broadening  such  as 
interdiffusion,  which  may  be  enhanced  by  the  formation  of 
native  point  defects  as  part  of  the  interfacial  reactions.  There¬ 
fore,  it  is  presently  impossible  to  determine  whether  or  not 
electrostatic  effects  play  a  significant  role.  Recent  STM  in¬ 
vestigations  provide  insights  into  the  early  stages  of  the  in¬ 
teractions  of  the  Si(lOO)  2X1  surface  with  phosphine. At 
low  to  intermediate  dose,  the  Si  dimers  on  the  Si(lOO)  2X1 
surface  are  completely  replaced  by  Si-P  heterodimers,  push¬ 
ing  Si  surface  atoms  from  their  original  Si-dimer  positions 
into  adatom  positions.  Since  these  Si  adatoms  are  likely  to 
become  incorporated  into  the  epitaxial  film  upon  further 
growth,  they  contribute  to  interface  broadening.  On  the 
P-saturated  surface,  the  Si-Si  dimers  are  replaced  by  P-P 
homodimers  generating  surface  strain  that  is  relieved,  in  part, 
by  the  generation  of  shallow  trenches  parallel  to  the  dimer 
rows  that  are  bound  by  elongated  {111}  facets  and  can  affect 
the  kinetics  of  subsequent  heteroepitaxial  overgrowth.  In 
particular,  the  {111}  facets  in  the  Si(lOO)  surface  upon  over¬ 
growth  by  GaP  can  act  as  nucleation  sites  for  the  formation 
of  stacking  faults  that  propagate  deeply  into  the  heteroepi- 
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(a)  (b) 


(c)  (d) 


Fig.  4.  Auger  electron  spectroscopy  line  scans  at  the  energies  of  the  (a)  Ga-LMM,  (b)  P-LVV,  (c)  0-KLL,  and  (d)  Si~LVV  lines  over  a  Si02-masked  silicon 
surface  with  10X10  ^tm  window  areas  filled  with  epitaxial  GaP. 


taxial  film.  Driving  the  phosphorus  coverage  of  the  surface 
to  saturation  thus  is  not  necessarily  a  desirable  condition  of 
GaP  heteroepitaxy  on  Si,  which  must  be  weighed  against  the 
control  of  stoichiometry,  requiring  large  V:III  source  vapor 
flux  ratio. 

An  important  cause  of  defect  formation  is  residual  con¬ 
tamination  of  the  silicon  surface.  In  our  experiments,  a  modi¬ 
fied  RCA  cleaning  procedure  ending  with  an  aqueous  HF  dip 
was  used  to  create  a  hydrogen-passivated  Si(lOO)  ixl  sur¬ 
face  prior  to  transfer  into  the  CBE  chamber  through  a  load- 
lock.  This  initial  reconstruction  was  verified  by  RHEED.  The 
control  of  the  pH  and  the  concentration  of  the  etching  solu¬ 
tion,  absence  of  dissolved  oxygen,  and  the  conditions  of  oxi¬ 
dation  prior  to  HF  (or,  for  {111}  substrate  wafers,  NH4F) 
etching  are  all  important  factors  in  controlling  the  surface 
roughness  and  chemical  integrity  of  the  hydrogen-terminated 
silicon  surface.^^"^^  Recent  quantitative  high  resolution  XPS 


studies  of  carefully  prepared  hydrogen-terminated  Si(OOl) 
(IXI)  surfaces  revealed  surface  concentrations  of  residual 
oxygen  and  fluorine  atoms  in  the  mid- 10^^  cm~^  range. 
Because  of  the  high  selectivity  of  the  epitaxial  growth  of 
GaP  on  Si  vs  Si02 ,  this  level  of  residual  contamination  can 
make  a  significant  contribution  to  the  generation  of  defects. 

Figure  4  shows  Auger  electron  spectroscopy  (AES)  line 
scans  across  10X10  pm  windows  in  a  Si02  mask  on  a 
Si(lOO)  substrate  after  deposition  of  heteroepitaxial  GaP  in 
the  window  areas  at  (a)  120,  (b)  1070,  (c)  510,  and  (d)  92  eV. 
They  reveal  a  substantially  larger  kinetic  barrier  to  nucle- 
ation  of  GaP  on  the  Si02-masked  surface  areas  as  compared 
to  the  bare  Si  window  areas.  Although  high  selectivity  is  a 
welcome  property  in  the  context  of  device  fabrication,  in  the 
context  of  defect  formation,  the  kinetic  barrier  to  overgrowth 
of  surface  areas  contaminated  by  oxygen  is  a  detriment. 
Upon  forced  overgrowth  of  such  contaminated  surface 
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patches,  defects  may  form  in  the  epitaxial  film  due  to  the 
generation  of  strain.  This  strain  may  be  relieved  by  the  for¬ 
mation  of  60°  dislocations  that  dissociate  into  partials.  Also, 
for  amorphous  contaminant  patches,  loss  of  registry  to  the 
underlying  substrate  lattice  can  cause  the  formation  of  stack¬ 
ing  faults.  Similar  experiments  conducted  on  partially  SiC- 
coated  Si  surfaces  show  that  the  overgrowth  by  GaP  is  also 
highly  selective  with  regard  to  SiC,  that  is,  carbon  contami¬ 
nation  of  the  silicon  surface  is  also  of  concern.  Burying  the 
surface  contaminants  under  a  homoepitaxial  layer  of  silicon 
that  exposes  a  virgin  surface  at  the  initiation  of  heteroepitaxy 
can  result  in  improvements,  because  the  stacking  fault  en¬ 
ergy  for  silicon  (65  mJ/m^)  is  substantially  larger  than  for 
GaP  (41  mJ/m^).^^  The  relatively  small  stacking  fault  energy 
of  GaP  is  a  consequence  of  the  decrease  of  the  stacking  fault 
energy  in  the  zincblende  structure  III-V  compounds  with 
increasing  ionicity  of  the  bonding.  Thus,  the  structure  be¬ 
comes  unstable  in  the  electronegativity  difference  range  be¬ 
tween  AlP,  which  has  zincblende  structure  at  slightly  larger 
lattice  constant  than  silicon  and  InN  that  has  wurzite  struc¬ 
ture  at  smaller  lattice  constant  than  silicon.  Therefore,  the 
insistence  on  nearly  lattice-matched  materials  combination 
generates  conditions,  where  the  gains  in  minimizing  strain 
are  paid  for  by  losses  in  maximizing  the  probability  of  stack¬ 
ing  errors.  Indeed,  high  densities  of  stacking  faults  have  been 
observed  in  GaP/Si(100)  heterostructures  by  TEM.^® 

Since  in  zincblende  structure  compounds  stacking  faults 
nucleate  on  {111}  and,  for  GaP  growth  on  Si(lOO),  the  GaP 
nuclei  are  bound  by  {111}  facets, the  formation  of  three- 
dimensional  faceted  GaP  islands  in  the  initial  period  of 
nucleation  and  overgrowth  of  the  Si  surface  must  be  cur¬ 
tailed  to  control  the  density  of  planar  defects  in  the  hetero¬ 
structure.  The  aspect  ratio  A  of  height  to  width  of  the  nuclei 
depends  on  both  thermodynamic  and  kinetic  parameters.  The 
assessment  of  habit  of  heteroepitaxial  nuclei  governed  by 
thermodynamics  is  possible  on  the  basis  of  the  modified 
Wulff’s  law 

^hkl~  ^hkl  _  ^ 

f'hkl 

Stating  that  the  habit  of  the  nuclei — formed  under  the  condi¬ 
tions  of  heteroepitaxy  on  the  substrate  surface  of  orientation 
(hkl) — is  related  to  the  equilibrium  habit  of  homogeneously 
formed  three-dimensional  nuclei  at  same  supersaturation  A/x 
by  truncation.  This  truncation  is  characterized  by  central 
distance  in  the  heterogeneously  formed  nucleus  to  the 
interface  of  orientation  (hkl),  which,  for  a  given  supersatu¬ 
ration  and  volume  u  of  the  crystal  building  blocks,  depends 
on  the  relative  magnitudes  of  surface  free  energy  on  the 
facet  (hkl)  on  the  homogeneously  formed  nucleus  and  the 
interfacial  attachment  energy  A  three-dimensional 

nucleation  mechanism  thus  ensues  for  0<af^f^i<2(Tf^j^i ,  with 
decreasing  aspect  ratio  A  (height/width)  of  the  three- 
dimensional  nuclei  for  increasing  ,  approaching  a  two- 
dimensional  nucleation  and  growth  mechanism  in  the  limit 
.  However,  in  addition  to  thermodynamic  causes 
for  three-dimensional  overgrowth,  kinetic  enhancements  of 


A  results  from  the  catalysis  of  TBP  decomposition  on  the 
GaP  surface  elements. Upon  growth,  the  three-dimensional 
nuclei  thus  formed  expand  laterally  and  meet  with  patches  of 
surface  contaminants  that  are  randomly  distributed  on  the 
silicon  surface.  In  view  of  the  large  barrier  to  GaP  growth  on 
the  surface  of  the  oxide  patch,  its  overgrowth  by  the  GaP 
nucleus  requires  a  substantial  fluctuation  in  the  chemical  po¬ 
tential  that,  given  the  relatively  low  stacking  fault  energy, 
results  in  a  high  probability  of  stacking  fault  formation.  Al¬ 
though  it  is  thus  difficult  to  avoid  the  nucleation  of  stacking 
faults,  their  propagation  into  the  III-V  epilayer  can  be  mini¬ 
mized  by  structuring  of  the  Si(lOO)  surface  into  V  groves 
bound  by  (111)  and  (111)  facets  along  [Oil].  This  concept 
has  been  applied  successfully  in  controlling  stacking  fault 
propagation  in  InP/Si(100)  heterostructures.^^"^"^ 

The  blockage  of  the  propagation  of  stacking  faults  is  ex¬ 
plained  by  interactions  between  the  partial  dislocations  that 
support  them.  Stacking  faults  form  upon  dissociation  of  unit 
60°  dislocations  on  (111)  and  (111)  according  to 

bi  =  ^  [TlO]  =  ^  [21 1]  +  ^  [121]  =  b2  +  b3 ,  (2) 

b4=^  [101]=  ^  [211]+  ^  [lT2]  =  b5  +  b6,  (3) 

respectively,  where  bi  and  b4  refer  to  the  Burgers’  vectors  of 
the  60°  dislocations  and  b2,  b3  and  b5,  b^  refer  to  the  Bur¬ 
gers’  vectors  of  the  associated  partials.  This  is  illustrated  in 
Fig.  5(a).  Because  of  their  smaller  length  as  compared  to  bi 
and  b4,  the  dissociation  reactions  are  energetically  favored 
and  provide  a  driving  force  for  the  generation  of  the  stacking 
faults  suspended  between  the  partials.  For  InP,  which  has  a 
stacking  fault  energy  of  18  mJ/m^,  large  separations  between 
the  partials  have  been  observed.^^  Therefore,  for  submi¬ 
crometer  groove  depth,  the  faults  can  expand  after  nucleation 
over  the  entire  V-groove  facet.  Upon  interaction  between  the 
partials  with  Burgers’  vectors  b3  and  bg  on  the  intersection  of 
the  two  glide  planes  according  to 

^[lT2]+~[T2T]^^[011]  =  b7,  (4) 

Lomer-Cottrell  sessile  dislocations  of  pure  edge  character 
can  be  formed.  Although  this  dislocation  cannot  move  by 
glide,  in  principle,  it  can  move  by  climb.  However,  disloca¬ 
tion  climb  in  this  case  is  tagged  to  motion  of  the  entire  stack¬ 
ing  fault,  which  is  highly  unlikely.  In  view  of  the  polar  na¬ 
ture  of  the  dislocations  and  possible  presence  of  other 
defects,  that  is,  APBs  and  misfit  dislocations  near  the  inter¬ 
face,  more  complex  interactions  can  occur  and  require  fur¬ 
ther  study.  Also,  the  above  discussed  mechanism  of  self¬ 
termination  of  stacking  fault  propagation  relies  on  perfect 
faceting  at  a  submicrometer  scale  and  the  absence  of  nucle¬ 
ation  of  stacking  faults  on  {111}  variants  other  than  (111)  and 
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Fig.  5.  Initial  PR  intensity  traces  for  pulsed  CBE  of  GaP  on  Si(lOO):  (a)  at 
small  Ga  supersaturation;  (b)  under  optimized  processing  conditions;  (c)  at 
large  Ga  supersaturation. 

(Ill)  due  to  the  faceting  of  three-dimensional  GaP  nuclei  or 
uncontrolled  formation  of  microfacets.  The  first  of  these  two 
potential  problems  can  be  addressed  on  the  basis  of  optical 
process  monitoring  during  the  initial  phase  of  GaP  nucleation 
and  overgrowth  of  the  Si  surface  by  a  contiguous  GaP  film. 
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IV.  INITIAL  PHASE  OF  NUCLEATION  AND 
OVERGROWTH 

Since,  under  the  conditions  of  vapor  phase  epitaxy,  the 
dielectric  function  of  the  ambient  is  smaller  than  the  dielec¬ 
tric  function  of  the  deposited  compound,  a  corrugated  incom¬ 
plete  heteroepitaxial  film  is  associated  with  an  effective  di¬ 
electric  function  that  increases  with  decreasing  pore  size 
upon  progressive  overgrowth  of  the  substrate  surface  from 
initially  close  to  the  ambient  dielectric  function  to  the  dielec¬ 
tric  function  of  the  contiguous  heteroepitaxial  film.  Conse¬ 
quently,  during  this  initial  phase  of  heteroepitaxy,  distinct 
features  are  observed  in  the  time-dependent  PR  intensity, 
marked  by  a  vertical  arrow  in  Fig.  5(a).  Figures  5(a)-5(c) 
represent  a  set  of  experiments  carried  out  under  the  same 
conditions,  except  for  different  TEG  flux  in  the  initial  20  s  of 
the  experiment.  At  small  TEG  flux,  that  is,  small  supersatu¬ 
ration  of  the  surface  with  regard  to  Ga  precursors  to  growth, 
the  low  nucleation  rate  results  in  the  formation  of  widely 
spaced  nuclei  that  grow  to  substantial  height  before  coalesc¬ 
ing  into  a  contiguous  epilayer.  The  associated  roughening  of 
the  surface  is  observed  in  Fig.  5(a)  in  the  time-dependent 
LLS  intensity  and  is  also  marked  by  a  vertical  arrow.  At  the 
time  of  complete  coverage  of  the  substrate  surface  by  a  con¬ 
tiguous  heteroepitaxial  film,  that  is,  at  the  end  of  the  period 
labeled  reflectance  and  LLS  signals  merge  into  the 

time  dependence  for  contiguous  film  growth.  The  magnitude 
of  the  PR  and  LLS  intensity  changes  in  this  initial  period 
depends  on  the  aspect  ration  A  of  the  height  to  width  of 
the  Gap  islands.  Ex  situ  atomic  force  microscopy  investiga¬ 
tions  reported  by  us  elsewhere  confirm  the  roughening  of  the 
surface  relative  to  the  initial  roughness  of  the  silicon  surface 
in  the  nucleation  period  under  the  conditions  of  Fig.  5(a),  the 
subsequent  smoothening  of  the  epilayer,  once  a  contiguous 
GaP  film  has  been  formed,  and  the  increase  of  surface  rough¬ 
ness  upon  further  growth.^^  This  correlation  of  the  initial 
feature  during  the  nucleation  stage  to  significant  roughening 
in  the  later  stages  of  steady-state  film  growth  has  been  ob¬ 
served  consistently  in  several  experiments.  Therefore,  we 
conclude  that  improvements  in  the  control  of  A  in  the  initial 
nucleation  and  overgrowth  phase  can  affect  the  quality  of  the 
epilayer  during  later  stages  of  growth. 

Figure  5(b)  shows  the  result  of  increasing  the  TEG  flux  in 
the  first  20  s  of  the  nucleation  and  overgrowth  period  at  the 
same  TBP  flux  as  in  Fig.  5(a),  increasing  the  supersaturation 
on  the  surface.  Since  at  higher  supersaturation  a  higher  den¬ 
sity  of  nuclei  of  smaller  height  is  generated  and  an  increased 
supply  of  nutrients  exists  between  these  islands,  they  merge 
into  a  contiguous  film  upon  a  shorter  period  of  lateral 
growth.  In  addition,  the  higher  Ga  supersaturation  on  the 
silicon  surface  elements  between  the  GaP  nuclei  should  re¬ 
duce  the  local  differences  in  the  catalytic  properties  on  the 
partially  GaP  coated  surface.  In  contrast  to  Fig.  5(a),  no  ex¬ 
cess  intensity  in  the  time-dependent  PR  and  LLS  signals  is 
observed  during  the  initial  phase  of  film  formation  under  the 
conditions  of  Fig.  5(b).  However,  there  exists  a  sensitive  up¬ 
per  limit  in  the  initial  increase  of  the  TEG  flux  at  a  given 
TBP  flux,  above  which  gallium  droplets  form.  Also,  the  tim- 
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ing  of  the  merging  of  the  initially  higher  TEG  flux  into  the 
smaller  optimum  TEG  flux  for  steady-state  growth  of  the 
contiguous  heteroepitaxial  film  is  critical  to  avoid  phase 
separation  on  the  surface.  Thus,  upon  exceeding  a  critical 
upper  limit  for  the  gallium  supersaturation,  surface  roughen¬ 
ing  due  to  Ga  droplet  formation  results  in  a  rapid  deteriora¬ 
tion  of  the  surface  morphology,  as  illustrated  in  Fig.  5(c), 
which  shows  a  clearly  resolved  nucleation  peak  in  the  PR 
intensity — marked  by  a  vertical  arrow — and  an  immediate 
continuous  rise  in  the  LLS  intensity  that  accelerates  in  later 
periods  of  growth  not  shown  in  Fig.  5(c).  Under  extreme 
conditions,  a  vapor-liquid- solid  (VLS)  growth  mechanism 
may  take  over,  leading  to  very  large  aspect  ratios  of  the 
surface  features. 

V.  KINETICS  OF  STEADY-STATE 
HETEROEPITAXIAL  GROWTH 

Details  of  the  growth  kinetics  after  complete  sealing  of 
the  silicon  surface  by  a  contiguous  GaP  film  are  revealed  by 
the  response  in  the  time-dependent  PR  intensity  to  variations 
in  the  source  vapor  cycle  and  perturbations  of  the  steady- 
state  surface  composition.  For  example,  in  a  preceding 
publication,^^  we  have  shown  that  the  growth  rate  per  cycle 
Vc  for  constant  source  vapor  pulse  heights  and  widths  is 
insensitive  to  the  duration  of  pause  No.  1  between  the  trail¬ 
ing  edge  of  the  TBP  pulse  and  the  leading  edge  of  the  fol¬ 
lowing  TEG  pulse.  However,  increases  with  increasing 
length  of  pause  No.  2  between  the  trailing  edge  of  the  TEG 
pulse  and  the  leading  edge  of  the  following  TBP  pulse.  We 
interpreted  this  result  as  a  consequence  of  a  kinetic  barrier  to 
the  complete  dealkylation  of  the  impinging  TEG  molecules 
on  the  GaP  surface.  Their  conversion  into  surface  species 
that  contribute  to  the  growth,  for  example,  Ga  surface  atoms, 
represents  the  time  limiting  step  in  the  heteroepitaxial 
growth  process.  Thus,  a  reservoir  of  TEG  fragments  that  do 
not  directly  contribute  to  epitaxial  growth  coexists  on  the 
surface  with  active  fragments  that  are  derived  from  the  inac¬ 
tive  fragments  over  time  and  feed  the  GaP  growth.  Since 
can  correspond  to  several  bilayers  of  GaP  per  cycle,^  in 
steady  state,  a  store  of  up  to  several  monolayers  of  the  source 
vapor  fragments  must  exist  on  the  surface  to  sustain  the 
growth.  Therefore,  the  modeling  of  steady-state  heteroepi¬ 
taxial  growth  must  be  based  on  a  four-layer  stack:  (a)  ambi¬ 
ent;  (b)  surface  reaction  layer  (SRL);  (c)  heteroepitaxial  film; 
(d)  substrate.  The  SRL  is  strictly  stabilized  by  the  kinetics  of 
the  primary  pyrolysis  and  follow-up  reactions.  Since  only 
part  of  the  molecules  in  the  SRL  are  attached  to  the  surface, 
under  this  specific  condition  of  pulsed  CBE,  part  of  the 
chemical  reactions  on  the  surface  proceed  homogeneously 
between  molecules  that  not  at  all  or  only  weakly  bound  to 
the  surface  atoms  of  the  GaP  film  and  consequently  are  ran¬ 
domly  oriented.  Thus,  a  set  of  coupled  parallel  and  consecu¬ 
tive  homogeneous  reactions  within  the  SRL  proceed  simul¬ 
taneously  with  heterogeneous  surface  reactions  and 
crystallization  processes.  The  latter  heterogeneous  reactions 
are  coupled  to  the  homogeneous  reactions  through  interme¬ 
diates.  However,  there  exist  also  competing  reaction  path¬ 


ways,  e.g.,  p  elimination  of  radical  ligands  by  reaction  with 
adsorbed  hydrogen  atoms  in  parallel  to  homogeneous  radical 
chemistry  in  the  SRL.  Since  the  bonding  of  products  to  the 
surface  strengthens  as  the  radical  ligands  are  shed,  the  ho¬ 
mogeneous  reactions  will  dominate  the  initial  steps  of 
dealkylation  while  denuded  products,  such  as  Ga  and  P  at¬ 
oms  that  bond  strongly  to  the  surface  and  eventually  incor¬ 
porate  into  the  GaP  lattice,  will  probably  not  be  present  at 
large  concentration  in  the  random  part  of  the  SRL. 

The  complex  reflectivity  coefficient  for  /? -polarized  light 
impinging  onto  the  four- layer  stack  for  a  specific  value  of  the 
film  thickness  can  be  calculated  on  the  basis  of  the  Fresnel 
equations.^^  In  addition  to  knowledge  of  the  dielectric  func¬ 
tions  of  the  Si  substrate,  the  GaP  film,  and  the  ambient,  the 
angle  of  incidence  and  wavelength  this  requires  an  assess¬ 
ment  of  the  dielectric  function  of  the  surface  reaction  layer 
and  its  thickness.  Generally,  both  the  dielectric  function  and 
thickness  of  the  SRL  are  functions  of  time. 

For  a  preliminary  assessment,  we  recently  introduced  a 
simplified  model  that  reduces  the  complex  surface  reaction 
scheme  discussed  above  to  an  exponential  build  up  of  4 
upon  exposure  of  the  surface  to  a  pulse  of  TBP,  followed  by 
an  exponential  decay  to  zero  thickness  after  supply  of  the 
pulse  of  TEG,  which  is  accompanied  by  an  incremental  in¬ 
crease  in  the  thickness  of  the  GaP  epilayer.^^  This  model 
reveals  two  important  features  in  the  PR  intensity: 

(i)  A  periodic  increase  and  decrease  in  the  amplitude  of 
the  fine  structure  that  is  tagged  to  the  interference  oscilla¬ 
tions.  The  relative  positions  of  minima  in  the  fine  structure 
amplitude  with  respect  to  the  positions  in  the  minima  of  the 
interference  oscillations  depend  on  the  value  of  the  dielectric 
function  of  the  SRL. 

(ii)  A  change  of  the  signs  of  the  response  to  the  TBP  and 
TEG  pulses  at  the  minima  in  the  fine  structure  amplitude. 
Because  of  the  turnover  in  the  pulse  response  at  these  posi¬ 
tions,  we  refer  to  them  as  turning  points  in  the  PR  fine  struc¬ 
ture. 

Here  we  expand  the  model  by  adding  an  assessments  of 
the  changes  in  the  dielectric  function  due  to  changes  in  the 
composition  of  the  homogeneous  part  of  the  SRL  as  the 
growth  process  progresses.  Since  in  this  case  the  molecules 
are  randomly  spaced  and  oriented  the  electronic  transitions, 
contributing  to  the  dielectric  function  refer  to  localized 
states.  Thus,  the  dielectric  function  can  be  represented  by  a 
modification  of  the  Sellmeier  equation,  summing  the  contri¬ 
butions  of  all  constituent  molecules,  that  is. 


.,,=1-^22  2 


'^l^l^QklQklV^QklQkl 

_  p/  2 

^QklQkl 


(5) 


where  V  is  the  volume  of  the  surface  reaction  layer,  ^  and 
fiQkiQki  represent  the  unit  polarization  vector  and  the  matrix 
element  of  the  electric  dipole  operator  for  the  electronic  tran¬ 
sition  between  filled  states  and  empty  states  (21/  associ¬ 
ated  with  the  individual  molecule  labeled  by  subscript  k  of 
the  constituent  labeled  by  subscript  /,  Eququ  is  the  energy 
of  this  transition,  and  Ej^  =  fi(x)^  =  h{o}  +  irj)  is  the  energy  at 
which  6^  is  evaluated  in  the  limit  of  turn-on  parameter 
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^0'*'.  Since  all  molecules  of  a  particular  constituent  have 
same  molecular  orbitals  and  electric  dipole- allowed  transi¬ 
tions,  the  summation  over  k  results  simply  the  number  of 
molecules  of  each  species.  Also,  the  number  of  molecules  of 
one  particular  constituent  / — divided  by  the  volume — 
represents  its  concentration  Cj  in  the  surface  reaction  layer. 
Therefore,  Eq.  (5)  provides  for  the  desired  correlation  be¬ 
tween  the  composition  of  the  homogeneous  part  of  the  SRL 
and  the  dielectric  function. 

However,  Eq.  (5)  does  not  account  for  intermolecular  in¬ 
teractions,  which  in  view  of  the  high  concentrations  of  at 
least  some  of  the  constituents  must  be  considered.  Therefore, 
we  replace  the  concentrations  of  all  constituents  of  the  sur¬ 
face  reaction  layer  by  activities,  that  is,  with  a 

specific  activity  coefficient  for  each  constituent  that 

generally  depends  on  both  temperature  and  composition. 
Then, 


1  -47t2  2 


C’2  rpf  2  ’ 

QlQl  ^QlQl 


(6) 


where  the  first  sum  extends  over  all  constituents  of  the  sur¬ 
face  reaction  layer  and  the  second  sum  extends  over  the  elec¬ 
tric  dipole  allowed  transitions  from  filled  states  for  each  of 
these  constituent  molecules. 

Note  that  Eqs.  (5)  and  (6)  rely  on  the  assumption  that  the 
volume  of  the  surface  reaction  layer  is  constant  in  time, 
which  holds  only  in  the  limit  of  small  variations  in  its  com¬ 
position.  Therefore,  we  restrict  in  the  following  discussion  of 
specific  correlations  between  surface  reaction  layer  compo¬ 
sition  and  responses  in  the  PRS  intensity  to  experiments  car¬ 
ried  out  at  constant  temperature  with  a  preceding  reproduc¬ 
ible  surface  conditioning  step  and  small  variations  in 
composition.  Within  these  restrictions,  the  activity  coeffi¬ 
cients  can  be  considered  to  be  constants,  which  enables  the 
analysis  of  the  homogeneous  reactions  in  the  SRL  on  the 
basis  of  analytic  expressions  for  the  integrated  rate  laws. 

The  pyrolysis  of  the  primary  source  molecules  impinging 
onto  the  SRL  is  represented  by 


C4H9PH2  C^H\  ^  PH'2, 

(7) 

Ga(C2H5)3^Ga(C2H5)- +  C2//; . 

(8) 

In  these  equations,  fragments  located  in  the  SRL  are  repre¬ 
sented  in  italics  and  molecules  that  are  expelled  into  the  va¬ 
por  phase  and  pumped  off  are  represented  in  plain  lettering. 
The  r -butyl-phosphine  fragments  and  ethyl  radicals  can  de¬ 
compose  further  under  the  generation  of  adsorbed  hydrogen 
atoms  according  to 


C2//;'C2H4  +  7/-, 


(9) 


C4i/;  +  C4H9PH2^C4Hio+  C^HgPH', 

(10) 

C^HgPH’^C^Hg+\PH, 

(11) 

(12) 

PH'2^\PH^-H', 

(13) 

\PH-^\P’  +  H', 

(14) 

that  assist  the  dealkylation  of  the  TEG  fragments  on  the  sur- 

face  according  to 

Ga(C2//5);  +  //'^Ga(C2//5)  +  C2H6 , 

(15) 

Ga(C27/5)  +  //'^Ga’  +  C2H6, 

(16) 

in  competition  with 

C2//;  +  /f^C2H6, 

(17) 

C4//9  +  //'-»C4Hio, 

(18) 

\PH+H'-^PH'2, 

(19) 

PH'2  +  H'^m2, 

(20) 

and 

(21) 

The  final  step  in  the  overall  reaction  of  TBP  and  TEG  to 
form  GaP  is  the  incorporation  of  Ga  and  P  atoms  into  the 
GaP  lattice.  As  established  in  a  series  of  experiments  using 
deuterated  TBP  by  Li  et  al?^  for  the  growth  of  GaP  using 
TBP  and  trimethylgallium  (TMG)  as  source  vapors,  adsorbed 
PH2  and  PH  radicals  play  an  important  direct  role  in  the 
pyrolysis  of  both  TBP  according  to  reactions  (11)-(14)  and 
TMG  according  to 

Ga{CH^)y^PH^^  1  +  Ga{CH^)y-^  +  CH4 , 

;r=l,2  and  y=  1,2.  (22) 

Although  there  exist  significant  differences  in  the  interac¬ 
tions  of  TEG  and  TMG  fragments  with  the  GaP  surface,  we 
expect  that  adsorbed  PH^  also  plays  an  important  role  in  the 
heterogeneous  dealkylation  of  TEG  and  its  fragments.  All 
reactants  and  products  shown  in  the  equations  in  italics  con¬ 
tribute  to  the  composition  of  the  surface  reaction  layer, 
which  exists  strictly  on  the  basis  of  kinetic  barriers  to  the 
desorption  of  its  constituents,  with  the  exception  of  gallium, 
which  has  at  the  growth  temperature  a  vapor  pressure  well 
below  the  background  pressure  in  the  reactor.  We  note  that 
the  set  of  equations  written  above  is  not  exhaustive;  that  is, 
other  reactions  may  be  added  as  our  knowledge  of  details 
grows.  Since  the  heterogeneous  reactions  that  contribute  to 
crystal  growth  probably  entail  competition  for  active  surface 
sites  and  replacement  of  inhibitors  at  such  sites,  it  is  reason¬ 
able  to  expect  that  they  will  be  slower  than  the  homogeneous 
radical  reactions  in  the  SRL.  Also,  the  denuded  fragments, 
e.g.,  gallium  atoms  and  possibly  monoethyl  gallium  (MEG), 
are  expected  to  be  bonded  tighter  to  the  surface  than  organo- 
metallic  fragments  that  retain  several  ligands,  e.g.,  diethyl- 
gallium  (DEG).  Therefore,  we  limit  the  discussion  of  the 
homogeneous  kinetics  onto  reactions  involving  primary  and 
secondary  products  of  the  pyrolysis  reaction.  Under  condi¬ 
tions  of  pulsed  growth,  the  concentrations  and  activity  coef¬ 
ficients  of  all  reactants  and  products  vary  with  time,  and 
reactions  (9) -(22)  are  coupled  through  the  bimolecular  reac¬ 
tions,  having  surface  hydrogen  as  one  of  the  reactants.  No 
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Fig.  6.  Schematic  representation  of  the  timing  of  the  source  gas  flow  (top) 
and  response  in  the  PR  intensity  to  a  TEG  probe  pulse  during  interruptions 
of  steady  state  and  switching  to  continuous  TBP  flow  (0.8  seem)  during  the 
interruption. 

analytical  solutions  are  available  for  the  quantitative  analysis 
of  the  PR  intensity,  in  this  case  requiring  the  application  of 
numerical  methods. 

Considerable  simplifications  are  gained  under  the  condi¬ 
tions  of  interrupted  steady-state  growth,  where  the  TEG 
pulses  are  switched  off  at  initiation  time  ,  continuing  the 
exposure  of  the  surface  to  a  continuous  TBP  flux,  as  illus¬ 
trated  in  Fig.  6.  Since  we  use  in  our  experiments  typically 
TBP  to  TEG  flux  ratios  in  the  range  10:1  to  40:1,  at  constant 
TBP  flux,  the  hydrogen  and  phosphorus  activities  in  the  SRL 
are  only  marginally  perturbed  by  the  TEG  probe  pulse  and 
can  be  taken  as  constants  during  the  delay  period  Going 
back  to  Fig.  5(b),  we  note  that  initially  the  PR  response  to 
the  application  of  TEG  pulses  after  heating  the  surface  of  the 
substrate  under  TBP  exposure  to  the  growth  temperature  is 
very  small.  The  development  of  the  fine  structure  to  full 
swing  from  low  PR  intensity  after  application  of  the  TEG 
pulse  to  high  PR  intensity  after  the  TBP  pulse  requires  sev¬ 
eral  cycles.  Also,  we  note  that  in  Fig.  6,  the  response  in  the 
PR  intensity  to  a  TEG  probe  pulse  at  a  delay  after 
switching  from  pulsed  to  continuous  TBP  flow  and  discon¬ 
tinuation  of  TEG  pulse  supply  at  time  ,  is  substantially 
smaller  than  the  response  to  the  last  TEG  pulse  under  steady- 
state  conditions  AR^g.  We  interpret  these  features  in  Figs. 
5(b)  and  6  as  being  due  to  the  build-up  of  the  surface  reac¬ 
tion  layer  under  the  conditions  of  initial  nucleation  and  over¬ 
growth  and  its  destruction  upon  interruption  of  steady-state 
growth,  respectively,  that  is,  relate  AR^^  to  the  deviations  of 
the  surface  reaction  layer  from  steady  state,  which  includes 
its  coupling  to  the  surface  of  the  substrate.  Thus,  we  can 
utilize  the  response  to  a  TEG  probe  pulse  for  an  assessment 
of  the  kinetics  of  the  chemical  changes  in  the  reaction  layer 
upon  perturbations  of  steady  state,  e.g.,  switching  from 
pulsed  to  continuous  TBP  flow  and  discontinuation  of  TEG 
pulse  supply. 

In  the  following  discussion,  we  set  ?int— 0,  that  is,  measure 
from  f  =  0  to  the  rising  edge  of  the  TEG  probe  pulse.  The 


3027 


time  (s) 

Fig.  7.  Single  wavelength  PRS  response  to  the  TEG  probe  pulse  under  the 
conditions  of  Fig.  9  at  \=632.8  nm  for  selected  values  of 


exposure  to  the  continuous  TBP  beam  is  maintained  for  a 
period  +  .  The  time  interval  refers  to  the  re¬ 

covery  period  after  the  application  of  the  TEG  probe  pulse  in 
the  PR  intensity  before  reconditioning  the  surface  to  steady- 
state  growth  for  the  next  experiment  and  is  typically  10  s.  A 
considerable  number  of  experiments  can  be  conducted  within 
one  heteroepitaxial  growth  run.  Comparisons  between  ex¬ 
periments  from  several  runs  were  made  to  establish  the  re¬ 
producibility  of  our  findings.  All  experiments  described  be¬ 
low  applied  the  same  pulse  cycle  sequence  for  conditioning 
to  steady-state  growth.  Figure  7  shows  the  decrease  in  the 
amplitude  of  the  response  in  the  PR  intensity  to  the  TEG 
prove  pulse  fort  selected  values  of  of  increasing  in  length. 
Note  that,  in  view  of  the  increase  in  the  amplitude  of  the  PR 
intensity  with  increasing  distance  from  the  turning  points,  the 
response  to  the  TEG  probe  pulse  in  a  particular  experiment 
depends  on  the  timing  of  the  experiment  relative  to  the  ex¬ 
trema  in  the  interference  oscillations.  In  the  following  evalu¬ 
ations,  this  effect  is  eliminated  by  normalization  of  AR^^  to 
A/fss- 

Figures  8(a)  and  8(b)  show  plots  of  the  normalized  mea¬ 
sured  response  AR^^AR^^^/AR^j.  versus  for  two  experi¬ 
ments  under  the  conditions  of  Fig.  6.  The  curves  labeled  exp 
represents  cubic  spline  fits  to  the  experimental  data.  While 
for  a  given  set  of  experimental  parameters,  the  initial  de¬ 
crease  of  A Rj^  with  time  is  reproducible,  after  5-10  s,  varia¬ 
tions  between  different  experiments  are  observed.  This  is  in¬ 
terpreted  in  terms  of  the  kinetics  of  the  chemical  reactions  on 
the  GaP  surface,  starting  with  the  decay  of  DEG  radicals 
according  to  Eq.  (15).  The  rate  law  for  this  reaction  step  is  of 
the  form 

^^Ga(C2H5)2 

^^15^H^Ga(C2H5)2- 

Since  at  large  constant  flux  of  TBP,  the  activity  of  hydrogen 
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Fig.  8.  Plot  of  vs  time  delay  — measured  from  last  TEG  pulse 

under  steady-state  conditions  of  growth — to  the  application  of  a  single  TEG 
probe  pulse:  curve  1  based  on  Eqs.  (6)  and  (25),  curve  2  based  on  Eqs.  (6) 
and  (28).  (a)  and  (b)  refer  to  two  different  experiments  run  under  the  same 
conditions. 


in  the  surface  reaction  layer  an  at  is  only  slightly  per¬ 
turbed  by  the  ongoing  surface  reactions,  we  may  in  a  first 
approximation  treat  it  as  a  constant,  that  is,  rewrite  Eq.  (23) 
as 

^^Ga(C.H<;). 

"  ^15^Ga(C2H5)2* 

With  initial  condition  ^sfGa(C2H5)2(^  =  0)  =  aGa(C2H5)2^  we  thus 

get  the  integrated  rate  law 

^Ga(C2H5)2(0  “  ^Ga(C2H5)2  ”^15^ 

'^^Ga(C2H5)2 

that  is,  the  DEG  activity  decays  from  the  value  established  in 
steady  state  aGa(C2H5)2  exponentially  with  time  con¬ 

stant  Ti  =  l/k[^ ,  that  is,  k[^  =  0.4  s“^  We  note  that  under  the 
conditions  of  the  experiment  the  activities  of  PH^  are  also 
constants,  so  that  adding  Eq.  (22)  as  a  parallel  reaction  path 
merely  changes  the  interpretation  of  k[^,  but  not  the  rate 
equation. 
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The  time  evolution  of  the  activity  of  monoethylgallium 
(MEG)  in  the  surface  reaction  layer  is  composed  of  a  contri¬ 
bution  to  the  MEG  activity  — generated  in  the 

course  of  the  decay  of  the  DEG  according  to  reaction  (15) — 
that  adds  at  r>0  to  the  activity  of  MEG  «Ga(c  h  )  measured 
at  r=0.  By  reaction  (15),  the  rate  with  which  this  contribu¬ 
tion  is  generated  is  coupled  to  the  rate  of  the  DEG  decay 
according  to 

^  GaCC^H.))  ^  ^  Ga(C2H.)2) 


Also,  with  the  assumption  of  constant  hydrogen  activity,  the 
rate  law  for  reaction  (16)  can  be  written  as 


'^rl6^H<^Ga(C2HG-^16^Ga(C2HG  * 


Therefore,  we  get 


<^Ga(C2HG(0  -  ^Ga(C2H5)+^15<^Ga(C2H5)2 


f'  1 

t  \ 

exp 

Jo  \ 

J 

^  Ga(C2H5)  +  ^  Ga(C2H5)2 


1  ”  exp - 

'^1 


where  T2  =  1/A:J5 .  This  function  has  a  single  extremum  at 

/  ^Ga(C2Hc)2  Ti  +  Toi 

?max=Tiln  ^ ^ - (29) 

\  ^Ga(C2H5)2"^^Ga(C2H5)2  ^1  / 

and  initial  slope 

^^Ga(C2H5)|  ^Ga(C2H5)2  ^Ga(C2H5) 


for  «Ga(C2H5)/'r2  ^  «Ga(C2H5)2/'ri  •  Fitting  the  initial  decay  of 
on  the  basis  of  Fi(r^i)  and  F’2(?^i)  can  be  accom¬ 
plished  and  can  be  extended  in  time  by  incorporating  a  time- 
dependent  loss 


GaC2Hg-  ^Ga(C2H5)  +  ^(0^Ga(C2H5)2 


( 

t  \ 

1  1  M 

1  -  exp  - 

-Ti 

exp  -  — 

L  \ 

n/. 

J  \  ^21 

with  loss  factor  L(t)  starting  from  L(r  =  0)=  1  and  varying 
with  same  time  dependence  as  corresponding  to 

a  maximum  loss  of  10%  at  the  maximum  in  F2(?ji)  for  both 
experiments  of  Fig.  8.  The  curves  labeled  1  and  2  in  Figs. 
8(a)  and  8(b)  are  modeled  with  the  same  set  of  parameters 
Ti=2.5  s,  T2=4.2s,L(0,and<2Ga(C2Hg)  ”  0.05  X  aGa(C2H5)2-  ^ 
possible  interpretation  of  the  loss  of  MEG  is  its  parallel 
dealkylation  through  heterogeneous  surface  reactions  at  a 
rate  that  is  tagged  to  the  concentration  of  MEG  in  the  homo- 
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geneous  part  of  the  SRL.  The  difference  between  the  pre¬ 
dicted  behavior  according  to  Eqs.  (25)  and  (28)  and  the  ex¬ 
perimental  data  represented  in  Figs.  8(a)  and  8(b)  by  the 
curves  labeled  3  in  the  later  stages  of  the  decay  of  the  SRL 
from  its  steady- state  composition  is  attributed  to  surface  re¬ 
actions  that  become  dominant  as  the  TEG  fragments  become 
increasingly  denuded  of  radical  ligands.  In  this  case,  the  di¬ 
pole  approximation  is  no  longer  a  reliable  basis  for  the 
evaluation  of  the  dielectric  function  and  no  interpretation  of 
these  surface  reactions  can  be  offered  on  the  basis  of  single 
pulse  PR  monitoring. 

VI.  SUMMARY  AND  CONCLUSIONS 

On  the  basis  of  real-time  process  monitoring  by 
j!? -polarized  reflectance  in  combination  with  laser  light  scat¬ 
tering  we  have  shown  that,  for  a  given  flux  of  the  phosphorus 
source  vapor,  there  exists  a  narrow  band  of  initial  gallium 
supersaturations  for  the  overgrowth  of  the  silicon  surface  by 
a  contiguous  film  of  GaP  with  minimum  roughening.  Also, 
we  have  shown  that  steady-state  heteroepitaxial  growth  must 
be  represented  by  a  four-layer  stack:  ambient- surface  reac¬ 
tion  layer-epilayer-substrate.  Depending  on  the  source  va¬ 
por  fluxes,  the  surface  reaction  layer  can  have  a  thickness  of 
several  monolayers  of  fragments  of  the  gallium  and  phos¬ 
phorus  source  vapor  molecules.  Based  on  the  response  in  the 
PR  intensity  to  variations  and  interruptions  in  the  periodic 
source  vapor  pulse  sequence — utilized  for  establishing 
steady-state  conditions — we  have  show  that  the  dealkylation 
of  the  gallium  source  molecules  is  the  rate  limiting  step  in 
the  low  temperature  heteroepitaxial  growth  process.  Its  ini¬ 
tial  stages  involving  molecules  and  radicals  that  do  not  inter¬ 
act  strongly  with  the  underlying  substrate  can  be  modeled  in 
terms  of  homogeneous  reactions  in  the  SRL.  There  exists  at 
present  no  adequate  description  of  the  response  in  the  PR 
intensity  due  to  changes  in  the  concentration  of  molecules 
and  atoms  bound  directly  to  the  surface  atoms  of  the  het¬ 
eroepitaxial  film.  An  assessment  of  the  heterogeneous  kinet¬ 
ics  thus  requires  further  research  supplementing  optical  real¬ 
time  process  monitoring  by  additional  methods  of  surface 
analysis. 
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Si/Sio  97C0.03  superlattices  grown  on  Si(OOl)  substrates  by  Sb  surfactant  assisted  molecular  beam 
epitaxy  are  characterized  by  in  situ  reflection  high  energy  electron  diffraction  (RHEED),  atomic 
force  microscopy,  transmission  electron  microscopy  (TEM),  and  high  resolution  x-ray  diffraction. 

The  RHEED  shows  that,  in  the  absence  of  Sb,  the  growth  front  roughens  during  Si0.97C0.03  growth 
and  smooths  during  subsequent  Si  growth.  In  contrast,  when  Sb  is  present,  the  growth  front  remains 
smooth  throughout  the  growth.  This  observation  is  confirmed  by  cross-sectional  TEM,  which 
reveals  that  for  samples  grown  without  the  use  of  Sb,  the  Si/Sio  97C0.03  interfaces  (Si0.97C0.03  on  Si) 
are  much  more  abrupt  than  the  Si0.97C0.03/Si  interfaces.  In  the  case  of  Sb  assisted  growth,  there  is  no 
observable  difference  in  abruptness  between  the  two  types  of  interfaces.  Atomic  force  microscopy 
micrographs  of  the  Si0.97C0.03  surface  reveal  features  that  could  be  the  source  of  the  roughness 
observed  by  RHEED  and  TEM.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

By  introducing  carbon  into  the  Si/Ge  material  system,  one 
could  increase  flexibility  in  the  design  of  electronic  structure 
in  a  Si-based  electronics  technology.  The  reason  for  this  is 
that  the  SiGeC  alloys  may  provide  useful  conduction  band 
offsets  for  layers  grown  coherently  strained  to  Si.  This  offset 
might  allow  fabrication  of  novel  n-type  devices  such  as 
n-iypt  resonant  tunneling  devices  (RTDs)  or  high-electron- 
mobility  transistors  (n-HEMTs)  that  are  compatible  with 
VLSI  processing  lines.  In  addition,  due  to  the  smaller  lattice 
constant  of  diamond  (ao=0.357  nm  vs  ^0=0.543  nm  for  Si 
and  ^0=^0.566  nm  for  Ge),  carbon  could  be  used  for  strain 
compensation  of  SiGe  structures,  resulting  in  alloy  layers 
that  are  lattice-matched  to  Si  and  therefore  in  devices  that  are 
stable  during  high  temperature  processing  steps. 

When  Sii_;^Ge^  is  grown  coherently  strained  to  Si(OOl), 
the  compressive  strain  causes  the  fourfold-degenerate  A4 
band  to  become  the  lowest  conduction  band  state.  The  com¬ 
bination  of  strain  splitting  and  compositional  shift  cancel  for 
this  band,  resulting  in  almost  no  conduction  band  offset  for 
Sii_^Ge^  alloys  grown  strained  to  Si.^ 

While  conduction  band  offsets  can  be  achieved  in  the 
Si/Si|_;,.Ge^  system  by  growing  tensile-strained  layers  on  re¬ 
laxed  Sii_;,Ge^  buffer  layers,  the  incorporation  of  carbon 
may  provide  this  desirable  feature  without  the  substantial 
defect  densities  and  complicated  processing  inherent  for 
growth  on  relaxed  buffer  layers.  Since  Sii_^_3,Ge^C^,  alloys 
can  be  grown  lattice  matched  to  Si,  the  strain  splitting  of  the 
conduction  band  can  be  eliminated  and  a  useful  conduction 
band  offset  might  be  provided  by  a  compositional  shift  in 
band  gap.  According  to  estimates,^  the  band  gap  for  Sf_yCy 
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alloys  strained  to  Si  decreases  only  slightly  with  y.  There¬ 
fore,  it  is  essential  to  develop  growth  techniques  that  allow 
the  introduction  of  substantially  more  than  1%  carbon  to 
achieve  band  gap  differences  and  band  offsets  that  are  larger 
than  kgT  at  room  temperature. 

Our  experiments  show  that  for  carbon  concentrations  in 
excess  of  2%,  the  normal,  two-dimensional  layer-by-layer 
growth  of  Si(lOO)^  is  disrupted,  resulting  in  a  rough  surface. 
This  roughness  manifests  itself  in  reflection  high  energy 
electron  diffraction  (RHEED)  as  “spottiness”  in  the  pattern, 
rather  than  the  normally  streaked,  (2X1)-I-(1X2)  pattern  as¬ 
sociated  with  growth  on  atomically  smooth,  two-domain- 
reconstructed  Si (001)  surfaces. 

In  this  article,  we  report  a  study  of  surfactant-mediated 
growth  of  Si/Sio.97Co.o3  superlattices  through  the  use  of 
RHEED,  transmission  electron  microscopy  (TEM),  atomic 
force  microscopy  (AFM),  and  high  resolution  x-ray  diffrac¬ 
tion  (HRXRD).  The  RHEED  and  TEM  results  were  reported 
in  Ref.  3;  here  we  explain  the  RHEED  analysis  in  detail  and 
correlate  the  previous  results  with  AFM  measurements.  Ac¬ 
tual  composition  and  layer  thicknesses  were  confirmed  using 
HRXRD.  We  demonstrate  that  through  the  use  of  a  surfac¬ 
tant,  the  tendency  for  the  Sij  surface  to  roughen  during 
growth  can  be  reduced  or  eliminated,  allowing  layers  with 
even  higher  carbon  concentrations  to  be  grown.  This  tech¬ 
nique  has  been  used  previously  with  encouraging  results  to 
suppress  Stranski-Krastanov  islanding  during  the  growth  of 
Si/Ge  superlattices"^  and  Ge^^yCy  alloy  layers^  on  Si  sub¬ 
strates. 

The  article  is  organized  as  follows.  First,  the  sample 
growth  is  described.  Details  of  the  superlattice  structure  and 
postgrowth  compositional  analysis  by  HRXRD  is  presented. 
We  then  present  an  analysis  of  digitized  RHEED  data  taken 
during  growth  of  the  samples  and  discuss  the  findings  in 
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comparison  with  cross-sectional  TEM  micrographs  and  AFM 
images  of  the  sample  surfaces.  Finally,  we  put  forward  an 
explanation  as  to  how  the  surfactant  might  suppress  the  is¬ 
landing. 


II.  EXPERIMENT 

The  samples  used  in  this  experiment  consisted  of  two  15- 
period,  26.1  nm  Si/4,4  nm  Si0.97C0.03  superlattices  grown  at 
525  °C  on  100-nm-thick  Si  buffer  layers.  Prior  to  growth 
2000  n  cm  Si(OOl)  substrates  were  degreased  in  trichloroe- 
thane  and  acetone  for  2  min  each,  followed  by  rinses  in 
methanol  for  10  min  and  de-ionized  H2O.  A  15  s  dip  in  5% 
HF  just  prior  to  loading  into  the  MBE  system  (Perkin-Elmer 
Model  430S)  was  used  to  hydrogen  passivate  the  wafer  sur¬ 
faces.  In  situ,  the  wafer  surfaces  were  cleaned  at  875  °C 
under  the  influence  of  a  slight  Si  flux  (about  0.1  A/s)  until 
the  RHEED  pattern  consisted  of  the  usual  (2X1) +  (1X2) 
streaked  pattern  indicative  of  a  clean,  reconstructed  Si(OOl) 
surface. 

After  deposition  of  a  100  nm  undoped  Si  buffer  layer, 
approximately  one  monolayer  (6.8X10^"^  atoms/cm^)  of  Sb 
was  deposited  on  the  surface  of  sample  SL-Sb.  Sample  SL 
received  no  such  Sb  predeposition.  The  superlattices  were 
then  grown  on  each  sample  using  growth  rates  of  approxi¬ 
mately  1.8  A/s  and  0.015  A/s  for  Si  and  C,  respectively, 
evaporated  from  electron  beam  sources.  The  Si0.97C0.03  layer 
was  grown  with  both  shutters  opened,  so  the  growth  rate  for 
this  layer  was  1.815  A/s.  Closed-loop  control  of  the  flux  was 
accomplished  for  Si  through  the  use  of  a  Sentinel  III  depo¬ 
sition  controller  and  for  C  by  monitoring  the  amplitude  of 
amu  36  (C3)  with  a  residual  gas  analyzer  (RGA)  and  adjust¬ 
ing  the  power  to  the  electron  gun  to  maintain  a  predeter¬ 
mined  signal.  Growth  rates,  layer  thicknesses,  and  composi¬ 
tions  were  determined  later  using  HRXRD.  From  the 
superlattice  peak  positions,  the  average  carbon  concentration 
and  superlattice  period  were  measured  and,  together  with  a 
knowledge  of  the  shutter  opening  times,  used  to  calculate 
individual  layer  thicknesses  and  the  carbon  content  of  the 
Sii_^Cy  layer. 

During  superlattice  growth,  images  of  the  RHEED  pat¬ 
terns  were  digitized  and  captured  for  later  analysis.  The  setup 
consisted  of  a  standard  RHEED  system  (Perkin-Elmer),  a 
black  and  white  CCD  camera  (SONY  CCD-IRIS),  and  an 
S-VHS  VCR  (Panasonic  AG  7355).  Data  were  recorded  on 
S-VHS  videotape  and  digitized  at  a  resolution  of  512X512 
pixels  with  a  256  level  grayscale  using  a  computer  (Sun 
Sparc  2)  and  a  video  capture  board  (Data  Cell  S2200).  In 
order  to  filter  out  some  of  the  background  due  to  stray  light 
from  the  e-gun  sources,  we  inserted  a  green  filter  between 
the  camera  and  the  RHEED  screen. 

The  AFM  measurements  were  performed  in  laboratory 
ambient  without  any  surface  preparation  using  a  Nanoscope 
III  from  Digital  Instruments.  The  TEM  cross  sections  were 
prepared  by  mechanical  polishing  and  ion  milling  and  the 
micrographs  were  acquired  at  an  acceleration  voltage  of  300 
kV  on  a  Philips  EM430  electron  microscope. 


Fig.  i.  Analysis  of  the  RHEED  pattern.  The  amplitudes  of  the  intensity 
along  the  line  [marked  in  white  in  (a)  and  (c)]  are  shown  in  (b)  and  (d).  The 
areas  enclosed  by  the  solid  curves  and  the  dotted  lines  in  (b)  and  (d)  repre¬ 
sent  the  total  intensity  associated  with  that  streak  or  spot.  By  calculating  the 
ratio  of  the  spot  intensity  to  the  total  intensity  of  the  pattern,  we  obtain  a 
qualitative  measure  of  the  roughness  of  the  surface  R  [see  Eq.  (1)].  This 
ratio  can  be  used  as  a  qualitative  measure  of  surface  roughness  since,  for  a 
spotty  pattern  (c),  the  areas  associated  with  the  spots  will  dominate,  making 
On  the  other  hand,  when  the  RHEED  pattern  is  streaked  (a),  all  the 

2 

areas  are  comparable,  giving  an  R  of  about  5=0.4. 


III.  RESULTS 

The  RHEED  pattern  from  both  samples  prior  to  growth  of 
the  Si  buffer  layer  exhibited  the  usual  (2X1)  + (1X2) 
streaked  pattern  typical  of  a  clean  Si(OOl)  surface.  During 
growth  of  sample  SL,  immediately  upon  opening  the  carbon 
shutter,  the  pattern  became  spotty  in  appearance,  indicative 
of  a  rough  surface.  Each  subsequent  Si  layer  caused  the  pat¬ 
tern  to  revert  back  to  the  (2Xl)-reconstructed  pattern,  sug¬ 
gesting  that  the  Si  deposition  caused  the  surface  to  become 
smooth  again.  This  alternating  behavior  of  roughening  fol¬ 
lowed  by  smoothing  persisted  throughout  the  growth  of  this 
sample.  For  sample  SL-Sb,  the  half-order  streaks  originally 
visible  in  the  pattern  diminished  in  intensity  after  Sb  depo¬ 
sition  due  to  a  realignment  of  the  surface  reconstruction.^ 
The  observed  (1X1)  pattern  exhibited  no  spottiness  during 
the  subsequent  growth  of  the  superlattice  and  remained 
streaked  (smooth). 

In  order  to  study  the  differences  between  the  spotted  and 
streaked  patterns  observed  in  the  growth  of  sample  SL,  we 
employed  the  following  analysis  of  the  digitized  RHEED 
data.  First,  the  intensity  along  a  line  [marked  in  white  in 
Figs.  1(a)  and  1((^]  perpendicular  to  the  streak  direction  and 
intersecting  the  (10)  and  (10)  spots  was  digitized.  The  am¬ 
plitude  of  the  intensity  along  the  line  is  shown  in  Figs.  1(b) 
and  1(d).  Then,  the  intensity  associated  with  a  certain  spot  or 
streak  (nO)  was  integrated  along  the  line,  to  take  into  ac¬ 
count  the  intensity  from  the  full  width  of  the  streak  or  spot, 
to  give  the  quantity  I^q.  The  background  intensity  due  to 
light  from  the  e-gun  sources  was  subtracted,  as  indicated  by 
the  dotted  lines  in  Figs.  1(b)  and  1(d).  Finally,  we  calculated 
the  ratio,  R,  of  the  spot  intensity  to  the  intensity  of  the  whole 
pattern  [see  Eq.  (1)].  This  ratio  can  then  be  used  as  a  quali¬ 
tative  measure  of  surface  roughness  since,  for  a  spotty  pat¬ 
tern,  /fo  and  /jo  will  dominate,  making  On  the  other 

hand,  when  the  RHEED  pattern  is  streaked  [see  Figs.  1(a) 
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Fig.  2.  Roughness  ratio  R  vs  growth  time.  See  Fig.  1  for  the  derivation  of  R. 
An  /?  of  1  indicates  a  spotted  RHEED  pattern  and  an  R  of  0.4  indicates  a 
streaked  pattern.  For  sample  SL  (no  Sb  predeposition),  immediately  upon 
opening  the  C  shutter  the  ratio  is  shown  to  increase  rapidly,  saturating  at  a 
value  near  1.  Half  (^13  nm)  of  the  Si  layer  thickness  was  required  to 
completely  recover  the  original  pattern.  During  growth  of  sample  SL-Sb,  the 
RHEED  pattern  remained  unchanged  from  the  (1X1),  streaked  pattern  ob¬ 
served  immediately  following  Sb  predeposition. 


and  1(b)],  all  the  terms  are  of  comparable  magnitude,  giving 
an  R  of  about  §=0.4. 

- .  (1) 

In  Fig.  2(d),  the  ratio,  R,  is  plotted  as  a  function  of  time 
during  growth  of  the  two  samples  (C  shutter  opens  at  r=0  s). 
For  sample  SL  (no  Sb  predeposition),  immediately  upon 
opening  the  C  shutter  the  ratio  is  shown  to  increase  rapidly, 
saturating  at  a  value  near  1.  During  subsequent  growth  of  the 
Si  layer,  the  RHEED  pattern  slowly  recovered  its  original 
(2X1) +  (1X2),  streaked  pattern  and  the  ratio  returned  to  ap¬ 
proximately  0.4  (smooth).  From  Fig.  2,  we  note  that  fully 
half  (^13  nm)  of  the  Si  layer  thickness  was  required  to  com¬ 
pletely  recover  the  original  pattern.  During  growth  of  sample 
SL-Sb,  the  RHEED  pattern  remained  unchanged  from  the 
(ixl),  streaked  pattern  observed  immediately  following  Sb 
predeposition.  Data  were  not  available  for  SL-Sb  during 
Sio.97Co.03  deposition  because  stray  light  from  the  e~guns 
washed  out  the  pattern.  Nevertheless,  it  was  possible  to  view 
the  pattern  visually  during  these  periods  and  no  spottiness 
was  observed. 

In  Fig.  3,  we  present  cross-sectional  TEM  images  of  the 
samples  that  show  features  consistent  with  the  RHEED  ob¬ 
servations.  Figure  3(a)  is  an  image  taken  from  sample  SL, 
showing  alternating  thick  and  thin  layers  corresponding  to 
the  Si  and  Si0.97C0.03  layers,  respectively.  The  surface  of  the 
superlattice  is  marked  A  near  the  top  of  the  figure.  Clearly, 
the  interfaces  that  are  formed  when  Si0.97C0.03  is  grown  on  Si 


Sample  SL  Sample  SL-Sb 

5U  nm  ' 


Fig.  3.  Cross-sectional  TEM  images  of  the  superlattice  samples,  (a)  is  an 
image  taken  from  sample  SL,  showing  alternating  layers  of  thick  and  thin 
bands  corresponding  to  the  Si  (B)  and  Sio.97Co.03  (C)  layers,  respectively. 
The  surface  of  the  superlattice  is  marked  A  near  the  top.  Sample  SL  (a)  was 
terminated  with  a  Si0.97C0.03  layer  resulting  in  rough  surface  morphology 
(see  Fig.  5).  Sample  SL-Sb  (b)  was  terminated  with  a  Si  layer.  Clearly,  the 
interfaces  that  are  formed  when  Sio,97Co.o3  is  grown  on  Si  are  much  more 
abrupt  than  the  interfaces  formed  when  Si  is  grown  on  Si0.97C0.03.  In  the  case 
of  sample  SL-Sb  (b),  both  interfaces  appear  equally  abrupt.  In  comparison 
with  sample  SL,  they  appear  more  abrupt  than  the  case  for  which  Si  is 
grown  on  Sio.97Co,o3  and  less  abrupt  than  the  case  for  which  Sio.97Co,o3  is 
grown  on  Si. 

are  much  more  abrupt  than  the  interfaces  formed  when  Si  is 
grown  on  Sio.97Co  o3.  In  the  case  of  sample  SL-Sb  [Fig.  3(b)], 
both  interfaces  appear  equally  abrupt.  In  comparison  with 
sample  SL,  they  appear  more  abrupt  than  the  case  for  which 
Si  is  grown  on  Si0.97C0.03  and  less  abrupt  than  the  case  for 
which  Sio.97Co.03  is  grown  on  Si.  Sample  SL  was  terminated 
with  a  Si0.97C0.03  layer,  resulting  in  a  rough  surface  morphol¬ 
ogy  evident  in  the  TEM  and  AFM  micrographs  (see  Fig.  5). 
Sample  SL-Sb  was  terminated  with  a  Si  layer,  which  gives 
the  superlattice  a  smooth  surface  morphology. 

In  Fig.  4,  we  display  HRXRD  of  the  two  samples.  The 
peaks  associated  with  sample  SL  are  larger  in  magnitude  and 
have  narrower  widths  than  those  of  sample  SL-Sb.  One  rea¬ 
son  for  this  could  be  that  the  Si0.97C0.03/Si  interfaces  (when 
Sio  97C0.03  is  grown  on  Si)  in  sample  SL  are  more  abrupt  than 
the  Si/Sio.97Co.o3  interfaces  in  sample  SL-Sb,  as  seen  in  the 
TEM  image  (see  Fig.  3).  Another  reason  might  be  that  there 
is  a  higher  density  of  defects  that  look  like  stacking  faults  in 
sample  SL-Sb  as  compared  to  sample  SL.  These  defects 
could  reduce  the  lateral  coherence  of  the  superlattices  and 
thus  broaden  the  HRXRD  peaks.  The  defects  are  discussed  in 
more  detail  below.  As  mentioned  above,  the  superlattice  peak 
positions  were  measured  to  calculate  the  layer  thicknesses 
and  carbon  content  of  the  samples. 

In  Fig.  5,  we  show  an  AFM  micrograph  taken  from  the 
surface  of  sample  SL,  terminated  with  Sio,97Co.o3*  The  figure 
shows  features  on  the  order  of  1 .5  nm  peak-to-valley  perpen¬ 
dicular  to  the  surface  and  40.0  nm  laterally.  There  does  not 
appear  to  be  any  directional  dependence  to  the  features.  The 
RMS  roughness  of  this  surface  was  measured  to  be  approxi¬ 
mately  0.36  nm.  These  features  are  apparently  responsible 
for  the  spotted  pattern  observed  in  the  RHEED  image  of  Fig. 
1(c),  since  additional  samples  we  have  studied,  for  which  Si 
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HRXRD  of  Si/Si i_yCy  superiattices 


Fig.  4.  HRXRD  of  a  15  period  Si/Sio.97Co.o3  superlattice  grown  by  standard 
MBE  (sample  SL)  and  by  Sb  assisted  MBE  (sample  SL-Sb).  The  sample  SL 
superlattice  peaks  are  sharper  than  those  of  sample  SL-Sb.  This  is  probably 
due  to  the  Si/Sio,97Co.o3  interfaces  being  less  sharp  in  the  SL-Sb  sample  (as 
observed  in  the  cross-sectional  TEM  micrographs,  Fig.  3)  or  a  higher  den¬ 
sity  of  stacking  faults  in  sample  SL-Sb  as  compared  to  sample  SL. 


in  one  sample  and  Sb-terminated  Sio  985C0.015  in  another  were 
grown  on  Si(OOl),  lack  these  features  and  have  streaked 
RHEED  patterns.  The  feature  height  observed  in  the  AFM 
micrograph  agrees  with  the  thickness  variation  seen  in  the 
TEM  image  [see  Fig.  3(a)]. 

In  addition  to  the  interface  structure,  the  TEM  images  also 
reveal  that  here  are  defects  that  look  like  stacking  faults, 
possibly  originating  from  point  defects  such  as  C  dimers  or 
trimers  in  the  SL-Sb  sample.  These  defects  are  also  present 
in  the  SL  sample,  albeit  at  a  lower  density.  Most  of  the  de¬ 
fects  in  sample  SL-Sb  originate  in  the  region  between  super- 


Fig.  5.  AFM  micrograph  of  the  top  layer  (Sio.97Co.03)  of  a  15  period 
Si/Sio.97Co.o3  superlattice  grown  by  standard  MBE  (sample  SL).  The  features 
are  on  the  order  of  1 .4  nm  tall  and  40  nm  on  the  side.  These  surface  features 
apparantly  give  rise  to  the  spotty  RHEED  pattern  observed  [see  Fig.  1  (c)] 
and  the  variation  in  thickness  seen  in  the  TEM  (see  Fig.  3). 


lattice  layers  7  and  10  (counting  up  from  the  bufferlayer). 
Only  a  small  fraction  (0.4%)  of  the  volume  of  the  superlat¬ 
tice  is  enclosed  by  these  defects. 

IV.  DISCUSSION 

The  data  presented  in  the  above  section  suggest  that  one 
or  more  of  the  species  in  the  carbon  flux  disrupts  the  epitax¬ 
ial  growth  of  Sio.97Co.03  on  Si  (sample  SL).  These  species 
form  surface  nucleation  centers  where  diffusing  adatoms  can 
incorporate  in  competition  with  surface  steps,  resulting  in  3D 
growth.  Residual  gas  analysis  of  the  growth  flux  shows  that 
it  primarily  consists  of  monomers  (C),  dimers  (C2),  and  tri¬ 
mers  (C3).  In  principle,  any  of  the  carbon  species  could  act  as 
nucleation  centers.  We  expect  the  monomers  to  be  the  most 
mobile  on  the  surface,  and  hence,  the  most  likely  to  simply 
be  incorporated  in  step  flow  growth.  Of  multiple  carbon  spe¬ 
cies  the  dominant  is  the  dimers. 

All  of  the  multiple  carbon  species,  however,  are  potential 
candidates  for  nucleating  the  rough  growth.  First,  the  C-C 
bond  is  1.8  times  stronger  than  the  Si~Si  bond  and  based  on 
the  observation  that  the  Si  dimer  is  stable  up  to  600  K,^  one 
expects  the  C  dimer  and  trimers  to  be  stable  up  to  1100  K  or 
about  800  °C.  Since  the  growth  temperature  used  in  this  ex¬ 
periment  was  525  °C,  the  carbon  dimers  and  trimers  imping¬ 
ing  on  the  surface  remain  undissociated.  Another  argument 
for  the  stability  of  the  carbon  dimers  is  that  the  equilibrium 
constant  for  sublimation  of  solid  carbon  (to  monoatomic  car¬ 
bon  gas),  Kp,  is  3X10“^^  at  the  growth  temperature  men¬ 
tioned  above. 

Second,  the  diffusion  length  of  these  dimers  and  trimers  is 
expected  to  be  negligible  compared  to  that  of  the  monomers 
because  of  the  large  activation  energy  due  to  the  bond  bend¬ 
ing  and  stretching  required  for  a  dimer  or  trimer  to  move  on 
the  surface.  Thus,  while  the  monomers  might  diffuse  to  a 
step  and  thus  contribute  to  step  flow  growth,  the  dimers  or 
trimers  will  incorporate  at  the  site  of  impingement  and  form 
nucleation  centers. 

Third,  the  nucleation  center  density  generated  by  the 
dimers  and  trimers  is  large  enough  for  3D  growth  to  domi¬ 
nate  over  2D  step  flow  growth.  To  show  one  possibility  for 
how  this  could  occur,  we  need  to  consider  this  probability  of 
an  adatom  attaching  to  a  dimer  or  trimer  rather  than  to  a  step. 
To  assess  the  relative  probability,  we  follow  Mo  et  aVs 
argument.^  Adapting  his  argument,  we  consider  a  square  with 
the  side  aligned  with  a  step  on  a  slightly  miscut  substrate.  We 
set  the  length  of  the  side  equal  to  the  average  terrace  width 
W  of  the  steps  which  is  given  by  the  degree  of  miscut  of  the 
substrate.  Under  the  conditions  of  our  experiment,  this 
square  is  the  area  from  which  this  portion  of  the  step  accu¬ 
mulates  adatoms.  Let  us  say  that  the  square  has  a  dimer  at 
the  center  and  assume  that  both  the  step  and  the  dimer  are 
perfect  sinks  for  adatoms.  According  to  the  2D  random  walk 
theory,  the  number  of  hops  required  for  an  adatom  impinging 
at  a  random  site  in  the  square  to  reach  the  dimer  is  on  aver¬ 
age  where  a  is  the  length  of  a  hop.  The  number  of 

hops  required  for  the  adatom  find  the  step  is  also  on  average 
~( In  this  case  then,  both  island  growth  and  step  flow 
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growth  will  take  place.  For  both  sample  SL  and  SL-Sb,  the 
fluxes  of  the  dimers  and  trimers  were  about  10%  and  5%  of 
the  monomer  flux,  respectively.  These  fluxes  yield  a  dimer 
and  trimer  density  of  about  10^^  molecules/cm^  in  the  time 
required  to  complete  a  monolayer  of  Si0.97C0.03  growth. 
Given  a  limiting  case  of  a  wafer  miscut  of  about  0.5°,  the 
terrace  width  is  W^2X10~^  cm.  At  the  density  calculated 
above,  we  get  four  dimers  per  square,  so  3D  nucleation 
should  compete  effectively  with  2D  step-flow  growth. 

We  speculate  that  in  the  case  of  sample  SL-Sb,  where  the 
surface  stays  smooth  throughout  the  growth,  the  surfactant 
assisted  growth  mechanism  prevents  the  dimers  and  trimers 
from  forming  effective  nucleation  centers.  The  primary  role 
of  the  Sb  is  to  ride  as  a  surface  layer,  burying  the  carbon 
dimer  and  trimers.  An  impinging  silicon  adatom  diffuses  on 
the  Sb  layer  until  it  reaches  a  proper  site.  At  that  site,  the 
adatom  undergoes  an  exchange  with  a  surfactant  atom  and 
incorporates.  Again,  the  carbon  clusters  do  not  diffuse  on  the 
surface;  rather,  they  are  incorporated  immediately  by  some 
exchange  mechanism.  Since  direct  contact  between  Si  ada¬ 
toms  and  the  carbon  clusters  is  reduced,  the  clusters  no 
longer  serve  as  effective  nucleation  centers  and  the  growth 
stays  relatively  smooth  as  shown  by  AFM  and  the  streaked 
RHEED  pattern. 

The  TEM  picture  in  Fig.  3  shows  that  the  interface  be¬ 
tween  the  Si  and  Si0.97C0.03  on  the  substrate  side  is  slightly 
rougher  on  a  sample  superlattice  grown  with  Sb  than  it  is  on 
a  sample  grown  without  Sb.  One  possible  reason  for  this  is 
that,  in  our  case,  a  perfectly  ordered  Sb  terminated  Si(OOl) 
surface  was  not  achieved  [RHEED  pattern  was  (1 X 1)  as  op¬ 
posed  to  (2X1)].  The  unordered  surface  could  prompt  the 
exchange  to  occur  at  sites  other  than  steps,  thus  creating  a 
less  abrupt  growth  front.  A  highly  ordered  Sb-terminated  (2 
X  l)-reconstructed  surface  could  be  expected  to  aid  in  the 
formation  of  perfectly  flat  interfaces. 

In  conclusion,  the  Sb  prevents  the  carbon  dimers  and  tri¬ 
mers  from  serving  as  effective  nucleation  centers  for  3D 
growth  and  as  a  consequence  the  surface  stays  smooth  during 
growth  of  Si0.97C0.03  on  a  sample  grown  with  Sb  (sample 


SL-Sb),  whereas  the  surface  roughens  during  Si0.97C0.03 
growth  on  the  sample  grown  without  Sb  (sample  SL). 

V.  SUMMARY 

We  studied  the  effect  of  adding  Sb  as  a  surfactant  in  the 
MBE  growth  of  Si/Sio.97Co.o3  superlattices.  Our  analysis  of 
reflection  high  energy  electron  diffraction,  transmission  elec¬ 
tron  microscopy,  x-ray  diffraction,  and  atomic  force  micros¬ 
copy  data  shows  that  Sb  induces  2D  growth  of  Si0.97C0.03 
under  conditions  in  which  standard  MBE  yields  3D  growth. 
Epitaxial  growth  on  the  2X1  Si(OOl)  surface  could  be  easily 
disrupted  by  carbon  dimers  and  trimers  which  introduce  ad¬ 
ditional  sites  for  incorporation  of  Si  adatoms.  The  Sb- 
terminated  Si  surface  could  prevent  direct  contact  between 
the  carbon  dimers  and  the  silicon,  and  hence  suppresses  the 
tendency  of  the  surface  to  roughen.  In  conclusion,  the  use  of 
Sb  as  a  surfactant  during  growth  of  high-carbon-content 
Si0.97C0.03  alloys  was  shown  to  result  in  sharper  film  inter¬ 
faces  and  appears  useful  for  achieving  carbon  contents  in 
excess  of  what  would  normally  be  possible  for  growth  on 
bare  Si(OOl). 
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We  have  grown  modulation-doped  Ino  53Gao  47 As/Ino  52AI0.48AS  heterostructures  on  GaAs  substrates 
using  compositionally  step-graded  In^.Ga|_^As  buffers.  Triple-axis  x-ray  diffraction  measurements 
indicate  nearly  complete  and  isotropic  strain  relaxation  in  the  buffer,  lattice  matching  of  the  active 
layers  with  the  top  of  the  buffer,  and  no  significant  epilayer  tilt.  The  temperature  dependence  and  the 
photoresponse  of  the  electron  mobility  suggest  that  transport  in  the  heterostructures  is  limited 
principally  by  remote  ionized-impurity  scattering,  with  mobility  values  comparable  to  those  of 
heterostructures  grown  lattice-matched  to  InR  ©  1996  American  Vacuum  Society. 


1.  INTRODUCTION 

In^Gaj.^As  alloys  may  be  employed  as  quantum  wells 
for  the  confinement  of  a  two-dimensional  electron  gas 
(2DEG) .  ^  Modulation-doped  InGa As/In A1  As  he tero struc¬ 
tures  are  advantageous  for  field-effect  transistor  applications 
due  to  their  high  room  temperature  electron  mobilities,  high 
conduction  band  offsets,  and  minimal  deep-level  trapping  in 
InAlAs  barrier  layers.^  However,  only  one  heterostructure, 
Iuq  53Gao  47As/Ino.52Alo.48As,  can  be  grown  lattice-matched  to 
a  binary  substrate  (InP).  In  order  to  maximize  the  flexibility 
in  choice  of  alloy  composition  without  the  constraint  of  a 
lattice-matched  substrate,  it  is  necessary  to  interpose  a  strain 
relaxing  buffer  between  the  substrate  and  heterostructure. 
The  heterostructure  which  is  lattice-matched  to  InP  provides 
a  useful  test  structure  for  monitoring  the  effectiveness  of 
strain  relaxing  buffers  in  providing  a  suitable  substrate  for 
optimum  electronic  properties.  A  number  of  buffer  layers 
have  been  considered  for  the  lattice-mismatched  growth  of 
Iuq 53GaQ 47 As/Iuq GaAs  {Aa/ a'^3.S%) ,  in¬ 
cluding  compositionally  step-graded  buffers,^  linearly  graded 
buffers,"^”^  and  a  combination  of  step-grading  and  strained- 
layer  superlattices. ^  In  most  of  these  studies,  a  variety  of 
complicated  layer  designs  and  growth  conditions  have  been 
implemented.  Furthermore,  the  effectiveness  of  the  buffer  in 
relaxing  strain  was  not  correlated  with  the  electronic  proper¬ 
ties  of  the  active  layers.  Here,  we  report  the  growth  of 
modulation-doped  Iuq  53Gao  47As/Ino.52Alo ^gAs  heterostruc¬ 
tures  on  GaAs  substrates  using  simple  step-graded 
In_^Gai_;^.As  buffers  with  a  total  thickness  of  1  fim.  In  addi¬ 
tion,  we  present  a  correlation  between  nearly  complete  buffer 
strain  relaxation,  lattice  matching  of  the  active  layers  with 
the  top  of  the  buffer,  and  high  electron  mobility  of  the  het¬ 
erostructures. 


^^Present  address:  Department  of  Physics,  Carnegie  Mellon  University,  Pitts¬ 
burgh,  PA  15213;  Electronic  mail:  rgoldman-l-@andrew.cmu.edu 


II.  EXPERIMENTAL  PROCEDURES 

The  samples  were  grown  by  solid-source  molecular  beam 
epitaxy  (MBE),  with  As4/group  III  beam  equivalent  pressure 
ratios  '^35  and  growth  rates  ^--0.9  /nm/h.  The  buffers  were 
graded  in  200  nm  thick  steps,  from  indium  mole  fraction, 
x=0  to  0.5,  in  increments  of  Ax  =  0.1,  resulting  in  a  compo¬ 
sition  gradient,  50  at.  %  In/jum.  A  cross  section  of  the  tar¬ 
geted  layer  structure  is  presented  in  Fig.  1.  The  GaAs,  step- 
graded  In^Gai_^As  buffers,  and  active  layers  were  grown  at 
580,  350,  and  450  respectively. 

Triple-axis  x-ray  diffraction  measurements  were  per¬ 
formed  at  Beamline  X18A  at  the  National  Synchrotron  Light 
Source  (NSLS)  at  Brookhaven  National  Laboratory.  The 
wavelength  of  incident  radiation  was  selected  to  be  1.653  A 
using  a  double-crystal  Si(lll)  monochromator.  In  addition, 
the  source  point  of  the  synchrotron  was  focused  on  the 
sample  by  means  of  a  Pt-coated  aluminum  mirror  placed  in 
the  beam  path,  following  the  monochromator.  The  samples 
were  mounted  onto  a  six-circle  Huber  diffractometer  and  the 
signal  was  collected  by  a  Ge(lll)  analyzer  crystal  and  scin¬ 
tillation  counter.  With  this  configuration,  the  instrumental 
resolution  was  0.012°.  Contour  maps  were  performed  near 
the  (004)  and  (224)  Bragg  peaks.  These  contour  maps  con¬ 
sisted  of  a  series  of  o)-20  scans,  with  the  same  initial  value 
of  2^  (angle  between  incident  x-ray  beam  and  detector)  but  a 
slightly  different  value  of  co  (angle  between  the  incident 
beam  and  sample).  The  (004)  and  (224)  d  spacings  were 
determined  from  this  data. 

Electron  transport  measurements  were  implemented  with 
six-arm  Hall  bars  (200X800  /nm)  aligned  along  both  (110) 
in-plane  directions,  fabricated  by  standard  contact  photo¬ 
lithographic  processes,  with  Ni/AuGe/Ni  ohmic  contacts  de¬ 
posited  in  vacuum  and  annealed  at  420  °C.  Low  magnetic 
field  Hall  and  resistivity  measurements  as  a  function  of  tem¬ 
perature,  from  300  to  1.6  K,  and  resistivity  measurements  as 
a  function  of  magnetic  field,  from  0  to  7.5  T,  were  performed 
in  a  superconducting  NbTi  magnet.  In  addition,  the  photo¬ 
ionization  of  electrons  trapped  in  deep  levels  was  monitored 
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Fig.  1,  Cross  section  of  targeted  layer  structures.  The  step-graded  buffers 
were  graded  in  200  nm  thick  steps.  The  layers  are  undoped,  except  for  the 
(5-doped  region. 


by  illuminating  the  sample  with  a  GaAs  light-emitting  diode 
(LED). 

III.  RESULTS  AND  DISCUSSION 

Figures  2(a)  and  2(b)  show  symmetric  (004)  and  (224) 
(0-26  scans  for  the  sample.  In  the  plots,  the  epilayer  peak 
positions  are  referenced  with  respect  to  the  GaAs  substrate 
peak,  arbitrarily  set  to  A^=0  arcsec,  for  visual  comparison  of 


-10000  -8000  -6000  -4000  -2000  0  2000 

A0(arcseconds) 


Fig.  2.  Symmetric  (a)  (004)  and  (b)  (224)  ct)~2d  scans  of  the  five-step 
compositionally  graded  InjfGai_^As  buffer  layer  grown  on  (001)  GaAs. 


Bragg  angle  differences  between  the  substrate  and  epilayer. 
The  dip  in  the  substrate  peak  is  due  to  saturation  of  the 
detector.  It  is  clear  from  this  data  that  the  top  epilayers  are 
lattice-matched  to  the  buffer.  Four  sets  of  (004)  and  (224) 
reflections  were  measured  and  indicated  a  tetragonal  lattice 
with  symmetric  strain  relaxation  in  in-plane  (110)  directions. 
From  the  (004)  and  (224)  d  spacings,  the  in-plane  and  out- 
of-plane  lattice  constants,  a  and  c,  were  obtained.^  The  In 
mole  fraction  in  each  layer  was  determined,  assuming  a  lin¬ 
ear  interpolation  of  elastic  and  lattice  constants  between  bulk 
GaAs  and  InAs.^^ 

Analysis  of  the  x-ray  data  yields  layer  In  mole  fractions 
0.15,  0.26,  0.35,  0.45,  and  0.53,  respectively.  The  strain  re¬ 
laxation  in  each  layer  with  respect  to  the  substrate  ranges 
from  92  to  95 ±5%,  essentially  identical  to  within  experi¬ 
mental  error.  A  more  sensitive  indication  of  the  effectiveness 
of  strain  relaxation  is  the  strain  relaxation  in  each  layer  with 
respect  to  the  in-plane  lattice  constant  of  the  previous  layer. 
This  ranges  from  94  to  62%,  with  the  expected  decreasing 
trend  towards  the  surface  layers,  similar  to  observations  in 
SiGe/Si  heterostructures. The  overall  relaxation  of  the  top 
epilayer  is  92%,  with  in-plane  and  out-of-plane  lattice  con¬ 
stants  5.851  and  5.883  A,  respectively.  Assuming  that  the 
strain  is  relaxed  by  the  edge  component  of  60°  misfit  dislo¬ 
cations,  the  density  of  misfit  dislocations  at  the  interface  be¬ 
tween  the  fourth  and  fifth  buffer  step  is  2.6XlO^/cm^. 

Figures  3(a)  and  3(b)  present  (004)  and  (224)  contour 
maps  taken  with  the  x-ray  beam  incident  in  the  [110]  direc¬ 
tion.  In  the  plots,  the  y  axis  corresponds  to  ^and  the  x  axis  is 
,  the  center  value  of  (o  in  each  (o-20  scan.  Plotted  in  this 
manner,  the  y  axis  is  a  measure  of  strain,  and  the  x  axis  is  a 
measure  of  lattice  plane  tilt.  To  compare  peak  broadening 
stemming  from  d  spacing  variations  and  microtilting  of  lat¬ 
tice  planes  (mosaic  spread),  the  data  has  been  normalized  to 
the  top  epilayer  peak  intensity  and  plotted  as  contours  with 
lines  of  equal  intensity  ranging  from  (|^)/niax  1^  /jnax» 
crements  of  (5)/n,ax- 

In  the  (004)  map,  the  substrate  peak  lies  at  6=35.16°,  and 
the  five  steps  in  the  buffer  lie  at  successively  smaller  6.  The 
cumid  values  of  the  epilayer  peaks  are  approximately  the  same 
as  that  of  the  substrate,  indicating  that  there  is  no  measurable 
macroscopic  tilt  of  the  epilayer  about  the  [110]  axis.  Similar 
data  taken  with  the  sample  rotated  90°  azimuthally  resulted 
in  nominally  identical  (o^i^  values  of  the  epilayer  and  sub¬ 
strate  peaks,  indicating  no  measurable  macroscopic  tilt  about 
the  [flO]  axis.  In  the  (224)  map,  the  substrate  peak  lies  at 
45.71°,  and  the  five  steps  in  the  buffer  also  lie  at  successively 
smaller  6.  The  co^^d  values  of  the  epilayer  peaks  are  the  same 
as  that  of  the  substrate,  confirming  the  lack  of  tilt  about  the 
[110]  axis,  and  the  nearly  complete  relaxation  of  the  layers. 

For  both  (004)  and  (224)  reflections,  the  full  width  at 
half-maximum  (FWHM)  of  the  (o-26  scan  for  the  top  epil¬ 
ayers  was  0.07°,  indicating  a  uniform  distribution  of  (004) 
and  (224)  d  spacings  throughout  the  thickness.  Furthermore, 
the  FWHM  of  the  co  scans  of  the  top  epilayers  was  0.5°  and 
0.3°  for  the  (004)  and  (224)  reflections,  respectively.  We 
have  observed  similar  amounts  of  broadening  in  (o-26  and  (o 
scans  of  nominally  identical  epilayers  grown  on  linearly 
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Fig.  3.  Triple-axis  contour  maps  ($s  vs  performed  near  the  (a)  (004)  and  (b)  (224)  reflections.  The  maps  consist  of  a  series  of  (o-20  scans,  each  with 
the  same  initial  value  of  2^  but  a  slightly  different  value  of  cu.  To  compare  peak  broadening  setemming  from  d  spacing  variations  and  microtilting  of  lattice 
planes  (mosaic  spread),  the  data  has  been  normalized  to  the  top  epilayer  peak  intensity,  and  plotted  as  contours  with  lines  of  equal  intensity  ranging  from 
(i)Iniax  to  /max ,  m  increments  of  ( ^)/^,, . 


graded  Alo.48(It^zGai_^)o.52 As  buffers  (with  nominally  identi¬ 
cal  composition  gradient)  on  GaAs.^^  In  addition,  the  smaller 
cu-broadening  for  (224)  in  comparison  with  (004)  is  similar 
to  results  from  studies  of  step-graded  SiGe/Si 
heterostructures. Assuming  that  the  mosaic  spread  origi¬ 
nates  from  a  Gaussian  distribution  of  misoriented  blocks,  an 
estimate  of  the  dislocation  density  is  given  by 
Z)~(Acu)^/9Z7^,  where  Acu  is  the  rocking  curve  width  (in 
radians),  and  b  is  the  Burger’s  vector.^^  The  resulting  esti¬ 
mated  dislocation  density  is  ~10^/cm^.  Since  this  density  of 
dislocations  cannot  be  accounted  for  by  interfacial  misfit  dis¬ 
locations  (— 10^/cm^),  a  significant  density  of  threading  dis¬ 
locations  may  exist  in  the  top  epilayers.  However,  plan-view 
transmission  electron  microscopy  (TEM)  studies  of  the  epi¬ 
layers  grown  on  linearly  graded  Alo.48(Ii^2Gaj  _^)o.52As  buff¬ 
ers  mentioned  before  indicated  a  threading  dislocation  den¬ 
sity  —  lO^/cm^.^  Therefore,  the  accuracy  of  the  dislocation 
density  estimates  from  the  misoriented  block  model  needs 
further  investigation. 

The  0.5°  mosaic  spread  measured  in  the  top  epilayers  has 
not  adversely  affected  the  electronic  properties  of  the  2DEG. 
Figures  4(a)  and  4(b)  present  the  temperature  dependence  of 
the  apparent  electron  mobility  and  the  apparent  electron 
concentration  [n]^^  determined  from  low-field  Hall  and  re¬ 
sistivity  measurements.  The  solid  and  open  symbols  corre¬ 
spond  to  data  taken  in  the  [110]  and  [110]  directions,  respec¬ 
tively.  The  peak  electron  mobility,  in  the  [flO]  direction,  is 
comparable  to  that  of  similar  hetero structures  grown  lattice- 
matched  to  InP  which  have  room  temperature  mobilities 
ranging  from  9600  to  13  600  cvc^N  Similar  tem- 


1  10  100 
Temperature  (K) 


Fig.  4.  Temperature  dependence  of  (a)  apparent  electron  mobility  and 
(b)  apparent  electron  concentration  [n]^  determined  from  low  magnetic 
field  Hall  and  resistivity  measurements  in  (110)  in-plane  directions. 
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Table  I.  Summary  of  300  and  1.6  K  data  taken  in  both  (110)  in-plane  directions,  in  the  dark  and  under  illumination  with  a  GaAs  LED.  Listed  are  the  apparent 
electron  mobility  and  apparent  electron  concentration  [n]//  determined  from  low-field  Hall  and  resistivity  measurements.  Also  listed  are  the  subbands  [n]^ 
determined  from  the  FFT  power  spectrum  of  the  oscillations  in  reciprocal  magnetic  field.  The  experimental  uncertainties  are  approximately  ±5%. 


300  K 

1.6  K 

Sample 

M[no]  M[iio] 

(cmW  s) 

Wh 

(cm 

/iLllO]  /i[fl0] 

(cm^/V  s) 

(cm-^) 

[«0 

Wi 

(10'^  cm-2) 

K 

Dark 

7500  12  600 

2.7X10'^ 

37  700  64  000 

2.5X10'^ 

1.37 

0.88 

0.40 

LED 

7500  12  600 

2.7X10'^ 

40  500  68  300 

3.1X10'^ 

1.66 

0.78 

0.45 

perature  dependencies  of  and  nearly  temperature- 
independent  [n]fj  are  observed  in  both  (110)  directions.  The 
temperature  independence  of  the  mobility  in  the  range  20- 
1.6  K  indicates  that  the  mobility  is  limited  primarily  by  re¬ 
mote  ionized  impurity  scattering,  as  expected  for  a  well- 
behaved  2DEG,  Furthermore,  the  data  indicate  anisotropic 
fjLfj  at  all  temperatures.  These  anisotropies,  which  are  also 
observed  in  heterostructures  grown  lattice-matched  to  InP, 
will  be  discussed  elsewhere. 

The  two-dimensional  nature  of  the  electron  gas  was  con¬ 
firmed  from  1.6  K  high-field  Shubnikov  de  Haas  measure¬ 
ments  which  indicate  oscillations  of  the  resistivity  and 
quantum  Hall  plateaus  in  the  transverse  resistivity  p^^y.  A 
slight  beating  of  the  p^^  oscillations  is  observed,  indicating 
multiple  subband  occupation  of  the  2DEG.  The  electron  con¬ 
centrations  [n]i  determined  from  a  fast  Fourier  transform 
(FFT)  power  spectrum  of  the  oscillations  in  reciprocal 
magnetic  field  are  summarized  in  Table  1.  Charge-carrier 
confinement  in  the  2DEG  is  confirmed  through  a  comparison 
of  the  total  electron  concentration  [n]j=[n]Q-^[n]i  deter¬ 
mined  from  SdH  measurements  with  [«]//;  these  agree  to 
within  2%,  indicating  excellent  carrier  confinement. 

Further  confirmation  of  the  high  quality  of  this  hetero¬ 
structure  was  confirmed  by  illuminating  the  sample  with  a 
GaAs  LED.  The  photoionization  of  electrons  trapped  in  deep 
levels  within  the  fundamental  bandgap  of  InAlAs,  was  moni¬ 
tored  at  room  temperature  and  1.6  K,  and  the  results  are 
listed  in  Table  I.  At  room  temperature,  the  LED  has  a  negli¬ 
gible  effect  on  pufi  and  [^]//.  At  1.6  K,  the  LED  has  a  small 
effect  on  the  /Jbfj,  A/x<5000  cm^/V  s  with  a  change  of  [«]//, 
A[n]~0.5X  10^^/cm^.  The  increase  in  [n]^  leads  to  an  in¬ 
crease  in  /x^,  suggesting  that  illumination  with  the  LED  has 
led  to  the  photoionization  of  electrons  trapped  in  acceptor¬ 
like  deep  levels.  The  light  sensitivity  of  these  heterostruc¬ 
tures  is  comparable  to  or  less  than  those  grown  lattice- 
matched  on  InP.^^’^^  In  addition,  1.6  K  high-field  SdH 
measurements  were  performed  after  illumination  for  30  min 
(the  LED  had  been  turned  off).  The  subband  electron  con¬ 
centrations  [n]i  determined  from  the  FFT  power  spectrum  of 
the  p^x  oscillations  in  reciprocal  magnetic  field  are  listed  in 
Table  1.  In  this  case,  the  agreement  between  [n]^  and  [n]^  is 
within  7%. 

IV.  CONCLUSIONS 

In  summary,  we  have  demonstrated  that  a  simple,  1  /mm 
thick,  compositionally  step-graded  In^Ga^.^^As  buffer  effec¬ 


tively  relaxes  the  —3.8%  mismatch  strain  between  an 
Ino  53Gao47As/Ino.52Alo,48As  heterostructure  and  a  GaAs  sub¬ 
strate.  Triple  axis  x-ray  diffraction  data  reveals  essentially 
complete  buffer  strain  relaxation,  lattice  matching  of  the  ac¬ 
tive  layers  with  the  buffer,  and  negligible  epilayer  tilt.  High 
electron  mobilities  and  insignificant  light  sensitivities  of  the 
mobility  and  carrier  concentration  of  the  2DEG  are  observed. 
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Surface  processes  during  the  heteroepitaxy  of  GaP  on  Si  under  pulsed  chemical  beam  epitaxy 
conditions  were  investigated  simultaneously  by  the  optical  methods  reflectance 
difference/anisotropy  spectroscopy,  p -polarized  reflectance  spectroscopy  (PRS),  and  laser  light 
scattering.  Our  studies  were  performed  during  both  cyclic  and  interrupted  growth,  where  the  surface 
was  exposed  to  individual  pulses  of  the  precursors  triethylgallium  (TEG)  and  tertiarybutylphosphine 
(TBP).  The  data  show  that  the  three  optical  probes  provide  different  perspectives  of  growth.  Several 
surface  processes  exhibit  time  constants  of  the  order  of  1  s.  One  such  process  is  the  clustering  of  Ga 
atoms,  or  less  likely,  of  TEG  fragments,  that  occurs  with  TEG  exposure.  The  optical  data  also  show 
that  TBP  dealkylation  occurs  essentially  instantaneously  upon  arrival  at  the  surface,  and  that  TEG 
dealkylation  is  the  rate-limiting  step.  The  PRS  data  exhibit  fine  structure  that  shows  that 
heteroepitaxial  growth  can  be  described  by  a  four-phase  model  consisting  of  the  substrate,  a  GaP 
layer,  a  surface  reaction  layer  containing  all  adsorbed  species  not  yet  incorporated  in  the  growing 
layer,  and  the  ambient.  By  assuming  that  this  surface  layer  is  very  thin  we  derive  approximate 
equations  that  allow  us  to  treat  the  PRS  response  quantitatively.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

In  this  article  we  discuss  real-time  monitoring  of  the 
pulsed  chemical  beam  heteroepitaxial  growth  of  GaP  on  Si 
by  three  different  optical  techniques:  reflectance  difference/ 
anisotropy  spectroscopy  (RDS/RAS),^"^  -polarized  reflec¬ 
tance  spectroscopy  (PRS),"^’^  and  laser  light  scattering 
(LLS).^  The  objective  is  a  better  understanding  of  the  growth 
process,  specifically  surface  chemistry  and  reaction  kinetics. 
A  discussion  of  the  complicated  chemistry  of  this  growth 
system  is  given  in  Ref.  7. 

The  results  are  expected  to  be  useful  in  the  engineering  of 
heteroepitaxial  growth  on  silicon.  In  particular,  the  quality  of 
the  heteroepitaxial  layers  depends  strongly  on  interfacial 
properties,  and  can  be  optimized  only  when  the  deposition 
process,  especially  the  early  stage,  is  well  understood.  As 
discussed  earlier,^’^“^^  PRS  in  the  visible  range  shows  a  large 
sinusoidal  signal  during  growth  due  to  interference  between 
front-  and  back-surface  reflections  associated  with  the  grow¬ 
ing  GaP  layer,  from  which  the  overall  growth  rate  can  be 
determined.  Superimposed  on  this  is  a  fine  structure  that  is 
linked  to  the  pulsed  supply  of  the  precursors.  This  fine  struc¬ 
ture  is  also  observed  in  RDS  and,  to  a  lesser  extent,  in  LLS. 
The  three  probes  provide  different  perspectives  of  the  growth 
process,  and  show  in  particular  that  in  the  growth  system 
studied  here,  triethylgallium  (TEG)  dealkylation  is  the  rate- 
limiting  step. 

II.  EXPERIMENT 

Heteroepitaxial  growth  was  performed  in  a  pulsed  mode 
at  low  substrate  temperatures  of  360-400  as  measured  by 


'^Published  without  author  corrections. 
^^Electronic  mail:  rossow@unity.ncsu.edu 


thermocouples  and  calibrated  by  a  pyrometer.  The  precursors 
were  tertiarybutylphosphine  (TBP)  and  TEG.  The  silicon 
substrates  were  lightly  (1-10  fl  cm)  p  doped.  To  determine 
the  possible  influence  of  steps  on  growth,  we  used  (113) 
substrates  and  substrates  oriented  6°  off  (001)  toward  [110]. 
Since  on-axis  (001)  Si  substrates  yielded  no  net  RDS  signals 
as  a  result  of  the  formation  of  antiphase  domains,  (001)  sub¬ 
strates  were  used  only  for  comparison. 

Details  of  the  RDS  configuration  are  given  in  Refs.  1  and 
2,  and  those  of  PRS  and  LLS  in  Ref.  5.  For  PRS  and  LLS  a 
He-Ne  laser  was  used  as  light  source  (633  nm,  1.96  eV). 
RDS  transients  were  taken  at  a  fixed  photon  energy  of  3.6 
eV,  where  the  optical  penetration  depth  of  GaP  is  low  and  the 
detected  light  intensity  is  high.  The  intrinsic  RDS  time  reso¬ 
lution  was  the  RDS  sampling  period,  1  ms.  However,  to  im¬ 
prove  signal-to-noise  ratios  we  averaged  25  and  50  points  for 
interrupted  and  cyclic  growth,  respectively,  yielding  effective 
time  resolutions  of  25  and  50  ms,  respectively.  The  time 
resolution  for  PRS  and  LLS  was  100  ms.  The  pneumatic 
switching  valves  for  TEG  and  TBP  have  a  small  dead  vol¬ 
ume  and  a  time  response  faster  than  20  ms,  as  specified  by 
the  manufacturer  (NUPRO). 

III.  LINEARIZATION  OF  THE  PRS  SIGNAL 

A  typical  PRS  transient  for  a  GaP  layer  heteropitaxially 
grown  on  Si  is  shown  in  Fig.  1.  These  data  can  be  described 
with  the  so-called  four-phase  model,  which  involves  the  am¬ 
bient  (0),  a  surface  reaction  layer  (1),  the  GaP  layer  (2),  and 
the  Si  substrate  (3),  where  ail  interfaces  are  assumed  to  be 
sharp  and  reflections  from  the  back  of  the  substrate  can  be 
ignored.  It  is  necessary  to  include  a  surface  reaction  layer, 
because  unreacted  species  on  the  surface  of  the  growing  GaP 
film  generally  have  refractive  indices  different  from  that  of 
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Fig.  1 .  Typical  PRS  transient  during  heteroepitaxial  growth  on  Si.  The  fine  structure  is  shown  on  an  expanded  scale  in  the  inset.  The  nonzero  values  of  the 
minima  are  due  to  an  offset  in  the  electronic  signal. 


GaR  This  is  manifest  by  the  data  of  Fig.  1,  which  show  a  fine 
structure  in  the  PRS  data  that  accompanies  cyclic  changes  in 
ambient  exposure. 

In  the  four-phase  model  the  PRS  signal  is  proportional  to 
the  absolute  square  ^4^=1  ^4^1^  of  the  complex  reflectance 
coefficient  r^p  for  /? -polarized  light,  which  is  given  by^^ 

_  (^01p+^12pg'^)  +  (^01p?'l2p  +  g'^')>~23pg'^  ,  . 

i'^  +  roipri2pe‘^^')  +  {ri2p  +  roipe‘^‘^')r2ipe‘^^^’ 

where 

lirdi  , - ^ - 

A  =  V^/”sin^  do,  /=1,2  (2) 


and  where  6q  is  the  angle  of  incidence.  Since  the  surface 
reaction  layer  is  thin  (|^i|<^l),  we  can  linearize  Eq.  (1)  with 
respect  to  /3i  as 


'-4p()Si)  =  r4^()S,=0)  +  /3i 


^r4p 

dp, 


^,=0 


—  ^ApO'^P 


dr, 


4p 


^  dP\ 


(3) 


The  first  term  on  the  right  side  of  Eq.  (3)  describes  interfer¬ 
ence  due  to  the  growing  GaP  layer  and  the  second  the  fine 
Structure  due  to  the  variation  of  di  or  6^  with  exposure. 

The  interference  term  is  given  by 


_  ro2p+ r23pe‘^^^ 

'^•’^~l  +  ro2pr22pe‘^^^- 


(4) 


If  the  Brewster  condition  Oq^O^  is  fulfilled  exactly,  then 
r^pQ—rQ^p  —  O  when  exp{i2/32)  =  l  and  therefore 
^02p~  ~^23p-  Under  this  condition 


^4p0=^02,Yir^— (5) 

«-2/ro2pe''^2  sin /32.  (6) 

For  real  P2  we  therefore  have 

PRS  signal~R4po=k4pol^  =  4Ro2p  sin^ /32.  (7) 

where  ^o2p=ko2pP-  Thus  for  6^=6^  and  P,=0  the  PRS  sig- 
nal  exhibits  a  sinusoidal  behavior  with  a  period 
A£l=X/2(e2-sin^  For  GaP  at  632.8  nm 

[^GaP“  iO.OO  (see  Ref.  13)]  and  6^0^70°,  ^d  is  ap¬ 

proximately  101  nm. 

For  growth  monitoring  the  Brewster  condition  is  difficult 
to  realize  because  the  dielectric  function,  and  consequently 
Op ,  is  temperature  dependent.  For  the  more  general  case  of 
,  ^4^0  is  given  by 

^4p0'^^O3p  +  2[/?23p~Re(''03p''23p)]“2  cos  2^2[^23p 

-Re(^03p''23p)]-2  sin  2^2  Im(''o3p''23p),  (8) 

where  the  7?,^^  terms  have  obvious  meaning.  If  the  substrate 
is  weakly  absorbing,  as  is  the  case  for  Si  at  632.8  nm,  the  last 
term  can  be  neglected  and  we  obtain 

^4/70^^03/7  +  2[R23p“R^(^03/7''23/3)][l  “COS  2^2]^  W 

which  is  basically  Eq.  (7)  with  a  dc  offset  as  seen  in 

Fig.  1.  If  Ro3^0  i0==^0p)  then  from  either  Eq.  (7)  or  (9)  the 
PRS  signal  must  begin  to  rise  when  GaP  growth  is  initiated. 
However,  Fig.  1  shows  that  this  is  not  observed:  the  signal 
begins  near  its  maximum  and  then  decreases.  From  Eq.  (9) 
this  is  possible  if  Rq^^O  and  ^23/7^^03/?’  which  means  that 
[^23p  ~  Rc(ro3^r 23^)1  <  0.  Furthermore,  with  increasing 
thickness  of  the  GaP  layer  we  observe  that  the  signal  ap- 
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proaches  zero.  Therefore,  Ro3p^M^23p~^^i^o3p^23p)\' 
From  this  result  it  follows  that  Using  these  con¬ 

ditions  and  the  values  for  the  dielectric  functions  of  Si  and 
GaP  given  above,  we  can  calculate  We  find  this  to  be 
about  70.4°,  which  is  slightly  less  than  . 

To  determine  the  fine-structure  contribution  we  must 
evaluate  the  scaling  factor 

dr^p  _  2i(l-rgip)/ri2p 

V _ ^  / 

(l+''02p''23pe‘^^2)2 

(10) 


Combining  the  previous  results  the  general  expression  for 
the  PRS  signal,  to  first  order  in  py ,  is 

PRS~/?4p-i?4po+2Re  rlpoA  ^  •  (13) 

Therefore,  the  amplitude  of  the  fine  structure  that  is  caused 
by  a  variation  of  /5i,  i.e.,  variations  in  and/or  di,  depends 
also  on  r^pQ  and  dr^^f Neglecting  for  simplicity 
the  squared  terms  of  the  Fresnel  coefficients  we  obtain 

PRS~/?4^^/?03p  +  2[R23p“R^(^03/?'*23p)][l  “COS  2^2] 

X  Re{''02p(  1  -  «”'^^^)/3i2![ri2p+  r23pe'^^2 

X(l+r?2p)]}-  (14) 


For  dQ^dg,  rQ2p  and  ^23^  are  small  compared  to  1  [<«  0.083 
and  0.072,  respectively,  for  6o=10°  and  esi=15.07+!0.15) 
[see  Ref.  13].  Neglecting  products  of  these  terms  yields 


dr4p 

d/3i 


01  =  0 


2«(l-''oip)  , 


+  r23pe'2^2(l  + 


(11) 


The  term  in  the  brackets  can  be  given  a  simple  geometric 
interpretation.  In  the  transparent  region  of  GaP,  y02  is  real. 
Therefore,  with  increasing  layer  thickness  describes  a 
circle  of  unit  radius  in  the  complex  plane.  If  the  surface 
reaction  layer  is  also  transparent,  ri2p  is  also  real  and  the 
term  in  the  brackets  is  largest/smallest  for  -  ±1  de¬ 
pending  on  the  sign  of  ri2p  and  r23^.  Since  for  ^'^^2  =  ±  1 
the  interference  term  shows  extrema  the  fine  structure  will 
be  largest/smallest  at  the  minima/maxima  of  the  PRS  signal 
(see  Fig.  1).  This  effect  is  most  pronounced  when 
ri2p"^r23p(l  +  ^i2p)j  a  condition  that  is  fulfilled  for  reason¬ 
able  values  of  (for  example,  for  6|  =  8  the  two  terms  are 
nearly  equal).  For  small  surface  absorption  we  can  write 
ri2p=  |ri2p|(l +i<5),  where  ^<^1.  Then 

[''12P  + '■23pe'^H  1  +  ^12p)]~[kl2pl  + 'kl2pl  <5 

+  ^23pe'^Kl  +  kl2pP)]. 

(12) 

where  the  term  Sr23p\ri2p\^  can  be  neglected.  Now  the  center 
of  the  circle  with  radius  r2^p(  1  + 1  f'npl^)  is  located  a  distance 
\ri2p\  on  the  real  axis  and  <5|ri2p|  above  the  real  axis.  The 
amplitude  of  the  fine  structure  reaches  its  maximum  value  at 
tan  2l32=\ri2p\  1  +  ki2pP])-  This  no  longer  coincides 

with  the  extrema  of  the  interference  term.  Therefore,  from 
the  relative  phases  of  the  interference  and  fine-structure  sig¬ 
nals  we  can  obtain  information  about  Im(ri2;7),  and  thus  the 
surface  absorption. 

If  Im(ri2p)^Re(ri2;7),  which  is  the  case  for  a  metallic 
surface  layer  with  Re(6i)<0,  Im(6i)>0,  no  phase  matching 
of  ri2p  and  r2^p[l+r^2pli  can  occur.  However,  if 
Re(ri2p)^l  then  we  again  obtain  the  equation  of  a  circle, 
but  one  with  its  center  near  the  imaginary  axis. 


Therefore,  the  amplitude  of  the  fine  structure  is  also  periodic 
in  2^2'  general,  the  situation  is  complicated  because  the 
fine-structure  term  is  multiplied  by  real  and  imaginary  parts 
of  r4^o »  which  results  in  terms  (1-cos  2^82)  and  sin  2^2  that 
are  not  in  phase.  For  one  extremum  2^2  which  case 


2  Re 

1677(7  I - - 

=  -  ro2p  — ^  [Re(  Vei  -  sin-"  6»o) 

X  Im(  r  i2p  -  r23p[  1  +  r j2p] )  +  Im(  V^i-sin^  Oq) 
XRe(ri2p-r23p[l+ri2p])].  (15) 


This  vanishes  for  real  ej.  The  same  occurs  for  2/32=0.  In 
contrast,  we  often  observe  that  the  fine  structure  is  largest 
near  the  maxima  of  PRS  signals.  Consequently,  in  these 
cases  Im(6i)  must  be  significant.  This  also  follows  directly 
from  Eq.  (1).  At  the  extrema,  Qxp(i2/32)  —  ±l,  and  for  GaP 
on  Si  at  632.8  nm  all  Fresnel  coefficients  are  real.  Then  the 
absolute  value  of  contains  only  terms  in  cos  2^i ,  which 
is  quadratic  in  /3i .  We  note  also  that  for  both  parts  of  the  PRS 
signal  the  zero-order  term  due  to  interference,  |?'4p(y0i=O)p, 
and  the  fine  structure  term,  2  Re[r4^(y8i 
=  0)/3i((3'r4^/(9^i)|^^=o]’  are  both  of  the  order  of  the  square  of 
the  Fresnel  coefficient.  Therefore,  the  large  descrepancy  in 
value  between  the  two  terms  can  only  be  explained  by  Pi 
being  small  as  suggested  above.  The  ratio  in  the  maxima  is 
approximately 


2  Re  r*piPi  =  0)/3, 


K(/3i  =  0)\^^ 


Aird  I - ^ — 

=  -  l/''02p  -Y'  ^o) 


X Im(ri2p- ?-23p[  1  +  r^2p])  +  lm( \le\-sin^  Oq) 

XRe(ri2p-r23p[l  +  r?2p])]-  (16) 
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Fig.  2.  RDS  transient  at  3.6  eV  for  one  exposure  cycle  during  heteroepi- 
taxial  growth  of  GaP  on  a  vicinal  wafer  orientated  6°  off  (001)  toward  [110]. 
The  response  to  the  TEG  pulse  is  slower  than  that  to  the  TBP  pulse.  The 
upper  level  is  not  flat  but  has  overlaid  structure. 


TEG  pulse  GaP/Si(001)6° 


From  Fig.  1  this  ratio  is  0.02.  If  we  assume  that  the  real 
and  imaginary  parts  of  (^i2p~^23/?[l  +  of  the  order 

of  l/ro2p,  which  is  a  crude  approximation,  and  neglect  sin^  9q 
the  ratio  becomes 

^[Re(^/^)  +  Im(^/^)].  (17) 

If  we  further  assume  that  d  is  about  0.5  nm,  then  AirdlX  is 
0.01  and  [Re(  V^)  +  Im(V^)]  is  of  the  order  of  2.  This  sets 
upper  limits  of  10  to  the  imaginary  and  absolute  values  of  the 
real  part  of  . 

Although  our  motivation  for  this  analysis  is  the  under¬ 
standing  of  the  PRS  data  for  GaP  on  Si,  this  analysis  holds  in 
general  for  any  heterostructure  where  the  epitaxial  layer  has 
a  high  refractive  index  and  is  not  optically  absorbing. 

IV.  RESULTS  AND  DISCUSSION 

In  the  following  we  discuss  the  results  of  three  different 
types  of  experiments:  continuous  cyclic  growth,  single 
pulses  of  TEG  during  continuous  TBP  exposure,  and  single 
pulses  of  both  TEG  and  TBP.  These  experiments  were  per¬ 
formed  for  (113)  surfaces  and  surfaces  cut  6"^  off  (001)  to¬ 
ward  [110]  as  indicated  either  in  the  figures  or  in  the  cap¬ 
tions.  Similar  results  were  obtained  for  both  orientations.  So 
far,  we  have  found  no  evidence  of  a  growth  dependence  on 
step  density. 

A.  Continuous  cyclic  growth 

Figure  2  shows  a  typical  RDS  transient  obtained  at  3.6  eV 
and  25  ms  averaging  for  a  single  4  s  cycle  of  alternating  TEG 
and  TBP  exposures  during  the  growth  of  GaP  on  a  Si  surface 
oriented  6°  off  (001)  toward  [110].  At  3.6  eV  this  GaP  layer 
is  optically  thick.  The  onsets  of  the  rising  and  falling  parts  of 
the  transient  correlate  with  the  onsets  of  TEG  and  TBP  ex¬ 
posures,  respectively,  but  the  rising  and  falling  durations  do 
not  coincide  with  precursor  exposures.  Although  TEG  expo¬ 
sure  was  0.3  s  the  rise  time  is  broadened  to  about  0.5  s, 
whereas  the  decay  time  is  less  than  0.25  s  for  a  TBP  pulse  of 


Fig.  3.  Comparison  between  RDS  responses  to  a  single  pulse  of  TEG  on  a 
surface  oriented  6°  off  (001)  toward  [110],  where  the  TBP  exposure  is  either 
continuous  (lower  panel)  or  pulsed  (upper  panel).  All  pulse  durations  were 
0.5  s,  as  indicated  by  the  solid  bars.  With  continuous  TBP  exposure  the 
baseline  is  essentially  invariant,  indicating  that  the  surface  recovers  quickly 
after  the  TEG  pulse. 

0.8  s.  An  overshoot  also  appears  on  both  rising  and  falling 
edges.  The  difference  between  response  and  exposure  times 
are  likely  caused  by  surface  processes. 

B.  Single  precursor  pulses 

To  investigate  this  behavior  further  we  applied  single  0.3 
and  0.5  s  TEG  pulses  to  freshly  grown  GaP  annealed  at  the 
growth  temperature  by  a  continuous  or  pulsed  supply  of  TBP. 
The  RDS  response  to  a  0.5  s  TEG  pulse  during  TBP  expo¬ 
sure  for  a  surface  annealed  in  TBP  for  54  s  following  growth 
is  shown  in  the  lower  panel  of  Fig.  3.  At  this  point  TBP 
exposure  was  terminated  but  the  molecular  hydrogen  flow 
remained.  The  upper  panel  shows  the  result  obtained  when 
this  surface  was  exposed  to  TEG  followed  4  s  later  by  a  TBP 
pulse  and  2.5  s  later  by  continuous  exposure  to  TBP.  A  repeat 
of  this  sequence  yielded  the  same  results  except  that  the 
starting  level  was  lower  than  that  shown  in  the  upper  panel 
of  Fig.  3.  This  level  shift  indicates  that  the  surface  becomes 
Ga  rich  when  TBP  exposure  is  terminated. 

The  RDS  response  to  TEG  clearly  depends  on  whether 
TBP  exposure  is  interrupted  or  continuous.  With  continuous 
TBP  exposure  the  RDS  signal  increases  linearly  during  the 
TEG  pulse  and  afterward  decays  exponentially  to  its  starting 
value.  With  interrupted  TBP  exposure  a  TEG  pulse  generates 
a  fast  (rise  time  <0.25  s)  response  followed  by  an  exponen¬ 
tial  decay  of  time  constant  of  about  0.7  s  to  a  level  higher 
than  the  original  baseline.  The  original  baseline  is  recovered 
only  after  further  TBP  exposure.  We  expect  that  the  change 
of  the  RDS  signal  is  related  to  the  total  amount  of  Ga  present 
on  the  surface.  If  we  take  the  RDS  signal  of  the 
phosphorous-rich,  TBP-annealed  surface  as  the  reference,  the 
change  in  the  RDS  signal  after  TBP  exposure  in  the  upper 
transient  (t^5  s)  is  within  20%  of  the  change  induced  by 
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Fig.  5.  Comparison  of  PRS,  LLS,  and  3.6  eV  RDS  responses  upon  exposure 
of  a  (113)  surface  to  a  TEG  pulse  followed  by  a  TBP  pulse. 


Fig.  4.  Typical  PRS  and  LLS  transients  for  TEG  pulses  for  a  (113)  surface 
under  continuous  exposure  to  TBP. 

TEG  for  continuous  TBP  exposure  in  the  lower  transient.  It 
is  also  comparable  to  the  change  in  signal  for  continuous 
cyclic  growth  when  TBP  is  pulsed  (see  Fig.  2). 

The  PRS  and  LLS  data  provide  another  perspective  of 
these  processes,  as  illustrated  in  Fig.  4.  For  continuous  TBP 
exposure  the  scattered  light  intensity  increases  only  slightly 
if  at  all  following  a  TEG  pulse,  while  the  PRS  transient 
exhibits  steplike  behavior.  From  Eq.  (7)  we  know  that  during 
steady-state  growth,  the  increasing  film  thickness  yields  a 
sinusoidal  PRS  response,  which  is  a  much  weaker  edge  than 
observed.  We  can  therefore  conclude  that  more  than  one  pro¬ 
cess  is  active  on  the  surface.  This  is  further  supported  by  the 
surprising  fact  that  Ga  does  not  begin  to  be  incorporated  into 
the  GaP  layer  until  after  the  TEG  pulse  is  complete.  This  is 
not  in  contradiction  to  the  faster  RDS  response  of  Fig.  2, 
because  at  3.6  eV  RDS  is  sensitive  only  to  changes  of  sur¬ 
face  anisotropy  and  not  to  layer  thickness. 

More  insight  into  these  processes  can  be  obtained  by  ex¬ 
amining  responses  to  separate  pulses  of  TEG  and  TBP.  In 
Fig.  5  we  compare  LLS,  PRS,  and  RDS  transients  for  sepa¬ 
rate  TEG  and  TBP  pulses  for  GaP  layers  grown  on  (113)  Si 
surfaces.  As  with  (001)  surfaces  TEG  exposure  results  in  a 
broad  feature  in  all  three  sets  of  data.  However,  the  onsets  for 
the  LLS  and  PRS  responses  are  delayed  substantially  with 
respect  to  that  of  the  TEG  pulse.  However,  at  substrate  tem¬ 
peratures  well  below  those  used  for  normal  growth,  an  im¬ 
mediate  PRS  and  LLS  response  was  obtained.  Under  present 
conditions  a  TBP  pulse  generates  an  immediate  exponential 
decay  in  the  PRS  and  LLS  transients  while  the  RDS  signal 
recovers  essentially  linearly  within  0.4  s.  As  shown  in  Fig.  6 
the  PRS  and  LLS  decays  are  well  described  by  exponentials 
with  time  constants  of  0.54  and  0.70  s,  respectively.  The 
RDS  transient  is  more  complicated  and  is  discussed  below. 

Since  the  TEG  pulse  length  was  0.5  s  and  the  PRS  and 
LLS  responses  occurred  after  the  TEG  pulse  had  terminated. 


the  PRS  and  LLS  delay  times  are  at  least  0.5  s.  Since  this 
delay  time  is  much  longer  than  the  specified  response  time  of 
the  valves  and  since  the  response  to  TBP  exposure  is  effec¬ 
tively  immediate,  the  results  show  that  rate-limiting  interme¬ 
diate  surface  processes  are  involved.  Light  scattering  is 
caused  by  structures  on  the  surface  with  a  characteristic 
length,  i.e.,  a  feature  size  or  separation,  of  order  X  (here 
632.8  nm),  which  are  removed  or  smoothed  by  exposure  to 
TBP.  The  reflectance  of  the  material  in  these  structures  is 
probably  rather  high  because  it  is  very  unlikely  that  these 
structures  are  large  but  instead  should  have  diameters  of  the 
order  of  a  few  nm.  Since  the  refractive  index  of  TEG  is 
low — smaller  than  1.002  for  the  atmospheric-pressure  gas 
phase^^ — and  TEG  is  unstable  at  the  temperatures  used  here, 
the  delayed  LLS  response  indicates  that  the  LLS  signal  is 


time  (s) 

Fig.  6.  Exponential  fit  to  the  PRS  and  LLS  decay  transients  after  TBP 
exposure  for  a  (113)  surface  with  a  pulse  sequence  similar  to  that  used  in 
Fig.  5. 
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TEG  pulse  GaP/Si(113) 


Fig.  7.  Two  RDS  transients  for  a  (113)  surface  annealed  for  30  s  in  TBP 
followed  by  the  same  pulse  sequence  as  in  Fig.  5. 

due  to  the  formation  of  Ga-containing  clusters.  The  most 
likely  constituent  is  metallic  Ga,  since  neither  monoethylgal- 
lium  (MEG)  nor  diethylgallium  (DEG)  are  likely  to  aggre¬ 
gate.  Furthermore,  even  in  liquid-phase  densities  their  refrac¬ 
tive  indices  should  not  exceed  1.5,  which  is  too  small  to 
explain  the  LLS  signal.  However,  the  species  that  diffuse  to 
form  the  clusters  may  be  MEG,  DEG,  or  even  TEG,  with 
complete  dealkylization  occurring  only  after  the  molecule 
arrives  at  the  cluster.  If  MEG  is  the  diffusing  species,  the  fact 
that  all  its  outer  electrons  are  in  filled  orbitals  suggests  that  it 
is  relatively  stable  and  hence  there  exists  some  probability 
that  it  could  also  desorb. 

In  this  picture  the  delayed  onset  of  the  LLS  and  PRS 
responses  would  be  due  to  complete  and  partial  TEG  dealky¬ 
lization,  which  at  the  low  pressures  (low  10“^  mbar)  and 
substrate  temperatures  (typically  around  360  °C)  used  here  is 
expected  to  occur  on  the  surface  instead  of  in  the  gas  phase. 
The  Ga-rich  clusters  remain  stable  because  the  incorporation 
of  Ga  as  GaP  requires  the  presence  of  P. 

Although  the  LLS  and  PRS  signals  are  related  their  ori¬ 
gins  are  different.  For  sufficiently  low  TEG  exposures  the 
LLS  signal  vanishes  but  a  strong  PRS  response  remains. 
Moreover,  depending  on  the  thickness  of  the  GaP  layer  and 
therefore  the  phase  shift  of  the  electromagnetic  wave  within 
the  layer,  the  peaklike  feature  in  the  PRS  transient  due  to  the 
TEG  pulse  can  be  inverted  as  discussed  above  in  the  context 
of  Eq.  (11).  Consequently,  it  is  likely  that  the  PRS  signal  is 
caused  by  a  thin  surface  layer  whose  dielectric  function  dif¬ 
fers  strongly  from  that  of  the  underlying  film. 

Figure  7  shows  two  RDS  responses  for  a  TEG  pulse  ob¬ 
tained  on  a  GaP  layer  grown  on  a  (113)  Si  surface.  The  lower 
transient  is  taken  at  a  later  time  but  under  otherwise  identical 
conditions.  Clearly,  each  transient  has  at  least  two  compo¬ 
nents.  One  is  a  brief  negative  spike  representing  a  fast  pro¬ 
cess  that  lasts  less  than  0.5  s.  The  other  is  a  broad  feature 
between  the  negative  spike  and  the  TBP  pulse  that  is  similar 
to  that  observed  in  PRS  and  LLS.  The  shape  of  the  broad 
feature  is  history  dependent,  and  unlike  the  negative  spike  it 
can  appear  either  above  or  below  the  TBP-established  base 
line.  This  variation  of  level  cannot  be  due  to  interference 
from  a  backreflected  wave  within  the  GaP  layer,  as  is  the 


TEG  pulses  GaP/Si(113) 


Fig.  8.  RDS  transients  for  a  (113)  surface  for  various  combinations  of  TEG 
and  TBP  pulses  followed  by  continuous  TBP  exposure. 


case  for  PRS,  since  at  3.6  eV  the  20  nm  penetration  depth  is 
much  smaller  than  the  layer  thickness.  Also,  this  behavior 
cannot  be  explained  by  surface  chemistry  alone,  because  if 
the  surface  only  changes  between  P-rich  and  Ga-rich  condi¬ 
tions  the  final  state  should  be  similar  for  the  two  transients. 
However,  it  is  well  known  that  the  presence  of  anisotropic 
structures  on  the  surface,  for  example  ellipsoidal  clusters, 
cause  large  RDS  responses,  with  the  RDS  signal  depending 
on  the  shape  and  the  optical  properties  of  the  clusters.  For 
randomly  distributed  isotropic  clusters  the  RDS  signal  must 
vanish  by  symmetry.  A  broad  RDS  response  could  be  ex¬ 
plained  by  supposing  that  initially  isotropic  clusters  become 
anisotropic.  The  cluster  material  itself  is  not  expected  to  be 
anisotropic  since  bulk  Ga  is  liquid  at  the  growth  temperature. 
We  also  observe  a  change  in  the  corresponding  PRS  tran¬ 
sient. 

Curiously,  the  sharp  negative  spike  does  not  always  ap¬ 
pear  at  the  onset  of  TEG  exposure  but,  as  shown  in  Fig.  8,  is 
further  delayed  with  successive  TEG/TBP  cycles.  The  TBP- 
established  baseline  also  decreases  with  successive  cycles, 
recovering  only  upon  continuous  exposure  to  TBP,  as  also 
shown  in  Fig.  8.  This  indicates  that  the  surface  retains  some 
morphology  remnant  or  fraction  of  fully  or  partially  dealky- 
lized  TEG.  The  recovery  effect  of  continuous  TBP  exposure 
is  most  pronounced  in  Fig.  9,  where  a  steplike  transient  is 
observed. 

C.  Possible  surface  processes 

It  is  puzzling  that  surfaces  being  continuously  exposed  to 
TBP  react  much  more  slowly  to  TEG  than  when  TBP  expo¬ 
sure  is  interrupted.  No  such  variation  is  observed  for  TBP: 
the  PRS,  LLS,  and  RDS  data  all  show  that  TBP  appears  to 
react  immediately  upon  reaching  the  surface,  regardless  of  its 
initial  state.  In  particular,  no  LLS  response  would  be  ex¬ 
pected  if  TBP  were  only  physisorbed  on  the  surface,  since 
the  refractive  index  of  TBP  is  small.  One  possible  explana¬ 
tion  for  the  slower  TEG  response  with  TBP  coexposure  is 
that  the  butyl  group  requires  time  to  desorb.  The  butyl  group 
probably  does  not  desorb  directly  as  a  radical  or  as  frag¬ 
ments.  Because  the  butyl  group  is  also  very  bulky,  it  is  also 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


3046 


3046 


Rossow  et  a/.:  Investigations  of  surface  processes  in  GaP  heteroepitaxy  on  Si 


TEG  pulse  GaP/Si(1l3) 


Fig.  9.  As  in  Fig.  8  but  after  further  deposition  of  GaP.  A  steplike  response 
occurs  when  TBP  exposure  becomes  continuous. 

unlikely  that  two  butyl  groups  will  react  or  that  butyl  groups 
are  incorporated  into  the  growing  layer.  Most  likely  the  butyl 
groups  will  react  with  hydrogen  that  originates  from  either 

(1)  PH2,  (ii)  the  butyl  group  itself  by  conversion  to  f-butene 
(C4H8),  or  (iii)  ^-exchange  within  the  ethyl  groups  to  form 
C2H4.  Hence,  the  likely  possibilities  are  that  (1)  butyl  reacts 
with  atomic  hydrogen  at  the  surface,  desorbing  as  r-butane  or 

(2)  r-butene  is  formed  directly.  In  the  former  case  we  can 
expect  that  the  attachment  of  TEG  or  its  fragments  to  the  P 
sites  will  be  delayed.  A  similar  argument  should  hold  for 
TBP. 

For  surfaces  terminated  by  P,  one  P  bond  is  unsaturated. 
Two  neighboring  P  atoms  may  form  a  dimer,  or  the  dangling 
bond  can  be  saturated  by  hydrogen.  It  is  not  clear  at  present 
whether  hydrogen  desorbs  from  the  surface  at  these  growth 
temperatures.  However,  the  behavior  of  Si  surfaces  may  give 
some  hints  about  the  nature  of  the  P  dangling  bond.  The  Si 
dimer  bond  is  strong  and  the  desorption  temperature  of  H 
from  the  dihydride  phase  is  lower  (about  300  °C)  than  that  of 
H  from  the  monohydride  phase  (around  520  °C).  Since  P  has 
a  lone  pair  orbital,  a  P-rich  surface  terminated  by  H  should 
more  closely  resemble  the  Si  dihydride  than  the  Si  monohy¬ 
dride.  Growth  temperatures  near  360  °C  are  therefore  above 
the  temperature  for  H  desorption.  Hydrogen  desorption  may 
proceed  in  three  ways:  (1)  as  H2;  (ii)  as  ethane  formed  by 
/^-exchange  within  the  ethyl  group;  and  (iii)  as  r -butane 
(C4H10)  by  saturation  within  the  butyl  group.  However,  in  all 
cases  all  surface  dangling  bonds  are  saturated  and  the  surface 
is  passivated  in  a  manner  similar  to  that  of  As -terminated 

V.  SUMMARY  AND  CONCLUSIONS 

Comparison  of  RDS,  PRS,  and  LLS  data  for  exposure  of 
GaP  layers  grown  on  vicinal  (001)  and  (113)  Si  surfaces  to 
single  pulses  of  TEG  and  TBP  reveals  that  RDS,  PRS,  and 
LLS  deliver  complementary  information,  a  necessary  prereq¬ 
uisite  for  an  unambigious  interpretation  of  growth  mecha¬ 
nisms.  However,  the  observed  chemistry  is  very  complex. 


Further  investigations  using  a  quadrupole  mass  spectrometer 
in  addition  to  the  optical  probes  and  an  optical  multichannel 
analyzer  (OMA)  detection  system  to  obtain  PRS  spectra  are 
in  progress. 

We  have  derived  linearized  expressions  for  the  fine  struc¬ 
ture  in  the  PRS  data,  which  shows  that  a  reaction  layer  in¬ 
cluding  all  not-yet-incorporated  surface  species  is  present  on 
the  surface  of  the  growing  GaP  layer.  This  reaction  layer  is 
optically  absorbing  for  the  wavelengths  used  here.  The  time 
constants  of  the  surface  reaction  processes  preceding  Ga  and 
P  incorporation  are  of  the  order  of  1  s  under  present  condi¬ 
tions,  and  appear  to  be  substantially  faster  for  TBP  than  for 
TEG.  At  least  two  processes  occur  when  the  surface  is  ex¬ 
posed  to  TEG.  From  LLS  data  we  find  evidence  for  Ga  clus¬ 
tering,  which  is  consistent  with  the  observation  that  the  sur¬ 
face  reaction  layer  is  optically  absorbing.  Unreacted  species 
that  are  not  directly  incorporated  into  the  growing  layer  also 
remain  on  the  surface  possibly  for  tens  of  seconds.  For  this 
system  we  find  no  evidence  for  atomic  layer  epitaxy,  i.e.,  that 
growth  is  self-limiting.  Our  results  are  not  specific  to  GaP  on 
Si  and  are  equally  valid  for  GaP  homoepitaxy  as  well.  We 
expect  that  the  ternary  system  Ga^In|  _^P  will  behave  simi¬ 
larly. 
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The  As/P  exchange  behavior  at  interfaces  of  As  compound/P  compound  heterojunction  superlattices 
has  been  assessed  by  observing  changes  in  average  lattice  spacing  when  very  thin  but  different 
thickness  As  compound  layers  are  inserted  into  InP,  or  P  compound  layers  inserted  into  GaAs. 

Compositional  changes  are  inferred  from  strain  behavior  measured  by  x-ray  diffraction. 

Implications  of  the  data  are:  (1)  As  rapidly  displaces  P  and  P  slowly  displaces  As;  (2)  As  atoms 
displace  P  atoms  and  reconstruct  as  a  discrete  single  layer  above  600  °C  and  as  a  double  layer  below 
600  °C  when  InP  surfaces  are  exposed  to  ASH3 ;  and  (3)  interfaces  and  layers  of  different  P 
compounds  on  GaAs  show  the  same  temperature- dependent  inclusion  of  As,  ©  1996  American 
Vacuum  Society. 


I,  INTRODUCTION 

InGaAs/InP  multiple  quantum  wells  are  often  unintention¬ 
ally  compressively  strained  relative  to  the  InP  substrate.  This 
strain  is  variously  attributed  to  interaction  of  ASH3  with  the 
InP  surface  as  InGaAs  growth  is  started,^  to  carry  over  of 
excess  As  from  InGaAs  into  InP,^  or  to  different  bond  lengths 
at  heterojunction  interfaces.^ 

The  purpose  of  this  study  was  to  better  understand  the 
formation  of  As/P  heterojunction  interfaces  grown  by  metal- 
organic  vapor-phase  epitaxy  (MOVPE)  and  the  implications 
for  growth  of  both  InGaAs/InP  and  InGaP/GaAs  materials 
systems.  Superlattices  were  formed  by  inserting  very  thin 
strained  layers  (equivalent  to  a  few  angstroms)  of  a  P  com¬ 
pound  III-V  semiconductor  into  GaAs,  or  alternatively  an 
As  compound  into  InP.  The  strain  of  the  superlattices  was 
analyzed  from  x-ray-diffraction  rocking  curves.  Three  impli¬ 
cations  of  As-P  exchange  have  been  made  and  are  discussed. 
These  are  that 

(1)  As  rapidly  displaces  P  and  P  slowly  displaces  As, 

(2)  As  atoms  displace  P  atoms  and  reconstruct  as  a  dis¬ 
crete  single  layer  above  600  °C  and  as  a  double  layer  below 
600  °C  when  InP  surfaces  are  exposed  to  ASH3 ,  and 

(3)  interfaces  and  layers  of  different  P  compounds  on 
GaAs  show  the  same  temperature-dependent  (In)GaAsi_j^.P^ 
composition. 

II.  EXPERIMENTAL  DETAILS 

The  superlattices  were  grown  in  a  50-mm-diam  horizontal 
quartz  chamber  using  a  lamp-heated  graphite  susceptor.  Use 
of  20  Torr  chamber  pressure  and  1500  seem  total  gas  flow 
provided  gas  velocity  of  ~  1  m/s  to  allow  very  rapid  change 
of  source  gases  and  controllable  growth  periods  of  less  than 
1  s.  The  growth  parameters  are  summarized  in  Table  I.  The 
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thin  strained  P  (or  As)  compound  III-V  semiconductors  were 
inserted  into  GaAs  (or  InP)  at  intervals  of  180  A  for  30 
repetitions. 

The  multilayer  superlattice  strain  was  analyzed  with 
x-ray-diffraction  rocking  curves  using  superlattice  peak 
structure  about  the  Cu  Ka^  (004)  peak  from  (001) -oriented 
substrates.  Changes  in  the  average  lattice  parameter  are  mea¬ 
sured  from  the  strain  in  the  superlattice. 


III.  METHOD  OF  INTERPRETATION 

The  x-ray  diffraction  from  the  (004)  peak  directly  mea¬ 
sures  the  superlattice  strain  as 

( Aa^  /a)sidsL=  (A«i  /a)LdL+  (unknown),  (1) 

where  (Aa^/a)sL  is  the  superlattice  strain,  is  the  period 
thickness,  (Aa^/a)^  is  the  inserted  layer  strain,  di^  is  the 
layer  thickness,  and  the  unknown  is  the  interface  strains  mul¬ 
tiplied  by  the  interface  thicknesses.  Typically  in  these  super¬ 
lattices  there  is  strain  in  excess  to  what  can  be  accounted 
from  the  inserted  layer  alone.  By  using  measured  and  known 
data  we  can  extract  the  unknown  excess  strain  contribution. 
If  Vegard’s  law  is  then  used  to  relate  lattice  parameter  to 
alloy  composition,  inferences  can  be  made  about  the  As~P 
intermixing  at  the  heterojunctions. 

The  analysis  on  a  sample  series  is  done  by  looking  at  the 
strain  as  a  function  of  inserted-layer  growth  time  such  as 
shown  in  Fig.  1.  Two  key  parameters  used  in  the  interpreta¬ 
tion  of  the  data  are  the  excess  strain  extrapolated  to  zero  time 
A(7’(0),  and  the  excess  strain  as  a  function  of  time  Acr(r). 
A(7(0)  is  thought  to  represent  two  strain  contributions,  one 
from  the  inserted-layer  interface  as  the  inserted-layer  growth 
is  initiated,  and  one  from  the  interface  as  the  inserted-layer 
growth  is  terminated.  Acr(?)  is  thought  in  most  cases  to  be 
excess  strain  within  the  inserted  layer. 
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Table  I.  Epilayer  growth  conditions. 


Source  partial  pressures 

Growth  rates 

PH3 

1.3X10“^ 

InP 

2.2  (A/s) 

ASH3 

8.0X10"’ 

InAs 

2.0 

TMIn 

~5  XIO"’ 

GaAs 

2.2 

TMGa 

~5  XIO"’ 

GaP 

2.5 

The  signs  of  Acr(O)  and  Acr(r)  are  useful  to  give  a  quick 
idea  of  the  composition  which  could  cause  the  excess  strain. 
In  InP,  only  alloys  with  InAs  can  give  a  positive  (compres¬ 
sive)  strain.  Although  the  precise  InAsi-^^P^  composition 
cannot  be  known,  we  assume  for  simplicity  that  the  compo¬ 
sition  is  pure  InAs  and  determine  what  the  thickness  would 
be.  Vegard’s  law  assures  that  the  strain  for  the  As  is  the  same 
whether  it  is  an  InAs  layer  or  is  distributed  as  an  InAsP  alloy. 
Similarly,  for  GaAs,  only  alloys  with  GaP  can  give  a  nega¬ 
tive  (tensile)  strain. 

Several  assumptions  must  be  made  for  this  strain  analysis, 
as  follows. 

(1)  Growth  rates  for  thin  layers  must  be  the  same  as  has 
been  measured  for  thick  layer  structures.  An  accurate  equiva¬ 
lent  thickness  is  needed  to  calculate  the  expected  strain  con¬ 
tribution  introduced  by  the  inserted  layer.  Prior  published 
work  has  shown  this  to  be  a  good  assumption."^ 

(2)  Vegard’s  law  relates  lattice  parameters  to  composi¬ 
tions  of  very  thin  layers.  This  has  also  been  shown  to  be  a 
good  assumption,  supported  by  correct  correlation  of  strain 
to  thickness  and  mismatch  for  fractions  of  a  monolayer 
(ML). 

(3)  Excess  strain  at  interfaces  can  only  occur  from  ex¬ 
change  of  atoms  on  the  group- V  sublattice,  i.e.,  only  from 
As-P.  This  again  is  thought  to  be  valid  because  at  the  high 
V/III  ratios  being  used,  essentially  all  available  group  III 
atoms  are  incorporated.  Even  if  the  group-III  atoms  were 


Fig.  1.  InP  superlattices  (30  periods  of  180  A)  formed  at  650  °C  by  (1)  ASH3 
exposure  of  InP,  (2)  InAs  thin  layer  growth,  and  (3)  GaAs  thin  layer  growth. 
The  data  demonstrate  an  excess  InAs-like  strain  Acr(0)  at  r=0,  and  an 
excess  InAs-like  time-dependent  strain  Acr(f). 


Fig.  2.  Strain  in  InP  superlattices  (30  periods  of  180  A)  formed  at  650  °C  by 
two  steps:  (1)  1.4  s  exposure  of  InP  to  ASH3  to  form  an  As-terminated 
surface;  (2)  growth  interrupt  under  (H2+PH3)  or  H2  only  to  desorb  the 
surface  As. 


intermixing  across  the  heterojunction,  there  is  no  excess 
strain  because  there  are  no  excess  group-III  atoms  incorpo¬ 
rated. 

(4)  The  strained  regions  are  coherent  with  the  substrate 
lattice,  and  tetragonal  distortion  perpendicular  to  the  inter¬ 
face  accommodates  difference  in  the  lattice  spacing.  The  per¬ 
pendicular  lattice  parameter  aj_  is  related  to  the  relaxed  lat¬ 
tice  parameter  a  by  the  elastic  moduli!  as"^ 

Aa- Aa^[CH/(Cn  +  2Ci2)]-0.5Aa^  .  (2) 

A.  Implication  1:  As  rapidly  displaces  P,  and  P  slowly 
displaces  As 

InAs-like  excess  strain  components,  A(r(0)  and  Aa(t), 
form  when  the  InP  surface  is  exposed  to  ASH3  as  shown  in 
Fig.  1,  and  this  occurs  for  both  simple  ASH3  exposure  of  the 
InP  surface  and  when  layers  of  InGaAs,  InAs,  or  GaAs  are 
grown.^  The  As  for  Acr(O)  is  attributed  to  As  rapidly  displac¬ 
ing  P  surface  atoms  when  InP  is  exposed  to  ASH3,  but  P  not 
quickly  displacing  As  surface  atoms  exposed  to  PH3 . 

Deliberate  exposure  of  an  As-terminated  InP  surface  to 
PH3  at  650  as  shown  in  Fig.  2  results  in  rather  slow  dis¬ 
placement  of  the  As,  confirming  that  As  atoms  are  slowly 
displaced  by  P.  Removal  of  As  atoms  becomes  very  slow 
when  the  strain  is  equivalent  to  a  surface  coverage  less  than 
a  single  layer.  The  As-terminated  surface  is  stable  when  ex¬ 
posed  to  interrupt  in  a  hydrogen  atmosphere  for  up  to  25  s. 

In  Fig.  3  for  GaAs  exposed  to  PH3  a  partial  displacement 
of  As  surface  atoms  by  P  to  form  A<j(0)  occurs  within  less 
than  1  s,  but  P  incorporation  to  form  Acr(0  is  very  slow. 
Rapid  displacement  of  P  by  As  is  demonstrated  in  this  case 
by  establishing  P  surface  atoms  with  exposure  of  the  GaAs 
surface  to  PH3  for  20  s  at  650  °C,  then  exposing  the  P  surface 
to  ASH3  for  1  s.  The  strain  dropped  to  zero  and  x-ray  satellite 
peaks  disappeared.  The  P  atoms  were  desorbed  under  ASH3 . 
It  is  evident  without  the  ASH3  interrupt  that  P  atoms  remain 
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Fig.  3.  Temperature  dependence  of  strain  in  superlattices  (30  periods  of  180 
A)  formed  by  periodically  exposing  the  MOVPE-grown  GaAs  growing  sur¬ 
face  to  PH3 .  A(7(0)  is  attributed  to  GaP  formed  by  P  adsorbed  on  the  GaAs 
surface.  An  additional  excess  time-dependent  GaP-like  strain  component 
Aa-(r)  is  slowly  formed.  The  (+)  data  point  results  when  a  20  s  exposure  to 
PH3  is  followed  by  a  1  s  exposure  to  ASH3 . 

when  a  GaAs  overlayer  is  grown  using  ASH3  in  combination 
with  a  trimethylgallium. 

B.  Implication  2:  As  atoms  displace  P  atoms  and 
reconstruct  as  a  discrete  single  layer  above  600  °C 
and  as  a  double  layer  below  600  °C  when  InP 
surfaces  are  exposed  to  AsHs 

The  InAs-like  strain  Ao-(0)  from  brief  ASH3  exposure  of 
InP  is  shown  in  Fig.  4  to  be  equivalent  to  ^1  monolayer 
InAs  for  growth  temperatures  above  600  °C  and  to  be 
equivalent  to  ~2  monolayers  InAs  for  growth  temperatures 
below  600  °C.  The  implied  As  surface  reconstruction  on  the 
InP  is  ~1  layer  of  As  atoms  above  600  °C  and  ~2  layers  of 
As  atoms  below  600  °C,  with  virtually  all  the  surface  As 
incorporated  as  InP  growth  resumes. 


AsH3  Exposure  Time  (seconds) 


Fig.  4.  Temperature  dependence  of  strain  in  superlattices  (30  periods  of  1 80 
A)  formed  by  periodic  exposure  of  InP  surface  to  ASH3 . 


GaP  Growth  Time  (seconds) 

Fig.  5.  Temperature  dependence  of  strain  in  superlattices  (30  periods  of  180 
A)  formed  by  inserting  thin  GaP  layers  periodically  into  GaAs.  The  dotted 
line  is  the  calculated  strain  cr(0  for  GaP,  and  the  dashed  line  is  (T{t)  calcu¬ 
lated  for  GaAsosPo.s-  Unintended  incorporation  of  residual  As  results  in 
layers  of  GaAs,  in  which  x  increases  with  temperature. 


C.  Implication  3:  Interfaces  and  layers  of  different  P 
compounds  on  GaAs  show  the  same 
temperature-dependent  (ln)GaAsi_jfP;f  composition 

Temperature  dependence  of  strain  from  exposure  of  the 
GaAs  surface  to  PH3  is  shown  in  Fig.  3.  The  negative  A(j(0) 
and  A(j(?)  indicate  that  there  is  replacement  of  As  atoms  in 
the  GaAs  with  P  atoms.  Acr(0)  for  650  °C  growth  is  equiva¬ 
lent  to  that  of  either  0.5  ML  of  GaP  or  its  equivalent  of  1  ML 
of  GaAso.sPo.s-  As  the  growth  temperature  is  reduced,  Ac7(0) 
decreases,  i.e.,  becomes  more  GaAs-like.  From  Acr(f)  be¬ 
havior,  further  replacement  of  As  by  P  is  a  slow  process  for 
temperatures  of  650  °C  or  less,  but  is  much  faster  as  the 
temperature  is  raised  to  675  °C.  The  fact  that  the  650  °C 
excess  strain  for  long  PH3  exposures  increases  beyond  that 
equivalent  to  a  single  ML  of  GaP  (strain  of  1  ML  GaP  in  180 
A  GaAs  is  —0.10%)  suggests  that  the  time-dependent  P  sub¬ 
stitution  for  As  is  not  limited  to  the  surface  but  penetrates 
below  the  top-most  layer. 

In  Fig.  5  is  shown  the  strain  for  deliberately  grown  thin 
GaP  layers.  A(j(0)=0,  at  least  for  7=650  °C.  The  Acr(0 
behavior  does  not  correspond  to  that  calculated  for  the  GaP 
thickness,  but  is  consistent  with  alloys  of  GaAsi_^Pjp  where 
X  varies  with  growth  temperature.  Considerable  unintended 
As  is  incorporated  during  growth  of  the  initial  monolayers  of 
GaP,  and  the  amount  of  As  increases  at  lower  temperatures. 
At  650  °C  the  layer  composition  is  nominally  GaAso^Po.s- 
The  existence  of  a  large  reservoir  of  residual  As  is  evident 
from  the  650  °C  growths  which  ranged  in  layer  thicknesses 
up  to  5  ML  without  significant  As  depletion.  Whether  the 
residual  As  is  from  excess  adsorbed  on  the  GaAs  surface  or 
from  surrounding  reactor  deposits  is  unresolved. 

The  strain  for  thin  InP  layer  superlattices  in  GaAs  is 
shown  in  Fig.  6,  The  InAsPAcr(0)  is  seen  to  be  temperature 
dependent,  however,  the  Acr(0  over  the  growth  temperature 
range  550-650  °C  has  the  same  slope  as  calculated  for  InP 
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InP  Growth  Time  (seconds) 

Fig.  6.  Temperature  dependence  of  strain  in  superiattices  (30  periods  of  180 
A)  formed  by  inserting  thin  InP  layers  periodically  into  GaAs.  The  dotted 
line  is  the  calculated  strain  (r(t)  for  InP. 


layer  growth.  There  is  no  evidence  of  excess  strain  due  to 
residual  As  incorporating  into  the  InP. 

The  results  for  thin  Ino.5Gao.5P  layer  superiattices  in  GaAs 
are  shown  in  Fig.  7.  Rather  than  the  expected  A(r(O  =  0  for 
nominally  lattice-matched  Ino.5Gao.5P,  a  compressive  strain 
exists  suggestive  of  an  excess  Ino  5Gao  5AS  component.  The 
Acr(O)  are  of  like  magnitude  to  those  of  InP  in  Fig.  6.  Ao’(f) 
for  the  Ino  5Gao  5P  650  °C  best  fits  a  layer  composi¬ 

tion  of  Ino  5Gao.5Aso.5Po.5j  showing  the  same  x^O.5  residual 
As  incorporation  as  seen  for  thin  GaP  layer  growth  at  650  °C 
in  Fig.  5.  Again,  the  amount  of  residual  As  available  for 
incorporation  is  surprising.  Layers  as  thick  as  10  ML  con¬ 
tinue  to  show  the  same  composition  without  evidence  of  de¬ 
pleting  the  available  As. 

All  of  the  above  GaAs  superiattices  show  evidence  of 
unintended  excess  As  incorporating  at  the  interface  bound- 


InGaP  Growth  Time  (seconds) 


Fig.  8.  Temperature  dependence  of  the  group- V  sublattice  composition  x 
using  the  strain  data:  (a)  from  Ao-(O)  of  PH3-formed  GaP  layers  in  Fig.  3;  (b) 
from  the  cr(r)  slope  of  GaP-grown  layer  in  Fig.  5;  (c)  from  Acr(0)  of  InP 
layers  in  Fig.  6;  (d)  from  Acr(0)  of  Ino,5Gao  5P  layers  in  Fig.  7;  (e)  from  the 
a(t)  slope  of  Ino.5Gao.5P-grown  layer  in  Fig.  7. 

aries  and/or  in  the  initial  monolayers  of  P  compound  growth. 
The  concentration  of  As  and  its  temperature  dependence  can 
be  analyzed,  at  least  qualitatively,  for  each  case  as  shown  in 
Fig.  8.  For  GaAs  exposed  to  PH3  or  with  inserted  layers  of 
InP  or  Ino.5Gao.5P,  the  As  vs  P  composition  can  be  estimated 
from  Acr(0).  With  PH3  exposure  it  is  assumed  that  Acr(0)  is 
due  to  formation  of  a  single  GaAsj.^^P^  monolayer  sand¬ 
wiched  in  GaAs.  The  layer  composition  of  the  embedded 
mixed  As/P  layer  is  calculated  from  the  strained  lattice  pa¬ 
rameter  of  this  monolayer  using  Vegard’s  law.  The  InAsP- 
like  Acr(0)  of  both  InP  and  Ino.5Gao5P  consists  of  the  com¬ 
bined  strains  of  the  initial  and  terminal  interfaces  bridging 
the  P  compound  layer  to  the  GaAs.  The  interface  composi¬ 
tions  are  not  obvious,  but  the  temperature  dependence  of 
Acr(0)  shows  that  the  As/P  composition  is  similar  to  that  for 
PH3  exposure.  To  estimate  the  interface  As/P  composition  it 
was  assumed  that  for  an  Iuq  5Gao  5P  layer  the  combined  in¬ 
terfaces  are  equivalent  to  2  ML  Ino.25Gao.75Asi_;cP;^ ,  where 
each  interface  consists  of  a  monolayer  with  bonds  of 
Ga-(Asi_^P^)-(Ino  5Gao  5).  This  same  interface  composition 
was  also  applied  to  the  InP  data  to  estimate  x. 

The  concentrations  of  residual  As  in  the  GaP  and 
Ino  5Gao  5P  layer  growth  were  determined  from  Acr(0  using 
the  growth  rates  and  compositions  of  GaAsi-^^P^^.  and 
Ino.5GaQ  5AS1  _^P^ ,  respectively.  The  As  concentrations  deter¬ 
mined  from  A(t(0)  and  A  (7(0  for  the  above  cases  are  shown 
in  Fig.  8  to  be  remarkably  similar  with  virtually  identical 
temperature  dependence.  The  mechanism  for  apportioning 
the  As/P  seems  to  be  in  common,  suggesting  that  arsenic  and 
phosphorus  concentrations  on  the  growing  surface  are  in 
equilibrium. 


Fig.  7.  Temperature  dependence  of  strain  in  superiattices  (30  periods  of  180  IV.  SUMMARY 
A)  formed  by  inserting  thin  Ino.sGao  5P  layers  periodically  into  GaAs.  There 

should  be  no  ait)  for  the  nominally  lattice-matched  layer.  Unintended  in-  Tht  measured  Strain  in  multilayer  SUperlattices  formed 

corporation  of  residual  As  results  in  layers  of  lno.5Gao  5Asi_  ^P^ .  both  by  inserting  thin  As  compound  layers  periodically  into 
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MOVPE-grown  InP  and  by  inserting  thin  P  compound  layers 
periodically  into  GaAs  has  been  used  to  derive  information 
about  As~P  intermixing  at  the  heterojunction  interfaces.  Im¬ 
plications  of  the  strain  behavior  are  as  follows. 

(1)  As  rapidly  displaces  P,  and  P  slowly  displaces  As. 
This  results  in  the  dominance  of  As  replacing  P  at  the  het¬ 
erojunctions. 

(2)  As  atoms  displace  P  atoms  and  reconstruct  as  a  dis¬ 
crete  single  layer  above  600  °C  and  as  a  double  layer  below 
600  °C  when  InP  surfaces  are  exposed  to  ASH3 .  More  sur¬ 
face  adsorbed  As  is  available  to  carry  over  into  InP  over¬ 
growth  at  the  lower  temperatures. 

(3)  Interfaces  and  layers  of  different  P  compounds  on 
GaAs  show  the  same  temperature-dependent  (In)GaAsi_j^P^ 


composition.  The  As  concentration  in  the  solid  appears  to  be 
in  equilibrium  with  residual  As  that  is  present  following 
GaAs  growth  and  P  from  PH3. 
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We  discuss  the  structure  and  formation  mechanism  of  ZnSe/GaAs(100)  interfaces  as  probed  by 
reflectance  difference  spectroscopy  (RDS).  First  we  describe  a  simple  analytic  procedure  that 
separates  the  surface  and  interface  contributions  in  the  RD  spectra  obtained  for  a  heterostructure. 

The  procedure  opens  up  the  new  possibilities  of  RDS  as  an  in  situ  interface  probe.  We  have  applied 
this  technique  to  characterize  the  ZnSe/GaAs  interfaces  prepared  on  various  GaAs(lOO)  surfaces. 

The  interface-anisotropy  spectra  thus  obtained  and  the  cross-sectional  transmission  electron 
microscopy,  show  clearly  that  the  chemical  composition  of  the  interfacial  layer  can  be  changed  by 
controlling  the  reconstruction  and  the  pregrowth  Zn  and  Se  treatments  of  the  initial  GaAs  surface. 

©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

It  is  now  widely  recognized  that  in  situ  diagnostics  is  one 
of  the  key  technologies  for  advanced  semiconductor  process¬ 
ing.  While  there  are  a  number  of  surface  probes  that  can  be 
used  in  processing  environments,  there  are  few  spec¬ 
troscopies  or  microscopies  that  are  practically  applicable  for 
in  situ  interface  diagnostics.  As  such,  a  versatile  in  situ  in¬ 
terface  probe  is  strongly  desired  to  facilitate  our  understand¬ 
ing  and  control  of  the  structure  and  formation  process  of  the 
interfaces.  Reflectance-difference  spectroscopy  (RDS), 
which  measures  in-plane  reflectance  anisotropy  and  requires 
only  a  compact  optical  system,  has  been  increasingly  used  as 
a  surface  probe.  Although  RDS  is  also  a  potential  optical 
probe  for  interface  studies,  its  application  in  this  regard  has 
been  limited  to  the  simple  cases  where  the  surface  contribu¬ 
tion  was  negligible.^’"^ 

In  this  article  we  first  describe  an  analytic  procedure  that 
allows  surface-induced  and  interface-induced  anisotropies  to 
be  determined  simultaneously  when  the  heterostructure  of 
interest  shows  both  anisotropies.^  We  have  applied  the  in  situ 
RDS  technique  to  the  ZnSe/GaAs  heterostructure.  Combined 
with  ex  situ  cross-section  transmission  electron  microscopy 
(TEM),  the  structure  of  the  ZnSe/GaAs(100)  interfaces  pre¬ 
pared  on  various  GaAs  surfaces  were  studied.  The  results 
prove  clearly  that  RDS  is  indeed  an  excellent  tool  for  inter¬ 
face  diagnostics,  and  illustrate  how  the  ZnSe/GaAs  interface 
can  be  controlled  at  atomic  level  by  controlling  the  recon¬ 
struction  and  pregrowth  treatment  of  the  initial  GaAs  sur¬ 
face. 

The  fundamental  interest  in  the  ZnSe/GaAs  consists  in  its 
heterovalent  nature,  i.e.,  II-VI  on  III~V.  Since  the  lattice 
mismatch  of  this  system  is  as  small  as  0.27%,  hetero valency 
is  expected  to  play  a  dominant  role  in  the  process  of  interface 
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formation.  Specifically,  rearrangements  and/or  reconstruc¬ 
tions  possibly  take  place  at  this  interface  to  satisfy  the 
electron-counting  requirements.^  On  the  other  hand,  control 
of  the  ZnSe/GaAs  interface  is  also  technologically  important. 
We  have  found  that  the  microscopic  structure  and  formation 
chemistry  of  this  interface  is  strongly  related  to  the  formation 
of  the  various  grown-in  defects.^  Reduction  of  such  defect  is 
strongly  desired  to  improve  the  lifetime  of  ZnSe-based  light- 
emitting  devices.^  It  was  also  reported  that  the  valence-band 
offset  depends  on  the  interface  structure.^ 

In  the  following,  we  first  describe  the  mathematical  pro¬ 
cedure  we  have  developed  to  analyze  the  RDS  data  from  the 
heterostructures.  After  discussing  the  effects  of  the  Zn  and  Se 
treatments  on  variously  reconstructed  GaAs  surfaces,  we 
show  the  interface-induced  anisotropy  spectra  and  the  cross- 
sectional  TEM  images.  On  the  basis  of  these  results,  we  dis¬ 
cuss  the  structure  and  formation  process  of  the  ZnSe/GaAs 
interface,  with  an  emphasis  on  the  effects  of  the  pregrowth 
treatments. 

II.  EXPERIMENT 

Our  homemade  RDS  system  is  similar  to  the  one  reported 
by  Aspnes  et  al.,^^  except  for  the  use  of  the  calcite  Glan- 
Taylor  prism  as  the  incidence  polarizer.  In  RDS,  one  mea¬ 
sures  the  difference  between  the  normal-incidence  reflec¬ 
tances  for  two  orthogonal  polarization  directions  in  the 
surface  plane.  The  results  are  displayed  in  terms  of  Ar/ 
r  =  Ar/r+/A^,  where  r  —  rtxp{i0)  is  the  complex  reflec¬ 
tance.  We  define  Ar  as  Ar  =  rf|o  ~  where  the  suffices 
denote  the  incident  polarization  vectors.  By  minimizing  the 
parasitic  anisotropies  in  the  optical  system,  we  were  able  to 
collect  reliable  AO  data,  which  is  a  prerequisite  to  obtain 
accurate  interface-induced  anisotropies  by  the  procedure  de¬ 
scribed  below.  In  this  article  we  will  show  only  the  Ar/r 
spectra.  The  spectra  cover  an  energy  range  from  1.5  to  5.8 
eV,  with  a  resolution  better  than  1  nm. 

The  heteroepitaxial  growth  of  ZnSe  on  GaAs(lOO)  was 
carried  out  in  a  dual-chamber  molecular  beam  epitaxy 
(MBE)  system.  First,  an  undoped  homoepitaxial  GaAs  layer 
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was  grown  in  the  III-V  chamber  to  prepare  an  atomically 
flat  surface.  The  high  quality  of  the  surface  was  confirmed  in 
a  separate  experiment  using  a  scanning  tunneling  microscope 
connected  to  the  MBE  system.  The  sample  was  then  trans¬ 
ferred  in  vacuo  to  the  II-VI  MBE  chamber  for  the  ZnSe 
growth  and  for  in  situ  characterization  by  RDS  and  reflection 
high-energy  electron  diffraction  (RHEED).  We  were  able  to 
control  the  surface  reconstruction  (or,  in  other  word,  the  As 
coverage)  by  controlling  both  the  flux,  or  the  background 
pressure,  of  As  and  the  sample  temperature  during  the 
sample  transfer  out  of  the  III-V  MBE  chamber,  and  by  a 
proper  anneal  prior  to  the  growth  in  the  II-VI  chamber.  Dur¬ 
ing  the  ramp-up  of  the  sample  temperature  and  the  pregrowth 
treatment,  great  care  was  taken  to  avoid  uncontrolled  reac¬ 
tion  of  the  GaAs  surface  with  the  background  Se  in  the  II-VI 
chamber. 

In  the  present  study,  the  initial  GaAs  surfaces  were  pre¬ 
pared  by  the  following  procedures:  (1)  A  Zn  exposure  on  an 
As-terminated  c(4X4)  GaAs  surface,  which  is  intended  to 
form  a  Zn-As  layer  at  the  interface.  (2)  A  Zn  exposure  on  an 
As-terminated  (2X4)  GaAs  surface,  which  is  known  to  be 
the  optimal  treatment  for  reducing  the  defect  density  in  the 
final  ZnSe/GaAs  structure.^  (3)  A  Se  exposure  on  a  Ga- 
terminated  (4X6)  surface  followed  by  798  K  annealing  for 
approximately  2  min.  This  processing  forms  a  well-ordered 
Se-terminated  (2X1)  surface. 

These  conditions  will  be  called  Zn/c(4X4),  Zn/(2X4), 
and  Se/(4X6),  respectively.  In  either  case,  the  exposure  was 
continued  until  the  RD  signal  at  2.3  eV  (for  the  Zn  exposure) 
or  3.0  eV  (for  the  Se  exposure)  was  saturated,  the  total  ex¬ 
posure  time  being  typically  60  s.  For  the  surface  treatment 
and  the  ZnSe  growth,  the  beam-equivalent  pressures  of  Zn 
and  Se  were  (6.7±1.0)X10“^  Pa  and  (1.1±0.1)X10''*  Pa, 
respectively.  The  sample  temperature  was  533  K  throughout 
the  Zn  or  Se  treatment  and  the  ZnSe  growth.  During  the 
growth,  the  incident  light  was  blocked  to  avoid  any  possible 
photoassisted  effects. 


III.  OPTICAL  MODEL 

In  a  heterostructure,  there  are  a  number  of  possible  origins 
of  optical  anisotropies,  such  as  anisotropic  structures  (recon¬ 
structions  and  steps)  of  the  surface^^  and  buried  interfaces, 
bulk  anisotropies  induced  by  the  linear  electro-optic  effect 
(LEOE),^^’^^  anisotropic  relaxation  of  the  mismatched  lattice, 
development  of  anisotropic  morphology,^’ and  so  on.  Thus, 
we  need  to  use  a  proper  model  to  sort  out  the  origins  of  the 
observed  anisotropies. 

In  the  following,  we  describe  a  simple  data-analysis  pro¬ 
cedure  using  a  three  phase  model.^  As  shown  later,  the  RD 
spectra  from  our  ZnSe/GaAs  samples  are  significantly  modu¬ 
lated  by  the  interference  between  the  surface  and  interface 
reflections.  The  procedure  we  describe  below  is  useful  for  us 
to  eliminate  the  interference  artifacts  from  the  data,  and  to 
identify  which  processes  give  rise  to  the  observed  anisot¬ 
ropy:  the  reflection  at  the  surface  or  at  the  interface,  or  the 
propagation  through  the  overlayer.  While  a  better  simulation 


of  the  observed  spectra  may  be  done  using  an  optical  model 
with  more  constituent  layers,  the  primary  advantage  of  the 
following  approach  is  its  simplicity. 

In  the  three-phase  model  that  consists  of  a  substrate,  an 
overlayer,  and  an  ambient,  the  overall  reflectance  f  is  given 


so  ^oa 


(1) 


where  Z=exp(47r/n^L/k),  n^  is  the  complex  refractive  in¬ 
dex  of  the  overlayer,  L  is  the  overlayer  thickness,  X.  is  the 
wavelength  of  light,  and  are  the  complex  reflectances 
at  the  overlayer-ambient  and  substrate-overlayer  boundaries, 
respectively.  A  total  derivative,  df,  can  be  expressed  using 
the  partial  differentials  of  r  with  respect  to  the  three  vari¬ 
ables  that  are  potentially  anisotropic:  ,  and  n^ .  By 

performing  the  partial-differential  calculations,  we  obtain  the 
expression  for  Ar/r: 


5+  — 


C  +  An^C 


4'7riL^ 

\  )’ 


(2) 


with 


A  =  il+Zr,J„JiZr,„  +  r„a),  (3a) 

B  =  rog(l-Zf,„){l+Zr,o),  (3b) 

C  =  Zr,„{l-r„J{l+roJ.  (3c) 

In  the  following,  we  call  and  Ar^Jr^^  the  surface 

and  the  interface  anisotropies,  respectively.  If  we  neglect  the 
AHq  term  in  Eq.  (2),  which  is  an  adequate  assumption  in  the 
ZnSe/GaAs  case  with  L<100  nm,  the  surface  and  interface 
anisotropies  can  be  determined  using  two  RD  measurements 
at  different  thicknesses,  and  L2.  Otherwise,  if  the  surface 
anisotropy  is  known  from  separate  experiments  on  homoepi- 
taxial  samples  or  thick-films  samples,  we  can  estimate  the 
interface  contribution  by  subtracting  the  surface  contribution 
from  the  measured  RD  signal  using  Eq.  (2).  We  emphasize 
that  this  procedure  involves  no  parameter  fitting,  making  it¬ 
self  suitable  for  the  real-time  diagnostics  of  the  surface  and 
interface. 


IV.  RESULTS  AND  DISCUSSION 
A.  Pregrowth  surface  treatments 

Figure  1  compares  the  RD  spectra  of  the  GaAs  surfaces 
prepared  by  the  three  different  procedures  as  identified  in 
Sec.  II.  All  the  beam  exposures  and  the  measurements  were 
carried  out  at  533  K.  The  dotted  curves  in  Figs.  1(a),  1(b), 
and  1(c)  were  obtained  for  the  GaAs  surfaces  reconstructed 
into  c(4X4),  (2X4),  and  (4X6),  respectively.  Each  of  these 
curves  is  in  good  agreement  with  the  corresponding  spec¬ 
trum  reported  by  Kamiya  et  al}^ 

When  the  c(4X4)  surface,  which  has  double-layer  As,  is 
exposed  to  the  Zn  beam  for  60  s,  the  spectrum  showed  a 
significant  change  below  4  eV,  indicating  the  Zn  atoms  re¬ 
acted  with  the  surface  As.  The  negative  peak  at  2.8  eV  for 
the  c(4X4)  surface  has  been  attributed  to  the  top-layer  As 
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Fig.  1.  RD  spectra  of  the  GaAs  surfaces  exposed  to  the  Zn  or  Se  beam, 
(a)  Zn  treatment  on  a  c(4X4)  surface;  (b)  Zn  treatment  on  a  (2X4)  surface; 
(c)  Se  treatment  on  a  (4X6)  surface,  and  the  effect  of  annealing  at  798  K. 

dimers. Following  this  peak  assignment,  the  reduction  of 
this  peak  upon  the  Zn  treatment  suggests  disappearance  of 
the  top-layer  As  dimers.  This  interpretation  is  consistent  with 
the  RHEED  result  that  the  Zn  exposure  transformed  the  sur¬ 
face  reconstruction  from  c(4X4)  to  (1X4). 

In  contrast  to  the  case  of  c(4X4),  the  (2X4)  surface, 
which  is  also  terminated  with  As  but  does  not  have  the 
double-layer  structure,  shows  a  relatively  small  change  upon 
the  Zn  treatment,  as  shown  in  Fig.  1(b).  The  RHEED  pattern 
remained  (2X4)  after  the  treatment.  These  results  indicate 
that  Zn  did  not  interact  with  the  majority  of  the  surface  As 
dimers. 

In  the  Fig.  1(c)  run,  the  broken  curve  was  measured  im¬ 
mediately  after  the  Ga-terminated  (4X6)  surface  was  ex¬ 
posed  to  the  Se  beam.  The  sample  was  then  annealed  at  798 
K,  and  the  solid  curve  was  measured  at  533  K.  The  RHEED 
pattern  of  the  as-treated  surface  showed  (2X1)  reconstruc¬ 
tion  with  weak  half-order  streaks,  and  the  following  anneal 


Fig.  2.  (a)  Evolution  of  the  RD  spectra  of  the  ZnSe/GaAs  heterostructure 
with  the  ZnSe  thickness.  The  data  taken  under  the  Se  beam  are  shown.  The 
initial  GaAs  surface  was  the  Zn/(2X4)  surface,  see  Fig.  1(b).  (b)  The  surface 
(top)  and  interface  (bottom)  contributions  in  the  spectra  shown  in  (a). 

at  798  K  transformed  the  surface  to  a  well-ordered  (2X1) 
structure,  as  reported  in  the  literature.  The  RD  spectrum 
showed  a  dramatic  change  upon  the  Se  exposure,  while  the 
annealing  at  798  K  induced  only  minor  changes,  specifically 
sharpening  of  the  spectral  features.  Our  STM  observation  on 
a  similarly  prepared  sample  showed  that  the  surface  after  the 
annealing  is  atomically  flat  with  well-ordered  dimer  rows.^^ 

B.  Separation  of  surface  and  interface  contributions 

Figure  2(a)  shows  an  example  of  apparent  evolution  of 
the  RD  spectra  with  the  ZnSe  thickness.  For  this  experiment, 
the  initial  GaAs  surface  was  the  one  characterized  in  Fig. 
1(b),  namely,  the  Zn/(2X4)  surface.  The  growth  was  inter¬ 
rupted  at  the  thicknesses  noted  in  the  figure  legend,  and  at 
each  thickness  two  spectra  were  measured,  first  under  the  Se 
beam  and  then  under  the  Zn  beam.  The  surface  reconstruc¬ 
tions  under  the  Se  and  the  Zn  beams  were,  respectively, 
(2X1)  and  c(2X2),  as  commonly  observed  for  MBE-grown 
ZnSe.^^  In  Fig.  2,  we  only  show  the  spectra  measured  under 
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the  Se  beam.  The  comparison  between  the  spectra  under  the 
Se  and  Zn  beams  was  reported  in  our  previous  article.^  The 
roughness-induced  signal,  which  usually  appears  as  a  broad 
baseline, was  negligible  in  all  the  experiments  discussed 
in  this  article. 

As  clearly  seen  in  Fig.  2(a),  the  apparent  RD  spectra  show 
prominent  changes  with  the  ZnSe  thickness.  This  is  due  to 
the  interference  between  the  surface  and  interface  reflections. 
By  applying  the  analytic  procedure  described  in  Sec.  Ill,  the 
interface  and  surface  contributions  were  separated,  and  the 
results  are  displayed  in  Fig.  2(b),  In  this  figure,  we  obtained 
each  spectrum  from  the  corresponding  two  source  spectra  in 
Fig.  2(a),  as  noted  in  the  legend.  The  spectral  lineshapes  are 
essentially  independent  of  epilayer  thickness,  which  confirms 
that  the  interference  artifacts  have  been  successfully  elimi¬ 
nated.  The  dielectric  functions  used  in  this  calculation  were 
the  500  K  data  for  GaAs^®  and  the  473  K  data  for  ZnSe.^* 
Because  the  473  K  ZnSe  data  terminate  at  5.1  eV,  the  calcu¬ 
lated  spectra  also  terminate  at  the  same  energy.  The  excep¬ 
tions  are  the  surface-anisotropy  spectra  for  (12,  21  nm)  and 
(21,  26  nm),  where  the  source  spectra  had  a  negligible  inter¬ 
face  contribution  above  4.3  eV  due  to  strong  absorption  in 
the  ZnSe  epilayer.  The  surface-anisotropy  spectra  [upper  half 
of  Fig.  2(b)]  could  be  fairly  simulated  by  energy  scaling  of 
the  ZnSe  dielectric-functions  spectrum,  E  Im  €(E),  suggest¬ 
ing  that  many-body  screening  is  the  dominant  mechanism  for 
the  anisotropy  of  the  Se-terminated  (2X1)  ZnSe 
surface.^’^^’^^ 

There  are  two  observations  that  support  the  interface  ori¬ 
gin  of  the  spectra  shown  in  the  lower  half  of  Fig.  2(b).  First, 
switching  the  surface  termination  from  Se  to  Zn  hardly 
changes  the  spectral  lineshape.^  Second,  the  spectral  line- 
shape  changes  as  the  structure  and  chemical  composition  of 
the  initial  GaAs  surface  is  changed,  as  shown  later  in  this 
article. 

Before  discussing  the  interface  structure  in  detail,  we  ad¬ 
dress  the  characteristic  notch  observed  near  2.6  eV  in  the  36 
nm  spectrum  in  Fig.  2(a).  A  similar  feature  was  reported  by 
Yang  et  and  was  interpreted  as  the  crossover  transition 
from  the  valence  band  of  ZnSe  to  the  quantum-well  state  at 
the  interface.  We  have  found  that,  for  all  the  samples  pre¬ 
pared  under  different  conditions,  this  feature  becomes  notice¬ 
able  only  when  the  ZnSe  thickness  exceeds  20  nm.  This 
observation  can  be  well  understood  if  this  feature  is  related 
to  the  quantum-well  state  at  the  interface.  For  the  quantum 
well  to  be  formed  at  the  interface,  the  band  bending  needs  to 
be  large  enough,  which  requires  the  ZnSe  thickness  to  be 
more  than  a  certain  value. 

C.  ZnSe/GaAs  interfaces 

In  Fig.  3(a),  we  compare  the  RD  spectra  of  the  ZnSe/ 
GaAs  interfaces  formed  on  the  three  different  GaAs  surfaces 
characterized  in  Fig.  1.  The  source  data  from  which  these 
spectra  were  calculated  were  measured  under  the  Se  beam, 
and  in  a  similar  way  as  in  Fig.  2.  The  ZnSe  thicknesses  for 
these  data  were  in  the  range  from  7  to  18  nm. 

The  broadband  centered  at  2.9  eV  changes  its  peak  inten- 
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Fig.  3.  (a)  RD  spectra  of  the  ZnSe/GaAs  interfaces  prepared  on  the  three 
different  GaAs  surfaces  (see  Fig.  1  for  the  RD  spectra  of  these  surfaces), 
(b)  Imaginary  part  of  A(6J).  The  curves  were  obtained  by  converting  the 
spectra  in  (a).  Also  shown  is  the  surface  anisotropy  of  the  Se/(4X6)  sample 
obtained  by  converting  the  solid  curve  in  Fig.  1(c). 

sity  with  different  surface  preparations,  the  Se/(4X6)  sample 
showing  the  highest  peak.  The  sharp  feature  above  4.6  eV  is 
tentatively  interpreted  as  due  to  the  bulk  anisotropy  induced 
by  the  linear  electro-optic  effect  at  the  E^  critical  point  of 
ZnSe,^  This  bulk-induced  feature  can  be  separated  from  the 
interface-induced  signals  by  modifying  both  the  optical 
model  and  the  dielectric  functions  of  the  constituent  materi¬ 
als  to  take  the  linear  electro-optic  effect  into  account.  In  the 
following,  we  focus  the  discussion  on  the  interface-induced 
2.9  eV  band. 

The  differences  between  the  three  interfaces  are  more 
clearly  illustrated  in  Fig.  3(b),  where  the  RD  spectra  in  Fig. 
3(a)  were  converted  into  anisotropy  spectra  of  the  dielectric 
response  of  the  thin  interface  layer,  Aied).  Shown  in  Fig. 
3(b)  are  the  imaginary  part  of  A(ed).  This  conversion  was 
done  by  applying  the  expression  for  the  surface-induced  op¬ 
tical  anisotropy, replacing  the  dielectric  constant  of  the  am¬ 
bient  with  the  dielectric  functions  of  the  overlayer.  In  brief, 
the  model  assumes  an  optically  anisotropic  thin  layer  at  the 
boundary  between  two  isotropic  media,  whose  thickness  d 
satisfies  d<X.  A{€d)  of  this  thin  layer  is  defined,  in  the 
present  case,  as  A(€d)  =  (efio  ”  ^no)  where  ^iio 
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are  the  comple)^ dielectric  functions  for  polarization  vectors 
parallel  to  the  [110]  and  [110]  directions,  respectively. 

In  Fig.  3(b),  the  Se/(4X6)  sample  shows  a  remarkable 
sharp  peak  at  2.8  eV.  Also  shown  in  this  figure  is  the 
lmA(6<i)  spectrum  of  the  Se/(4X6)  surface  prior  to  the 
growth,  namely,  the  conversion  of  the  solid  curve  in  Fig. 
1(c).  The  agreement  between  the  two  curves  below  3  eV  is 
striking.  This  observation  indicates  that  the  anisotropic  sur¬ 
face  layer  formed  by  the  Se  treatment  is  so  stable  and  inert^^ 
that  it  maintained  its  structure  even  after  the  ZnSe  epilayer 
was  grown  on  it. 

In  Fig.  4(c),  a  cross  section  TEM  image  of  this  interface  is 
shown.  We  can  clearly  observe  an  interfacial  layer  with 
zincblende  structure.  A  high-resolution  TEM  observation 
showed  the  thickness  of  this  layer  is  about  4  monolayers.  Li 
et  reported  similar  TEM  results,  and  proposed  that  this 
layer  is  a  Ga-Se  compound  with  zincblende  structure.  A 
known  example  of  such  a  compound  is  Ga2Se3.  Our 
scattering-factor  analysis  on  002  and  004  dark-field  images 
also  showed  that  this  layer  is  a  vacancy-containing  form  of 
gallium  selenide,  although  unambiguous  determination  of  its 
chemical  stoichiometry  has  not  been  successful  so  far,^ 

We  note  that  although  bulk  Ga2Se3  is  known  to  show 
optical  anisotropy,^^  the  t^{ed)  in  Fig.  3  do  not  exhibit  any 
features  at  the  critical-point  energies  of  Ga2Se3.  For  ex¬ 
ample,  Eq  and  Ex  transitions  of  zincblende  Ga2Se3  are  at  2.1 
and  3.9  eV,  respectively.^^  We  point  out  that  the  peak  posi¬ 
tion  in  Fig.  3(b)  is  instead  close  to  the  Ex  critical  point  of 
GaAs  at  the  growth  temperature.^^ 

The  Zn/c(4X4)  spectrum  shows  nearly  zero  anisotropy 
below  2.8  eV,  which  indicates  that  the  formation  of  the 
Ga-Se  layer  was  suppressed  in  this  case.  The  cross-sectional 
TEM  image  of  this  interface,  shown  in  Fig.  4(a),  reveals  an 
interfacial  layer  with  an  opposite  contrast  to  Fig.  4(c).  A 
high-resolution  observation  showed  that  this  layer  is  2-4 
monolayers  thick.  Our  analysis  of  the  002  and  004  dark-field 
images  suggested  that  this  layer  contains  Zn  and  As,  which  is 
consistent  with  the  RDS  result.  The  suppression  of  the 
Ga-Se  formation  and  formation  of  the  Zn-As  layer  can  be 
reasonably  understood  if  we  recall  that  the  c(4X4)  surface 
has  a  double  layer  of  As  which  presumably  acts  as  a  barrier 
for  the  Ga-Se  reaction,  and  that  the  Zn  atoms  react  with  As 
on  the  c(4X4)  surface,  as  shown  in  Fig.  1(a). 

The  iS.{ed)  spectrum  of  the  Zn/(2X4)  sample  falls  be¬ 
tween  the  Se/(4X6)  and  Zn/c(4X4)  spectra.  This  suggests 
that  the  Ga-Se  bonds  account  for,  if  at  all,  only  a  part  of  the 
interface  bonds.  On  the  other  hand,  in  the  TEM  picture 
shown  in  Fig.  4(b)  we  observe  an  interfacial  layer  of  the 
same  contrast  as  in  Fig.  4(a),  but  with  less  thickness.  Thus, 
similarly  to  the  case  of  Fig.  4(b),  the  interfacial  layer  seen  in 
Fig.  4(b)  is  suggested  to  contain  Zn  and  As.  Thus,  while  the 
RDS  and  TEM  data  suggest  a  rather  complex  interface  struc¬ 
ture  for  the  Zn/(2X4)  treatment,  a  quantitative  discussion  of 
the  interface  structure  is  difficult  at  this  stage.  It  is  now 
widely  recognized  that  the  Zn/(2X4)  treatment  is  the  most 
effective  in  reducing  the  density  of  the  stacking  faults 
originating  at  the  interface.^  Therefore,  the  interface  on  the 
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Fig.  4.  002  dark-field  TEM  images  of  the  ZnSe/GaAs  cross  sections.  The 
samples  were  prepared  on  the  three  different  surfaces:  (a)  Zn/c(4X4); 
(b)  Zn/(2X4);  (c)  Se/(4x6).  See  Fig.  1  for  the  RD  spectra  of  the  initial 
GaAs  surfaces,  and  Fig.  3  for  the  corresponding  interface-anisotropy  spec¬ 
tra. 

Zn/(2X4)  surface  is  of  technological  interest.  We  are  con¬ 
tinuing  the  effort  to  understand  the  structure  of  this  interface 
in  more  detail,  for  example  by  analyzing  the  chemical  sto¬ 
ichiometry  of  this  interface  by  x-ray  photoemission  spectros¬ 
copy. 

V.  CONCLUSIONS 

We  have  described  the  analytic  procedure  that  separates 
the  surface  and  interface  contributions  in  the  RD  signal  from 
a  hetero structure.  With  this  procedure,  RDS  is  now  a  unique 
tool  that  can  characterize  buried  interfaces  in  situ  during  pro¬ 
cessing.  Taking  advantage  of  the  in  situ  characterization  by 
RDS  and  RHEED,  we  discussed  the  structure  and  formation 
mechanisms  of  ZnSe/GaAs(100)  interfaces  on  three  different 
GaAs  surfaces.  The  results  are  summarized  below. 

(1)  When  the  (4X6)  GaAs  surface  was  exposed  to  Se,  a 
stable  Se-terminated  GaAs  surface  layer  was  formed.  An¬ 
nealing  of  the  Se-treated  GaAs  surface  induced  a  well- 
ordered  (2X1)  reconstruction.  The  surface  layer  thus  pre¬ 
pared  remained  intact,  as  long  as  the  optical  anisotropy  is 
concerned,  even  after  the  ZnSe  layer  was  grown  on  it.  The 
interfacial  layer,  which  is  about  4  (ML)  thick,  is  composed  of 
Ga  and  Se,  and  also  contains  vacancies. 

(2)  When  the  c(4X4)  GaAs  surface  was  exposed  to  the  Zn 
beam,  both  RDS  and  RHEED  indicated  that  the  reaction  of 
Zn  with  the  GaAs  surface  induced  disappearance  of  the  top- 
layer  As  dimers.  The  RDS  showed  that  when  the  ZnSe  was 
grown  on  this  surface  the  formation  of  the  Ga-Se  layer  was 
suppressed,  being  consistent  with  the  TEM  results  suggesting 
that  the  interfacial  layer  contains  Zn  and  As. 
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(3)  When  the  (2X4)  GaAs  surface  was  exposed  to  Zn, 
both  RDS  and  RHEED  showed  that  Zn  did  not  react  with 
majority  of  the  surface  As.  Similarly  to  the  case  (2)  above, 
the  TEM  results  suggested  that  the  interface  layer  formed  on 
this  surface  contains  Zn  and  As.  On  the  other  hand,  the  in¬ 
terface  RD  spectrum  showed  a  sign  of  Ga-Se  bond  forma¬ 
tion.  These  TEM  and  RDS  results  can  be  consistent  with 
each  other  if  the  Ga~Se  bonds  are  formed  only  partially  in 
coverage  at  this  interface. 
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Reflectance  anisotropy  spectroscopy  (RAS)  in  combination  with  reflection  high-energy  electron 
diffraction  (RHEED)  was  used  to  study  in  situ  the  initial  steps  of  molecular  beam  epitaxial  growth 
of  InAs  on  GaAs(OOl).  Due  to  the  large  lattice  mismatch  InAs  is  known  to  grow  in  Stranski- 
Krastanov  mode  leading  to  the  formation  of  quantum  dots  after  the  transition  from  two-  to 
three-dimensional  growth  mode.  In  this  article  the  precise  determination  of  the  growth  mode 
transition  and  the  subsequent  development  of  the  islands  have  been  of  particular  interest.  During  the 
growth  of  the  two-dimensional  InAs  layer,  the  RHEED-pattem  changed  from  the  c(4X4)  of  the 
clean  GaAs  to  a  (1X3)  surface  reconstruction.  Accordingly,  the  RAS-spectra,  taken  every  0.2  ML, 
indicate  changes  of  the  As-dimer  configuration.  At  1.8  ML  (spotty  RHEED-pattem)  a  saturation  of 
the  intensity  of  the  dimer  related  RAS -signal  around  2.6  eV  was  found.  The  relaxation  of  the  InAs 
layer  and  the  formation  of  the  quantum  dots  was  followed  by  time-resolved  RAS  at  2.6  and  4  eV. 

It  is  shown  here,  that  the  time  constant  of  this  process,  the  thickness  of  the  InAs  wetting  layer  and 
the  equilibrium  morphology  of  the  islands  are  strongly  temperature  dependent.  The  remaining 
equilibrium  InAs  wetting  layer  thickness  at  the  surface  was  estimated  to  be  about  1  ML  (0.8  ML  at 
625  K  and  1.2  ML  at  725  K).  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  preparation  of  InAs  islands,  by  relaxation  of  the  first 
few  monolayers  (MLs)  grown  on  GaAs  substrates,  show 
great  promise  to  be  used  for  growth  of  high  quality  quantum 
dots  in  optoelectronic  devices.  Therefore,  there  is  presently  a 
great  interest  in  understanding  the  mechanism  of  their  forma¬ 
tion  as  part  of  an  attempt  to  find  ways  to  improve  their  uni¬ 
formity,  the  main  stumbling  block  to  their  application  in 
practical  devices.^ 

Targeting  a  one  step  deposition  of  quantum  dots  without 
any  pre-patteming,  many  investigations  have  been  performed 
in  nearly  all  highly  lattice  mismatched  semiconductor  sys¬ 
tems  such  as  InAs/GaAs^  or  InP/GalnR^  The  successful 
deposition  of  coherent  InAs  islands  embedded  in  AlGaAs  on 
high  index  substrates,  especially  on  the  (311B)-GaAs 
surface"^  proves  the  high  potential  of  this  growth  technique. 
However  the  reproducibility  of  the  dot  formation  which  is 
essential  for  the  growth  of  dot-stacks,  still  causes  many  prob¬ 
lems.  A  search  for  greater  control  of  the  whole  process,  in¬ 
cluding  the  growth  mode  transition  and  the  island  develop¬ 
ment  seems  to  be  necessary  for  establishing  this  process  as  a 
reliable  technique.  The  first  step  toward  control  is  to  monitor 
in  real-time  the  growth  mode  transition.  Reflection  high- 
energy  electron  diffraction  (RHEED)  is  capable  to  monitor 
this  by  the  change  from  streaky  diffraction  patterns  for  two- 
dimensional  to  spotty  ones  for  three-dimensional  growth.^ 
However,  the  subsequent  development  of  the  three- 
dimensional  islands  cannot  be  studied  by  this  technique.  In 
addition,  RHEED  is  only  applicable  in  ultra  high  vacuum 
systems  such  as  molecular  beam  epitaxy  (MBE)  and  metal 


organic  molecular  beam  epitaxy  (MOMBE),  but  not  in  gas 
phase  environments  as  used  in  metal  organic  vapor  phase 
epitaxy  (MOVPE).  Therefore,  reflectance  anisotropy  spec¬ 
troscopy  (RAS)  has  already  shown  its  capability  for  in  situ 
analysis.  The  growth  rate  and  the  composition  of  ternary 
compounds,  for  example,  were  determined  by  RAS  in  both 
MOVPE  and  MBE  systems.^  In  a  former  MBE-study^  RAS 
measurements  of  InAs  grown  on  GaAs(OOl)  up  to  a  total 
coverage  of  100  ML  were  reported.  The  distinctly  different 
spectra  for  clean  and  slightly  InAs  covered  GaAs  surfaces 
yielded  evidence  for  different  surface  reconstructions. 
Growth  beyond  a  total  coverage  of  4  ML  caused  large 
anisotropies  which  were  explained  by  roughness  effects  due 
to  three-dimensional  growth.  This  hypothesis  was  confirmed 
by  effective  medium  calculations. 

Building  on  this  study,  simultaneously  performed  time  re¬ 
solved  RAS  and  spectroscopic  ellipsometry  (SE)  measure¬ 
ments  at  certain  photon  energies  were  successfully  used  in 
MOVPE  to  monitor  the  Stranski-Krastanov  growth  mode 
transition.^  Complete  spectra  taken  over  a  range  from  1.5  to 
5.5  eV  revealed  characteristic  changes  due  to  reconstruction 
and  growth  mode  transition.  For  continuous  deposition  of 
4  ML  InAs  on  GaAs(OOl)  a  large  increase  in  anisotropy  with 
increasing  deposition  temperature  appeared.  It  was  explained 
by  the  formation  of  large  clusters  elongated  in  [110]  direc¬ 
tion.  This  result,  derived  from  effective  medium  theories  was 
proven  by  the  observation  of  elongated  islands  in  high  reso¬ 
lution  scanning  electron  microscopy.  Thus  RAS  might  also 
allow  estimates  on  island  size. 

The  intention  of  the  investigations  presented  here  was  to 
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study  in  detail  the  relaxation  of  the  InAs  layer  and  to  monitor 
the  island  formation  until  the  surface  equilibrium  is  reached. 
This  was  done  by  performing  simultaneous  RAS  and 
RHEED  measurements.  In  order  to  gain  a  better  understand¬ 
ing  of  the  processes  governing  the  dot  formation  different 
deposition  parameters  were  chosen. 

II.  EXPERIMENT 

The  experiments  were  carried  out  in  a  MB E- system^  al¬ 
lowing  simultaneous  RHEED  and  RAS  measurements.  De¬ 
tails  concerning  the  flux  measurement  have  been  described 
elsewhere.^  The  RAS^^  was  coupled  to  the  MBE  chamber 
via  a  low  strain  silica  window.  The  temperature  calibration 
was  made  using  the  (2X4)  to  (4X2)  reconstruction  change  in 
the  RHEED  pattern.  Before  InAs  deposition  a  1200  nm 
thick  GaAs  buffer  layer  was  grown  at  820  K  (growth  rate 
r  —  lfi/h). 

Three  different  types  of  experiments  were  performed: 

(i)  Stepwise  deposition:  InAs  was  deposited  stepwise 
(0.2  ML  steps)  at  different  deposition  temperatures  (625  K, 
725  K,  and  775  K)  and  RAS  spectra  were  taken  in  a  range 
from  1.5  to  5.5  eV  immediately  after  every  deposition.  These 
experiments  were  carried  out  up  to  an  effective  coverage  of 
4  ML  with  a  growth  rate  of  0.05  ML/s.  The  RHEED  pattern 
were  observed  simultaneously. 

(ii)  Continuous  deposition  monitored  in  real  time:  during 
the  continuous  deposition  of  2  ML  and  3  ML  of  InAs  time 
resolved  RAS  measurements  were  performed  at  photon  en¬ 
ergies  of  2.6  eV  and  4.0  eV.  The  measurements  were  ex¬ 
tended  after  growth  and  were  finished  in  order  to  monitor  the 
formation  of  QDs  and  the  approach  of  the  system  to  equilib¬ 
rium  state  in  real  time.  The  growth  rate  for  this  experiment 
was  0.025  ML/s  and  it  was  performed  at  different  growth 
temperatures  (625  K  and  725  K). 

(iii)  Postgrowth  evolution  of  the  surface  towards  equilib¬ 
rium:  For  2  ML  InAs  deposited  at  different  growth  tempera¬ 
tures  the  post-growth  development  of  the  surface  was  also 
investigated  by  taking  RAS  spectra  until  no  changes  in  the 
spectra  could  be  distinguished. 

The  spectra  obtained  for  the  surface  equilibrium  were 
compared  to  those  achieved  for  stepwise  deposition  in  order 
to  determine  the  thickness  of  the  remaining  two-dimensional 
InAs  wetting  layer. 

III.  RESULTS  AND  DISCUSSION 

A.  RAS  spectra  for  stepwise  deposition  of  inAs  on 
GaAs 

Characteristic  RAS  spectra  gained  for  stepwise  deposition 
at  625  K  up  to  an  overall  coverage  of  3  ML  are  given  in 
Fig.  1(a).  Starting  from  the  well  known  c(4X4)  As-rich 
GaAs  reconstruction  already  after  the  deposition  of  0.2  ML 
significant  changes  in  the  As-dimer  related  part  of  the  spec¬ 
trum,  i.e.,  a  minimum  at  2.6  eV  occurred.  The  RHEED  pat¬ 
tern  changed  to  an  asymmetric  (1X3)  surface  reconstruction. 
For  additional  InAs  deposition  the  RHEED-pattem  became 
clearer  and  more  symmetric.  In  the  RAS  spectra  a  change  in 


(a)  Energy  (eV) 


7) 

(D 


O 

CO 


W 

bi 

CD 


< 


(b)  InAs  coverage  (ML) 


Fig.  1.  RAS  data  taken  immediately  after  the  deposition  of  well  defined 
amounts  of  InAs  on  GaAs(OOl).  (a)  RAS  spectra:  the  strong  signal  change  at 
2.6  eV  reflects  the  reconstruction  change  from  an  arsenic  rich  c(4X4)-GaAs 
(001)  surface  to  a  (lX3)-reconstructed  one.  The  signal  change  (increase)  at 
photon  energies  around  4  eV  for  coverages  beyond  the  critical  thickness 
(1.8  ML)  originates  from  the  formation  of  anisotropic  dots  or  clusters  of 
InAs.  (b)  Spectral  position  of  the  As-dimer  related  maximum  (left  axis)  and 
the  RAS  signal  at  2.6  eV  (right  axis)  [data  gained  from  (a)]. 


the  sign  around  2.6  eV  takes  place  leading  now  to  a  maxi¬ 
mum  at  this  energy.  The  occurrence  of  the  change  in  the  sign 
indicates  the  rotation  of  the  As-dimers  from  the  [110]  to  the 
[110]-direction.  The  intensity  of  the  As-dimer  related  maxi¬ 
mum  in  dependence  of  InAs  coverage  is  shown  in 
Fig.  1(b). 

Besides  the  changes  in  signal  intensity  also  a  shift  in  the 
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energy  position  of  the  maximum  appears.  Figure  1(b)  reveals 
a  decrease  of  its  spectral  position  with  increasing  InAs  cov¬ 
erage.  The  lowest  energy  position  is  found  for  a  total  cover¬ 
age  of  2  ML.  This  corresponds  to  a  movement  of  the  As- 
dimer  related  maximum  from  the  typical  GaAs  related 
position  at  2.6  eV  to  a  more  InAs  like  position.  RAS  studies 
on  InAs  bulk  material  have  shown  a  maximum  at  about  2.3 
eV.^  For  InAs  coverages  above  2.2  ML  the  spectral  position 
of  the  maximum  shifted  back  to  energies  found  before  for 
lower  total  coverages.  At  a  InAs  coverage  of  1.8  ML  the 
appearance  of  spotty  RHEED  pattern  indicates  the  change 
from  two  dimensional  layer-by-layer  to  three-dimensional  is¬ 
land  growth. 

Knowing  this,  the  changes  in  the  RAS  spectra  can  be 
interpreted  as  follows:  During  two-dimensional  growth  the 
intensity  of  the  As-dimer  related  signal  increases  because  of 
reconstruction  changes,  but  stays  constant  at  the  growth 
mode  transition,  indicating  a  constant  As-dimer  coverage  and 
orientation  at  the  surface.  When  the  island  formation  starts 
(2  ML  to  2.2  ML),  and  the  nucleation  of  three-dimensional 
islands  takes  place  (as  previously  reported  in  Ref.  5),  mate¬ 
rial  from  the  two-dimensional  wetting  layer  is  transfered  to 
the  islands  resulting  in  a  decrease  of  the  maximum  signal 
intensity  and  a  shift  of  the  maximum  in  energy  position,  that 
should  be  related  to  the  layer  thickness  of  the  two- 
dimensional  InAs-layer  below  the  dots.  The  thicker  the  InAs 
layer  the  closer  is  the  spectral  position  of  the  signal  to  that  of 
InAs.  The  opposite  shift  of  the  maximum  position  for  cover¬ 
ages  above  the  critical  thickness  is  characteristic  for  InAs 
coverages  below  the  critical  layer  thickness  and  indicates  a 
mass  transfer  out  of  the  two-dimensional  layer  towards  the 
InAs  islands.  This  effect  can  be  used  to  determine  the  re¬ 
sidual  wetting  layer  thickness. 

Further  InAs  deposition  lead  to  an  apparent  downshift  of 
the  RAS  peak-position  and  the  RAS-amplitude.  The  latter 
can  be  understood  by  comparing  the  whole  spectra  for  cov¬ 
erages  beyond  the  critical  thickness  (3  ML)  to  those  of  the 
smooth  two-dimensional  case  (1.8  ML)  (Fig.  1).  A  de¬ 
creased  RAS-signal  is  evident  for  the  whole  spectral  range 
(except  of  the  4  eV  peak),  a  phenomenon  that  can  be  attrib¬ 
uted  to  an  increased  surface  roughness,  caused  by  the  is¬ 
lands.  A  correlation  between  the  size  and  morphology  of  the 
anisotropic  islands  and  the  RAS  spectra  was  investigated  in 
Ref.  8.  Performing  effective  medium  calculations  (similar  to 
those  described  in  Ref.  8),  we  found  that  the  increased  RAS- 
signal  at  4  eV  may  be  caused  by  an  increased  island  density, 
while  the  decrease  in  the  RAS-signal  may  be  induced  by  an 
increased  island  anisotropy  (elongation  in  [110]). 

Performing  the  same  experiment  at  a  substrate  tempera¬ 
ture  of  725  K  resulted  in  a  similar  behavior  as  seen  at  625  K. 
The  critical  coverage  was  found  to  be  approximately  the 
same  as  for  the  lower  temperature  (1.8  ML).  Taking  an  RAS 
spectrum  for  4  ML  deposited  at  725  K  half  an  hour  after  the 
last  deposition,  we  observed  a  significant  increase  of  the  an¬ 
isotropy  at  high  photon  energies  (above  4.0  eV).  This  was  a 
first  hint  that  the  island  formation  even  at  this  relatively  high 
temperature  is  a  long  lasting  process.  The  equilibrium  prop- 
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erties  for  different  temperatures  are  studied  in  detail  later 
(Figs.  3(a)  and  3(b)). 

For  depositions  at  even  higher  temperatures  (T  =715  K) 
the  changes  from  one  0.2  ML  step  to  the  next  one  were  found 
to  be  much  smaller  than  for  lower  temperatures  and  both  a 
saturation  of  the  dimer  related  anisotropy  and  a  spotty 
RHEED-pattern  (indicating  three-dimer  growth)  occurred  at 
significantly  higher  nominal  coverages  (about  3  ML).  Thus  it 
is  concluded  that  at  this  temperature  reevaporation  of  indium 
from  the  surface  already  occurs  and  that  the  effective  growth 
rate  is  lower  as  expected. 


B.  Time  resolved  measurements  during  continuous 
InAs  deposition 

In  order  to  follow  the  dynamic  changes  of  reconstruction 
and  morphology  on  line,  time  resolved  RAS  measurements 
at  the  characteristic  energies  2.6  eV  (the  As-dimer  related 
energy)  and  4.0  eV  (sensitive  to  roughness)  were  performed 
for  continuous  deposition  of  2  or  3  ML  InAs  at  two  different 
temperatures  (625  K  and  725  K).  After  deposition  the  RAS 
transients  were  continued  for  additional  270  s. 

In  Fig.  2(a)  the  dimer  related  change  of  the  RAS  signal  at 
2.6  eV  from  a  negative  to  a  positive  one,  as  already  found  for 
the  stepwise  deposition  (Fig.  1(b))  is  shown.  Its  sudden  in¬ 
crease  when  InAs  is  deposited  proves  the  immediate  change 
of  the  surface  reconstruction.  For  an  InAs  coverage  around 
the  critical  thickness  the  RAS  signal  at  2.6  eV  saturates  in¬ 
dicating  that  the  As-dimer  coverage  now  stays  constant. 
When  the  growth  mode  changes,  because  it  becomes  ener¬ 
getically  favorable  for  the  surface  to  form  islands,  the  InAs 
layer  thickness  is  reduced,  as  measured  by  a  slight  decrease 
of  the  RAS  signal  at  2.6  eV.  This  decrease  is  found  for  both 
coverages,  at  both  temperatures,  but  it  is  most  evident  for 
deposition  of  3  ML  InAs  at  725  K.  The  preceding  increase  of 
the  signal  (only  observed  for  3  ML  at  725  K  (bottom  of  Fig. 
2(a))  that  already  starts  during  growth  and  continued  when 
the  deposition  was  stopped,  can  be  interpreted  by  the  super¬ 
position  of  a  roughness  related  contribution  since  after 
growth  mode  transition  island  formation  leads  to  an  increas¬ 
ing  surface  roughness.  This  behavior  was  not  found  in  the 
spectra  taken  for  stepwise  deposition,  possibly  because 
growth  interruptions  allowed  a  smoothening  of  the  surface 
between  two  deposition  steps.  However,  the  following  de¬ 
crease  in  anisotropy,  found  in  all  experiments,  should  not 
only  be  related  to  the  smoothening  of  the  surface,  but  also  to 
a  transport  of  material  to  the  islands  (i.e.,  a  decrease  of  the 
As-dimer  coverage  of  the  surface)  as  already  discussed  in  the 
previous  section. 

RAS  transients  at  4  eV  are  given  in  (Fig.  2(b)).  They  have 
been  taken  to  gain  additional  information  about  surface 
roughness  introduced  by  island  formation.  At  high  photon 
energies,  because  of  the  shorter  wavelengths  light  scattering 
due  to  surface  roughness  is  stronger  than  at  2.6  eV  thus 
causing  larger  changes  in  the  RAS  signal. 

For  all  experiments  (2,  3  ML  and  625,  725  K)  during 
two-dimensional  growth  (i.e.,  below  1.8  ML)  the  evolution 
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Fig.  2.  RAS  transients  during  InAs  deposition  for  two  total  coverages  (2  ML,  3  ML)  and  two  temperatures  (625  K,  725  K)  measured  at  a  photon  energy  of 
(a)  2.6  eV  and  (b)  4.0  eV.  The  horizontal  arrows  starting  at  t=10s  indicate  the  deposition  time.  The  initial  changes  (up  to  110  s)  describe  the  surface 
modification  from  c(4X4)  GaAs  to  a  (1X3)  InAs.  The  further  signal  increase  in  (b)  is  caused  by  surface  roughening. 


of  the  transients  is  governed  by  changes  of  the  reconstruction 
as  shown  in  Fig.  1(a). 

After  that,  for  deposition  of  2  ML  InAs  at  625  K  only  a 
weak  signal  change  occurs,  even  when  RHEED  already 
showed  three-dimensional  growth.  Thus  surface  roughness 
should  be  low,  i.e.,  only  small  islands  had  formed  on  the 
surface.  However,  the  slight  increase  that  occurred  after 
deposition  was  halted,  shows  that  the  equilibrium  surface 
state  is  not  yet  reached. 

At  the  higher  deposition  temperature  (bottom  of  Fig.  2(b)) 
the  roughness  related  increase  is  enhanced  and  changes  of 
the  surface  toward  equilibrium  seem  to  be  much  faster  indi¬ 
cating  that  kinetic  processes  are  responsible  for  the  island 
formation. 

For  the  3  ML  deposition  of  InAs,  the  response  of  the  RAS 
to  three-dimensional  islands  becomes  evident  by  a  sign 
change  in  the  RAS  signal  that  occurs  already  during  the 
deposition.  Stopping  the  deposition  additionally  increases 
this  effect.  Again  for  the  higher  deposition  temperature  the 
evolution  of  the  surface  towards  the  surface  equilibrium  is 
accelerated. 

Depositing  3  ML  InAs  at  725  K  results  in  a  larger  aniso¬ 
tropy  as  compared  to  that  occurring  for  low  temperature 
deposition,  and  can  be  understood  from  the  surface  diffusion 
properties.  Increasing  the  temperature  means  increasing  the 
diffusion  length.  Enhanced  diffusion  leads  to  an  enlargement 


of  the  individual  islands  and  hereby  to  a  lowering  of  the 
island  density,  because  big  islands  (beyond  the  critical  island 
size^^  found  in  several  TEM  investigations)  grow  on  the  ex¬ 
tense  of  smaller  ones  (Oswald  ripening).  Thus  taking  RAS 
transients  at  4  eV  we  can  decide  in  situ  whether  large  islands 
have  formed  or  not  by  looking  at  the  anisotropy  increase 
after  the  deposition  is  halted. 

C.  Post  growth  evolution  of  the  islands 

Since  the  time  resolved  measurements  demonstrated,  that 
the  evolution  of  the  islands  continues  after  the  deposition  is 
halted,  additional  investigations  of  the  post  growth  behavior 
were  performed. 

Therefore  InAs  was  deposited  to  an  amount  slightly  above 
the  critical  layer  thickness  and  RAS -spectra  were  taken  until 
no  significant  changes  were  observed  anymore  and  the  equi¬ 
librium  of  the  islanded  surface  was  reached.  These  spectra 
are  given  in  Fig.  3(a).  A  significant  drop  of  the  signal  at  2.6 
eV  (dimer  related)  and  a  shift  of  the  maximum  position,  both 
toward  values  as  found  for  lower  InAs  coverages  (see  Fig.  1) 
were  measured.  This  behavior  again  gives  evidence  that  the 
two-dimensional  InAs-layer  is  reduced  in  favor  of  the  is¬ 
lands.  For  the  lowest  deposition  temperature  under  investi¬ 
gation  (625  K)  anisotropy  changes  originating  from  this  re¬ 
arrangement  of  the  surface  were  found  to  take  up  to  70  min 
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Fig.  3.  Development  of  the  RAS  signal  for  extended  times  (up  to  80  min)  after  deposition  of  2  ML  InAs  on  GaAs  (001)  for  two  temperatures  (625  K,  725  K). 
(a)  The  RAS  spectra  indicate,  especially  at  the  lower  temperature  that  still  changes,  e.g.,  surface  modifications  occur  for  times  compared  long  with  the 
deposition  time,  (b)  Time  resolved  RAS  signal  at  2.6  eV  taken  from  (a). 
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Fig.  4.  Comparison  of  a  RAS  spectrum  obtained  70  min  after  a  one  step 
deposition  of  2  ML  InAs/GaAs  (“equilibrium”)  with  different  submono¬ 
layer  deposition  spectra  (0.6,  0.8,  1.0  ML)  obtained  under  the  same  condi¬ 
tions:  The  “equilibrium” -wetting  layer  thickness  was  determined  to  be  0.8 
ML  at  625  K. 


after  deposition.  The  process  is  enhanced  for  higher  deposi¬ 
tion  temperatures  and  the  equilibrium  state  is  thus  reached 
earlier  (35  min  at  675  K  and  25  min  at  725  K). 

Figure  3(b)  summarizes  the  whole  island  formation  pro¬ 
cess  by  combining  the  time  resolved  measurements  (Fig. 
2(a))  with  the  2.6  eV  RAS  response  gained  from  the  equilib¬ 
rium  studies  (Fig.  3(a))  at  defined  times  after  deposition  of  2 
ML.  The  whole  process  is  shown,  from  two-dimensional 
wetting  layer  growth  over  island  nucleation  and  coalescence 
until  surface  equilibrium.  This  shows  the  good  agreement  of 
time  resolved  and  spectral  measurements. 

An  estimate  of  the  final  wetting  layer  thickness  is  ob¬ 
tained  by  a  comparison  of  the  equilibrium  spectra  (Fig.  3(a)) 
with  the  spectra  for  stepwise  depositions  (Fig.  1(a))  at  the 
same  temperatures.  The  comparison  is  given  in  (Fig.  4). 

For  low  temperature  deposition  (625  K)  the  best  agree¬ 
ment  was  found  for  a  residual  InAs  layer  thickness  of  0.8 
ML.  For  deposition  at  725  K  1.2  ML  InAs  remained  as  two- 
dimensional  wetting-layer.  This  difference  in  layer  thickness 
may  be  caused  by  different  growth  rates  for  the  submono¬ 
layer  depositions  (due  to  desorption  of  Indium  at  725  K  the 
growth  rate  may  be  lower),  but  could  also  be  caused  by 
different  equilibrium  surface  states  for  different  growth  con¬ 
ditions  (including  different  island  shapes  and  sizes  for  differ¬ 
ent  temperatures). 

This  behavior  is  very  different  from  MOVPE- 
investigations  where  the  final  quantum-dot  state  was  reached 
soon  after  the  deposition  was  halted.  This  difference  may  be 
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Fig.  5.  “Equilibrium”  studies  at  elevated  temperature  (775  K)  for  2  ML 
InAs  deposited  on  GaAs  (001).  Due  to  In  desorption  at  this  temperature  the 
spectra  return  to  spectra  measured  for  a  total  coverage  of  only  0.2  ML.  The 
remaining  InAs  is  found  to  be  very  stable:  only  annealing  at  a  even  higher 
temperature  (1025  K)  removes  the  last  InAs  and  delivers  the  starting  sur¬ 
face,  the  c(4X4)-GaAs,  again. 


due  to  the  higher  surface  mobility  in  MOVPE^^  leading  to  a 
faster  transport  of  material  to  the  islands.  Besides  chemical 
radicals  at  the  surface  may  cause  a  faster  reduction  of  the 
wetting  layer. 

Performing  the  same  studies  at  a  even  higher  temperature 
(Fig.  5)  we  found,  that  the  equilibrium  was  reached  only 
hours  after  growth.  Here  two  effects  are  superimposed:  de¬ 
sorption  of  In  from  the  surface  and  the  islands  and  addition¬ 
ally  transformation  of  material  from  the  wetting  layer  and  the 
smaller  QDs  to  the  larger  islands.  However,  the  final  state 
shows  clearly  that  reevaporation  is  the  dominating  process  at 
this  temperature.  Comparing  the  spectra  taken  two  hours  af¬ 
ter  deposition  with  those  taken  for  submonolayer  deposition 
at  this  temperature  we  found  excellent  agreement  with  the 
0.2  ML-spectra.  The  RHEED  pattern  also  changes  back  from 
a  spotty  to  a  streaky,  first  weak,  later  more  pronounced  asym¬ 
metric  IX  3-reconstruction.  This  indicates  that  all  islands 
have  gone  and  only  small,  two-dimensional  InAs  regions  are 
left  on  the  surface.  Continuation  of  this  experiment  for  about 
2  h  gave  no  further  changes,  showing  that  the  last  0.2  ML  are 
very  stable  at  this  temperature.  Annealing  the  sample  to 
790  K  and  cooling  it  down  to  775  K  again,  yields  a  clear 
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c(4X4)  reconstruction  in  both,  RAS  and  RHEED,  that  was 
comparable  to  that  of  the  clean  GaAs  surface.  Thus  anneal¬ 
ing  to  790  K  leads  to  the  desorption  of  the  complete  InAs 
layer. 

IV.  SUMMARY 

In  this  work  RAS  and  RHEED  investigations  for  different 
coverages  of  InAs  on  GaAs(OOI)  were  performed.  The 
growth  mode  transition  was  observed  and  the  main  interest 
was  on  the  development  of  the  islanded  surface  towards 
equilibrium.  For  this  purpose  time  resolved  measurements 
during  growth  and  afterward  were  correlated  to  the  recon¬ 
structions  observed  in  RHEED  and  detailed  information  pre¬ 
senting  the  initial  steps  of  dot  evolution  were  performed.  For 
two-dimentional  wetting  layer  growth  the  intial  change  from 
the  GaAs  c(4X4)-reconstruction  to  a  (lX3)-reconstructed 
InAs  surface  was  confirmed.  Analyzing  the  shifts  of  the  As- 
dimer  related  RAS  maximum  towards  lower  energies  and  the 
corresponding  RHEED  pattern,  the  critical  thickness  of  the 
InAs  layer  was  determined  precisely  to  be  1.8  ML  for  625  K 
deposition.  Time  resolved  RAS  studies  with  the  intention  to 
monitor  not  only  the  transition  from  two-dimensional  to 
three-dimensional  growth  but  additionally  to  study  in  detail 
the  further  development  of  the  islands  were  performed.  With 
increasing  deposition  beyond  the  critical  thickness  and  for 
higher  growth  temperatures  (775  K)  an  increase  of  the  an¬ 
isotropy  due  to  roughening  of  the  surface  by  large  islands 
was  found.  A  strong  temperature  dependence  of  the  equilib¬ 
rium  surface  state  was  found.  For  a  coverage  slightly  above 
the  critical  thickness  the  equilibrium  for  the  625  K  deposi¬ 


tion  was  reached  70  min  after  deposition  and  at  725K  it  took 
only  25  min.  At  775  K  reevaporation  of  In  from  the  surface 
was  observed  and  a  residual  coverage  of  only  0.2  ML  after 
about  2  h  was  approximated. 
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Reflectance  anisotropy  spectroscopy  study  of  GaAs  overlayer  growth 
on  submonolayer  coverages  of  Si  on  the  GaAs(001)-c(4x4)  surface 
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Reflectance  anisotropy  spectroscopy  (RAS)  has  been  employed  to  characterize  the  overlayer  growth 
of  GaAs  onto  sub  to  one  monolayer  coverages  of  Si  flayers  deposited  on  the  GaAs(001)“c(4X4) 
surface.  The  low  growth  temperature  (400  °C),  required  to  avoid  spreading  of  the  dopant  away  from 
the  S  plane,  has  meant  that  the  study  of  a  RAS  feature  related  to  the  linear  electro-optic  (LEO) 
effect  is  complicated  by  disordering  at  the  GaAs  surface.  This  disordering  is  induced  not  only  by  the 
growth  temperature,  but  also  by  the  presence  of  the  Si  S  layer  itself.  Variable  thickness  studies 
indicate  that  the  LEO-induced  signal  is  dependent  on  the  field  profile  in  the  surface  layer.  It  has  been 
observed  that  the  intensity  of  the  LEO  feature,  as  a  function  of  Si  coverage,  reaches  a  maximum  at 
—0.01  ML  (— 6.4X  10^^  atoms  cm“^)  in  agreement  with  previous  studies  of  the  site  occupancy  of  Si 
S  layers.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Considerable  effort  has  been  employed,  over  many  years 
now,  to  characterize  fully  the  nature  of  the  reconstructions 
obtained  from  GaAs  surfaces  prepared  under  different 
growth  conditions.  Such  studies  have  formed  a  key  starting 
point  when  trying  to  predict  both  the  structural  and  electronic 
characteristics  of  interfaces  formed  by  GaAs  with  other  ma¬ 
terials.  More  recently,  similar  attention  has  become  focused 
on  the  way  in  which  these  reconstructions  become  altered 
after  planes  of  dopant  atoms,  such  as  Si  or  Be,  are  deposited 
on  the  GaAs  surface.  The  majority  of  these  studies  have 
employed  either  reflection  high-energy  electron  diffraction 
(RHEED),  or  a  combination  of  RHEED  and  scanning  tunnel¬ 
ing  microscopy  (STM)  as  tools  to  probe  the  atomic  bonding 
that  gives  rise  to  the  observed  surface  reconstructions.^"^ 
While  these  investigations  have  explored  a  wide  manifold  of 
possible  reconstructions,  using  a  broad  range  of  growth  con¬ 
ditions,  only  a  small  number  of  studies  have  been  aimed 
specifically  at  probably  the  most  technologically  relevant  Si/ 
GaAs  interfaces,  i.e.,  those  prepared  at  low  temperature 
(400  and  below) At  these  growth  temperatures,  the 
spread  of  Si  atoms  away  from  the  S  plane  remains  small 
enough  for  practical  ^doping  applications. 

We  have  recently  used  a  combination  of  reflectance  an¬ 
isotropy  spectroscopy  (RAS)  and  RHEED  measurements  to 
study  submonolayer  coverages  of  Si  on  the  GaAs(001)-c(4 
X4)  surface,  at  a  substrate  temperature  of  400  One  of 
the  observations  to  come  out  of  this  and  other  RAS 
studies^’^^  of  the  GaAs(OOl)  surface  is  that  there  are  charac¬ 
teristic  spectral  features  that  vary  in  a  systematic  manner 
with  the  reconstruction  of  the  GaAs  surface.  The  most  well 
known  of  these  features  occurs  at  an  energy  of  2.65-2.7  eV 
and  changes  in  both  shape  and  sign  from  a  pronounced  mini¬ 
mum  for  the  c(4X4)  reconstruction,  to  a  maximum  for  the 
(2X4)  reconstructed  GaAs (001)  surface.^’^^  This  change  re¬ 
flects  an  alteration  in  surface  bond  orientation  from  along 
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[110]  to  [110],  and  makes  RAS  measurements  extremely 
sensitive  to  the  degree  of  dopant  coverage.  In  our  case,  we 
have  been  able  to  discern  overlayer  coverages  of  as  little  as 
0.005  ML  of  either  Si  or  Be.^  In  addition,  there  are  a  number 
of  characteristic  energies,  linked  to  specific  surface  recon¬ 
structions,  that  act  as  nodal  (isoplethtic)  points  through 
which  the  RAS  spectra  evolve  with  increasing  Si  coverage. 

In  this  article,  RAS  data  are  presented  for  GaAs  overlayer 
growth  on  top  of  different  submonolayer  coverages  of  Si  on 
the  GaAs(001)-c(4X4)  surface.  This  is  a  natural  progression 
of  our  earlier  Si/GaAs  work  and  is  of  direct  relevance  to  the 
inclusion  of  S  layers  within  a  III-V  semiconductor  device. 
The  interpretation  of  the  RAS  spectra  becomes  somewhat 
more  complicated,  in  this  case,  since  the  measured  anisot¬ 
ropy  reflects  not  only  changes  in  the  surface  order  but  is  also 
influenced  by  the  surface  field  and  the  thickness  of  the  over¬ 
layer.  The  dependence  on  surface  field  has  been  studied  pre¬ 
viously  for  uniformly  doped  GaAs^^’^^  and  has  been  shown 
to  give  rise  to  a  feature  in  the  RAS  spectrum  at  around  3  eV. 
This  feature  has  been  attributed,  in  turn,  to  the  linear  electro¬ 
optic  (LEO)  effect,  associated  with  the  and  inter¬ 

band  transitions.  The  strength  of  the  electric  field  in  the  near¬ 
surface  region  can  be  altered  either  by  changing  the 
concentration  of  Si  atoms  in  the  S  layer,  or  by  varying  the 
position  of  the  S  layer  with  respect  to  the  surface.  Hence,  it 
has  been  our  aim  in  this  study  to  probe  the  contribution  of 
the  LEO  effect  to  the  RAS  signal  and  to  increase  our  under¬ 
standing  of  the  surface  ordering  that  takes  place  when  GaAs 
is  deposited  on  top  of  Si/GaAs.  Once  again,  all  growth  has 
been  restricted  to  400  ""C  in  order  to  limit  the  diffusion  of  Si 
atoms  away  from  the  S  plane. 

IL  EXPERIMENT 

All  growth  took  place  on  GaAs(OOl)  on-axis  substrates 
within  a  VG  Semicon  V80H  MBE  reactor,  fitted  with  a  VG 
LEG  110  RHEED  system.  Experimental  procedures  such  as 
substrate  preparation,  growth  temperature,  and  MBE  flux 
calibration  details  can  be  found  elsewhere.  The  in-house 
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constructed  RAS  system  was  positioned  at  a  strain-free  py¬ 
rometer  viewport  situated  on  the  MBE  reactor  growth  cham¬ 
ber.  A  description  of  the  integrated  MBE/RAS  system  can 
also  be  found  elsewhere,  the  design  being  based  upon  that 
due  to  Scholz  et  al  and  Aspnes  et  The  RAS  system, 

which  has  a  working  spectral  range  from  1.5  to  5.5  eV,  mea¬ 
sures  the  difference  (Ar)  between  the  anisotropic  complex 
reflectance  (r)  along  the  [110]  and  [110]  optical  eigenaxes 
within  the  (001)  surface  crystallographic  plane,  normalized 
to  the  mean  reflectance  (r): 

Ar_  r[Tl0]-r[110] 
r  r[  110]  + r[110] 

Only  the  real  part  of  the  RAS  signal  was  investigated,  since 
even  small  residual  strain  effects,  associated  with  the  pyrom¬ 
eter  viewport,  significantly  affect  the  imaginary  component 
of  the  RAS  signature. 

Following  the  complete  thermal  desorption  of  the  GaAs 
surface  oxides  at  ~600  under  an  AS4  flux  of  F 
molecules  cm“^  s“\  a  sharp,  clear  GaAs(001)-(2X4)-^  re¬ 
construction  was  observed  by  RHEED.  A  1  juum,  undoped 
GaAs  buffer  layer  was  then  grown  at  a  rate  of  1  /xmh”^ 
with  an  As4/Ga  flux  ratio  of  —0.8,  at  a  temperature  of 
580  ®C.  The  sample  was  cooled  to  400  °C  for  both  the  depo¬ 
sition  of  Si  and  subsequent  overgrowth  with  GaAs.  The  Si 
was  deposited  from  a  standard  40  cm^  VG  Mk.II  effusion 
cell,  which  had  been  rigorously  calibrated  from  a  large  series 
of  Hall  effect  and  electrochemical  C-V  profiling  measure¬ 
ments.  The  effusion  ceil  temperature  was  set  to 
rsj=1120  ®C,  which  yielded  a  flux  of  2.8X10^^  atoms 
cm“^  s”\  such  that  a  single  monolayer  would  be  deposited 
in  —0.64  h.  To  replicate  the  growth  conditions  encountered 
during  S  doping,  the  AS4  flux  was  incident  onto  the  sample 
surface  at  all  stages  throughout  the  experiment.  Si  coverages 
of  0.001,  0.0025,  0.005,  0.01,  0.1,  and  1.0  ML  were  investi¬ 
gated  in  this  study.  GaAs  overlayer  coverages  started  at  1 
ML  and  doubled  with  each  successive  deposition,  up  to  a 
maximum  of  512  ML.  RAS  spectra  were  recorded  for  the 
clean  GaAs(001)-c(4X4)  surface,  at  400  ®C,  and  after  each 
deposition.  The  RAS  intensities  presented  here  differ  from 
those  in  our  previous  publications,^’^  as  noted  in  Ref.  16. 

III.  RESULTS  AND  DISCUSSION 

Figure  1(a)  shows  a  typical  RAS  spectrum  for  a  clean 
GaAs(001)-c(4X4)  surface,  i.e.,  one  where  the  GaAs  buffer 
layer  has  been  grown  at  580  °C  and  the  sample  cooled  to  the 
measurement  temperature  of  400  °C.  In  our  experiments, 
both  the  Si  deposition  and  subsequent  GaAs  depositions 
have  been  carried  out  at  400  °C;  hence,  we  need  to  ascertain 
the  similarity  between  GaAs  surfaces  grown  at  400  °C  and 
those  grown  at  580  °C,  then  cooled  to  400  °C.  Our  results 
indicate  that,  under  the  growth  conditions  employed  here, 
low  temperature  (400  °C)  growth  results  in  a  disordered  sur¬ 
face  that,  when  growth  is  terminated,  slowly  recovers  to  a 
well-ordered  surface  once  more.  To  demonstrate  this.  Fig. 
1(b)  displays  a  time-resolved  RAS  measurement  at  an  energy 
of  2.65  eV,  the  energy  characteristic  of  direct  excitation  of 
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Fig.  1.  (a)  Typical  RAS  spectrum  obtained  at  400  °C  from  a  GaAs(001)-c(4 
X4)  surface,  and  (b)  time-resolved  RAS  measurement  at  2.65  eV  before, 
during,  and  after  10  s  GaAs  deposition  on  GaAs(OOl),  at  400  °C. 

surface  dimer  species.^  The  initial  equilibrium  signal,  the  dif¬ 
ferent  equilibrium  reached  during  the  10  s  growth,  and  the 
return  towards  the  initial  equilibrium  following  growth  are 
evident.  As  can  be  seen,  a  timescale  of  >1  h  is  required  to 
recover  the  intensity  of  the  2.65  eV  minimum  at  this  growth 
temperature. 

This  is  a  somewhat  unrealistically  long  time  to  use  when 
studying  GaAs  overgrowth  on  Si/GaAs  by  a  cycle  of  sequen¬ 
tial  growth  and  RAS  measurements,  under  UHV  conditions. 
Hence,  we  have  chosen  to  systematically  record  RAS  spectra 
5  min  (300  s)  after  each  deposition.  Since  each  scan  takes  3 
min  (180  s)  to  complete,  it  is  important  to  determine  the 
degree  by  which  the  RAS  intensity  varies  between  300  and 
480  s  after  each  deposition.  Reference  to  Fig.  1(b)  reveals 
that  the  intensity  of  the  2.65  eV  minimum  changes  by  7.5% 
during  this  time  interval,  an  amount  that  we  have  taken  to  be 
an  acceptable  systematic  uncertainty  in  our  study  of  GaAs 
overgrowth  on  Si/GaAs. 

A,  0.01  ML  Si 

Figure  2(a)  shows  a  series  of  RAS  spectra  that  correspond 
to  the  initially  clean  GaAs(001)-c(4X4)  surface,  its  modifi¬ 
cation  with  the  addition  of  0.01  ML  Si,  and  the  first  stages  of 
overgrowth  with  GaAs.  The  RAS  spectra  for  GaAs  overlayer 
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Fig.  2.  RAS  spectra  for  (a)  initial  GaAs(001)-c(4X4)  surface,  following 
addition  of  0.01  ML  Si  and  the  first  stages  of  overgrowth  with  GaAs,  and  (b) 
increasing  GaAs  coverage  on  top  of  0.01  ML  Si/GaAs. 


thicknesses  of  2  and  4  ML  have  been  omitted  for  clarity. 
Most  noticeably,  deposition  of  0.01  ML  Si  is  seen  to  give  rise 
to  an  upward  shift  of  the  2.65  eV  minimum,  which  becomes 
enhanced  greatly  on  overgrowth  with  1  ML  GaAs.  This  is 
contrary  to  what  one  would  expect  if  the  deposited  GaAs 
was  simply  restoring  a  well-ordered  c(4X4)  surface.  Increas¬ 
ing  GaAs  coverage  systematically  reverses  the  shift  obtained 
for  1  ML  GaAs  (the  omitted  RAS  spectra  for  depositions  of 
2  and  4  ML  lie  between  those  shown  for  1  and  8  ML). 

While  it  is  tempting  to  attribute  the  shift  of  the  2.65  eV 
minimum  solely  to  a  growth  temperature  induced  decrease  in 
surface  order,  the  influence  of  the  LEO  effect  and  the  pos¬ 
sible  variation  of  RAS  signal  with  GaAs  coverage  above  the 
Si  S  layer  cannot  be  totally  excluded.  The  RAS  spectrum  for 
8  ML  GaAs  on  top  of  0.01  ML  Si/GaAS  shows  an  inflection 
at  —2.9  eV,  the  signature  of  an  LEO-related  feature.  Previous 


studies  of  the  LEO  effect  in  n-GaAS,^^  at  300  K,  have  shown 
the  LEO-related  feature  to  be  centered  around  3.05  eV  in 
energy  and  only  affect  the  lineshape  of  the  RAS  spectrum 
between  2.85  and  3,25  eV.  If  we  apply  this  finding  to  our 
own  RAS  spectra,  measured  at  400  °C,  then  we  might  rea¬ 
sonably  expect  the  contribution  of  the  LEO  feature  to  be 
restricted  to  the  spectral  range  from  2.6  to  3.2  eV.  This  could 
distort  the  shape  of  the  2.65  eV  minimum  and  might  cause  a 
displacement  of  this  minimum  to  lower  energy.  However,  our 
observation  that  there  is  no  further  displacement  of  the  mini¬ 
mum  between  1  and  8  ML  GaAs  coverage,  added  to  the  fact 
that  the  LEO  feature  only  provides  a  weak  contribution  to 
the  RAS  spectrum  for  8  ML,  leads  us  to  believe  that  the 
spectral  changes  are  related,  in  some  way,  to  the  overgrowth 
with  GaAs. 

Figure  2(b)  reveals  the  effect  of  increasing  GaAs  over¬ 
layer  coverage.  Each  spectrum  is  plotted  using  the  same  ab¬ 
solute  scale,  but  displaced  vertically  for  clarity.  The  position 
of  the  zero  line  has  been  included  in  each  case.  The  intensity 
of  the  LEO  feature,  identified  for  8  ML  GaAs/0.01  ML  Si/ 
GaAs,  is  found  to  reach  a  maximum  at  an  overlayer  thick¬ 
ness  of  64  ML  GaAs  and  then  decrease  with  further  GaAs 
coverage.  At  the  same  time,  a  minimum  appears  in  the  RAS 
spectrum  at  —4.7  eV.  Such  a  minimum  has  been  observed 
previously  for  ii-GaAs,^^  where  the  authors  concluded  that 
its  behavior  was  independent  of  both  the  degree  and  type  of 
doping.  From  Fig.  2(b)  it  is  evident  that  the  4.7  eV  minimum 
follows  the  behavior  of  the  LEO  feature,  indicating  that  it  is 
also  related  to  the  surface  field,  although  its  origin  remains 
unclear. 

Previous  studies  of  the  LEO  effect  in  n-GaAs^^’^^  have 
shown  that  it  is  possible  to  establish  a  direct  correspondance 
between  the  integrated  “LEO  area”  and  the  degree  of  bulk 
doping,  since  the  LEO  intensity  is  proportional  to  the  field 
that  arises  due  to  pinning  of  the  Fermi  level  at  the  GaAs 
surface.  An  accurate  determination  of  the  integrated  LEO 
area  was  facilitated  by  being  able  to  subtract  the  RAS  spec¬ 
trum  for  an  undoped  GaAs  (001)  sample  from  the  spectra 
obtained  for  GaAs  layers  with  different  degrees  of  bulk  dop¬ 
ing,  but  similar  surface  structure.  In  the  present  case,  the 
RAS  spectra  for  GaAs  overgrowth  on  0.01  ML  Si/GaAs  are 
not  identical  to  that  for  the  clean  GaAs(001)-c(4X4)  surface, 
hence;  the  integrated  LEO  area  cannot  be  obtained  by  a 
simple  process  of  spectral  subtraction. 

By  solving  Poisson’s  equation  self-consistently,  we  have 
calculated  that  0.01  ML  Si,  a  fully  electrically  active  <5  layer, 
separated  by  64  ML  GaAs  from  the  sample  surface  corre¬ 
sponds  to  a  surface  field  of  —5X10^  V  cm~^  Moreover,  the 
strength  of  the  surface  field  increases,  as  expected,  as  we 
bring  the  S  layer  closer  to  the  surface  of  the  sample.  How¬ 
ever,  the  intensity  of  the  LEO  feature  is  observed  to  decrease 
for  coverages  both  above  and  below  64  ML  GaAs.  The  de¬ 
crease  for  thicknesses  >64  ML  GaAs  can  be  explained  by  a 
combination  of  (i)  the  surface  field  decreasing  with  increas¬ 
ing  coverage,  and  (ii)  the  d  layer  passing  beyond  the  penetra¬ 
tion  depth  of  the  incident  radiation  (—170  A  at  3  eV^^).  One 
possible  explanation  for  the  decrease  in  signal  below  64  ML 
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Fig.  3.  RAS  spectra  for  (a)  initial  GaAs(001)-c(4X4)  surface,  following 
addition  of  0.0025  ML  Si  and  the  first  stages  of  overgrowth  with  GaAs,  and 
(b)  increasing  GaAs  coverage  on  top  of  0.0025  ML  Si/GaAs. 

is  that  the  RAS  technique  averages  over  the  near-surface 
region.  This  is  beyond  the  scope  of  the  present  article  and 
will  be  discussed  elsewhere. 

B.  0.0025  ML  Si 

Figure  3(a)  contains  RAS  spectra  for  GaAs  overgrowth  on 
0.0025  ML  Si/GaAs.  By  examining  the  minimum  at  2.65  eV, 
it  can  be  seen  that  it  is  virtually  impossible  to  distinguish  the 
clean  surface  c(4X4)  spectrum  and  that  obtained  following 
deposition  of  0.0025  ML  Si.  The  RAS  spectrum  obtained 
following  overgrowth  with  1  ML  GaAs  is  very  similar  to  that 
shown  for  the  same  coverage  in  Fig.  2(a).  However,  the  ad¬ 
ditional  vertical  shift  of  the  2.65  eV  minimum,  for  a  GaAs 
overlayer  thickness  of  8  ML,  is  reduced  by  comparison  with 
Fig.  2(a).  In  fact,  it  is  a  general  observation  that  the  degree  of 


vertical  displacement  of  the  2.65  eV  minimum,  for  over¬ 
growth  by  1-8  ML  GaAs,  decreases  with  the  Si  submono¬ 
layer  coverage.  The  implication  of  this  is  that  there  are  three 
effects  that  act  to  modify  our  RAS  spectra  when  GaAs  over¬ 
growth  is  performed.  These  are  the  LEO  effect,  the  disorder¬ 
ing  induced  by  the  low  growth  temperature  and,  finally,  the 
fact  that  overgrowth  is  performed  on  top  of  a  Si  ^  layer. 
Indeed,  the  results  are  consistent  with  the  presence  of  a  dis¬ 
ordering  that  increases  with  Si  <^layer  coverage  and  acts  in 
addition  to  the  growth  temperature  induced  disordering. 

Figure  3(b)  follows  the  evolution  of  the  RAS  spectrum  for 
GaAs  thicknesses  in  excess  of  8  ML.  Once  again,  the  LEO 
feature  is  observed  to  develop  initially  with  increasing  GaAs 
coverage,  up  to  64  ML  GaAs,  before  starting  to  decrease  in 
intensity  and  disappearing  completely  by  256  ML  GaAs.  The 
sensitivity  of  RAS  to  the  presence  of  dopant  atoms,  such  as 
Si  or  Be,  on  the  GaAs  surface  has  previously  been  shown  to 
be  limited  to  0.005  ML,^  the  minimum  coverage  that  pro¬ 
vides  a  resolvable  shift  of  the  2.65  eV  minimum.  The  fact 
that  the  LEO-related  feature  is  still  clearly  observable  for 
GaAs  overgrowth  on  0.0025  ML  Si/GaAs  means  that  the 
RAS  technique  shows  increased  sensitivity  (1.6X10^^ 
atoms  cm“^),  for  submonolayer  coverages  of  planar  dopant 
atoms,  under  these  conditions. 

C.  0.001-1.0  ML  Si 

Up  until  this  point  we  have  looked  only  at  GaAs  over¬ 
growth  on  Si  coverages  of  <0.01  ML  Si/GaAs,  where  the 
RHEED  reconstruction  is  exclusively  c(4X4).  It  is  to  be 
noted,  from  our  previous  RAS  measurements  for  Si  on 
GaAs(OOl),^  that  a  Si  coverage  of  0.1  ML  corresponds  to  a 
crossover  in  the  behavior  of  the  2.65  eV  feature.  At  cover¬ 
ages  of  <0.1  ML  Si,  both  c(4X4)  and  (2X1)  reconstructions 
appear  to  coexist  and  so  the  overall  RAS  signal  contains 
contributions  from  both  surface  phases.  For  Si  coverages 
>0.1  ML,  the  (1X2)  reconstruction  dominates  and  the  2.65 
eV  feature  shows  a  positive  rather  than  a  negative  peak.  In 
Fig.  4  we  display  RAS  spectra  for  64  ML  GaAs  overgrowth, 
the  thickness  at  which  the  LEO  feature  is  fully  developed,  on 
all  the  Si  submonolayer  coverages  we  have  studied.  Once 
again,  all  spectra  have  been  plotted  with  the  same  absolute 
scale,  but  displaced  vertically  for  clarity.  The  overall  shapes 
of  the  RAS  spectra  are  remarkably  similar,  excluding  the 
contribution  of  the  LEO-related  feature,  considering  that  the 
Si  coverages  span  three  orders  of  magnitude  from  0.001  to 
1.0  ML.  It  can  be  seen  that  the  intensity  of  the  LEO-related 
feature  increases  with  Si  submonolayer  coverage  up  to  0.01 
ML  (equal  to  —6.4X10^^  atoms  cm" ^),  and  then  decreases 
slightly  with  further  coverage  up  to  1.0  ML  Si.  A  similar 
correspondence  has  been  reported  between  the  density  of 
Siq.^  (i.e.,  Si  on  donor  sites)  and  the  total  Si  coverage,  up  to 
a  coverage  of  —10^^  cm“^  (0.016  ML).^^  In  that  case,  the 
measured  density  of  Si^^  then  remained  approximately  con¬ 
stant  up  to  a  coverage  of  —10^"^  cm“^,  in  very  good  agree¬ 
ment  with  the  results  presented  here. 

It  is  worth  noting  that  the  LEO  feature  remains  evident  in 
the  RAS  spectrum  for  64  ML  GaAs  on  top  of  0.001  ML 
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Fig.  4.  RAS  spectra  for  64  ML  GaAs  deposited  on  Si/GaAs,  where  the  Si 
coverages  vary  from  0.001-1.0  ML. 

Si/GaAs,  although  it  only  appears  as  a  kink  around  2.85  eV. 
However,  its  presence  has  been  confirmed  by  comparing 
RAS  spectra  that  have  been  differentiated.  In  this  case,  the 
field  at  the  surface  has  been  calculated  to  be  —9X10"^ 
Vcm"'. 

IV.  CONCLUSIONS 

The  RAS  technique  has  been  used  to  characterize  the 
overlayer  growth  of  GaAs  onto  sub  to  one  monolayer  cover¬ 
ages  of  Si  (flayers  deposited  on  the  GaAs(001)-c(4X4)  sur¬ 
face.  The  low  growth  temperature  (400  °C)  required  to  avoid 
spreading  of  the  dopant  away  from  the  8  plane  has  meant 
that  the  study  of  an  LEO-related  signal  is  complicated  by 
disordering  at  the  GaAs  surface.  This  disordering  is  induced 
not  only  by  the  growth  temperature,  but  also  by  the  presence 
of  the  Si  8  layer  itself.  Variable  thickness  studies  indicate 
that  the  LEO-induced  signal  is  dependent  on  the  field  profile 
in  the  surface  layer.  Finally,  the  intensity  of  the  LEO  feature 
as  a  function  of  Si  coverage  reaches  a  maximum  at  -—O-Ol 
ML  (^6.4X10^^  atoms  cm~^)  in  agreement  with  previous 
studies  of  the  site  occupancy  of  Si  8  layers. 
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Using  reflectance-difference  spectroscopy,  we  determine  surface-induced  optical  anisotropy  (SIOA) 
spectra  of  clean,  hydrogenated,  and  oxidized  (113)  and  vicinal  (001)  Si  surfaces  to  obtain  a  better 
understanding  of  the  origin  of  the  optical  response  of  surfaces  and  interfaces.  Hydrogenation  was 
performed  either  by  etching  in  dilute  HF  or  by  exposing  clean  surfaces  to  atomic  hydrogen. 

Hydrogenated  and  oxidized  vicinal  surfaces  show  energy-derivativelike  spectra  that  roughly  scale 
with  offcut  angle,  indicating  step-induced  behavior,  and  exhibiting  features  near  3.4  and  4.2  eV,  the 
threshold  energies  of  the  (^j,  E^)  and  E2  interband  critical  points  of  bulk  Si,  respectively.  The 
appearance  of  derivativelike  line  shapes  indicates  that  bulk  threshold  energies  become  dichroic  near 
the  surface  due  to  the  surface-induced  modification  of  the  potential,  as  supported  by  model 
calculations.  However,  direct  integration  yields  dielectric  functions  somewhat  different  from  bulk 
values,  indicating  that  the  surface  affects  the  near-surface  dielectric  function.  SIOA  spectra  for  clean 
surfaces  are  qualitatively  different,  exhibiting  dielectric-functionlike  line  shapes  that  appear  to 
originate  primarily  from  terraces,  although  indications  of  smaller  derivativelike  contributions  are 
also  found.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

In  this  work,  we  use  reflectance-difference  spectroscopy 
(RDS)  to  investigate  surface-induced  optical  anisotropy 
(SIOA)  spectra  of  a  broad  range  of  clean,  hydrogenated,  and 
oxidized  Si  surfaces  with  the  objective  of  obtaining  a  better 
understanding  of  the  optical  response  of  surfaces  and  inter¬ 
faces.  RDS  measures  the  difference  in  reflectance  between 
two  principal  axes  in  the  surface  plane  and  is,  therefore,  well 
suited  for  isolating  surface  optical  responses.  Since  the  bulk 
optical  response  of  cubic  materials  is  isotropic,  it  cancels 
upon  subtraction  leaving  an  RDS  signal  originating  from  the 
lower-symmetry  surface. 

Because  the  Si  surface  has  been  well  studied  and  has  been 
shown  to  be  relatively  structurally  simple.  Si  is  an  ideal  ma¬ 
terial  for  such  investigations.  It  has  been  shown  that  clean 
room- temperature  (RT)  (001)  surfaces  form  asymmetric 
dimers  that  are  parallel  to  each  other  within  a  given  terrace.^ 
Adjacent  terraces  separated  by  single-atomic-height  steps 
form  dimer  domains  rotated  90°  relative  to  each  other.  Be¬ 
cause  the  two  sublattices  are  chemically  equivalent,  in  the 
absence  of  any  driving  mechanism  for  the  formation  of 
double-height  steps,  single-height  steps  will  predominate,  the 
areas  associated  with  the  two  different  domains  will  be  ap¬ 
proximately  equal,  and  no  macroscopic  RDS  response  will 
be  observed.  However,  for  angles  greater  than  approximately 
4°,  atomically  clean,  annealed  surfaces  are  dominated  by 
double-height  steps  so  that  dimers  in  adjacent  terraces  are 
parallel.^’^  In  this  situation,  a  net  RDS  response  is  obtained 
that  in  general  contains  both  terrace  and  step  contributions. 

One  way  to  investigate  step  and  terrace  contributions  is  to 
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chemically  modify  the  surface  and  observe  changes  in  the 
RDS  signal.  In  this  work,  we  used  both  hydrogen  and  oxy¬ 
gen  to  saturate  dangling  bonds  and  terminate  the  surface. 
Hydrogen  termination  is  of  interest  since  the  H-terminated 
(001)  Si  surface  exists  in  two  stable  configurations  depend¬ 
ing  on  coverage.  At  low  coverages,  a  single  hydrogen  atom 
bonds  to  the  single  dangling  bond  of  each  Si  surface  atom.  In 
this  so-called  monohydride  phase,  dimer  bonding  is  retained 
but  the  initially  asymmetric  dimers  become  symmetric."^ 
Upon  further  exposure,  the  surface  converts  to  the  dihydride 
phase  where  the  dimer  bonds  are  broken  and  both  Si  dan¬ 
gling  bonds  on  each  atom  are  saturated  by  hydrogen.^  Other 
phases  are  also  possible^  with  H  exposure,  however,  they  are 
not  as  stable  as  the  mono-  and  dihydride  surfaces.  While  the 
exact  structure  of  the  clean  and  chemically  modified  surfaces 
is  still  a  matter  of  debate,  it  is  clear  that  the  backbonds  of  the 
Si  dimer  atoms  are  strained.  This  allows  the  effect  of  local¬ 
ized  strain  on  optical  spectra  to  be  studied. 


II.  EXPERIMENT 

SIOA  spectra  were  obtained  using  a  RDS  spectrometer^"^ 
that  operates  at  near-normal  incidence  from  1.5  to  5.5  eV. 
The  anisotropy  of  the  complex  reflectance  r  is  given  by 
Ar/f=2(r^-r^)/(r^+r^)  =  Ar/r+/A^,  where  and 
are  the  complex  reflectances  for  light  linearly  polarized 
along  the  a  and  ^  principal  axes,  respectively^  Here  we  re¬ 
port  data  for  Ar/r  with  a  defined  to  be  along  [1 10],  and  p  to 
be  orthogonal  to  the  [110]  direction  and  the  surface  normal. 
Wafer  orientations  were  verified  using  low  energy  electron 
diffraction  (LEED).  Several  chemically  treated  samples  were 
measured  on  a  rotational  stage  where  spectra  were  taken  at 
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two  orthogonal  orientations  and  subtracted  to  minimize  arti¬ 
facts  in  Ar/r  due  to  nonidealities  of  the  optical  components. 

We  investigated  a  broad  range  of  Si  surfaces,  specifically: 
(a)  vicinal  (001)  surfaces  cut  4°,  6®,  and  10^^  off  (001)  toward 
[110];  and  (b)  the  (113)  surface  which  can  be  considered  as 
an  (001)  surface  miscut  25°  off  (001)  toward  [110].  The  wa¬ 
fers  were  lightly  p-  ox  n  -doped  with  resistivities  from  1  to 
10  fi  cm  and  were  chemically  prepared  by  an  RCA  clean. 
Further  treatment  consisted  of  a  30  s  dip  in  dilute  HF  fol¬ 
lowed  by  a  de-ionized-water  rinse  to  obtain  a  hydrogenated 
surface  or  a  30  s  dip  in  H2O2  to  obtain  an  oxidized  surface. 
Data  were  taken  directly  after  treatment  with  the  samples  in 
air.  To  establish  that  the  surfaces  did  not  degrade  during  mea¬ 
surement,  we  also  obtained  data  on  samples  immersed  in 
dilute  HF  or  H2O2  contained  in  a  quartz  beaker. 

For  comparison,  we  measured  the  evolution  of  the  SIOA 
response  of  single-domain  surfaces  exposed  to  atomic  H. 
These  samples  were  prepared  by  an  RCA  clean  with  no  final 
HF  dip,  then  transferred  to  a  UHV  system  with  a  low  10~^^ 
Torr  base  pressure  where  the  residual  oxide  layer  was  des¬ 
orbed  and  the  surface  reconstructed  by  annealing  for  2  min  at 
920  °C.  LEED  patterns  exhibited  sharp,  well-defined  spots 
with  splittings  in  the  direction  of  the  steps,  along  the  fS  di¬ 
rection.  We  measured  the  evolution  of  the  RDS  signal  as  the 
surface  was  exposed  to  atomic  H,  which  was  generated  by 
dissociating  H2  in  the  10“^  mbar  range  with  a  hot  W  fila¬ 
ment. 

III.  RESULTS  AND  DISCUSSION 

Figure  1  shows  RDS  spectra  of  (113)  and  two  vicinal 
(001)  Si  surfaces  hydrogenated  in  dilute  HF.^^  RCA  cleaned 
surfaces  with  no  final  HF  dip  show  similar  RDS  signals.  It  is 
important  to  note  that  the  magnitude  of  the  signal  increases 
with  offcut  angle.  Since  the  total  area  of  the  terraces  de¬ 
creases  slightly  with  offcut  angle  and  the  areal  density  of 
steps  increases,  it  follows  that  these  RDS  signals  are  domi¬ 
nated  by  step-induced  contributions,  namely  (001)  double¬ 
height  steps.  This  has  been  addressed  previously  in  Ref.  11. 

These  line  shapes  all  exhibit  three  main  features:  The  first 
is  an  energy-derivativelike  structure  near  3.4  eV;  the  second 
and  third  form  a  double  structure  near  4.3  eV.  These  energies 
are  near  those  of  the  {Eq,  £1),  and  E2  interband  critical 


points  of  bulk  silicon,  respectively.  Previous  work^^"^^  has 
shown  that  RDS  spectra  can  exhibit  energy-derivativelike 
features  and  can  even  be  dominated  by  such  features  which 
can  be  described  as  follows.  For  a  thin  anisotropic  overlayer 
on  an  isotropic  bulk,  the  RDS  signal  is  given  to  first  order  in 
d/k  by 

Af  AiriEd  e^—eo 

_ — _ ^ P  /I  ^ 


where  E  is  the  photon  energy,  d  is  the  thickness  of  the  an¬ 
isotropic  overlayer,  k  is  the  wavelength  of  light,  and  he 
the  product  of  Planck’s  constant  and  the  speed  of  light, 
and  6^  are  the  components  of  the  surface  dielectric  tensor 
along  a  and  yS,  respectively,  and  is  the  bulk  dielectric 
function.  If  we  assume  that  and  are  both  functions  of 
,  but  with  slightly  different  threshold  energies  and  broad¬ 
ening  parameters,  i.e.,  that  the  associated  critical  points  are 
dichroic,  then  in  the  vicinity  of  a  critical  point,  we  can  write 
approximately 

e<,=  ei,(£’-£'j-A£'j„  +  jT  +  zArj, 
f ys  =  f -  A + * r + «■  A r^) , 
whence 


Ar  AxriEd  (  — AE'^  +  fAF)  deu 
r  he  dE' 

where  /kEg  =  AEg^~  AEg^  and  Ar=Ar^-Ar^. 

Although  the  assumption  is  typically  made,  it  is  not  clear 
a  priori  that  the  energy  derivative  in  Eq.  (2)  should  be  with 
respect  to  or  whether  a  surface-modified  bulk  dielectric 
function  may  be  more  appropriate.  Since  extra  scattering 
channels  are  available  at  the  surface,  we  would  expect  at 
least  an  increase  in  broadening  of  the  interband  critical  point 
structures  near  the  surface.  To  examine  this  further,  we  de¬ 
termined  whether  we  could  recover  from  RDS  data  by 
integrating  Eq.  (2).  We  define  the  result  as  a  near-surface 
dielectric  constant,  ,  given  by 


e,{E)-d=6,{Eo)-d-i 


he 


1 


477  (” A£'„  +  /Ar) 


Jeo  E  r 


(3) 


where  £"0  is  a  lower  limit  of  integration,  here  equal  to  1 .5  eV. 
For  lxxi(€^'d)  we  assume  an  integration  constant  of  0  which 
is  reasonable  since  Im(  •  d)  is  observed  to  be  nearly  con¬ 
stant  below  3  eV  and  a  constant  nonzero  absorption  is  un¬ 
likely.  We  also  assume  that  AF  is  small  compared  to  A£ 
which  was  found  to  be  80  meV  A  in  the  case  of  Si(113).^^ 
This  point  will  be  discussed  later.  The  resulting  calculated 
spectra  Im(6^'^)  for  hydrogenated  or  oxidized  Si  samples 
cut  10°  off  (001)  toward  (110)  are  shown  in  Fig.  2.  For 
scaling  purposes,  we  divided  lm(6^^d)  by  0.3  nm,  a  reason¬ 
able  thickness  for  an  anisotropic  surface  layer. 

The  calculated  spectra  show  a  feature  near  3.33  eV,  which 
is  very  near  the  energies  of  the  {Eq,Ei)  interband  critical 
points  of  bulk  silicon,  and  a  second  broad  feature  near  4.2 
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Fig.  2.  spectra  obtained  by  applying  Eq.  (3)  to  RDS  data  for 

hydrogenated  and  oxidized  (113)  and  10°  vicinal  (001)  Si  surfaces.  The 
hydrogenated  data  are  from  Fig.  1.  The  energy  locations  of  the  bulk  inter¬ 
band  critical  points  (Eq  ,Ei),  E2,  and  E[  are  indicated. 


eV,  which  is  near  the  E2  critical  point.  Figure  3  shows  a 
comparison  between  the  bulk  crystalline  spectrum  Im(6^) 
and  the  calculated  spectrum  Im(6^*  J)  from  Fig.  2  for  the 
(113)  surface.  There  is  qualitative  agreement  between  the 
two  spectra,  however,  the  {Eq,Ei)  structure  for  the  (113) 
surface  is  weaker,  and  the  features  are  somewhat  broader  and 
shifted.  The  different  amplitudes  for  the  {Eq,  E{)  and  E2 
structures  may  not  be  unreasonable  since  it  is  not  apparent  a 
priori  why  the  scaling  factor  AEg  should  be  the  same  for  the 
two  transitions.  In  addition,  with  respect  to  scattering  from 
the  (113)  surface  with  respect  to  the  bulk,  which  would  ex¬ 
plain  the  enhanced  broadening  of  the  integrated  spectrum. 
We  can  gain  further  insight  into  the  effect  of  the  surface  on 
the  dielectric  response  by  comparing  the  same  (113)  spec¬ 
trum  from  Fig.  2  to  that  of  bulk  porous  which  is 

strongly  influenced  by  the  very  large  internal  surface  area  of 
its  pores.  The  result  is  shown  in  Fig.  4.  In  this  case,  there  is 
better  agreement  near  £’2,  but  the  (113)  spectrum  retains 
some  structure  associated  with  the  (Eq,Ei)  structure  al¬ 
though  it  is  completely  absent  in  the  porous  Si  spectrum. 
These  results  suggest  that  the  correct  representation  for  6^  in 
the  derivative  of  Eq.  (2)  appears  to  lie  somewhere  between 
the  two  limits  of  crystalline  and  porous  Si. 


Fig.  3.  Comparison  between  the  hydrogenated  (113)  lm(6^.-d/)  spectrum  of 
Fig.  2  (solid  line)  and  Im(e^)  of  bulk  crystalline  Si  (dashed  line). 


Fig.  4.  Comparison  between  the  hydrogenated  (113)  Im(e^, -^Z)  spectrum  of 
Fig.  2  (solid  line)  and  lm(£^)  of  porous  Si  layers  (dashed  line). 

In  calculating  the  spectra  of  Fig.  2,  we  assumed  that 
(”-A£^  +  /Ar)  is  essentially  real,  which  means  that 
AEg>AT.  If  this  were  not  the  case,  €^{E)  •  d  would  be  mul¬ 
tiplied  by  a  complex  phase  factor  exp(/  6),  which  would  mix 
real  and  imaginary  parts  of  defdE.  Although  no  strong  jus¬ 
tification  can  be  given  at  present  for  our  assumption,  we 
observed  no  improvement  of  the  fit  to  the  spectra  when  non¬ 
zero  values  of  AF  were  used.  In  addition,  the  spectra  calcu¬ 
lated  for  nonzero  values  of  AF  showed  features  shifted  un¬ 
reasonably  to  lower  energies.  Note  that  the  statement 
AF<^A£^  implies  only  that  AF^^AF ^  on  the  scale  of  AEg , 
which  still  allows  a  significant  increase  of  F  with  respect  to 
bulk  values.  We  also  assumed  that  both  AEg  and  AF  are 
essentially  independent  of  photon  energy,  which  is  a  reason¬ 
able  assumption  near  a  critical  point.  However,  the  complete 
spectrum  contains  contributions  from  several  critical  points 
and  thus  the  use  of  constant  values  for  AEg  and  AF  for  all 
critical  points  is  a  simplification.  The  question  as  to  whether 
6^  should  also  be  replaced  in  the  denominator  of  Eq.  (2) 
depends  on  whether  the  three-phase  (ambient/anisotropic 
surface  layer/isotropic  bulk)  model  is  sufficient  to  describe 
the  optical  response.  The  question  has  not  been  resolved, 
however,  it  will  be  discussed  further  in  Ref.  18. 

We  now  discuss  the  origin  of  the  threshold  energy  shifts 
of  the  bulk  critical  points  that  lead  to  the  derivativelike  fea¬ 
tures.  The  obvious  causes  are  surface  strain  and  chemical 
induction,  which  is  a  change  in  the  subsurface  electron  dis¬ 
tribution  caused  by  chemisorption.  Both  cases  result  in  near¬ 
surface  Si  atomic  positions  that  are  different  from  those  of 
the  bulk.  To  estimate  the  possible  effect  of  strain  on  critical- 
point  energies  near  the  surface,  we  consider  clean  (001)  Si 
using  atomic  positions  reported  in  Ref.  19  for  the  c(2X4) 
reconstruction.  Although  a  (2X1)  reconstruction  is  observed 
at  RT,  there  is  evidence  that  the  c(2X4)  unit  is  still  the  build¬ 
ing  block  even  at  high  temperatures.^^  From  the  atomic  po¬ 
sitions  given  in  Ref.  19,  the  length  of  the  asymmetric  dimer 
bond  is  2.30  A,  which  is  slightly  less  than  the  length  of  the 
bulk  Si-Si  bond  (2.352  A  at  RT).  Due  to  dimer  formation, 
the  Si-Si  back  bonds  are  rotated  and  strained.  The  back 
bonds  of  the  lower  atom  of  the  asymmetric  dimer  have  a 
length  2.315  A  and  thus  are  in  1.6%  compressive  strain, 
whereas,  the  back  bonds  for  the  higher  atom  have  a  length 
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Fig.  5.  RDS  spectra  for  clean  vicinal  (001)  and  (113)  Si  surfaces.  The 
spectra  are  offset  to  yield  the  value  zero  at  the  low  energy  limit  of  the  figure. 


2.360  A  and  thus  are  in  0.4%  tensile  strain.  Therefore,  the 
net  effect  is  a  compressive  strain  of  the  Si-Si  back  bonds. 
From  the  elastic  stiffness  constants  (cii  +  2cj2)=2.93 
Mbar,^^  hydrostatic  strain  shifts  the  (jFq  ,£’i)  critical  point  of 
bulk  Si  by  5  meV/kbar.^^  We  calculate  a  gap  shift  of  about 
0.2  eV  to  higher  energies.  Most  likely  the  apparent  shift 
would  be  averaged  over  several  layers,  in  which  case  shifts 
would  be  of  the  order  of  0.1  eV  A,  which  was  observed. 

The  monohydride  surface  is  more  difficult  to  explain  us¬ 
ing  strain  arguments.  Here  the  dimer  is  symmetric,  so  all 
back  bonds  will  be  strained  equally.  However,  measured  and 
calculated  values  for  the  dimer  length  range  from  2.40  A 
(Ref.  4)  to  3.0  A  (Ref.  23)  compared  to  a  bulk  separation  of 
3.84  A.  The  strain  induced  by  the  back  bonds  strongly  de¬ 
pends  on  this  dimer  length.  Taking  a  value  of  2.34  A  for  the 
dihydride  phase  as  a  lower  limit,"^  we  estimate  a  strain  of 
0.4%  and  a  shift  of  about  47  meV. 

In  addition  to  strain,  surface  electric  fields  can  create  first- 
derivativelike  features  for  crystals  without  inversion  symme¬ 
try.  This  is  the  well-known  linear  electro-optic  effect, 
which  is  usually  related  to  the  electric  field  of  the  space 
charge  region  generated  from  Fermi  level  pinning  by  surface 
states.^^  This  mechanism  does  not  contribute  to  the  optical 
response  of  bulk  Si  because  of  its  inversion  symmetry.  How¬ 
ever,  this  symmetry  is  broken  at  the  surface  and  therefore  a 
local  electric  field  may  contribute  to  the  RDS  signal.  Such 
local  electric  fields  might  arise  from  the  charge  transfer  due 
to  the  chemisorbed  species.  The  Pauling  electronegativity  for 
H  (2.1)  is  higher  than  that  of  Si  (1.8),  which  results  in  a  net 
charge  transfer  from  Si  to  H.  The  charge  transfer  for  a  single 
Si-H  bond  would  be  0.02^0,  which  is  in  good  agreement 
with  the  calculated  value  of  0.029^ o  The  related 

electric  field,  therefore,  would  be  strong  enough  to  create  a 
derivativelike  feature. 

A  change  of  the  threshold  energies  from  their  bulk  values, 
as  indicated  by  the  observed  derivativelike  line  shapes,  can 
only  occur  if  the  states  involved  are  near-bulklike  and  local¬ 
ized.  We  discuss  this  issue  in  a  forthcoming  article. 

Unlike  the  hydrogenated  and  oxidized  Si  surfaces,  the 


Energy  (eV) 


Fig.  6.  Comparison  between  the  difference  between  the  10°  and  6°  vicinal 
Si(OOl)  RDS  spectra  (solid  line)  and  the  Si(113)  spectra  (dashed  line).  The 
(113)  spectra  are  offset  for  clarity  and  the  imaginary  component  has  been 
multiplied  by  (—1). 


clean  surfaces  exhibit  dielectric-functionlike  line  shapes  as 
shown  in  Fig.  5.  A  broad  feature  is  apparent  near  4.2  eV, 
similar  to  that  seen  for  porous  Si.  The  main  difference  is  the 
finite  value  between  2  and  3  eV,  which  is  most  likely  related 
to  the  dangling-bond  states  of  the  Si  dimers.  No  significant 
change  in  signal  amplitude  is  observed  for  increasing  offcut 
angle  for  the  clean  Si  surface,  which  indicates  that  step 
contributions  are  minor.  The  fact  that  the  smallest  signal  is 
observed  for  the  (113)  surface,  which  can  be  considered  to  be 
all  steps,  is  consistent  with  this  interpretation. 

To  isolate  possible  step  contributions  to  the  RDS  spectra 
of  the  clean  surfaces,  we  calculate  the  difference  between  the 
(001)  spectra  for  the  10°  and  6°  miscut  angles.  The  resulting 
real  and  imaginary  parts  are  shown  in  Fig.  6.  The  result 
indicates  that 

Af  Ar  Ar 

—  (10°)-  — (6°)-/ ^(113).  (4) 

The  simplest  interpretation  is  that  the  RDS  signal  arises  from 
a  linear  combination  of  step  and  terrace  contributions: 


Ar 

A? 

—  =A,i 

1 

+  A_y 

r 

\  r  1 

\r 

terr  '  ’ 

where  and  are  weighting  functions,  A^+A^  =  l,  and  A^ 
and  A^  are  proportional  to  the  areas  of  terrace  and  step.  How¬ 
ever,  this  model  cannot  explain  the  result  expressed  as  Eq. 
(4),  especially  if  we  assume  that  the  RDS  signal  for  the  (113) 
surface  is  dominated  by  steps.  Even  if  the  difference  between 
the  10°  and  6°  spectra  has  only  step  contributions,  it  is  not 
clear  why  its  amplitude  should  be  similar  to  that  of  the  (113) 
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spectrum.  The  phase  shift  indicated  by  i  could  be  explained 
if  we  assume  that  the  RDS  spectrum  for  the  clean  (113) 
surface  has  a  derivativelike  line  shape.  This  would  require 
that  the  character  of  the  RDS  spectra  changes  with  offcut 
angle.  The  appearance  of  i  then  indicates  that  the  system 
evolves  from  being  dominated  by  a  dichroic  band  gap  AEg 
to  being  dominated  by  a  dichroic  broadening  parameter  AT. 
However,  the  small  feature  near  3  eV  in  the  (113)  spectrum 
resembles  that  shown  in  Fig.  2  for  although 

slightly  shifted  to  lower  energies.  This  point  needs  to  be 
clarified  by  further  experiments. 

IV.  SUMMARY  AND  CONCLUSIONS 

We  have  investigated  terrace-  and  step-induced  contribu¬ 
tions  to  the  RDS  spectra  of  clean  and  chemically  modified  Si 
surfaces.  Hydrogenated  and  oxidized  surfaces  show  deriva¬ 
tivelike  line  shapes  arising  from  bulk  critical  points  that  have 
become  dichroic  as  a  result  of  a  potential  that  induces  either 
chemical  induction  or  strain  at  the  surface.  By  integrating 
these  spectra,  we  obtain  near-surface  dielectric  functions 
which  resemble  those  of  porous  Si,  a  material  that  is  domi¬ 
nated  by  surface  contributions  and  exhibits  features  that  can 
be  related  to  significantly  broadened  bulk  interband  critical 
points.  In  contrast,  RDS  line  shapes  of  clean  surfaces  are 
dielectric -functionlike  and  exhibit  only  minor  contributions 
from  steps. 
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Using  an  ab  initio,  nonlocal  pseudopotential  method,  we  have  calculated  the  reflectance  anisotropy 
spectrum  for  a  particular  model  of  the  GaAs(OOl)  c(4X4)  reconstructed  surface.  Excellent 
agreement  with  experiment  was  obtained,  supporting  the  chosen  model.  The  reflectance  anisotropy 
was  found  to  originate  from  optical  transitions  between  bulk  like  valence  band  states  and  surface 
states  in  the  conduction  band.  The  nature  and  distribution  of  the  electronic  states  involved  is 
discussed.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

When  light  is  reflected  at  normal  incidence  from  the  sur¬ 
face  of  a  cubic  crystal,  the  intensity  of  the  reflected  light 
should  not  be  dependent  on  its  polarization.  In  practice,  a 
small  difference  in  the  reflectance  of  light  polarized  along 
particular  symmetry  directions  is  observed.  As  the  complex 
dielectric  function,  which  describes  the  optical  response  of  a 
solid,  is,  in  a  centro-symmetric  crystal,  reduced  to  a  complex 
scalar,  any  anisotropy  in  the  normal  incidence  reflectivity  of 
cubic  crystals  must  arise  from  the  surface  region.  Reflection 
anisotropy  spectroscopy  (RAS),  which  measures  the  differ¬ 
ence  in  the  reflection  for  light  of  two  orthogonal  polariza¬ 
tions  normally  incident  on  a  surface  as  a  function  of  the 
photon  energy,  provides  useful  information  about  the  surface 
electronic  structure.  However,  to  interpret  the  features  ob¬ 
served  in  the  spectrum  one  has  to  resort  to  quite  detailed 
calculations. 

Theoretical  work  has  progressed  along  two  different  ap¬ 
proaches.  Mochan  and  Barrera^  identified  surface  local  field 
effects  as  being  responsible  for  the  observed  anisotropy.  By 
contrast,  the  calculations  of  Selloni  et  aO  were  performed 
within  a  single-particle  picture  in  which  the  electronic  eigen¬ 
states  of  the  crystal  were  calculated  explicitly  and  the  optical 
functions  derived  by  considering  the  electronic  transitions. 
Subsequent  work  by  Manghi  et  al^  and  Chang  et  al^  based 
on  this  latter  approach  resulted  in  calculated  spectra  showing 
reasonable  agreement  with  experiment,  suggesting  this  to  be 
appropriate  when  dealing  with  semiconductor  surfaces. 

The  GaAs(OOl)  surface  has  been  the  subject  of  much 
study  in  recent  years  because  of  its  perceived  technological 
importance.  It  displays  a  wide  variety  of  surface  reconstruc¬ 
tions,  the  most  important  being  the  (4X2),  the  (2X4),  and 
the  c(4X4)  reconstructions.  The  (2X4)  reconstruction  is  As 
terminated  and  occurs  in  three  different  phases,  known  as  the 
a,  P,  and  depending  on  the  growth  conditions.  In  all 
three  phases  the  surface  is  comprised  of  As  dimers  directed 
along  the  [110]  direction.  For  coverages  of  As  of  more  than 
0.75  of  a  monolayer  extra  As  atoms  adsorb  on  the  top  As 
layer  breaking  the  dimer  bonds  and  forming  new  dimers 
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along  the  [110]  direction.  At  sufficiently  high  As  coverages 
the  c(4X4)  reconstruction  forms.  The  widely  accepted 
model  for  this  reconstruction,  as  deduced  from  scanning  tun¬ 
neling  microscopy  (STM)  images^’^  and  supported  by  x-ray 
diffraction  experiments,^  is  shown  in  Fig.  1.  Kamiya  et  al^ 
have  shown  that  the  RAS  spectral  line  shapes  for  the  three 
primary  reconstructions  of  the  GaAs(OOl)  surface  are  quite 
different  from  each  other  making  RAS  an  effective  technique 
in  distinguishing  between  the  different  reconstructions. 

In  our  earlier  study^^  of  the  GaAs(OOl)  (2X4)  recon¬ 
structed  surfaces,  we  demonstrated  that  ab  initio  calculations 
of  the  electronic  structure  and  optical  properties  based  on  the 
one-electron  picture  gave  RAS  spectra  that  are  directly  com¬ 
parable  with  experimental  results.  We  also  showed  that  a 
comparison  of  the  calculated  and  experimental  spectra  can  be 
used  to  yield  information  about  the  atomic  structure  of  the 
surface.  In  particular,  we  found  that  the  low  temperature  ex¬ 
perimental  spectrum  of  the  (2X4)  surface  could  be  inter¬ 
preted  using  a  combination  of  two  structural  models  for  this 
surface,  viz.  the  a(2X4)  and  the  /52(2X4)  structures  as  de¬ 
noted  by  Northrup  and  Froyen.^^  The  quality  of  the  fit  be¬ 
tween  the  calculated  and  the  experimental  spectra  is  illus¬ 
trated  by  Fig.  2. 

In  growth  studies  of  GaAs(OOl)  surfaces,  the  oscillations 
in  the  RAS  intensity  at  a  photon  energy  of  2.7  eV  are  gen¬ 
erally  ascribed  to  the  As  dimers  on  the  surface  alternating 
between  the  [110]  and  [110]  directions  as  the  surface  under¬ 
goes  transitions  between  the  (2X4)  and  c(4X4)  reconstruc¬ 
tions.  It  is  therefore  important  that  any  theoretical  study 
aimed  at  arriving  at  an  understanding  of  the  RAS  spectrum 
must  give  a  good  description  of  both  these  reconstructions. 
Following  from  our  study  of  the  (2X4)  reconstructed  sur¬ 
face,  the  aim  of  this  paper  is  to  calculate  the  RAS  spectrum 
of  the  c(4X4)  reconstructed  surface  and  to  interpret  its  fea¬ 
tures  in  terms  of  electronic  structure. 

II.  CALCULATION  METHOD 

The  calculations  were  performed  in  a  manner  similar  to 
that  used  in  our  previous  work,^®  using  ab  initio  nonlocal 
pseudopotentials,  the  local  density  approximation  and  the 
conjugate-gradient  method  to  minimize  the  total  energy 
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#  second  layer  As 
O  third  layer  Ga 

Fig.  1.  The  model  for  the  GaAs(OOl)  c(4X4)  reconstructed  surface. 


through  the  relaxation  of  the  electronic  and  atomic  degrees 
of  freedom.  The  surface  was  represented  using  a  repeated 
slab  geometry,  with  a  slab  containing  four  layers  of  GaAs, 
one  layer  of  hydrogen  atoms  terminating  the  “bottom  sur¬ 
face”  and  six  extra  As  atoms  on  the  “top  surface”  forming 
the  three  dimers  oriented  in  the  [110]  direction  as  shown  in 
Fig.  1.  Six  layers  of  “vacuum”  separated  the  two  surfaces, 
making  the  total  slab  thickness  equivalent  to  eight  layers  of 
GaAs.  Due  to  the  symmetry  properties  of  GaAs,  any  slab 
terminated  with  As  on  both  surfaces  has  the  dimers  on  each 
surface  perpendicular  to  the  other.  Such  a  construction  would 
be  counterproductive  as  the  anisotropies  of  the  perpendicular 
dimers  on  either  surface  would  cancel  each  other  out  and 
lead  to  a  null  result.  As  the  optical  matrix  elements  are  cal¬ 
culated  in  reciprocal  space  it  is  hard  to  separate  the  contri¬ 
butions  of  the  two  surfaces,  so  the  most  practical  way  to  find 
the  anisotropy  of  the  top  surface  is  to  ensure  that  the  bottom 
surface  is  isotropic.  This  is  the  purpose  of  the  hydrogen  at¬ 
oms  on  the  bottom  surface  which  are  positioned  in  such  a 
way  that  each  hydrogen  atom  is  directly  below  the  Ga  atom 
in  the  next  layer.  This  has  the  effect  of  forcing  the  dangling 
bonds  to  hang  vertically  downwards,  thus  eliminating  their 
directionality.  Although  this  arrangement  is  not  physical  and 
does  not  give  the  lowest  total  energy  or  satisfy  electron 
counting  heuristics,  it  was  found,  when  compared  to  other 
possible  bottom  surface  terminations,  to  give  the  best 
results.  The  concern  that  the  surplus  of  charge  on  one  of  the 
two  surfaces  will  set  up  an  electric  field  which  will  affect  the 
optical,  electronic,  and  structural  properties  of  the  other  sur¬ 
face  was  tested  by  considering  another  bottom  surface  termi- 


Fig.  2.  The  calculated  RAS  spectrum  (solid  line),  as  obtained  from  the 
combination  of  two  structural  models,  and  the  low-temperature  experimental 
spectaim  (dashed  line),  for  the  GaAs(OOl)  (2X4)  reconstructed  surface. 


nation  which  had  hydrogen  atoms  positioned  below  alterna¬ 
tive  Ga  atoms  such  that  half  the  Ga  dangling  bonds  are 
occupied,  half  are  empty  and  that  electron  counting  is  satis¬ 
fied.  Although  there  was  slight  charge  transfer  across  the  slab 
in  both  cases,  nearly  identical  optical  properties  were  ob¬ 
tained.  As  long  as  the  slab  was  thick  enough  an  imbalance  of 
charge  was  found  not  to  be  a  problem.  It  was  further  found 
that  the  electronic  states  which  were  strongly  located  on  the 
hydrogen  atoms  were  close  to  the  Fermi  energy  and  that 
transitions  involving  these  states  were  largely  in  the  region 
below  1  eV,  which  is  not  of  interest.  A  plane  wave  basis  set 
corresponding  to  plane  waves  with  an  energy  of  up  to  136  eV 
was  used.  Although  this  relatively  small  cutoff  does  not  pro¬ 
vide  an  accurate  description  of  the  electronic  states,  in  par¬ 
ticular  of  the  higher  unoccupied  electronic  states,  realistic 
values  of  the  optical  matrix  elements  and  more  pertinently 
the  difference  in  the  reflectivity,  in  the  energy  range  of  inter¬ 
est,  might  be  expected.  Furthermore  test  calculations  on  the 
optical  properties  of  bulk  GaAs  agreed  well  with  experiment 
for  energies  up  to  ~6  eV.  Only  at  higher  energies  was  it 
found  that  the  matrix  elements  became  unreliable. 

It  was  noted,  however,  that  one  of  the  consequences  of 
using  a  relatively  small  basis  set  is  that  the  unoccupied  bands 
are  shifted  up  in  energy.  This  upwards  shift  in  energy  of  the 
conduction  band  states  is  not  to  be  confused  with  the  error 
due  to  the  use  of  density  functional  theory  which  results  in 
an  underestimate  of  the  band  gap.  When  the  cutoff  energy  is 
increased  it  was  found  that  the  energies  of  the  conduction 
band  states  move  downwards  until  convergence  is  reached 
and  only  then  is  there  an  underestimate  of  the  band  gap.  The 
energy  cutoff  which  we  used  in  these  calculations  has  the 
novel  property  that  these  two  opposing  errors  cancel  so  that 
the  band  gap  of  bulk  GaAs  is  in  very  close  agreement  with 
the  experimental  value.  Thus  there  was  no  need  to  apply  any 
kind  of  “scissors  operator”  and  gave  us  much  more  confi¬ 
dence  in  our  results.  It  should  also  be  noted  that  by  minimiz- 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


3077 


J.  M.  Bass  and  C.  C.  Matthai:  Ab  initio  calculations  of  the  RAS 


3077 


Fig.  3.  Simulated  scanning  tunneling  microscopy  image  for  the  GaAs(OOl) 
c(4X4)  reconstructed  surface. 


ing  the  total  energy  only  the  wave  functions  of  the  occupied 
states  could  be  obtained.  In  order  to  get  the  conduction  band 
energies  and  wave  functions  a  separate  procedure  was  used. 
Using  the  self-consistent  charge  density  found  in  the  first 
part  of  the  calculation,  an  alternative  procedure  was  used 
under  which  the  band  wave  functions  and  energies  were 
found  by  minimization  of  the  eigenvalues  (rather  than  the 
minimization  of  the  total  energy)  under  the  constraint  that 
each  band  must  be  orthogonal  to  all  the  bands  below  it.  Op¬ 
tical  matrix  elements  were  then  calculated  for  each  polariza¬ 
tion  direction,  using  the  momentum  operator,  and  the  imagi¬ 
nary  part  of  the  dielectric  function  calculated  according  to 

““TTtS  I  K^c,kl  “ 

'rrm  oj  Jbz 

The  real  part  was  then  calculated  from  this  equation  via  a 
Kramers-Kronig  transform.  From  the  dielectric  functions  it 
is  very  straightforward  to  calculate  the  complex  refractive 
indices  and  the  reflection  coefficients  from  which  the  reflec¬ 
tance  anisotropy  is  obtained. It  was  possible  to  ignore  the 
nonlocal  corrections  as  these  were  found  to  be  negligible  in 
the  energy  range  of  interest.  Care  was  also  taken  to  ensure 
that  integration  of  the  slab  Brillouin  zone  did  not  introduce 
artificial  anisotropies.  The  special  point  scheme  of  Chadi  and 
Cohen,  with  the  criterion 

n 

2  =  m=  1,2,  .  .  .  ,Ar 

/  =  i 

was  applied  to  generate  the  four  special  points  used  in  the 
Brillouin  zone  integration. 


Fig.  4.  The  calculated  RAS  spectrum  (solid  line),  and  the  experimental 
spectrum  (dashed  line),  for  the  GaAs(OOl)  c(4X4)  reconstructed  surface. 


III.  RESULTS 

Prior  to  calculating  the  RAS  spectrum  we  were,  with  very 
little  extra  effort,  able  to  generate  some  theoretical  scanning 
tunneling  microscopy  (STM)  images  of  the  GaAs(OOl)  c(4 
X4)  reconstructed  surface  one  of  which  is  shown  in  Fig.  3. 
This  is  a  realistic  simulation  involving  a  tip  consisting  of 
four  A1  atoms,  a  fixed  tunneling  gap  of  2  A,  a  bias  voltage  of 
—  3  V  and  a  tip-surface  interaction  given  by  the  Bardeen 
formalism. This  image  shows  the  characteristic  “brick¬ 
work”  structure  as  seen  in  experiment. Furthermore  each 
top  layer  As  atom  is  individually  resolved,  indicating  tunnel¬ 
ing  from  occupied  lone  pair  orbitals  on  the  As  atoms.  It  was 
reported  by  Avery  et  al^  that,  in  their  images  taken  at  a  bias 
voltage  of  —3  V,  the  central  As  dimer  appears  less  bright 
than  the  outer  As  dimers.  Our  simulated  image  at  —  3  V  does 
not  show  this  but  our  images  at  lower  bias  voltages  do,  such 
that  at  —  1  Y  the  central  As  dimer  is  barely  visible.  In  sum¬ 
mary  the  simulated  STM  images  can  be  considered  to  add 
extra  credibility  to  the  model  used  for  the  c(4X4)  recon¬ 
struction. 

The  calculated  RAS  spectrum  for  the  GaAs(OOl)  c(4 
X4)  reconstructed  surface  is  shown  in  Fig.  4  together  with 
the  “low”  temperature  (189  °C)  experimental  spectrum  of 
Kamiya  et  al^  Note  that  the  magnitude  of  the  calculated 
spectrum  has  been  scaled  to  give  a  similar  magnitude  to  the 
experimental  spectrum.  Such  a  scaling  is  necessary  because 
the  penetration  depth  of  visible  light  is  much  greater  than  the 
thickness  of  the  slab  used  in  the  calculation  resulting  in  the 
total  calculated  reflectance  being  too  small,  while  the  differ¬ 
ence  in  reflectances  is  largely  unaffected,  thus  causing  the 
reflectance  anisotropy  to  be  too  large.  Test  calculations  on 
slabs  of  different  thicknesses  have  supported  this  explana¬ 
tion.  The  agreement  between  calculation  and  experiment  in 
the  energy  range  of  1. 5-3.5  eV  is  remarkably  good.  Beyond 
that,  the  calculated  spectrum  falls  to  zero  and  negative  values 
fall  much  more  rapidly  than  is  experimentally  observed.  Part 
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of  the  reason  may  be  attributable  to  the  various  approxima¬ 
tions  used  in  the  calculation  as  a  similar  departure  between 
calculated  and  observed  RAS  spectra  was  also  found  for  the 
(2X4)  surface  (Fig.  2).  As  it  is  the  low  energy  structure  of 
the  spectra  that  is  used  to  characterize  the  two  surfaces,  the 
excellent  agreement  between  the  calculated  and  observed 
spectra  in  this  energy  range  allows  the  calculation  to  be  use¬ 
ful  in  interpreting  the  spectrum. 

In  analyzing  the  spectrum  it  is  useful  to  note  that  the 
anisotropy  in  the  surface  dielectric  function,  Im(ej[]|o] 
~  £[110])^  resembles  the  RAS  spectrum.  This  has  been  ob¬ 
served  in  these  calculations  and  has  also  been  reported  in 
experiment. Thus,  features  in  the  reflection  anisotropy  may 
be  related  to  specific  optical  transitions.  However,  rather  than 
look  at  specific  transitions,  (<Ac,kl^rl  is  more  instruc¬ 

tive  to  examine  which  valence  bands,  ,  and  conduction 
bands,  ijjc ,  contribute  most  to  the  dielectric  function  anisot¬ 
ropy  in  a  specific  energy  range.  The  two  characteristic  fea¬ 
tures  in  the  theoretical  RAS  spectrum  are  the  dip  at  2.55  eV 
and  the  peak  at  3.65  eV  so  we  therefore  choose  an  energy 
range  of  0.5  eV  centered  at  each  of  these  energies.  The  first 
point  to  note  is  that  in  both  energy  ranges  the  contribution  of 
any  given  valence  or  conduction  band  to  the  dielectric  func¬ 
tion  anisotropy  is  either  positive,  zero  or  negative.  The  peak 
(dip)  in  the  dielectric  function  anisotropy  occurs  when  the 
positive  (negative)  contributions  are  dominant.  We  then  ob¬ 
serve  that  the  set  of  which  are  responsible  for  the  largest 
contributions  to  the  dielectric  function  anisotropy  (either 
positive  or  negative)  at  the  lower  energy  is  very  similar  to 
that  which  are  responsible  for  the  largest  contributions  to  the 
dielectric  function  anisotropy  at  the  higher  energy.  The  two 
sets  of  ij/c ,  however,  are  quite  different  with  those  contrib¬ 
uting  to  the  dip  at  3.65  eV  generally  being  higher  in  energy 
as  would  be  expected.  An  examination  of  the  charge  density 
of  these  various  groups  of  valence  and  conduction  bands 
reveals  that,  for  both  features,  the  which  are  responsible 
for  the  dielectric  function  anisotropy  are  only  weakly  local¬ 
ized  in  the  vicinity  of  the  surface  whereas  the  ip^  are  strongly 
localized  in  the  vicinity  of  the  surface.  The  other  valence  and 
conduction  bands  show  no  preferential  localization.  These 
points  would  indicate  that  the  dielectric  function  anisotropy 
and  hence  the  RAS  spectrum  has  its  origins  in  transitions 
between  bulk  like  valence  band  states  and  surface  states  in 
the  conduction  band.  Furthermore  there  seems  to  be  no  fun¬ 
damental  difference  in  the  origin  of  the  dip  and  peak  in  the 
RAS  spectrum  except  for  the  relative  magnitude  of  the  tran¬ 
sitions  involved.  A  final  point  to  note  regarding  the  spatial 
distribution  of  the  electronic  states,  in  the  plane  of  atoms 
parallel  to  the  surface,  is  that  there  seems  to  be  a  slight 
localization  of  the  which  are  responsible  for  the  dielectric 
function  anisotropy  on  the  second  layer  As  atoms  labeled 
1-8  in  Fig.  1  whereas  the  appear  to  be  located  around  the 
first  layer  As  atoms. 

To  further  clarify  the  issue  of  surface  states  we  have  cal¬ 
culated  the  local  density  of  states  (LDOSs)  associated  with 
the  first  layer  of  As  atoms  and  the  GaAs  layer  below  it,  the 
results  of  which  are  shown  in  Fig.  5.  There  are  a  number  of 


E-Ef(eV) 


Fig.  5.  The  local  density  of  states  (solid  lines)  for  (a)  the  first  layer  of  As 
atoms  and  (b)  the  GaAs  layer  below  it.  The  density  of  states  of  bulk  GaAs 
is  given  by  the  dashed  lines. 

surface  states  common  to  both  layers  suggesting  that  they 
represent  bonding  states  between  the  top  two  As  layers. 
However,  at  both  valence  and  conduction  band  edges,  there 
are  surface  states  associated  with  the  first  layer  of  As  alone. 
Detailed  analysis  of  the  LDOS  and  optical  transition  matrix 
elements  show  that  some  of  the  wave  functions  that  are  re¬ 
sponsible  for  the  most  prominent  features  in  the  valence 
band  of  the  surface  LDOS  are  not  those  which  contribute 
most  significantly  to  the  dielectric  function  anisotropy.  How¬ 
ever  the  distinct  features  in  the  conduction  band  surface 
LDOS  at  eV  and  —1.7  eV  above  the  Fermi  level  are 
attributable  to  electronic  states  which  do  contribute  signifi¬ 
cantly  to  the  dielectric  function  anisotropy. 

IV.  CONCLUSIONS 

In  conclusion,  we  have  shown  that  RAS  spectra  for  the 
GaAs(OOl)  reconstructed  surfaces  calculated  using  an  ab  ini¬ 
tio  pseudopotential  method  within  the  one-electron  frame¬ 
work  can  produce  results  which  are  directly  comparable  with 
observed  spectra  at  low  energies.  Such  results  can  be  used  to 
differentiate  between  different  model  structures  and  to  yield 
information  about  the  electronic  wave  functions  responsible 
for,  and  the  origins  of,  the  observed  reflectance  anisotropy. 
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Discrete  dipole  calculations  of  the  double  cell  type  have  been  used  to  study  the  anisotropic  reflection 
at  normal  incidence  of  stepped  Si(OOl)  2X  1-type  surfaces.  Only  D^-type  steps  have  been  used.  The 
maximum  of  the  anisotropy  turns  out  to  be  in  a  direction  rotated  with  respect  to  the  principal  axes, 
and  the  anisotropy  itself  depends  strongly  on  terrace  width.  Further  the  crossed  polarizer 
configuration  is  interesting  for  experimental  work  since  it  has  no  offset.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

Scanning  tunneling  microscopy  (STM)  has  given  a  con¬ 
siderable  new  impetus  to  surface  science.  As  a  result  many 
detailed  features  of  the  surface  have  become  the  subject  of 
elaborate  studies,  one  of  those  features  being  the  step.  It 
extends  beyond  the  scope  of  this  article  to  give  an  extensive 
overview  of  all  research  about  stepped  surfaces  mainly  be¬ 
cause  the  optical  properties  of  stepped  surfaces  are  the  prime 
interest  of  this  article.  There  is  hardly  any  literature  in  this 
field.  The  work  coming  closest  is  the  study  by  Yasuda  and 
Aspnes^  about  several  Si  surfaces.  Very  recent  is  the  work  of 
Mantese  et  al.  about  Si(OOl)  with  high  miscut  angles.^  Fur¬ 
ther,  there  exists  some  theoretical  work  about  the  influence 
of  missing  row  reconstructions  on  the  surface  optical  re¬ 
sponse  of  Au  (Ref.  4)  and  Si.^  The  aim  of  this  article  is  to 
study  the  influence  of  steps  on  the  optical  response  of 
stepped  Si  (001)  (2X1)  surfaces  and  to  understand  the  extent 
steps  can  be  ignored  when  measuring  and  interpreting  the 
optical  anisotropy  of  this  type  of  surface.  This  question  is 
relevant  since  samples  used  in  practice  always  have  some 
kind  of  miscut.  The  steps  are  well-defined  structures  on  top 
of  a  surface.  This  opens  unique  possibilities  to  study  the 
short  range  interactions  by  varying  the  density  of  the  steps. 
Since  steps  are  by  their  very  nature  microscopic  structures  on 
the  surface,  the  best  way  to  study  them  is  by  means  of  a 
microscopic  model.  Unfortunately,  terraces  with  a  width  of 
more  than  10  lattice  spacings  are  beyond  the  capacity  of 
most  microscopic  methods  particularly  electronic  structure 
calculation  methods.  We  will  show  in  this  article  that  discrete 
dipole  types  of  calculations,  using  the  double  cell 
technique,^’^  are  able  to  handle  such  high  Miller  index  sur¬ 
faces.  However  the  cellular  polarizabilities,  which  are  the 
basic  input  for  this  class  of  calculations,  are  not  yet  available 
from  ab  initio  calculations.  The  present  calculations  have 
been  performed  using  model  polarizabilities. 

II.  EXTENDED  DOUBLE  CELL  METHOD 

How  to  calculate  the  optical  response  of  the  surface  of  a 
semi-infinite  crystalline  system  using  the  discrete  dipole 
model  has  been  described  by  us  in  a  number  of  articles.^’^ 
The  basic  idea  is  that  the  entire  system  becomes  represented 
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by  a  number  of  discrete  sources:  the  point  dipoles  ,  located 
at  sites  ,  Each  dipole  becomes  linearly  induced  by  the  local 
field  Eloc,/  at  the  dipole  position,  according  to  its  polarizabil¬ 
ity  ai .  In  this  way,  a  set  of  simultaneous  linear  equations  is 
obtained  for  any  system  which  must  be  solved  numerically. 
Hence  very  large  systems  and  especially  semi-infinite  sys¬ 
tems  cannot  be  treated  in  a  straightforward  way.  Only  if  sym¬ 
metry  can  be  taken  into  account,  being  the  case  for  crystal¬ 
line  systems,  semi-infinite  systems  can  be  transformed  to  a 
problem  of  manageable  proportions.  The  double  cell 
method^’^  is  one  of  the  most  efficient  methods  to  solve  the 
discrete  dipole  equations  for  an  arbitrary  choice  of  crystalline 
surfaces. 

Two  kinds  of  symmetry  are  exploited  by  the  double  cell 
method.  The  translational  symmetry  remains  intact  for  direc¬ 
tions  parallel  to  the  surface.  Deep  inside  the  crystal,  how¬ 
ever,  the  influence  of  the  surface  can  be  ignored,  and  there 
effectively,  three-dimensional  translational  symmetry  exists. 
There  the  response  of  all  bulk  dipoles  can  be  efficiently  de¬ 
scribed  by  normal  modes.  For  this  reason,  the  system  is  split 
in  the  double  cell  method  into  a  surface  and  a  bulk  part, 
described  by  two  normal  modes.  Two  complex  numbers, 
governing  the  strength  of  these  normal  modes,  are  the  only 
free  parameters  for  the  bulk.  All  dipole-dipole  interactions 
of  the  system  are  taken  into  account  if  we  consider  the  inter¬ 
actions  between  the  characteristic  dipoles  p/  of  the  surface 
layer  and  the  bulk  normal  modes.  The  characteristic  dipole  is 
a  single  dipole  which  can  represent  an  entire  lattice  plane  of 
dipoles  as  a  result  of  parallel  translation  symmetry.  Upon 
solution  of  the  dipole-dipole  (electrodynamic)  interactions 
governing  the  system,  the  individual  dipoles  of  the  system 
are  obtained.  Now  a  rather  straightforward  procedure  can  be 
used  to  obtain  the  remote  optical  response  (reflection). 

To  enable  treatment  of  the  the  optical  response  of  stepped 
surfaces,  the  double  cell  method  needs  adaptation  to  handle 
systems  having  a  triclinic  bulk  unit  cell  with  two  or  more 
dipoles.  Until  now  this  method  could  handle  only  systems 
having  a  simple  monoclinic  bulk  unit  cell  (with  one  dipole)^ 
spanned  by  Sj  ,82  and  the  perpendicular  axis  .  If  more  than 
one  dipole  had  to  be  taken  into  account,  we  reduced  them  to 
effectively  one,  by  means  of  the  commensurability  theorem.^ 
Most  of  the  adaptation  is  shown  in  Fig.  1  and  concerns  only 
the  true  bulk  (normal  mode)  part  of  the  double 
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Fig.  1 .  Schematic  view  of  the  double  cell  geometry  with  thin  triclinic  bulk 
unit  cells  and  the  ±Z  layers  using  the  one-fold  integral  transform. 


cell  method.  The  key  quantity  for  this  type  of  discrete  dipole 
calculations  is  the  transfer  tensor  f,  governing  the  interaction 
between  a  dipole  and  a  plane  of  dipoles.  All  bulk  transfer 
tensors  were  evaluated  by  means  of  the  Ewald  threefold  in¬ 
tegral  transform.  This  integral  transform  expands  the  transfer 
tensor  into  a  series  of  partial  waves  controlled  by  the  wave 
vectors  and  transverse  projectors  With  triclinic  bulk 
unit  cells  the  distance  between  neighboring  cells  in  a  direc¬ 
tion  perpendicular  to  the  surface  can  become  very  small, 
turning  down  the  efficiency  of  the  threefold  integral  trans¬ 
form.  Therefore,  for  a  certain  number  Z,  we  calculate  the 
interactions  due  to  planes  being  at  most  Z  bulk  unit  cells 
below  or  above  the  bulk  cell  of  interest  V  by  means  of 
Ewald’s  one-fold  integral  transform^  (Fig.  1).  This  requires  a 
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rederivation  of  the  secular  matrix/determinant  and  the  inter¬ 
action  equations.  Technically  the  derivation  is  almost  exactly 
as  given  in,^  but  we  get  a  slightly  different  expression  for  the 
secular  matrix  (determinant): 
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where  we  have  used  the  following  definition  for  the  normal 
modes: 
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where  is  the  wave  vector,  the  eigenvector,  and  the 
strength  of  the  normal  modes.  Further,  we  use  for  the  bulk 
sites  of  the  dipoles  the  definition  , 

where  are  the  sites  of  the  dipoles  within  the  bulk  unit  cell, 
and  d^  is  the  vector  connecting  the  origins  of  bulk  and  sur¬ 
face  unit  cell.  Equation  (1)  reduces  to  the  result  given  in  Ref. 
6,  by  making  Z—0.  The  crucial  difference  is  in  the  first  sum¬ 
mation.  All  transfer  tensors  are  calculated  now, 

using  the  one-fold  integral  transform.  In  a  similar  way,  we 
get  the  expression  for  the  four  different  types  of  matrix  ele¬ 
ments  of  the  double  cell  interaction  matrix 
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where  r^-  is  a  surface  dipole  site,  the  direction  of  the  ana- 
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lyzer  and  is  defined  as 
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The  superscript  labels  S  and  B  will  always  refer  to  surface 
and  bulk,  respectively.  The  only  place  where  we  see  explicit 
changes  due  to  the  one-fold  integral  transform,  is  for  the  SB 
type  of  matrix  elements.  We  will  use  frequently  the  aniso¬ 
tropic  azimuth  fl,  for  this  article,  the  angle  between  the  elec¬ 
tric  field  and  x.  The  inhomogeneous  vector  of  the  interaction 


equations  remains  unchanged.^  Equations  (l)-(7)  comprise 
the  bulk  of  the  modifications  necessary  to  let  our  software 
handle  oblique  bulk  unit  cells  with  any  number  of  dipoles. 

When  more  than  (effectively)  one  dipole  occupies  the 
bulk  unit  cell,  the  calculation  of  the  normal  mode  parameters 
slows  down  substantially.  Therefore,  the  performance  of  the 
root  searching  procedure,  used  in  the  normal  mode  part  of 
our  discrete  dipole  package,  has  been  improved.  Historically, 
a  general  purpose  Broyden  routine  has  been  used  to  find  the 
roots  from  the  modulus  of  the  determinant,  but  this  turned 
out  to  be  far  too  slow.  In  a  second  version,  we  used  a  qua¬ 
dratic  (complex)  extrapolation  of  the  secular  determinant 
while  searching  for  zeros.  In  the  present  version,  we  have 
used  a  method  which  does  not  use  the  value  of  the  determi¬ 
nant  as  such.  In  this  method,  we  bring  the  secular  matrix  to 
triangular  shape,  using  full  pivoting  of  the  matrix  elements. 
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III.  STEPPED  SURFACES  AND  GEOMETRY 


I 


Fig.  2.  Intersection  of  the  110-001  plane  and  the  conventional  cube  showing 
miscut  angle  (the  integer  numbers  refer  to  A^^). 


For  the  triangular  secular  matrix  we  know  we  can  write  the 
determinant  as: 

Ns 

Mii=nA,..  (8) 

i=  1 

This  product  will  become  zero  if  and  only  if  one  of  the 
diagonal  elements  becomes  zero.  We  are  looking  for  zeros  of 
the  diagonal  elements:  since  we  have  done  full  pivoting,  the 
zeros  will  always  be  found  at  the  lower  right  corner  of  the 
matrix.  The  point  to  notice  is  the  matrix  element  which  be¬ 
comes  zero  also  approaches  zero  in  a  linear  way  even  for 
two-fold  degenerate  matrices.  Using  a  Regula-Falsi-like 
technique  for  the  last  two  diagonal  matrix  elements,  the  rel¬ 
evant  zeros  were  found  very  fast.  The  advantages  of  this 
procedure  are  clear:  It  handles  matrices  of  any  size,  uses 
linear  extrapolation,  and  creates  highly  integrated  code,  since 
we  can  use  the  triangular  matrix  at  the  root  immediately  to 
produce  the  normal  mode  eigenvectors. 


The  basic  geometry  of  stepped  surfaces  on  Si(OOl),  has 
been  given  by  Chadi,^  and  we  follow  his  treatment  and  ar¬ 
guments  here.  We  use  in  this  study  only  double  layer  stepped 
surfaces.  Of  the  two  types  of  double  layer  steps,  only  the 
D^-type  was  found  to  be  stable,^’^  so  we  use  this  type  of  step 
to  create  terraces.  Terraces  enclosed  between  D^-type  steps 
have  all  surface  dimers  oriented  in  the  same  direction,  paral¬ 
lel  to  the  step  edge.  For  this  type  of  stepped  surfaces,  the 
origin  of  a  change  in  anisotropy  can  only  be  in  the  amount  of 
step  edges.  If  we  would  have  used  single  step  surfaces,  there 
would  have  been  two  types  of  terraces  with  dimer  orientation 
perpendicular  to  each  other — then  the  change  in  anisotropy 
would  depend  on  both  the  amount  of  step  edges  and  the  ratio 
of  the  surface  areas  with  different  orientation. 

Chadi  gives  only  isolated  steps  in  his  article.  A  topview  of 
the  D^-type  step  is  shown  in  the  right  panel  of  Fig.  5.  This 
topview  shows  the  atomic  lattice.  The  atomic  lattice  as  such 
cannot  be  used  directly  for  discrete  dipole  calculations.  Ide¬ 
ally,  each  atom  should  be  represented  by  one  dipole;  how¬ 
ever,  such  correspondence  does  not  work.^^  For  semiconduc¬ 
tors,  the  discrete  dipole  model  overestimates  the  short  range 
interactions.  Partially  this  effect  can  be  overcome  by  assign¬ 
ing  one  dipole  to  each  couple  of  atoms,  as  Mochan  and  Bar¬ 
rera  did.  In  this  study,  we  do  the  same,  but  this  forces  us  to 
devise  a  suitable  dipole  lattice.  The  dipole  geometry — used 
by  us — is  shown  in  Figs.  3  and  4.  The  dipole  site  containing 
the  rebonded  atom  is,  to  a  certain  extent,  questionable.  From 
the  position  of  the  rebonded  atom  itself,  we  can  choose  the 
middle  of  either  of  two  downward  directed  bonds,  both 
choices  being  arbitrary.  The  reconstruction  of  the  Si(OOl) 
2X1  surface  is  commonly  called  rows  of  dimers  reconstruc¬ 
tion,  those  rows  being  perpendicular  to  the  Dq  steps.  In  ad¬ 
dition,  we  use  lines  of  dimers,  being  all  dimers  on  the  same 
line  parallel  to  the  steps.  Rows  of  dimers  belonging  to  suc¬ 
cessive  terraces  have  to  shift  over  a  aJA  distance  parallel  to 
the  step  edge.  On  the  next  terrace,  this  shift  can  be  the  same 


Fig.  3.  Location  of  dipole  sites  used  in  the  calculations.  Left:  sideview  Si  (001),  2  dimer  lines  seen  from  (110).  Right:  topview  Si  (001),  2  dimer  lines  seen 
from  (001). 
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Fig.  4.  Location  of  dipole  sites  used  in  the  calculations.  Left:  sideview  Si  (113)  seen  from  (110).  Right:  topview  Si  (113)  seen  from  (001). 


or  reversed,  resulting  in  two  different  types  of  surfaces.  We 
chose  a  continued  shift,  yielding  an  oblique  surface  unit  cell. 

We  use  D^type  stepped  surfaces,  having  the  miscut  angle 
in  the _( 110)  (OOl)-plane  of  the  original  2X1  recon¬ 
structed  (OOl)-surface,  as  shown  in  Fig.  2.  Only  planes  in  ^e 
shaded  ^ea  are  considered.  The  limiting  cases  are  the  (001)- 
and  (113)-surface — the  latter  plane  having  only  D^-type 
steps  and  no  dimers.  The  terraces  themselves  were  supposed 
to  have  the  (001)  2X1  reconstruction.  The  type  of  surface 
morphology  obtained  in  this  way  is  valid  for  orientations  as 
steep  as  (117).  Beyond  that  point,  nonrebonded  step  edges 
are  introduced  and  surface  periodicity  becomes  less  well  de¬ 
fined  [at  least  until  (114),  =  19.5°].^^  So  our  modeling 

of  terraces,  having  less  than  2  dimer  lines  of  width,  is  not 
very  realistic,  but  we  have  added  them,  nevertheless,  to  get  a 
better  feeling  for  possible  trends. 

To  obtain  a  stepped  surface  having  a  terrace  width  con¬ 
taining  Nd  dimer  lines  requires  a  miscut  angle  given  by 

tan(0  =  (3:^).  (9) 

For  the  cases  studied  here,  we  list  values  of  9^^  i^t  Table  1. 

The  coordinates  of  the  site^will  be_given  first  in  a  base 
frame  spanned  by  the  (110),  (110),  (001)  directions.  For  all 
surfaces  with  there  are  two  dipoles  in  the  bulk  unit 

cell  and  2+77^)  dipoles  in  the  free  surface  unit  cell.  The  two 


Table  I.  as  a  function  of  #  of  dimer  lines  Np  . 


Nd 

Miscut  angle  0,,,^  (°) 

0 

25.239  402 

1 

15.793  169 

2 

11.421  754 

3 

8.930  142 

4 

7.326  037 

5 

6.208  545 

00 

0.000  000 

(111) 

35.264  39 

bulk  dipoles  are  located  at  (0,0,0)  and  at  ^^(^0,0)  with  respect 
to  the  origin  of  the  bulk  unit  cell,  where  a^Vla^  being 
the  conventional  lattice  parameter  (5.43  A  for  Si).  Those 
sites  will  not  be  affected  by  a  rotation  to  the  surface  oriented 
frame.  The  two  dipoles  marking  the  step  have  sites  r^’^, 
given  in  the  base  frame  by 

T^=a[0,i(l-^),0], 

T|=a[iKl-?v^),0].  (10) 

The  Nq  terrace  dipoles  have  sites  given  in  the  base  frame 
by 


+  (11) 

Those  sites  are  with  respect  to  the  origin  of  the  free  surface 
unit  cell.  The  origins  of  bulk  and  free  surface  unit  cell  are 
connected  by  At=— ay  and  the  third  vector  spanning  the 
bulk  unit  cell  is  in  the  base  frame  —ay.  From  these  data,  the 
proper  coordinates  in  the  (stepped)  surface  oriented  frame 
are  obtained  by  a  coordinate  transformation  over  as 
described.  In  the  surface  frame  the  parallel  translational  sym¬ 
metry  is  governed  by  the  spanning  vectors  Sj  and  S2: 


Si  =  a(l,0,0), 


S2  =  a  [  i  W2  +  (3  +  2A1o)",0]  . 


(12) 


This  description  of  the  geometry  does  not  include  the  two 
limiting  ca^s  (001)  and  (113).  The  first  should  be  obvious 
and  the  (113)  case  follows  basically  the  treatment  given  here 
but  is  halved  in  the  (110)  direction. 

The  last  part  of  the  description  of  the  configuration  used 
concerns  the  polarizabilities.  All  polarizabilities  will  be  de¬ 
rived  from  bulk  optical  data  using  Clausius-Mossotti  in  the 
following  way: 


a^\  ^  3  /  6“  1  \ 

a  /  \6tt  \  6+  2/  ’ 


(13) 
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Fig.  5.  Left:  Azimuthal  plot  pp  reflectance  as  a  function  of  Right:  topview  separate  step  (atomic  sites). 


where  e  is  the  relative  dielectric  constant,  aja  depends  on 
the  surface  [(113)  is  different],  and  a^=A7TeQa^.  The  numeri¬ 
cal  value  for  equals  5.039  857  3X10”^^  F  for  all  cases 
except  for  (113).  There  the  value  is  6.299  821X10“^^  F  m^. 
Calculations  are  isotropic  if  only  isotropic  polarizabilities  are 
used  and  show  the  structural  contribution  to  the  anisotropy. 
To  investigate  a  possible  electronic  structure  influence,  an¬ 
isotropic  polarizabilities  are  required.  We  have  restricted  this 
anisotropy  to  the  surface  dipoles,  the  edge  and  terrace  ones. 
Taking  the  z  direction  parallel  to  the  dimer  axis,  we  intro¬ 
duce  anisotropy  as 

^Ani“  ^zz  ^  ^ XX, yy  » 

^yy~^xx'  (14) 

We  have  kept  in  our  calculations  this  intrinsic  anisotropy 
modest,  effectively  only  2%.  Hence,  we  use  only 
(isotropic),  and  r^„i=1.02  (anisotropic).  The  anisotropic  po¬ 
larizability  has  to  be  rotated  to  its  proper  position.  This  re¬ 
quires  interchange  of  indices  for  the  terrace  dipoles.  The 
edge  dipoles  have  been  rotated  twice  by  means  of  an  Euler 
rotation.  First  they  have  been  given  a  rotation  over  the  Euler 
angles  ^=-45°,  ^=—45°  to  get  the  correct  values  for  the 
base  frame: 

<(^A/)=S  (i5) 

ij 

where  5(^,^,^)  is  the  Euler  transformation  matrix  as  given 
by  Goldstein. The  second  rotation  from  the  base  frame  to 


the  surface  frame  is  over  the  angle  6^^  is  applied  to  the 
entire  system.  After  that,  the  polarizability  tensors  no  longer 
contain  zero  valued  components.  Numerical  values  for  the 
anisotropic  components  of  the  polarizability  prior  to  rotation 
are  1.914  052X  10"^+!2.085  819X  10“"  and 
=  1.876  522X10“^  + (2.044  921X10“"  in  units  of  Oq  [ex¬ 
cept  (113)]. 


IV.  RESULTS 

Using  the  previously  mentioned  geometry  and  optical  in¬ 
put  data,  model  calculations  have  been  performed  for  per¬ 
pendicular  incidence.  We  have  used  p  type  of  polarization 
for  the  incoming  light,  letting  the  direction  of  polarization 
coincide  with  anisotropic  azimuth  fl.  The  frequency  for  all 
calculations  has  been  ho)=3.0  eV  unless  stated  otherwise. 
The  polarizabilities  were  also  calculated  at  this  frequency. 
The  first  results  of  this  type  {pp  combinations)  are  shown  in 
Fig.  5,  left  panel.  For  those  calculations,  all  surface  dipoles 
have  been  given  anisotropic  polarizabilities.  The  numbers 
shown  in  boxes  in  the  curves  themselves  indicate  the  number 
of  dimer  lines  in  the  reconstrucdon.  The  two  limiting  cases 
are  given  by  the  label  (i)  for  (001),  and  0  for  (113).  Since  the 
anisotropy  is  weak  as  compared  to  the  average  signal,  an 
offset  of  0.46  has  been  subtracted  to  emphasize  the  results. 
The  right  panel  shows  the  atomic  configuration  of  a  single 
Dg  step,  shown  in  the  same  orientation  as  the  left  panel.  It  is 
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Fig.  6.  Left:  Top-top  modulation  of  (reference  anisotropies  indicated  by  horizontal  lines).  Right:  anisotropic  azimuth  offset  Afl.  Solid  line:  anisotropic 
results.  Dashed  line:  isotropic  results. 


immediately  clear  that  the  pp  plots  are  not  exactly  lattice 
oriented.  Neither  the  bulk  lattice,  nor  the  step  seems  to  con¬ 
trol  the  orientation  of  the  plots. 

The  maximum  value  of  the  pp  reflectance  is  obtained  for 
the  polarization  direction  approximately  perpendicular  to  the 
steps.  We  know  from  experience  that  single  linear  chains  of 
dipoles  have  increased  emission  when  the  polarization  direc¬ 
tion  is  parallel  to  the  chains.  If  we  assume  the  dimer  rows 
will  act  like  separate  chains  of  dipoles,  we  can  expect  that  for 
these  stepped  surfaces,  the  maximum  reflectance  is  obtained 
for  polarization  parallel  to  the  rows.  We  will  use  the  anisot¬ 
ropy  of  the  original  (001)  surface  as  a  reference  in  the  re¬ 
mainder  of  this  article.  It  is  surprising  to  see  that  this  refer¬ 
ence  anisotropy  is  not  the  maximum  anisotropy.  By  intuition, 
we  would  expect  the  interruption  of  the  dimer  rows  by  the 
steps  to  decrease  the  reference  anisotropy  of  the  unstepped 
surface.  Apparently  this  is  not  the  case.  Why  the  steps  some¬ 
times  happen  to  increase,  or  to  decrease  the  reference  anisot¬ 
ropy  has  no  simple  explanation.  It  is  clear,  nevertheless,  that 
steps  have  a  strong  influence  on  this  anisotropy.  They  can 
increase  or  decrease  it  by  a  factor  of  two.  Finally,  we  notice 
that  the  plots  are  tilted  into  the  same  direction  as  the  shift  of 
the  dimer  rows,  belonging  to  consequent  terraces.  This  is 
also  expected. 

The  maximum  terrace  width  shown  here  is  for  Nd=6, 
Technically  we  can  go  well  beyond  ^£>=10,  but  at  the  ex¬ 
pense  of  highly  increased  runtimes.  The  reason  for  the  incre¬ 
ment  is  the  number  of  dipoles  required  by  the  matching  layer 
between  the  free  surface  cell  and  the  bulk  normal  mode  re¬ 
gime.  This  number  increases  from  60  to  90,  going  from 
=  1  to  Nd—6.  Even  those  large  numbers  are  not  enough 


to  keep  the  convergence  rate  at  the  level  observed  for  the 
(001)  and  (113)  limiting  cases.  We  have  always  used  such  a 
number  of  matching  dipoles  that  convergence  had  no  influ¬ 
ence  on  our  results. 

To  have  more  direct  access  to  the  relevant  parameters,  we 
have  made  a  fit  through  the  calculated  data.  We  have  used  as 
a  fit  function: 

Rpp{fl)=A-\rB  cos(An^)  +  C  sm{N^rt),  (16) 

where  we  have  taken  Nq=2.  After  performing  the  least 
squares  procedure,  we  arrived  at  a  very  good  fit.  The  fit 
parameters  have  subsequently  been  transformed  to  more  di¬ 
rectly  understandable  data  as  follows: 

Rpp{Cl)=A  +  ARif  sin[Aa(n"Afl)], 

An  =  arctan(5/C)/Af| .  (17) 

We  will  focus  upon  the  top-top  modulation  AR^f  and  the 
azimuthal  shift  Afl.  The  solid  curves  in  Fig.  6  belong  to  the 
anisotropic  cases,  shown  as  azimuthal  plots  in  the  left  panel 
of  Fig.  5.  From  the  top-top  modulation,  we  learn  that  the 
anisotropy  is  not  a  smooth  function  of  the  miscut  angle  or 
terrace  width.  There  is  something  like  an  even -odd  behavior 
superimposed  on  top  of  a  simple  curve  having  a  single  maxi¬ 
mum.  This  maximum  shows  up  for  about  3  dimer  lines  of 
terrace  width.  It  turns  out  that  for  pp  reflectance  the  top-top 
modulation  cannot  be  used  experimentally  as  an  indicator  for 
the  terrace  width.  The  same  holds  for  the  azimuthal  shift. 
The  influence  of  the  azimuthal  shift  All  has  been  explicitly 
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omitted  by  Yasuda  and  Aspnes  in  their  article.^  The  only 
misoriented  (001)  surface  in  their  paper  has  a  corre¬ 

sponding  to  a  terrace  width  of  9  dimer  lines,  just  beyond  the 
region  where  we  find  the  strongest  shifts.  This  justifies  their 
assumptions.  The  minimum  of  the  azimuthal  shift  is  at 
Nj)=2  and  these  shifts  arrive  faster  at  the  reference  level 
(^0)  for  increasing  terrace  width,  than  the  corresponding 
top-top  modulation. 

To  figure  out  how  much  of  the  effects  found  is  structural 
(geometry,  dipole  sites),  and  how  much  is  electronic  (polar¬ 
izability),  we  have  repeated  the  calculations  with  isotropic 
polarizabilities  throughout.  Those  isotropic  results  are  also 
shown  in  Fig.  6  as  the  dashed  curves.  Although  the  results 
clearly  change,  the  overall  pattern  remains  unaffected.  Only 
for  the  azimuthal  shift  Afl,  we  see  that  for  the  (113)  case  the 
tilt  disappears.  This  also  happens  for  terraces  with 
but  there  the  shift  is  already  very  low.  For  the  values  of 
used  in  this  article  (about  1.02),  the  anisotropy  of  stepped 
surfaces  is  predominantly  structural.  For  a  larger  value  of 
^Ani  stronger  electronic  contributions  will  be  expected.  From 
these  calculations  we  get  a  first  impression  how  a  larger 
will  affect  the  anisotropy.  Better  polarizability  data  have  to 
be  obtained  from  other  calculations. 

Since  most  of  the  observed  behavior  has  to  be  understood 
as  structural,  we  have  also  tried  to  figure  out  whether  wig¬ 
gling  plays  a  role.  Those  rapid  oscillations  of  the  dipole 
strength  have  been  observed  by  us  before^  and  we  associate 
them  with  structural  effects.  The  new  phenomenon,  which 
might  play  a  role  here  is  lateral  wiggling,  wiggling  along  the 
surface.  Previous  observations  of  wiggling  were  always  per¬ 
pendicular  to  the  surface.  The  even-odd  behavior  of  the 


y/a 


Fig.  7.  Modulus  of  dipole  strength  |p|^-  as  a  function  of  y.  |p|/-axis  values  to 
be  multiplied  by  10~^. 


top-top  modulation  might  be  caused  by  it.  Since  wiggling 
depends  strongly  on  frequency,  we  have  repeated  the  aniso¬ 
tropic  calculations  with  fio)=6.0  eV.  The  results  indicate  that 
the  influence  of  wiggling  is  weak.  A  more  direct  observation 
is  given  by  the  individual  dipole  strengths  (in  Fig.  7).  All 
dipole  strengths  of  the  free  surface  layer  are  shown  as  a 
function  of  the  y -coordinate.  Indeed  lateral  wiggling  with  a 
two-fold  period  can  be  observed. 

Discrete  dipole  calculations  depend  critically  on  the 
choice  of  the  dipole  geometry.  To  verify  how  sensitive  our 
present  calculations  are  as  to  this  point,  we  have  repeated  the 
calculations  with  the  edge  dipole  moved  to  the  position  of 


a 


Fig.  8.  Left:  top-top  modulation  of  Rpp  (horizontal  line:  reference).  Right:  anisotropic  azimuth  offset  Afl.  Anisotropic  results,  h(i)=3,0  eV  edge  dipole  at 
rebonded  atom  position. 
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Fig.  9.  Azimuthal  plot  ps  reflectance  as  a  function  of 


the  rebonded  atom.  To  remove  asymmetry  due  to  the  tilting 
of  the  polarizability  tensor  of  this  site,  we  have  given  it  the 
same  polarizability  tensor  as  the  dimers.  The  results  are 
shown  in  Fig.  8.  There  are  a  few  major  differences.  The 
top-top  modulation  for  the  =  1-case  has  increased  by  a 
factor  of  3,  the  anisotropy  for  the  A^£)= 0-case  has  virtually 
disappeared,  and  the  azimuthal  shifts  have  increased  system¬ 


atically  by  a  factor  of  2.  Those  changes  are  large,  but  the 
overall  behavior  has  not  been  affected  by  the  shift  of  the 
edge  dipole.  When  the  changes  are  not  significant. 

For  the  stepped  surfaces  treated  here,  another  outspoken 
new  feature  of  the  calculation  is  that  the  polarization  no 
longer  remains  in  the  plane  of  incoming  polarization.  For 
this  reason,  we  have  investigated  also  this  ps-type  (crossed 
polarizer)  configuration.  The  azimuthal  results  for  the  aniso¬ 
tropic  case  are  shown  in  Fig.  9.  The  result  is  a  series  of 
four-lobed  patterns.  Those  patterns  are  obtained  without  us¬ 
ing  an  offset.  At  first  glance,  those  patterns  look  fundamen¬ 
tally  different  from  the  ones  shown  in  Fig.  5,  but  this  is  only 
seemingly.  The  major  point  is  that  there  is  really  no  offset. 
The  complex  ps-reflection  coefficient  has  the  same  two  pe¬ 
riod  behavior  as  the  pp  component,  but  since  it  is  symmetric 
around  zero,  the  modulus-procedure  /?  =  r*r  turns  it  into  a 
four  period  phenomenon. 

The  ps-type  of  reflectances  have  again  been  fitted  by  us 
by  means  of  Eq.  (16),  using  now  Nq^—4.  The  results  obtained 
this  way,  are  shown  in  Fig.  10.  Again  the  solid  lines  represent 
the  anisotropic  cases.  A  major  result  shows  up  already  after 
inspection  of  the  top-top  modulation.  For  Nd>2  this  modu¬ 
lation  is  a  monotonous  function  of  .  The  even-odd  be¬ 
havior  is  absent.  As  compared  to  their  pp-counterparts,  the 
modulation  has  dropped  by  two  orders  of  magnitude.  Further 
the  A/) =00  [original  (001)]  results  turn  out  to  be  the  highest 
now.  Only  in  the  results  for  the  azimuthal  shift,  still  the 
even-odd  behavior  is  visible  but  not  as  pronounced  as  for 
the  pp  cases.  The  differences  between  the  azimuthal  shift 
data  comparing  pp  and  the  ps-cases  are  smaller  than  between 
the  corresponding  top -top  modulation  data.  We  have  also 


Fig.  10.  Left:  top-top  modulation  of  (horizontal  linCvS:  reference).  Right:  anisotropic  azimuth  offset  Afi.  Solid  line:  anisotropic  results.  Dashed  line: 
isotropic  results. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


3088 


de  Boeij,  Wijers,  and  Zoethout:  Anisotropic  optical  reflection 


3088 


repeated  the  calculations  using  isotropic  polarizabilities. 
Again  the  influence  is  small. 

The  major  conclusion  from  the  ps  type  of  reflection  coef¬ 
ficients  is  in  the  way  experiments  can  be  done.  Despite  the 
fact  that  the  ps-type  results  are  in  strength  two  orders  of 
magnitude  below  the  pp-type  results,  those  data  can  be  ob¬ 
tained  without  offset.  This  means  that  the  full  sensitivity  of 
the  detection  system  can  be  used.  If  our  calculations  turn  out 
to  be  reliable,  it  means  that  the  ps  results  can  be  easier  cor¬ 
related  with  the  terrace  width,  since  the  ps-response  curve  is 
monotonic. 

V.  CONCLUSION 

In  this  article,  we  have  shown  a  number  of  model  calcu¬ 
lations  based  on  the  discrete  dipole  model  for  the  optical 
response  of  stepped  surfaces.  We  have  shown  that  it  is  pos¬ 
sible  to  perform  those  calculations  for  terrace  widths,  having 
up  to  10  dimer  lines.  Most  of  the  step  influence  is  gone  for 
terraces  bdng  more  than  6  dimer  lines  wide.  Measurements 
on  Si  (001),  with  up  to  4°  miscut,  need  no  correction  for 
those  effects.  For  higher  miscut  angles,  the  influence  of  steps 
becomes  too  strong  to  be  ignored.  The  step  contribution  to 
the  anisotropy  can  be  as  large  as  the  anisotropy  of  the  origi¬ 
nal  Si  (001)  2X1  surface.  That  is  the  regime  where  interest¬ 
ing  fundamental  work  can  be  done.  The  point  where  our 
calculations  and  the  experimental  data  will  possibly  deviate 


will  give  an  indication  for  the  effective  range  of  the  electro¬ 
dynamic  interaction.  This  asks  for  further  experimental  work 
into  this  direction.  For  perpendicular  incidence,  as  studied 
here,  the  main  axes  of  the  polarization  plots  do  not  follow 
the  crystallography  of  the  surfaces.  From  our  calculations, 
follows  that  the  easier  way  to  measure  the  influence  of  steps 
on  optics  is  by  measuring  in  crossed  polarizer  (ps)  mode. 
The  expected  advantages  are  in  a  better  sensitivity  and  in 
easier  interpretation  of  results. 


*T.  Yasuda,  D.  E.  Aspnes,  D.  R.  Lee,  C.  H.  Bjorkman,  and  G.  Lucovsky,  J. 
Vac.  Sci.  Technol.  A  12,  1152,  (1994). 

^L.  Mantese,  U.  Rossow,  and  D.  E.  Aspnes,  Proceedings  of  the  ACSI-3 
Conference,  1995  (to  be  published). 

^G.  R  M.  Poppe,  H.  Wormeester,  A.  Molenbroek,  C.  M.  J.  Wijers,  and  A. 
van  Silfhout,  Phys.  Rev.  B  43,  12  122  (1991). 

Wang,  W.  L.  Mochan,  and  R.  G.  Barrera,  Phys.  Rev.  B  42,  9155  (1990). 
^G.  R  M.  Poppe  and  C.  M.  J.  Wijers,  Physica  B  167,  221  (1990). 

^C.  M.  J.  Wijers  and  G.  P.  M.  Poppe,  Phys.  Rev.  B  46,  7605  (1992). 

^G.  P.  M.  Poppe,  C.  M.  J.  Wijers,  and  A.  van  Silfhout,  Phys.  Rev.  B  44, 
7917  (1991). 

^D.  J.  Chadi,  Phys.  Rev.  Lett.  59,  1691  (1987). 

^H.  J.  W.  Zandvliet  (private  communication), 

*°W.  L.  Mochan  and  R.  G.  Barrera,  Phys.  Rev,  Lett.  55,  1192  (1985). 

^’A.  a.  Baski,  Naval  Research  Laboratory,  Washington  DC  (to  be  pub¬ 
lished). 

•2r.  Bersohn,  Y.  Pao,  and  H.  L.  Frish,  J,  Chem.  Phys.  45,  3184  (1966). 
Goldstein,  Classical  Mechanics  (Addison-Wesley,  Reading,  MA 
1974). 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


Reflection  anisotropy  spectroscopy,  surface  photovoltage  spectroscopy, 
and  contactless  electroreflectance  investigation  of  the 
lnP/lno.53Gao.47As(001)  heterojunction  system 

M.  Leibovitch,  P.  Ram,  L.  Malikova,  and  Fred  H.  Poliak®* 

Physics  Department  and  New  York  State  Center  for  Advanced  Technology  in  Ultrafast  Photonic  Materials 
and  Applications,  Brooklyn  College  of  the  City  University  of  New  York,  Brooklyn,  New  York  11210 

J.  L.  Freeouf 

Interface  Studies,  Inc.,  Katonah,  New  York  10536-0982 

L.  Kronik,  B.  Mishori,  and  Yoram  Shapira 

Department  of  Electrical  Engineering,  Tel  Aviv  University,  Ramat-Aviv  69978,  Israel 

A.  R.  Clawson 

ECE  Department-0407,  University  of  California  San  Diego,  La  Jolla,  California  92093 

C.  M.  Hanson 

NCCOCS  RDTE  Div  555,  San  Diego,  California  92152-5070 
(Received  24  January  1996;  accepted  20  April  1996) 

Using  the  optical  methods  of  reflection  anisotropy  spectroscopy,  surface  photovoltage  spectroscopy, 
and  contactless  electroreflectance,  we  have  conducted  an  ex  situ  investigation  of  (a)  the 
InP/Ino.53Gao  47As(001)  heterojunction  interface  as  a  function  of  InP  overlayer  thickness  (50-1000 
nm)  and  (b)  the  surfaces  of  n-  and  p -doped  Ino  53Gao47As(001).  All  samples  were  fabricated  by 
organometallic  vapor  phase  epitaxy.  The  results  from  these  optical  probes  make  it  possible  to  form 
a  comprehensive  quantitative  picture  of  the  InP/InGaAs  heterojunction,  including  conduction  and 
valence  band  offsets  of  275  and  325  meV,  respectively,  as  well  as  the  (001)  surface  of  InGaAs 
(surface  Fermi  level —200  mV  from  the  conduction  band  edge).  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  technological  importance  of  InP/Ino.53Gao  47AS  het¬ 
erojunctions  is  due  to  their  integral  part  in  InP-based  micro¬ 
electronic  and  photonic  devices.  Two  microelectronic  de¬ 
vices  in  particular,  InP/InGaAs  high  electron  mobility 
transistors  (HEMTs)  and  heterojunction  bipolar  transistors 
(HBTs),  have  impressive  microwave  properties  including 
high  frequency  gain,  noise  characteristics,  and  power  han¬ 
dling  capacity.  InP-based  HEMTs  are  being  pursued  in  inte¬ 
grated  circuits  (ICs)  such  as  60  GHz  low  noise  amplifiers,^ 
and  likewise  HBTs  are  of  interest  in  ICs  such  as  high  reso¬ 
lution,  high  speed  analog  to  digital  converters.^  InP-based 
photonic  devices  can  uniquely  meet  today’s  needs  for  optical 
transport  and  access  networks.  Lasers  have  been  pursued 
commercially  for  some  time  and  photonic  integrated  circuits 
(PICs)  are  also  being  developed.  Examples  of  PICs  that  have 
been  recently  reported  are  distributed  feedback  laser  inte¬ 
grated  with  an  electroabsorption  modulator  for  long  haul 
transmission  at  2.5  GHz,  and  electro-optic  integrated  circuits 
for  receivers  capable  of  wide  bandwidths  of  6  GHz. 

The  device  properties  of  InP/InGaAs  heterojunction  de¬ 
vices  are  strongly  influenced  by  the  electronic  and  optical 
properties  of  the  interface.^  Better  understanding  of  this  het¬ 
erojunction  can  potentially  improve  the  performance  of  de¬ 
vices  even  further.  Perhaps  more  importantly  it  can  aid  in  the 


^^Also  at  the  Graduate  School  and  University  Center  of  the  City  University 
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commercial  pursuit  of  InP-based  devices  where  the  emphasis 
is  on  reproducibility,  reliability,  circuit  yield,  and  increased 
IC  complexity  at  the  lowest  possible  cost. 

Using  the  optical  methods  of  reflection  anisotropy  spec¬ 
troscopy  (RAS),"^"^®  surface  photovoltage  spectroscopy 
(SPS),^^"^"^  and  contactless  electroreflectance  (CER),^^ 
we  have  conducted  an  ex  situ  investigation  of  the 
InP/Ino.53Gao  47As(001)  heterojunction  interface  as  a  function 
of  InP  overlayer  thickness  (50,  100,  150,  200,  250,  300,  and 
1000  nm).  The  results  from  these  optical  probes  make  it 
possible  to  form  a  comprehensive  quantitative  picture  of  the 
InP/InGaAs  heterojunction.  To  gain  further  information 
about  this  heterojunction  we  also  have  studied  the  surfaces  of 
n-  and  -doped  Ino53Gao47As  using  RAS  and  CER.  In  the 
former  system  we  have  (a)  determined  the  conduction 
{ISEq)  and  valence  (AE^)  band  offsets  to  be  275  and  325 
meV,  respectively,  and  (b)  observed  the  evolution  of  the  in¬ 
terface  electric  field  with  InP  overlayer  dimension.  The  be¬ 
havior  of  the  interface  electric  field  with  InP  coverage  indi¬ 
cates  a  lack  of  significant  interface  charge  and  has  been 
explained  by  a  self-consistent  Poisson’s  calculation  of  the 
properties  of  the  heterojunction.  The  CER  signal  from  the 
direct  band  gap  of  the  InP  confirmed  the  low  doping  level  in 
this  material.  Also,  the  observed  below  band  gap  CER  fea¬ 
tures  may  be  related  to  the  presence  of  some  As  in  the  InP,  as 
suspected  from  the  growth  conditions.*^  For  the  InGaAs  sur¬ 
face  we  have  (a)  observed  that  the  RAS  signal  increases 
linearly  with  the  square  root  of  the  doping  level  (and  hence 
surface  electric  field)  due  to  the  linear  electro-optic  effect 
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associated  with  the  Ei,  £^i  +  Ai  optical  features^~^  and  (b) 
evaluated  the  surface  Fermi  level  to  be  200  mV  below  the 
conduction  band  edge. 

Reflection  anisotropy  spectroscopy  measures  the  polariza¬ 
tion  anisotropy  of  light  linearly  polarized  along  the  [110]  and 
[110]  principal  axes  in  the  plane  of  the  (001)  surface  of 
zincblende-type  semiconductors.  It  has  been  demonstrated 
that  RAS  can  be  employed  to  determine  the  sign  and  mag¬ 
nitude  of  near  surface  electric  fields  from  the  anisotropy  pro¬ 
duced  by  the  linear  electro-optic  effect  in  the  vicinity  of  the 
spin-orbit  split  E^  +  A^  optical  features.^"^  Our  work 
demonstrates  that  RAS  can  be  employed  to  gain  information 
about  the  electric  fields  at  buried  interfaces'^  in  addition  to 
its  previous  applications  to  (a)  the  in  situ  surface  chemistry 
of  MBE  and  OMCVD  growth"^  and  (b)  surface  electric 
fields. SPS  employs  a  Kelvin  probe  for  the  measurement 
of  the  changes  in  surface  potential  as  a  function  of  hetero¬ 
junction  overlayer  thickness.  This  method  produces  a  unified 
picture  about  the  potential  profile  in  the  entire  structure  in¬ 
cluding  band  offsets,  interface  and  bulk  defect  states,  as  well 
as  net  carrier  concentrations. CER  is  a  contactless  form 
of  electromodulation  in  which  the  modulating  electric  field  is 
applied  to  the  sample  using  a  capacitorlike  configuration. 
The  observed  spectra,  which  may  display  Franz-Keldysh  os¬ 
cillations  (FKOs),  yields  information  about  band  gaps  and 
built-in  electric  fields  (from  the  FKOs).^^ 

II.  EXPERIMENTAL  DETAILS 

The  samples  studied  in  this  investigation,  all  fabricated  by 
organometallic  vapor  phase  epitaxy  (OMVPE),  consisted  of 
two  groups  of  material:  (a)  1.3-yLtm-thick  n  (Si)-  and  /7-(Zn)- 
doped  Ino,53Gao47As/InP(001)  and  (b)  l-/xm-thick  n-type  Si- 
doped  (1X10^^  cm“^)  Ino 53Gao,47As  layers  grown  on 
(1.1X10^^  cm“^)  Sn-doped  (001)  InP  substrates,  with  un¬ 
doped  (expected  unintentional  n  doping)  layers  of  InP  (50, 
100,  150,  200,  250,  300,  and  1000  nm)  on  top  of  the  InGaAs. 
For  the  former  set  the  doping  concentrations  were  4,  10,  30, 
200,  1000,  5000,  8000  (in  units  of  1X10^^  cm~^)  for  the 
n-type  samples  and  10,  20,  300,  7000  (in  units  of  IXIO^^ 
cm“^)  for  the  p  doping.  The  former  were  grown  on 
(3X10^^  cm“^)  S-doped  InP  substrates  while  the  latter  were 
fabricated  on  n  (2X10^^  cm“^)  InP.  For  the  CER  measure¬ 
ments,  the  modulating  voltage  was  ±500  V  at  200  Hz. 

The  samples  were  grown  in  a  50-mm-diam  horizontal, 
lamp-heated,  quartz  chamber  reactor  at  650  °C,  100  Torn 
The  sources  were  trimethyl  indium,  trimethyl  gallium,  ars¬ 
ine,  and  phosphine  for  the  InP  and  InGaAs  growth  and  silane 
diluted  in  H2  and  dimethylzinc  for  the  n  and  /? -doping,  re¬ 
spectively. 

III.  EXPERIMENTAL  RESULTS 

A.  n-  and  p-doped  Ino.ssGao^yAs  samples 
1,  Reflection  anisotropy  difference  measurements 

Reflection  difference  anisotropy  experiments  have  been 
used  to  study  the  surface  electric  fields  in  n-  and  p-doped 
GaAs(OOl)  and  ZnSe(OOl)  as  well  as  InSb  and  GaAs(llO). 
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Fig.  1.  —  (A/?//?)  j  ^jqi6  for  a  series  of  w-doped  InGaAs  samples: 

n  =  3X  10^^  cm“^  (curve  a),  2X 10*^  (curve  b),  IXIO’^  cm“^  (curve  c), 
and  8x  10^^  cm~^  (curve  d).  The  energies  of  the  £,  (2.57  eV)  and  £1  +  A] 
(2.82  eV)  optical  transitions  in  Ino53Gao47As  are  denoted  by  arrows  at  the 
bottom  of  the  figure. 


In  zincblende-type  semiconductors  the  RAS  signal  in  the  vi¬ 
cinity  of  the  El,  +  optical  features  [interband  transi¬ 
tions  along  the  A  lines  ((111)  directions)  of  the  Brillouin 
zone]  contains  a  linear  electro-optic  effect.^”^  Also,  there 
may  be  a  background  signal  due  to  any  improper  alignment 
of  the  modulator  and  analyzer.  Since  it  is  the  linear  electro¬ 
optic  effect  that  provides  the  information  of  interest,  we 
would  like  to  eliminate  the  influence  of  the  background  term. 

We  define  (AR/R)^  as 

(AR/ R) a  =  {R[no]~ R[\\o])KR[\\o]~^ R[\  [o]) ^  (la) 

where  R^q  and  RiIq  are  the  reflectivities  for  light  polarized 
along  the  [110]  and  [110]  directions,  respectively.  On  the 
other  hand,  the  signal  (AR/R)^^  is 

iAR/R)h  =  iR[\\o]~R[i\o])^iR[i\o]~^R[r\o])-  (lb) 

By  taking  the  difference  between  these  two  signals,  i.e., 
(AR/R)~ ~{AR/R)^  —  {AR/R)iy,  we  can  eliminate  any 
background  terms.  We  term  (AR/R)~  as  the  RAS  spectrum. 

Plotted  in  Fig.  1  is(AR/R)J'  -  (A/?//?)[’><jq16  in  the  range 
1.5-4  eV  for  a  series  of  n-doped  samples,  where 
{AR/R)^^^Qie  is  the  RAS  signal  for  the  1  XIO^^  cm”^  mate¬ 
rial  and  /  — 3X  10^^  cm“^  (curve  a),  2X10^^  cm”^  (curve  b), 
lx  10^^  cm“^  (curve  c),  and  8X10^^  cm”^  (curve  d).  The 
energies  of  the  Ei  (2.57  eV)  and  Ej  +  Ai  (2.82  eV)  optical 
transitions  in  unstrained  Ino53Gao.47As^^  are  denoted  by  ar¬ 
rows  at  the  bottom  of  the  figure.  Note  that  the  amplitude 
(peak-to-valley)  of  the  signal  increases  as  the  doping  concen¬ 
tration  (and  hence  surface  electric  field)  is  raised,  in  a  man¬ 
ner  similar  to  that  observed  in  GaAs(OOl).^’^ 

In  Fig.  2  is  displayed  {AR/R)J'  —  (AR/R)~^^Qie  for  a  set 
of  p-type  samples  with  i  =  2X  10^^  cm”'"^  (curve  a),  3X10'^ 
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Photon  Energy  (eV) 

Fig.  2.  {t^RIR)^  —  (A/?//?)j'^jQi6  for  a  set  of  p-type  InGaAs  samples  with 
;7  =  2X10’^  cm“^  (curve  a),  3X10^^  cm“^  (curve  b),  and  7X10*^  cm“^ 
(curve  c).  The  energies  of  the  Ej  (2.57  eV)  and  E,  +  A,  (2.82  eV)  optical 
transitions  in  Ino.53Gao,47As  are  denoted  by  arrows  at  the  bottom  of  the 
figure. 

cm“^  (curve  b),  7X10^^  cm~^  (curve  c).  The  energies  of  the 
(2.57  eV)  and  fj  +  Aj  (2.82  eV)  optical  transitions  in 
unstrained  Ino  53Gao47As  are  denoted  by  arrows  at  the  bot¬ 
tom  of  the  figure.  Again  there  is  a  shift  in  oscillator  strength 
from  the  £'i  +  Aj  to  Ey  structure  as  the  doping  concentration, 
and  hence  surface  electric  field,  is  increased.  However,  note 
that  the  phase  of  the  RAS  signals  is  reversed  in  the  n-  and 
p-type  samples,  thus  making  it  possible  to  determine  the 
nature  of  the  band-bending  at  the  surface/interface.^"^ 

We  have  found  that  the  amplitude  of  the  RAS  signal  in¬ 
creases  linearly  as  the  square  root  of  the  carrier  concentration 
(surface  electric  field)  as  reported  for  GaAs(OOl)  in  Refs.  5 
and  6. 

2.  Contactless  electroreflectance 

In  order  to  obtain  more  information  about  the  surface 
electric  fields  in  these  samples  we  also  have  performed  CER 
measurements  in  the  vicinity  of  the  direct  gap  (Eq)  at  300  K. 
Shown  in  Fig.  3  is  the  CER  spectrum  from  the  2X10^^  cm“^ 
n -doped  material.  The  data  exhibit  well-defined  FKOs  from 
which  it  is  possible  to  evaluate  the  surface  electric  field  . 
The  positions  of  the  Nth  extrema  in  the  FKOs  are  given  by^^ 

N^=liEi,-Eo/hQ)^'^+X,  (2a) 

where  the  electro-optic  energy  h&  is 

{h@)'^  =  q^h^F^I2iJi\\  (2b) 

and  Ei^  is  the  photon  energy  of  the  A^th  extrema,  Eq  is  the 
band  gap,  F  is  the  electric  field,  /Hw  is  the  reduced  interband 
effective  mass  (0.0376  in  units  of  the  free  electron  mass)^^  in 
the  direction  of  F  and  x  is  an  arbitrary  phase  factor.  It  has 
been  shown  that  for  the  nonuniform  field  in  a  space  charge 
layer  the  field  evaluated  from  the  FKOs  is  F^ .  This  is  the 
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Fig.  3.  Contactless  electroreflectance  spectrum  of  an  /r-doped  (2X10'^ 
cm~^)  InGaAs  sample. 

case  when  the  ac  modulating  field  F^^^OASF^ ,  independent 
of  the  penetration  depth  of  the  light.  The  obtained  value  of 
F,=90±10  kV/cm. 

The  surface  Fermi  level  ^  (relative  to  the  conduction 
band  edge)  can  be  evaluated  from  the  relation 

FMi2qNa/eoK){Vj^-V^^^,-kT/q)V'\  (3) 

where  is  the  donor  concentration,  69  is  the  permittivity  of 
free  space,  ac(-14)^^  is  the  static  dielectric  constant,  and 

ly  is  the  bulk  Fermi  position,  relative  to  the  conduction 
band  edge.  We  have  neglected  any  photovoltaic  effects.  Even 
though  CER  does  not  employ  a  pump  beam  (to  produce  the 
modulation)  there  still  may  be  a  photovoltaic  effect  (typically 
about  50-100  meV  at  300  K)  due  to  the  probe  light. 
From  Eq.  (3)  with  V^jy  =  15  mV^^  we  find  ^  =  200  ±  40 
mV  (relative  to  the  conduction  band),  in  reasonable  agree¬ 
ment  with  the  barrier  height  reported  by  Tawari  and  Frank.^^ 

B.  lnP/lno.53Gao.47As  system 

1.  Surface  photovoltage  spectroscopy  measurements 

The  SPS  response  of  the  above  structure  has  been  studied 
numerically  by  simultaneously  solving  the  continuity  and 
Poisson  equations.  The  details  of  the  simulation  program 
have  been  elaborated  elsewhere.  The  calculated  SPS  re¬ 
sponse  for  several  overlayer  thicknesses  is  shown  in  Fig.  4, 
using  values  of  the  absorption  coefficient  and  carrier  lifetime 
taken  from  Ref.  18.  The  spectra  of  Fig.  4  display  two  distinct 
spectral  regions  according  to  the  dominant  absorption  site: 
Below  the  band  gap  of  the  InP  overlayer  (^1.35  eV),  the 
photons  are  absorbed  in  the  InGaAs  layer.  Above  that  energy, 
they  are  absorbed  in  both  the  InP  overlayer  and  the  InGaAs 
layer.  With  increased  overlayer  thickness  both  the  shape  of 
the  spectra  and  their  magnitude  undergo  very  significant 
changes.  It  is  of  particular  importance  to  note  that  the 
“knee”  in  the  spectrum  at  about  1.35  eV,  clearly  seen  in  the 
spectra  of  samples  with  relatively  thick  overlayers,  is  com¬ 
pletely  absent  from  the  spectra  of  samples  with  thinner  ones. 

Furthermore,  in  order  to  test  the  sensitivity  of  the  spectral 
features  to  the  values  of  the  various  parameters  we  have 
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Photon  Energy  (eV) 

Fig.  4.  Numerical  simulation  of  SPS  spectra  of  the  InP/InGaAs(001)  struc¬ 
ture  for  various  InP  overlayer  thicknesses  (for  clarity  the  curve  for  200  nm 
is  down  shifted  by  20  mV).  Inset: 'dependence  of  the  spectral  “knee”  at  1.35 
eV  vs  InP  overlayer  thickness  for  A£’c=275  meV  and  AEc=225  meV. 

performed  additional  simulations  using  parameters  (absorp¬ 
tion  coefficient  and  carrier  lifetime)  deviating  from  the  stan¬ 
dard  ones  by  plus/minus  an  order  of  magnitude.  We  find  that 
while  these  quantities  may  significantly  influence  the  ampli¬ 
tude  of  the  SPS  spectrum,  they  have  no  appreciable  effect  on 
the  critical  thickness,  at  which  the  above  mentioned 
“knee”  appears.  However,  changing  the  value  of  the  band 
offsets  has  a  very  large  effect  on  . 

The  magnitude  of  the  above  mentioned  “knee”  as  a  func¬ 
tion  of  overlayer  thickness  for  both  A£'(;=275  meV  (used 
for  the  main  portion  of  Fig.  4)  and  225  meV  is  shown 

in  the  inset  of  Fig.  4.  It  is  easily  seen  that  indeed  this  “knee” 
appears  quite  abruptly  at  a  thickness  of  ,  and  also  that 
is  critically  dependent  on  Thus,  once  has  been 

determined  experimentally,  the  band  offsets  may  be  found  by 
numerical  fitting. 

The  experimental  SPS  spectra  for  several  overlayer  thick¬ 
nesses  are  shown  in  Fig.  5.  Three  regimes  are  apparent,  i.e., 
photon  energies  below  the  0.75  eV  (£"0  of  InGaAs),  photon 
energies  between  the  InGaAs  and  InP  band  gaps  and  photon 


Fig.  5.  Experimental  SPS  spectrum  of  the  InP/InGaAs(001)  structure  for 
several  InP  overlayer  thicknesses. 
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Fig.  6.  The  CER  spectra  in  the  vicinity  of  Eq  of  InGaAs  for  (a)  bare 
n-InGaAs  («  =  1  X  10^^  cm“^),  (b)  InP(50  nm)/InGaAs  (n  =  1  X  lO’^  cm”^), 
(c)  InP(300  nm)/InGaAs  (n=lXl0’^  cm~^),  and  (d)  bare  p-InGaAs 
(p  =  2XlO^^  cm"3). 

energies  above  the  InP  band  gap.  The  first  regime  clearly 
corresponds  to  absorption  in  a  defect  gap  state  (it  does  not 
appear  in  the  numerical  simulation  since  no  specific  defect 
gap  states  were  included  in  it).  The  second  and  third  regimes 
are  very  similar  to  those  obtained  in  the  numerical  simula¬ 
tion.  All  features  of  the  numerical  simulation  are  also  ob¬ 
served  in  the  real  experiment.  These  include  a  very  large 
contribution  from  the  InGaAs  layer,  which  is  not  to  be  ex¬ 
pected  in  the  absence  of  charge  transfer  into  the  InP  over¬ 
layer  and  a  critical  thickness  (Wc=  150-200  nm)  below 
which  the  InP-related  “knee”  does  not  appear  in  the  spec¬ 
trum.  Further  simulations  which  use  and  A£^  as  fitting 
parameters  show  that  the  best  fit  for  W^,  is  obtained  for 
AEc=215±30  meV  and  AEy^  325 ±30  meV.  The  error  re¬ 
sults  mainly  from  the  experimental  uncertainty  in  the  precise 
value  of  .  These  values  are  in  excellent  agreement  with 
values  reported  in  the  literature.^^"^^  Note  that  since  the  de¬ 
pendence  of  Wc  on  the  band  offsets  is  very  strong  a  rather 
large  uncertainty  in  leads  to  a  rather  small  error  in  the 
band  offsets — clearly  an  advantage  of  the  proposed  ap¬ 
proach.  The  full  details  of  such  SPS  measurements  and 
analysis  will  be  the  subject  of  a  future  work.^^ 

2,  CER  measurements 

Shown  in  Figs.  6(a),  6(b),  and  6(c)  are  the  CER  spectra  in 
the  vicinity  of  Eq  of  InGaAs  for  bare  « -InGaAs  (n  =  I  X  10^^ 
cm~^),  InP(50  nm)/InGaAs  (n  =  1  X  10^^  cm“^),  and  InP(300 
nm)/InGaAs  («=1X10^^  cm“^)  samples,  respectively.  For 
comparison  purposes,  we  also  have  displayed  in  Fig.  6(d)  the 
CER  data  for  bare  p-InGaAs  {p  =  2X  10^^  cm“^).  The  sign 
of  the  CER  signal  for  curves  a,  b,  and  c  indicates  that  all  the 
surface/interface  fields  are  “n  type,”  i.e.,  upward  band  bend¬ 
ing  towards  the  surface/interface.  For  the  doping  levels  used 
in  these  samples  the  space  charge  region  is  expected  to  be 
fully  depleted.  The  fields  determined  from  the  FKOs  of 
curves  a,  b,  and  c  are  50±5  kV/cm,  27±3  kV/cm,  and  30±3 
kV/cm,  respectively.  Thus,  with  increasing  InP  overlayer 
thickness  the  interface  electric  field  decreases,  in  agreement 
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Fig.  7.  The  CER  spectrum  for  the  InP(300  nm)/InGaAs  sample  in  the  region 
of  Eq  of  InP. 

with  a  computer  simulation.  For  the  bare  InGaAs  sample, 
from  Eq.  (3)  with  ^  =  30  mV  we  find  160  ±  30  mV, 

in  reasonable  agreement  with  the  CER  measurement  of  Sec. 
II A  2.  The  discrepancy  is  probably  due  to  the  photovoltaic 
effect,  which  will  be  smaller  in  the  more  heavily  doped  ma¬ 
terial.  However,  the  trend  with  InP  coverage  is  clear. 

In  Fig.  7  we  display  the  CER  spectrum  for  the  InP  (300 
nm)/InGaAs  sample  in  the  region  of  £*0  of  InP.  The  peak  at 
1.33  eV  corresponds  to  the  direct  gap  of  the  InP.  The  absence 
of  EKOs  confirms  the  low  doping  level  in  this  material.  The 
criteria  for  the  appearance  of  FKOs  is  that  the  electro-optic 
energy  the  broadening  parameter.  With  r=6  meV 

(from  Fig.  7)  and  /Xj|— 0.069^^  (in  units  of  the  free  electron 
mass)  we  find  kV/cm.  The  observed  below  band  gap 

CER  features  may  be  related  to  the  presence  of  some  As  in 
the  InP,  as  suspected  from  the  growth  conditions. These 
resonances  are  the  subject  of  further  investigation. 

3.  RAS  measurements 

Displayed  in  Fig.  8  are  the  results  of  the  RAS  investiga¬ 
tion  of  the  InP/InGaAs  heterojunction  as  a  function  of  InP 
layer  thickness.  In  the  spirit  of  Figs.  1  and  2  we  have  plotted 
the  parameter  (AR/R)^  where  (AR/R)^^,^  is 

the  RAS  signal  from  the  bare  InGaAs  (n=lX10^^  cm“^) 
surface  and  /“InP(50  nm)  [curve  (a)],  InP(100  nm)  [curve 
(b)],  InP(150  nm)  [curve  (c)],  and  InP(200  nm)  [curve  (d)]. 
The  arrows  at  2.57  and  2.82  eV  are  the  Ei  and  £i  +  Ai 
optical  transitions,  respectively,  of  relaxed  InGaAs  while  the 
arrow  at  3.2  eV  denotes  £i ,  £|  +  Aj  of  InP  (in  this  material 
the  spin-orbit  splitting  Aj  is  only  about  100  meV),^^  These 
measurements  are  in  agreement  with  the  above  CER  results, 
i.e.,  increased  InP  coverage  reduces  the  interface  electric 
field.  However,  comparison  of  curve  (a)  with  Figs.  1  and  2 
seems  to  indicate  that  the  interface  field  is  type”  (down¬ 
ward  band  bending  towards  the  surface/interface),  in  contrast 
to  our  CER  results.  This  may  be  due  to  the  differing  probe 
depths  of  the  two  techniques.  One  explanation  of  this  differ¬ 
ence  is  the  possibility  of  a  “camel-back  diode”  type  of  band 
bending  within  the  InGaAs  near  the  InP  interface.  If  the 
maximum  conduction  band  potential  is  not  directly  at  the 


Photon  Energy  (eV) 

Fig.  8.  {AR/R)J' for  a  series  of  InP/InGaAs  samples  with 
InP(50  nm)  [curve  (a)],  InP(100  nm)  [curve  (b)],  InP(150  nm)  [curve  (c)], 
and  InP(200  nm)  [curve  (d)].  The  arrows  at  2.57  and  2.82  eV  are  the  and 
E,  +  A|  optical  transitions,  respectively,  of  InGaAs,  while  the  arrow  at  3.2 
eV  denotes  Ej ,  4- A]  of  InP. 


interface  (denoted  as  2  =  0)  but  is  reached  a  few  hundred 
angstroms  beyond  the  interface  (z  =  Zrnax)  there  will  be  two 
electric  fields,  i.e.,  a  “/?-type”  field  between  0<z<Zniax 
an  “/I -type”  field  beyond  this  point.  Thus,  the  much  larger 
“n-type”  region  may  be  probed  by  CER  and  SPS  while 
RAS,  which  has  a  penetration  depth  of  only  a  few  hundred 
angstroms,  senses  the  former  field. 

IV.  SUMMARY 

We  have  conducted  an  ex  situ  investigation  of  (a)  the 
surfaces  of  n-  and  p -doped  Ino.53Gao.47 As (001)  and  (b)  the 
InP/Ino.53Gao  47As(001)  heterojunction  interface  as  a  function 
of  InP  overlayer  thickness  (50-1000  nm)  using  the  optical 
methods  of  RAS,  SPS,  and  CER.  The  results  from  these 
optical  probes  make  it  possible  to  form  a  comprehensive 
quantitative  picture  of  the  InP/InGaAs  heterojunction  as  well 
as  the  (001)  surface  of  InGaAs.  In  the  former  system  we 
have  (a)  determined  AEc^llS  meV  and  A£y=325  meV 
from  SPS  and  (b)  observed  the  evolution  of  the  interface 
electric  field  with  InP  overlayer  dimension  using  SPS,  CER, 
and  RAS.  The  behavior  of  the  interface  electric  field  with  InP 
coverage  indicates  a  lack  of  significant  interface  charge  and 
has  been  explained  by  a  self-consistent  Poisson’s  calculation 
of  the  properties  of  the  heterojunction.  The  RAS  measure¬ 
ment  indicated  a  different  type  of  interface  InGaAs  electric 
field  in  relation  to  the  SPS  and  CER  studies.  We  have  pos¬ 
tulated  one  possible  explanation  for  this  apparent  discrep¬ 
ancy.  The  CER  signal  from  the  direct  band  gap  of  the  InP 
confirmed  the  low  doping  level  in  this  material.  Also,  the 
observed  below  band  gap  CER  features  may  be  related  to  the 
presence  of  some  As  in  the  InP,  as  suspected  from  the  growth 
conditions.  For  the  InGaAs  surface  we  have  observed  that 
the  RAS  signal  increases  linearly  with  the  square  root  of  the 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


3094 


Leibovitch  et  al,:  RAS,  SPS,  and  CER  investigation 

doping  level  (and  hence  surface  electric  field)  and  that  there 
is  a  change  in  phase  between  the  n-  and  p-type  material. 
These  observations  are  due  to  the  linear  electro-optic  effect 
associated  with  the  £i  +  Ai  optical  features.  Also  we 
have  evaluated  the  surface  Fermi  level  to  be  200  mV  below 
the  conduction  band  edge  using  CER. 
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Silicon  wafer  bonding  is  achieved  by  joining  two  particle-free  silicon  wafers  and  annealing  to 
elevated  temperatures  (—1100  °C).  We  have  used  multiple  internal  transmission  infrared  absorption 
spectroscopy  to  probe  the  interface  between  the  wafers  upon  initial  joining  and  also  during 
subsequent  annealing  steps.  For  atomically  flat  hydrophobic  wafers  (H  passivated),  we  observe  a 
pronounced  shift  in  the  Si-H  stretching  frequency  due  to  the  physical  interaction  (van  der  Waals 
attraction)  that  occurs  when  the  surfaces  come  into  intimate  contact.  The  hydrogen  eventually 
disappears  at  high  temperatures  (1000  ^C)  and  Si-Si  bonds  are  formed  between  the  two  surfaces. 

For  hydrophilic  wafers  (oxide  passivated),  we  initially  observe  three  to  five  monolayers  of  water  at 
the  interface  (providing  the  initial  attraction  through  H  bonding),  as  well  as  the  presence  of  hydroxyl 
groups  that  terminate  the  oxide  at  low  temperature.  Upon  moderate  heating  (<400  °C),  the  water 
trapped  at  the  interface  dissociates  and  leads  to  the  formation  of  additional  oxide.  Between  400  and 
800  °C,  the  hydroxyl  groups  disappear,  resulting  in  a  corresponding  increase  in  oxide  and  the 
formation  of  Si-O-Si  bridging  linkages  across  the  two  surfaces.  ©  1996  American  Vacuum 
Society. 


L  INTRODUCTION 

Silicon  wafer  bonding^  is  an  important  technological  pro¬ 
cess  and  represents  a  significant  challenge  in  interface  sci¬ 
ence.  Direct^  silicon  wafer  bonding  occurs  when  two 
particle-free  wafers  are  brought  into  contact  and  subse¬ 
quently  annealed.  Initial  joining  is  performed  at  room  tem¬ 
perature  by  applying  gentle  pressure  at  the  center  so  that  the 
two  wafers  can  come  into  intimate  contact  by  expelling  any 
ambient  between  them.  Permanent  bonding  is  then  achieved 
via  interface  chemical  reactions  that  are  initiated  by  increas¬ 
ing  the  temperature.  This  method  is  used  in  industry  to  pro¬ 
duce  silicon-on-insulator  materials,^  replace  epitaxy,  fabri¬ 
cate  p-i-n  diodes  and  power  devices,  as  well  as 
micromechanical  structures."^  Despite  this,  little  is  known 
about  the  microscopic  interface  processes  that  occur  in  this 
constrained  environment,  effectively  hindering  precise  con¬ 
trol  or  optimization  of  the  bonding  process. 

Consequently,  the  scientific  challenge  is  to  probe  and  un¬ 
derstand  such  interface  phenomena  in  this  novel  environ¬ 
ment.  Initially,  the  weak  physical  interaction  between  the  in¬ 
ner  surfaces  provides  the  force  necessary  to  bring  the 
surfaces  together  and  expel  ambient  gas.  Yet,  the  surfaces  are 
not  necessarily  atomically  smooth  or  macroscopically  flat,  so 
that  mechanical  strain  must  also  play  a  critical  role  in  deter¬ 
mining  the  bonding  quality.  Upon  annealing,  the  interface 
species  and  inner  surfaces  can  undergo  a  chemical  evolution 
that  may  be  unique  to  this  highly  constrained  environment. 
Since  no  ambient  molecule  can  reach  the  interface  region 
after  joining,  this  system  is  equivalent  to  a  vacuum  vessel  in 
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which  pressures  can  locally  vary  from  several  atmospheres  to 
near  perfect  vacuum.  The  issues  range,  therefore,  from  iden¬ 
tifying  chemical  species  and  their  interaction  at  the  interface 
to  mapping  and  understanding  inhomogeneities,  such  as  in¬ 
trinsic  void  formation. 

Characterization  of  the  interface  is  particularly  challeng¬ 
ing  and  may  be  best  achieved  using  infrared  radiation.  Tra¬ 
ditional  surface  science  techniques  cannot  penetrate  through 
a  wafer  (0.5  mm  thick).  In  addition,  techniques  such  as  x-ray 
photoelectron  spectroscopy  (XPS)  and  secondary  ion  mass 
spectrometry  (SIMS)  require  grinding  and  etching  prior  to 
analysis,  which  is  destructive  and  only  possible  for  well 
bonded  wafers  (i.e.,  after  a  high  temperature  anneal).  In  con¬ 
trast,  infrared  (IR)  spectroscopy  is  well  suited  to  the  study  of 
both  the  physical  and  chemical  aspects  of  the  joining  at  room 
temperature,  and  subsequent  bonding  by  higher  temperature 
anneal.  In  general,  the  IR  radiation  can  easily  reach  the  bur¬ 
ied  interface  since  Si  is  transparent  above  1500  cm“^  As  a 
result,  the  characteristic  high  frequency  (>2000  cm“^) 
stretching  vibrations  of  hydrogen-containing  species  such  as 
Si-H,  H2O,  OH,  NH;^. ,  and  CH^  can  be  routinely  probed  in 
all  of  the  different  environments  inherent  to  these  systems.^ 
Furthermore,  the  lower  frequency  (<1500  cm“^)  modes  of 
important  atoms  such  as  O,  C,  N,  and  F  can  also  be  studied, 
although  this  frequency  region  is  quite  challenging  because 
of  Si  lattice  absorption,  diminishing  IR  source  intensities, 
and  the  lower  sensitivity  of  IR  detectors  below  1500  cm“^ 
Therefore,  the  sample  dimensions  become  critical  if  one 
wishes  to  probe  the  700-1500  cm~^  spectral  range,  as  shown 
in  the  present  work. 

In  this  article,  a  review  of  recent  IR  studies  is  presented 
for  both  Si-Si  bonding,  starting  from  hydrophobic  (H- 
terminated)  surfaces,  and  Si-Si02-Si  bonding,  starting  from 
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hydrophilic  (oxidized)  surfaces.  In  Sec.  II,  the  process  used 
for  wafer  bonding  is  first  described  with  a  brief  outline  of  the 
technical  difficulties  of  this  otherwise  extremely  simple 
method.  Next,  the  technique  of  multiple  internal  transmission 
(MIT)  is  summarized,  with  emphasis  on  its  use  for  spectros¬ 
copy  below  1500  cm"^  and  on  its  relative  sensitivity  for 
different  optical  configurations  and  polarizations.  Following 
this,  in  Sec.  Ill,  the  bonding  of  hydrophobic  and  hydrophilic 
surfaces  is  discussed,  with  the  initial  bonding  mechanism 
being  quite  different  in  the  two  cases.  For  hydrophobic  sur¬ 
faces,  we  focus  on  the  joining  of  the  particularly  simple 
H/Si(lll)-(1X1)  surfaces  characterized  by  a  nearly  perfect, 
atomically  smooth,  ideally  H-terminated  structure.  We  then 
contrast  these  results  with  those  obtained  for  the  joining  of 
two  hydrophobic,  atomically  rough,  surfaces.  For  hydro¬ 
philic  surfaces,  we  study  the  joining  of  two  chemically  oxi¬ 
dized  surfaces,  characterized  by  a  thin  (~5  A)  chemical  ox¬ 
ide.  Finally,  Sec.  IV  emphasizes  the  physical  processes  and 
chemical  reactions  taking  place  at  the  interface  for  the  dif¬ 
ferent  surface  preparations. 

II.  EXPERIMENT 
A.  Wafer  bonding 

The  method  used  for  cleaning  and  initially  joining  the 
silicon  wafers  is  remarkably  simple.  Standard  wet  chemical 
treatments,  such  as  RCA  cleans,  are  performed  to  obtain  oxi¬ 
dized  wafers  with  negligible  metallic  and  carbon 
contamination.^  The  chemical  oxide  thus  formed  is  approxi¬ 
mately  5  A  thick,  usually  contains  some  dissolved  hydrogen, 
and  has  a  lower  stoichiometry  than  high  quality  thermal 
(gate)  oxides.^’^  Hydrophilic  wafers  can  be  joined  immedi¬ 
ately  following  this  treatment.  In  contrast,  hydrophobic  wa¬ 
fers  require  an  additional  HF  or  NH4F  etch,  followed  by 
thorough  rinsing  in  de-ionized  (DI)  water.  The  choice  of 
etchant  depends  critically  on  the  crystallographic  face,  and 
controls  the  final  surface  morphology.  For  Si(lOO),  concen¬ 
trated  HF  solutions  are  used  to  minimize  the  facetting  and 
roughness  of  the  surfaces,^  whereas  highly  buffered  HF  so¬ 
lutions  (e.g.,  concentrated  NH4F  solutions)  are  used  to 
achieve  atomically  flat,  ideally  H-terminated  Si(lll) 
surfaces.^  The  two  wafers  are  then  placed  on  a  spinner  that 
maintains  a  3  mm  interwafer  spacing,  while  highly  filtered 
DI  water  is  forced  between  them  for  several  minutes  as  they 
slowly  rotate,  to  wash  away  any  remaining  particulate  con¬ 
taminants.  The  wafers  are  then  covered  and  spun  at  3000 
rpm  for  5  min,  while  being  mildly  heated  by  an  IR  lamp 
(<80  °C),  as  shown  in  Fig.  1(a).  Immediately  following  this 
step,  the  wafers  are  pushed  together^^  with  a  gentle  pressure 
applied  at  the  center  [Fig.  1(b)]. 

The  process  of  joining  is  fascinating  and  can  be  followed 
in  real  time,  as  the  area  of  intimate  contact  can  be  seen 
growing  outward  from  the  center  using  an  IR  imaging 
system.  With  near  monochromatic  IR  radiation  (1.2  /xm) 
transmitted  through  the  Si  pair,  the  contrast  imaged  by  the 
camera  depends  critically  on  the  distance  between  the  wa¬ 
fers.  Initially,  the  center  portion  of  the  wafers  (where  the 
pinching  occurred)  gives  the  highest  brightness  as  the  wafers 


Rotate 

b) 


Pinch 


Fig.  1.  Schematic  representation  of  the  apparatus  used  for  wafer  bonding: 
(a)  the  wafers  are  held  in  a  “spinner”  by  teflon  contacts  and  spun  at  3000 
rpm  under  a  low  intensity  infrared  lamp  °C);  (b)  after  the  requi¬ 

site  spin  cycle,  the  wafers  are  contacted  together  and  pinched  at  the  center  to 
initiate  the  “joining.” 

are  much  closer  than  the  radiation  wavelength  {d<l  fim). 
The  rest  of  the  area  usually  shows  interference  fringes  and  a 
darker  outer  portion.  The  central  portion  expands  to  encom¬ 
pass  the  whole  wafer  area  (at  which  point  joining  is  com¬ 
plete),  with  a  “contact  wave  velocity”  that  is  dependent  on 
the  particular  conditions  such  as  the  ambient  viscosity,  wafer 
flatness,  and  density  of  physisorbed  molecules. At  this 
stage,  dust  particles  trapped  at  the  interface  generate  highly 
visible,  wide  spots  (1-10  mm  diameter)  with  differing  con¬ 
trast.  Clearly,  the  presence  of  voids  or  other  contaminants 
will  tend  to  weaken  the  bonding  strength  between  the  wafers 
and  may  compromise  the  electrical  characteristics  of  such 
bonded  substrates,  so  that  higher  resolution  techniques  such 
as  acoustic  microscopy  or  x-ray  imaging  are  also  widely 
used.^^ 

The  macroscopic  bonding  strength  is  an  important  indica¬ 
tion  of  the  quality  of  the  bonding.  It  can  be  obtained  in  a 
straightforward  manner  by  inserting  a  razorblade  and  mea¬ 
suring  the  crack  propagation.^^  Such  measurements  indicate 
that  wafers  are  only  very  weakly  bonded  after  room  tempera¬ 
ture  joining,  with  the  bond  strength  of  hydrophobic  wafers 
being  significantly  less  than  that  of  hydrophilic  wafers.  Con¬ 
sequently,  any  handling  of  the  wafers  at  this  stage  (particu¬ 
larly  the  cutting  and  edge  polishing  necessary  for  IR  must  be 
done  with  the  utmost  caution.  However,  the  bonding  strength 
is  found  to  increase  monotonically  with  annealing  tempera¬ 
ture  so  that,  in  general,  wafer  pairs  are  safe  to  handle  after  a 


J.  Vac.  Sci.  Technol.  B,  Vol.  14,  No.  4,  Jul/Aug  1996 


3097 


Weldon  et  aL:  Physics  and  chemistry  of  Si  wafer  bonding 


3097 


p-pol. 


Silicon  Layer 
Refractive  index  n  =  3.42 
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Fig.  2.  Illustration  of  the  multiple  internal  transmission  (MIT)  and  external 
transmission  (ET)  optical  configurations  used. 


500  °C  anneal.  Interestingly,  hydrophobic  wafers  exhibit  a 
higher  bond  strength  than  hydrophilic  after  a  1 100  °C  anneal, 
as  the  latter  are  limited  by  the  properties  of  the  oxide  layer. 
At  intermediate  temperatures  (600-900  °C),  the  formation  of 
intrinsic  voids  (not  related  to  particles)  often  occurs,  particu¬ 
larly  for  hydrophobic  wafers.  A  comprehensive  micro¬ 
scopic  picture  of  the  relevant  bonding  mechanisms,  intrinsic 
void  formation  processes,  and  ultimate  interface  structures 
after  high  temperature  annealing  has  yet  to  emerge  and  is  the 
main  motivation  for  this  work. 


B.  Infrared  absorption  spectroscopy 

Probing  the  interface  with  sufficient  sensitivity  to  observe 
submonolayer  coverages  requires  the  use  of  multiple  internal 
transmission  (MIT)  spectroscopy.^  The  geometry  is  shown 
schematically  in  Fig.  2  and  is  termed  MIT  because  the  IR 
radiation  is  transmitted  through  the  interface  rather  than  re¬ 
flected,  even  under  the  angular  conditions  where  total  inter¬ 
nal  reflection  is  observed  for  bulk  media,  because  the  inter¬ 
face  is  much  thinner  than  the  wavelength  (tunneling). 
Importantly,  this  situation  enhances  the  sensitivity  to  vibra¬ 
tional  absorption  normal  to  the  interface,  as  conveniently 
summarized  for  -polarized  radiation  using  a  three-layer 
model: 


A/ 


lird  1 
X  n  cos(^) 


cos^(^)Im(6) 


+  sin^((9)Im| 


(1) 


In  this  equation,  the  absorption  (A/Z/q)  of  IR  radiation,  in¬ 
cident  with  internal  angle  ^  on  a  layer  of  thickness  d  and 
dielectric  function  e,  is  divided  into  two  terms;  one  propor¬ 
tional  to  the  imaginary  part  of  e  and  the  other  to  the  imagi¬ 
nary  part  of  1/^,  which  describe  the  components  of  absorp¬ 
tion  parallel  and  normal  to  the  interface,  respectively.  Noting 
that  the  silicon  index  of  refraction  {n  ~  3.42  in  the  mid  IR)  is 
much  larger  than  that  of  the  interface  layer  (1^1  1.3),  two 


immediate  conclusions  can  be  drawn  from  this  equation.  The 
first  is  that  the  component  normal  to  the  interface  is  en¬ 
hanced  by 

n^  sin^(^) 

^l/ll(MIT)=|-|2  ^os2(0)  (2) 

over  the  parallel  component,  which  corresponds  to  a  factor 
of  40-50  for  a  typical  45°  incidence  angle.  The  second  con¬ 
sequence  of  this  configuration  is  apparent  on  comparison  of 
Eq.  (1)  to  that  obtained  for  internal  reflection  at  the  outer 
surfaces: 


lird  1 
X  n  cos((9) 


(3) 


where  n^.^^  is  now  the  index  of  air  or  vacuum  («vac^  1)  4 

(=2.0  for  45°)  and  7^  (=2.4  for  45°)  are  the  normalized  field 
intensities  on  the  vacuum  side  of  the  outer  surfaces. The 
normal  component  of  absorption  is  greater  for  MIT  than  for 
MIR  by  a  factor 

^  I  Y 

^_L  ( MIT/MIR)  ^  ”  }  >  (4) 

\^vac/ 


which  is  28  for  45°  and  42  for  60° ! 

The  sensitivity  of  the  MIT  configuration  is  therefore  20  to 
40  times  that  of  the  traditional  MIR  configuration.  To  illus¬ 
trate  the  practical  importance  of  this  result,  this  statement 
can  be  rephrased  as  follows:  using  MIT,  both  the  outer  sur¬ 
face  absorption  (contamination)  and  the  parallel  component 
of  the  interface  absorption  are  negligible  (at  least  20  times 
weaker)  compared  to  the  perpendicular  component  of  the 
interface  absorption.  These  two  factors  make  the  interpreta¬ 
tion  of  the  data  more  straightforward.  In  fact,  the  study  of 
interfaces  using  the  MIT  configuration  is  nearly  optimal 
since  the  experiment  can  be  done  outside  a  vacuum  chamber 
(stable  interface),  the  absorption  from  the  outer  surface  con¬ 
tamination  is  negligible,  and  the  sensitivity  to  the  interface  is 
over  an  order  of  magnitude  better  than  any  other  surface 
configuration!  The  experimental  challenge  remains  the  low 
frequency  region  (<1500  cm“^),  where  the  sample  dimen¬ 
sions  become  critical. 

The  problem  with  measurement  below  1500  cm“^  is  the 
silicon  bulk  lattice  absorption  which,  although  weak,  domi¬ 
nates  the  spectrum  for  typical  pathlengths.  Just  as  in  the  MIR 
configuration,  the  probing  IR  radiation  in  the  MIT  configu¬ 
ration  traverses  several  centimeters  of  silicon  (typically  7-10 
cm)  in  total.  Consequently,  the  weak  multiphonon 
absorption  in  Si  leads  to  orders  of  magnitude  attenuation  of 
the  IR  throughput,  making  it  impractical  to  work  in  this  fre¬ 
quency  range,  with  such  long  path  lengths.  To  study  the  in¬ 
terface,  we  would  then  be  restricted  to  using  a  one  pass  ex¬ 
ternal  transmission  configuration  if  it  were  not  for  realizing 
that  the  enhancement  per  pass  of  internal  transmission  over 
external  transmission  is  substantial.  Indeed,  in  external  trans¬ 
mission,  the  refraction  into  the  high  index  Si  controls  the 
internal  incident  angle  to  a  value  between  16.3°  (for  grazing 
external  incidence)  and  0°  (for  normal  external  incidence).  In 
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contrast,  using  internal  reflection,  large  angles  can  be 
achieved.  The  enhancement  of  the  component  perpendicular 
to  the  interface  is  then: 

_  sin  ( cos(  ^ext) 

per  pass.  Practically,  the  angle  achieved  with  external  reflec¬ 
tion  is  close  to  16°  (Brewster  condition)  and  the  angle  cho¬ 
sen  for  internal  reflection  is  60°.  The  enhancement  per  pass 
is  therefore  19,  suggesting  that  the  transmission  below  1500 
cm~^  could  be  19  times  weaker  for  a  similar  S/N  ratio.  A 
convenient  geometry  is  therefore  the  MIT  configuration  with 
60°  incidence  in  a  1 -cm- wide  sample.  This  gives  an  average 
of  5.8  passes  through  a  total  of  1.2  cm  of  Si,  yielding  an 
enhancement  of  110  over  a  single  external  reflection.  If  in¬ 
trinsic  Si  is  used  with  a  resistivity  of  20  fl  cm  or  greater, 
signal  levels  roughly  10  times  weaker  than  above  1500  cm“^ 
would  be  obtained  in  the  800-1500  cm“^  region,  but  the 
overall  gain  over  a  single  external  reflection  is  still  over  a 
factor  of  10.  The  MIT  configuration  using  a  shorter  sample  is 
therefore  theoretically  preferable.  In  practice,  precise  control 
of  the  sample  temperature  and  IR  beam  position  is  also  criti¬ 
cal  since  the  overall  signal  is  still  dominated  by  the  highly 
temperature  dependent  bulk  phonon  absorption.  In  addition, 
it  is  also  important  to  use  float-zone  silicon  instead  of 
Czochralski-grown  silicon  to  avoid  the  very  strong  intersti¬ 
tial  oxygen  absorption  centered  at  1100  cm“^ 

III.  RESULTS  AND  DISCUSSION 

A.  Hydrophobic  silicon  wafer  bonding 

1,  Experimental  results  for  the  H/SI(111)-(1  x1)  surface 

The  best  surface  to  study  in  order  to  characterize  the  pro¬ 
cesses  occurring  during  the  bonding  of  hydrophobic  wafers 
is  the  atomically  flat,  ideally  H-terminated  Si(lll)  surface 
prepared  by  wet  chemical  etching.^’^^  Such  H/Si(lll)-(lXl) 
surfaces  are  characterized  by  atomically  flat  domains  extend¬ 
ing  over  several  thousand  angstroms,  separated  by  atomi¬ 
cally  straight  double-layer  steps. Over  small  areas,  there¬ 
fore,  these  surfaces  have  the  possibility  to  come  within  a  few 
angstroms.  However,  over  the  macroscopic  wafer  (100  mm 
diameter),  the  flatness  is  no  better  than  5000  A,  making  it 
impossible  to  be  uniformly  close  without  wafer  deformation. 

Figure  3(a)  illustrates  the  differences  between  the  MIR 
spectrum  of  a  H/Si(lll)-(1X1)  surface  and  the  MIT  spec¬ 
trum  of  two  such  surfaces  after  room  temperature  joining. 
The  most  striking  observations  are  (1)  the  dramatically  in¬ 
creased  spectral  intensity  and  (2)  the  significantly  altered 
spectral  shape  after  joining.  Specifically,  the  sharp  (<1  cm“^ 
wide)  Si-H  stretch  absorption  line  at  2083.5  cm~^  of  the  free 
H/Si(lll)-(1X1)  surface  (i)  is  dramatically  broadened  to 
lower  frequency,  with  the  appearance  of  a  new  feature  cen¬ 
tered  at  2065  cm“\  when  the  wafers  are  joined  at  room 
temperature  (ii).  In  addition,  the  integrated  Si-H  stretching 
absorption  of  the  joined  wafers  is  24.5  times  larger  than  for 
the  single  H[/Si(lll)-(1  X 1)  surface,  in  good  agreement  with 
the  factor  of  28  calculated  for  a  45°  incident  angle,  using  Eq. 


a) 


Wavenumber  (cm‘^) 
b) 


Wavenumber  (cm"^) 

Fig.  3.  (a)  Demonstration  of  the  profound  changes  apparent  on  joining 
hydrogen -terminated  Si(lll)  wafers:  (i)  MIR  spectrum  of  a  single 
H/Si(in)-(1X1)  sample,  and  (ii)  MIT  spectrum  of  two  H/Si(lll)-(1X1) 
wafers  at  room  temperature  (outer  surfaces  are  oxide-passivated).  Both 
spectra  were  obtained  using  the  same  optical  setup  (thickness =0.5  mm  per 
Si,  sample  length =3. 8  cm,  internal  angle  of  incidence =45°).  (b)  Compari¬ 
son  of  the  spectrum  for  (i)  the  3.8  cm  area  shown  in  (a)  and  (ii)  1  cm 
sample  area. 

(4).  Interestingly,  we  find  that  the  process  is  completely  re¬ 
versible,  since  the  spectrum  collapses  back  to  a  sharp  line  at 
2083.5  cm“\  if  the  joined  wafers  are  separated,  and  broad¬ 
ens  again  as  they  are  allowed  to  come  back  together. 

We  propose  that  the  pronounced  spectral  broadening  is 
due  to  the  varying  degrees  of  interaction  between  the  mac¬ 
roscopic  surfaces.  The  shift  in  frequency  of  the  free-surface 
mode  at  2083.5  cm“^  is  then  indicative  of  the  interaction 
strength,  with  2065  cm"^  reflecting  the  condition  for  maxi¬ 
mum  shift.  To  check  this  hypothesis,  we  have  cut  a  Si(lll) 
bonded  wafer  pair  into  many  smaller  pieces  and  probed  each 
piece  separately,  as  shown  in  Fig.  3b(ii).  The  predominance 
of  the  2065  cm~^  mode  and  absence  of  the  unperturbed  free- 
surface  mode  at  2083.5  cm“^  confirms  that  the  former  fre¬ 
quency  represents  a  stable  configuration  that  can  be  achieved 
over  areas  as  large  as  1  cm  and  that  the  spectrum  in  Fig. 
3b(i)  is  an  average  over  the  whole  sample,  which  is  com¬ 
prised  of  regions  of  differing  interaction  strength.  However, 
up  to  this  point,  we  have  not  discussed  the  precise  physical 
or  chemical  origin  of  this  frequency  shift. 
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Clearly,  the  spectral  shift  cannot  be  caused  by  a  chemical 
reaction  between  the  two  surfaces,  since  the  changes  appar¬ 
ent  on  joining  at  room  temperature  are  completely  reversible. 
Furthermore,  the  possibility  that  a  thin  layer  (—monolayer) 
of  foreign  molecules  causes  the  shift  by  a  hydrogen  bonding 
or  Van  der  Waals  type  interaction,  can  also  be  eliminated.  We 
have  investigated  the  former  possibility  by  trapping  water, 
HF,  and  ethanol  between  the  two  H/Si(lll)-(1X1)  surfaces 
prior  to  joining  and  found  that,  in  all  cases,  a  systematic 
broadening  (50-75  cm~^)  of  the  Si-H  spectrum  occurs,  but 
with  an  overall  shift  to  higher  frequencies  of  10  to  20  cm~^ 
Similarly,  we  find  that  the  feature  at  2065  cm“^  in  the  spec¬ 
trum  of  joined  Si(lll)  cannot  be  related  to  the  presence  of 
physisorbed  contaminants  (such  as  ambient  or  CO), 
since  not  only  have  such  interactions  been  shown  to  increase 
the  Si-H  frequency  on  the  isolated  (111)  surface,^^  but  the 
2065  cm“^  mode  persists  on  annealing  the  joined  samples  to 
above  800  °C,  by  which  point  temperature  decomposition  of 
all  molecular  species  has  occurred.  In  summary,  the  spectrum 
shown  in  Fig.  3  cannot  be  reproduced  by  any  of  these  (con¬ 
taminant)  species,  so  we  conclude  that  the  perturbation  must 
arise  from  the  direct  interaction  of  the  two  clean  surfaces. 
Specifically,  we  propose  that  the  two  surfaces  can  come  suf¬ 
ficiently  close  under  the  influence  of  the  van  der  Waals  at¬ 
traction,  and  that  the  opposing  hydrogens  interact,  causing 
the  vibrational  frequency  to  shift. 

The  possibility  of  a  direct  physical  interaction  between 
the  two  surfaces  is  intriguing,  as  it  necessarily  requires  that 
the  separation  be  very  small,  and  that  most  physisorbed  con¬ 
taminants  are  purged  from  the  interface  region  upon  joining. 
Clearly,  the  data  suggest  that  such  a  clean  and  intimately 
contacted  interface  is  indeed  obtained.  To  provide  a  theoreti¬ 
cal  understanding  of  the  spectral  shifts,  we  have  carried  out  a 
series  of  calculations  on  ideal  H-terminated  Si(lll)  surfaces 
in  close  proximity.  These  ab  initio  calculations  are  based  on 
the  local  density  approximation,^^  utilizing  separable  norm- 
conserving  pseudopotentials  to  represent  the  atoms,^^’^^  and 
a  plane- wave  basis  set  to  expand  the  wave  functions.  We 
used  the  Ceperley -Alder  exchange-correlation  functional,  as 
parametrized  by  Perdew  and  Zunger.^"^  The  Car~Parrinello 
method  was  used  to  simultaneously  optimize  the  wave  func¬ 
tions  and  atomic  coordinates,^^  incorporating  recently  pro¬ 
posed  modifications  to  accelerate  the  dynamics.^^ 

The  joined  surfaces  were  simulated  by  two  supercells 
consisting  of  10  and  12  Si[lll]  layers  plus  terminating  H 
layers  on  each  surface.  The  surface  spacing  was  varied  by 
changing  the  lattice  constant  of  the  supercell  in  the  surface 
normal  direction.  An  in-plane  lattice  constant  of  3.81  A  was 
determined  by  minimizing  the  energy  of  the  bulk  Si  using  the 
same  pseudopotentials  and  cutoffs.  For  the  10  Si-layer  super¬ 
cell,  the  geometry  is  such  that  the  H  bonded  to  each  surface 
approaches  a  three-fold  hollow  site  of  the  facing  H  layer.^^  In 
contrast,  the  12  Si-layer  supercell  yields  “atop”  registry,  that 
is,  the  opposing  H  atoms  approach  head-on.  Tests  indicated 
that  a  plane  wave  cutoff  of  15  Ry  and  a  k  sample  of  eight 
special  points^^  in  the  irreducible  wedge  of  the  supercell 
Brillouin  zone  gave  well-converged  results.  The  damped 


Fig.  4.  Theoretical  results  for  (a)  the  bonding  energy  and  (b)  the  asymmetric 
Si-H  stretch  frequency  for  two  joined,  ideal  H/Si(lll)-(1X1)  surfaces  as  a 
function  of  wafer  spacing. 


Car-Parrinello  dynamics  were  run  until  residual  forces  on 
the  atoms  were  less  than  10“^  atomic  units.  After  all  the 
atoms  were  relaxed,  the  Si  atoms  were  fixed,  and  the  H  at¬ 
oms  were  moved  adiabatically  in  steps  of  0.05  A  in  the  (111) 
direction.  The  forces  on  the  H  atoms  at  five  points  were  fit  to 
the  derivative  of  a  Morse  potential,  and  the  force  constants 
extracted  from  the  fits.  The  vibrational  frequencies  were  ap¬ 
proximated  using  the  reduced  Si-H  mass  with  these  calcu¬ 
lated  force  constants. 

The  results  of  our  calculations  are  shown  in  Fig.  4.  The 
abscissa  is  chosen  to  be  “wafer  spacing,”  so  that  the  hollow- 
and  atop-registry  results  can  be  compared  in  a  physically 
reasonable  way.  We  define  this  quantity  as  the  spacing  be¬ 
tween  the  outermost  Si  layers  of  the  facing  surfaces,  assum¬ 
ing  the  ideal  bulk  interlayer  spacing.  The  actual  surfaces  dis¬ 
play  relaxation  and  slight  elastic  effects  in  the  outermost 
layers.  The  bonding  energy  curves  in  Fig.  4(a)  (which  are  the 
total  supercell  LDA  energies  referred  to  the  large  separation 
limit)  show  weak,  but  well-defined,  attractive  wells.  The 
minima  for  the  atop  and  three-fold  registries  correspond  to 
H-H  spacings  of  2.11  and  2.38  A,  respectively.  The  potential 
displays  a  much  steeper  rise  at  small  wafer  spacings  in  the 
atop  case.  At  distances  greater  than  the  atop  minimum,  the 
attraction  is  remarkably  independent  of  registry.  The  exist¬ 
ence  of  potential  minima  in  each  case  may  appear  rather 
surprising,  on  first  inspection,  since  the  Si-H  surface  is  a 
“closed  shell”  system,  so  that  one  might  expect  only  repul¬ 
sive  interactions  from  a  local  theory.  However,  Lang  has 
previously  found  that  the  LDA  gives  a  reasonable  account  of 
rare  gas  adsorption,  with  energy  minima  roughly  comparable 
in  magnitude  to  those  found  here,  and  in  rather  good  agree¬ 
ment  with  experiment. The  physical  mechanism  underlying 
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these  results  is  discussed  at  some  length  in  his  article. 

The  Si-H  vibrational  frequency  is  calculated  to  be  1992 
cm“^  at  large  spacing,  which  is  4%  below  the  measured 
value  (2083  cm“^).  This  constitutes  excellent  agreement  with 
experiment,  with  the  residual  discrepancy  attributable  to  an- 
harmonic  effects,  basis  convergence,  or  the  intrinsic  limita¬ 
tions  of  the  LDA.  In  Fig.  4(b),  we  see  that  there  is  a  small, 
but  systematic,  softening  of  the  optically  active  asymmetric 
stretch  vibrational  mode.  Interestingly,  the  hollow  and  atop 
registry  shifts  are  found  to  be  very  similar  in  magnitude, 
although  the  atop  frequency  versus  spacing  curve  is  shifted 
to  greater  wafer  spacings  by  ~0.5  A.  The  shifts  have  several 
sources.  The  Si-H  bond  lengths  are  calculated  to  be  0.0046 
and  0.0066  A  longer  at  the  spacing  of  maximum  shift  than  at 
large  spacings  for  the  hollow  and  atop  sites,  respectively. 
Based  on  the  anharmonicity  of  the  Morse  potential  fit  at  large 
spacings,  this  “static”  effect  on  the  Si-H  bond  accounts  for 
a  large  fraction  of  the  total  shift  in  each  case.  An  additional 
source  of  the  shifts  is  the  dynamic  effect  due  to  the  interac¬ 
tions  of  the  vibrating  H  atoms  on  one  surface  with  those  on 
the  other.  These  interactions  also  lead  to  significant  splittings 
between  the  optically  mactive  symmetric  stretch  mode  and 
the  asymmetric  stretch  at  small  wafer  spacings. 

In  summary,  we  find  that  there  is  excellent  agreement  be¬ 
tween  the  theoretically  predicted,  and  experimentally  ob¬ 
served,  Si-H  frequency  shifts  upon  joining.  We  have  not 
undertaken  a  full  lineshape  analysis  since  that  would  require 
many  assumptions  about  the  effects  of  steps  and  other  de¬ 
fects  on  the  spacing  and  registry  of  the  wafers,  as  well  as  a 
detailed  treatment  of  the  concomitant  elastic  deformations. 
The  principal  consequence  of  these  effects  is  that  the  maxi¬ 
mum  calculated  shift  may  not  be  practically  observable.  In¬ 
deed,  the  (shifted)  absorbance  maximum  is  observed  ^18 
cm~^  lower  in  frequency  than  the  constituent  free-surfaces 
Si-H  mode,  compared  to  the  29  cm“^  shift,  predicted  above. 
However,  a  shift  of  27  cm”^  has  been  observed  for  some 
samples  after  annealing  to  elevated  temperatures,  consistent 
with  the  attainment  of  an  even  more  intimate  interaction  un¬ 
der  these  conditions.  Clearly,  the  magnitude  of  the  maximum 
observed  shift  is  only  consistent  with  the  calculated  model 
results  if  the  interwafer  spacing  is  at  the  atomic  level,  that  is, 
within  a  few  A,  over  significant  areas  of  the  bonded  interface 
formed  upon  room  temperature  joining  of  two  Si(lll)  sur¬ 
faces, 

2.  Results  for  atomically  rough  hydrophobic  surfaces 

The  preceding  discussion  focused  on  the  interaction  of  the 
atomically  smooth  Si(Ill)  surfaces,  where  intimate  contact 
be  achieved  over  areas  on  the  order  of  1000  A.  The  situation 
is  clearly  different  if  the  hydrophobic  surfaces  are  atomically 
rough.  Such  surfaces  are  prepared  by  etching  Si(lll)  or 
Si(lOO)  in  dilute  HF  (1%  to  10%)  rather  than  in  NH4F.*  As  a 
result  of  this  microscopic  roughness,  the  interaction  between 
the  two  surfaces  is  significantly  decreased.  In  fact,  we  find 
that  when  these  H-terminated  surfaces  are  joined  together  the 
interface  spectrum  is  identical  to  the  clean  surface  spectrum 
(obtained  prior  to  bonding),  as  shown  in  Fig.  5  for  Si(lOO) 


2000  2070  2140  2210  2280  2350 

Wavenumber  (cm"^) 


Fig.  5.  MIT  spectra  of  two  joined  Si(100)-H  surfaces,  as  a  function  of 
annealing  temperature:  (i)  room  temperature;  (ii)  after  annealing  to  500  °C; 
and  (iii)  600  °C.  Note  the  presence  of  oxygen  attached  to-Si-H  groups 
[7^(Si-H)>2150  cm“^]  at  room  temperature,  which  agglomerates  to  form 
predominantly  (03)-Si-H,  on  annealing  above  500  °C. 


surf  aces.  Recalling  that  only  the  components  perpendicular 
to  the  surface  are  enhanced,  the  spectrum  shows  two  main 
contributions  at  2110  and  2140  cm~\  corresponding  to  the 
perpendicular  components  of  dihydride  and  trihydride  vibra¬ 
tions  of  H-terminated  Si(lOO)  (see  Fig.  6  of  Ref.  8).  There  is, 
therefore,  no  evidence  for  physical  interaction  between  the 
two  surfaces  over  the  vast  majority  of  the  wafer  area,  at  room 
temperature  (see  Fig.  7). 

The  absence  of  a  strong  interaction  between  the  two 
rough  surfaces  over  the  majority  of  the  interface  region  is 
further  supported  by  the  observed  behavior  upon  annealing 
to  500  °C.  At  this  temperature,  the  Si-H  stretch  spectrum 
collapses  into  a  single  new  feature  at  2097  cm“\  previously 
assigned  to  the  symmetric  stretch  of  the  dimerized, 
monohydride-terminated  H/Si(100)-(2X1)  surface  [Fig. 
5(ii)].^^  Such  an  atomic  rearrangement  of  the  inner  surfaces 
exactly  mimics  the  behavior  of  a  single  atomically  rough, 
H-terminated  Si(lOO)  surface  upon  annealing  in  UHV.  The 
partial  loss  of  hydrogen  from  the  higher  hydrides  (SiH2  and 
SiH3)  that  occurs  as  the  surface  becomes  smoother  and  as¬ 
sumes  its  most  stable  dimerized  structure,  also  has  important 
mechanistic  implications.  First,  the  occurrence  of  Si-H  bond 
scission  suggests  that,  although  no  interaction  is  apparent  by 
analysis  of  the  infrared  spectrum  alone,  a  minority  of  direct 
Si-Si  linkages  may  in  fact  form  between  the  two  surfaces. 
Second,  since  the  decomposition  almost  certainly  proceeds 
by  evolution  of  molecular  hydrogen  (as  in  UHV),  gaseous  H2 
is  expected  to  be  trapped  at  the  interface  (potentially  leading 
to  intrinsic  voids  or  bubble  formation),  but  with  too  low  a 
concentration  and  pressure  to  be  detectable  using  IR.^^  The 
former  conclusion  is  confirmed  by  inspection  of  data  ob¬ 
tained  on  annealing  to  600  °C  [Fig.  5(iii)],  which  show  a 
distinct  (—4  cm“^)  shift  in  the  monohydride  stretching  fre¬ 
quency,  consistent  with  perturbation  of  the  isolated  surface 
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Fig.  6.  Right  panel:  Si-H  stretching  modes  of  (02,3)-Si-H  species  present  at  the  interface  of  two  joined  hydrophobic  Si(lOO)  samples  at  room  temperature. 
Left  panel:  (i)  MIT  spectrum  of  the  low  frequency  (<1500  cm~*)  Si-0  stretching  modes  of  this  surface  oxygen;  (ii)  ET  spectrum  of  1600  A  of  thermal  Si02 
(grown  at  1100  °C)  showing  the  TO  and  LO  phonon  absorption. 


Structure  due  to  the  proximity  of  the  opposing  surface  at  this 
temperature. 

3.  Oxygen  contamination  and  oxidation  at  the  Si/Si 
interface 

It  is  clear  from  the  spectra  in  Fig.  5  that  some  oxygen 
contamination  of  the  silicon  surface  remains,  even  after  care¬ 
ful  HF  etching  and  rinsing  of  the  samples.  This  is  well  docu¬ 
mented  in  the  literature^^  [and  is  particularly  prevalent  for 
Si(lOO)]  for  isolated  silicon  surfaces,  and  is  not  therefore  a 
unique  property  of  joined  wafers.  However,  an  interesting 
difference  exists  for  these  highly  constrained  systems;  the 
evaporation  of  SiO  groups  at  elevated  temperature  that  is 
observed  for  free  Si  surfaces  is  obviously  suppressed  in  these 
environments.  Consequently,  although  some  dissolution  of 
the  oxygen  into  the  bulk  silicon  may  occur  upon  annealing  to 
high  temperature,  there  is  a  significant  possibility  that  some 
oxygen  will  remained  trapped  at  the  interface,  causing  elec¬ 
trical  degradation.  We  have  investigated  these  possibilities 
by  monitoring  the  thermal  evolution  of  the  (high  frequency) 
Si“H  modes  that  are  shifted  by  the  presence  of  oxygen  in  the 
Si  backbone.  With  this  approach,  we  assume  implicitly  that 
these  Si-H  modes  are  a  sensitive  probe  of  the  oxygen 
environment, i.e.,  that  the  oxygen  remains  in  the  Si  back¬ 
bone  and  the  Si-H  bond  remains  intact.  As  a  result,  no  in¬ 
formation  can  be  obtained  above  —900  °C,  the  temperature 
at  which  complete  loss  of  interfacial  hydrogen  is  observed. 
Consequently,  we  have  also  probed  the  SiO  vibrations  of  the 
oxide  directly,  by  modifying  the  experimental  configuration, 
as  described  earlier. 

Figure  6(i)  shows  the  low  frequency  region  of  the  infrared 
spectrum  of  a  (typical)  joined  hydrophobic  Si(lOO)  sample 
that  exhibits  Si-H  stretching  modes  between  2175-2275 
cm“\  indicative  of  oxygen  in  the  silicon  backbone  (Fig.  6, 
right).  Accordingly,  we  observe  a  prominent,  broad  absorp¬ 
tion  band  centered  at  1125  cm“^  that  is  assigned  to  the  Si-0 
stretching  modes  of  (O^)-Si-H  surface  structures,  by  refer¬ 
ence  to  previous  (UHV)  studies  of  the  initial  oxidation  of 
Si,^^  for  which  bands  at  1000-1200  cm~^  were  assigned  to 


local  Si-O-Si  modes.  Although  the  precise  interpretation 
awaits  a  detailed  theoretical  treatment,  the  width  and  fre¬ 
quency  range  of  this  absorption  band  is  consistent  with  the 
existence  of  an  inhomogeneous  array  of  (predominantly)  lo¬ 
cal  Si-0  modes.  Furthermore,  the  absorption  maximum  is 
indicative  of  (023)Si-H  species,  so  that  some  aggregation  or 
clustering  of  oxygen  must  occur,  potentially  resulting  in  the 
formation  of  small  oxidized  islands.  Indeed,  such  islands 
would  necessarily  exhibit  extended  coulombic  coupling,  giv¬ 
ing  rise  to  the  highest  frequency  component  of  the  absorption 
band  at  1230  cm“^  To  illustrate  this,  the  spectrum  of  a  thick 
(1600  A)  layer  of  thermal  Si02  is  also  shown  [Fig.  6(ii)]. 

For  an  Si02  network,  the  local  Si-0  modes  are  coupled 
together,  giving  rise  to  both  transverse  (TO)  and  longitudinal 
(LO)  optical  phonon  modes.^^  In  the  thin  film  limit  (d<X), 
both  the  TO  and  LO  phonons  can  be  optically  excited.  Im¬ 
portantly,  the  LO  (TO)  phonon  is  polarized  perpendicular 
(parallel)  to  the  plane  of  the  film  and  the  LO  phonon  exhibits 
strong  Coulombic  coupling  that  shifts  this  mode  to  higher 
frequency  than  the  corresponding  TO  phonon.  In  addition,  as 
a  result  of  this  extended  coupling,  the  LO  mode  frequency  is 
effectively  a  (weighted)  “average”  over  the  extended  net¬ 
work  and  is  therefore  not  as  sensitive  to  local  inhomogene¬ 
ities  as  the  TO  mode.  On  the  other  hand,  the  LO  mode  is 
sensitive  to  the  layer  thickness, displaying  a  decrease  in 
frequency  with  decreasing  thickness  (of  —1.5  cm~VA),  be¬ 
low  30  A.  The  separation  of  the  TO/LO  modes  is  clearly 
apparent  in  the  spectrum  of  thick  Si02,  for  which  the  LO 
mode  occurs  at  1250  cm“\  whereas  the  TO  mode  is  ob¬ 
served  at  1080  cm”^  [Fig.  6(ii)].  It  is  important  to  note  that 
this  spectrum  was  obtained  for  a  single  oxidized  silicon  wa¬ 
fer  using  the  external  transmission  configuration,  so  that  both 
the  LO  (perpendicular)  and  the  TO  (parallel)  modes  are  ob¬ 
served  with  good  sensitivity.  In  contrast,  the  LO  phonon  ab¬ 
sorption  would  be  42  times  more  intense  than  the  TO  using 
MIT  (at  60°  incidence),  so  that  it  effectively  dominates  the 
observed  spectrum.  Given  the  above  and  the  fact  that  the  LO 
phonon  of  a  continuous  Si02  layer  is  predicted  to  be  —1200 
cm”^  for  vanishingly  small  thicknesses,  the  spectrum  of  the 
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Fig.  7.  Schematic  of  proposed  mechanism  for  hydrophobic  [Si(lOO)]  wafer 
bonding. 

low  frequency  region  of  joined  hydrophobic  Si(lOO)  is 
clearly  more  consistent  with  a  local  Si-0  mode  description. 
Finally,  we  see  spectroscopic  evidence  for  the  formation  of 
local  Si-O-Si  bridging  bonds  upon  annealing  these  samples 
to  elevated  temperature,  as  illustrated  in  Fig.  7,  which  may 
(in  part)  explain  the  different  electrical  performance  that  is 
sometimes  reported  for  direct-bonded  Si  substrates  from  that 
observed  for  bulk  Si. 

B.  Hydrophilic  silicon  wafer  bonding 

1.  Experimental  results  for  chemically  oxidized 
wafers 

The  growth  of  (thin)  oxides  is  an  important  area  of 
microelectronics.^^  It  is  also  obviously  a  particularly  perti¬ 
nent  issue  in  the  formation  of  SOI  substrates  using  wafer 
bonding.  A  central  concern  is  the  quality  of  the  interface 
when  two  hydrophilic  (oxidized)  surfaces  are  bonded.  We 
have  addressed  this  problem  by  studying  the  bonding  of  two 
wet-chemically  oxidized  silicon  surfaces.^^  The  main  advan¬ 
tage  of  these  oxides  is  that  they  are  very  thin  (^4-5  A)  so 
that  the  formation  of  new  oxide  can  readily  be  identified.  The 
extension  of  this  work  to  thicker  thermal  oxides  is  necessary 
to  describe  the  actual  SOI  systems,  and  is  currently  under 
way  in  our  laboratory. 

The  spectrum  observed  after  joining  two  hydrophilic 
Si(lll)  surfaces  is  shown  in  Fig.  8,  along  with  the  relevant 
vibrational  assignments.  The  corresponding  spectrum  for  hy¬ 
drophilic  Si(lOO)  surfaces  is  similar^^  since  most  of  the  spe¬ 
cies  (H2O,  OH,  CH^ ,  and  H)  are  at  the  interface  or  inside  the 
oxide,  and  the  nature  of  the  chemical  oxide  is  similar  on  both 
Si(lll)  and  Si(lOO).  A  brief  summary  of  the  assignments  is 
as  follows:^®  the  broad  feature  between  2800  and  3500  cm“^ 


Frequency  (cm“^) 


Fig.  8.  Typical  MIT  infrared  spectrum  of  two  hydrophilic  [Si(lll)]  wafers 
joined  at  room  temperature.  Schematic  representations  of  the  vibrational 
modes  associated  with  each  absorbance  band  are  shown,  for  reference.  Note 
the  huge  (—40%)  absorbance  due  to  the  interfacial  oxide  initially  present  on 
these  hydrophilic  surfaces.  The  data  were  collected  in  a  single  acquisition 
and  have  not  been  corrected  for  baseline  variations.  The  region  at  —1100 
cm^'  in  the  Si-0  band  is  omitted  since  the  interstitial  oxygen  present  in 
these  CZ  wafers  completely  absorbs  the  IR  radiation  over  this  frequency 
range. 


along  and  the  sharper  feature  at  1630  cm“^  are  characteristic 
of  H-bonded  water  molecules  adsorbed  on  the  internal  sur¬ 
faces;  the  strong  shoulder  at  3500-3650  cm“^  is  due  to 
H-bonded  hydroxyl  (-OH)  groups  that  terminate  the  two  ox¬ 
ide  surfaces;  the  intense  absorption  band  centered  at  1205 
cm~^  is  assigned  to  the  LO  phonon  of  the  thin  chemical 
oxide  network  and  the  sharp  feature  at  2250  cm~^  to  H  dis¬ 
solved  in  this  oxide  (chemically  bonded  to  Si  in  the  Si02 
network). Finally,  the  mode  at  2100  cm”^  is  best  assigned 
to  the  stretching  mode  of  hydrogen  bonded  to  pure  Si  sur¬ 
faces,  which  are  therefore  believed  to  exist  at  the  Si/Si02 
interface.^^  The  interface  region  between  the  two  oxide  sur¬ 
faces  contains  an  estimated  3-4X10^^  H20/cm^  and 
1-2X10^^  OH/cm^,  all  subject  to  H  bonding.  Clearly,  in  or¬ 
der  to  establish  strong  chemical  bonds  between  the  two  oxide 
surfaces,  these  species  must  either  react  or  dissolve  into  bulk 
material,  thereby  allowing  the  initial  oxide  surfaces  to  come 
into  intimate  contact. 

In  order  to  determine  the  precise  fate  of  the  interfacial  OH 
and  H2O  groups,  we  have  spectroscopically  monitored  the 
evolution  of  both  the  molecular  interface  and  the  oxide  layer 
as  a  function  of  annealing  temperature  (30  min  in  N2  ambient 
for  each  temperature).^^  A  few  select  spectra  are  shown  in 
Figs.  9(a) -9(d)  for  the  key  annealing  temperatures  of  400, 
600,  800,  and  1000  °C,  and  the  complete  data  are  summa¬ 
rized  in  Figs.  10(a)-10(c).  A  substantial  loss  (---85%)  of  mo¬ 
lecular  water  is  observed  after  annealing  to  400  °C,  with 
complete  loss  evident  at  600  °C.  Interestingly,  a  small 
amount  of  reformation  of  H2O  occurs  at  800  °C,  indicating 
that  the  formal  interface  still  persists  at  these  temperatures, 
before  complete  closure  occurs  after  annealing  to  1000  °C. 
Correspondingly,  there  is  a  manifold  (75%)  increase  in  the 
absorbance  of  the  oxide  on  annealing  to  400  °C,  correspond¬ 
ing  to  formation  of  additional  —7-8  A  of  oxide  (3-4X10^^ 
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Fig.  9.  MIT  spectra  of  the  ratio  of  p:s  polarization  for  the  different  spectral 
regions  of  interest  for  hydrophilic  Si(lll)  as  a  function  of  annealing  tem¬ 
perature:  (a)  Si-0  phonon  modes;  (b)  difference  spectra  {T2~Ti)  derived 
from  the  Si-0  spectra  shown  in  (a).  The  solid  vertical  line  indicates  the 
lower  bound  of  the  absorbance  attributed  to  formation  of  bridging  Si-O-Si 
groups;  the  hashed  area  indicates  the  spectral  region  for  which  no  IR 
throughput  is  observed,  due  to  absorption  by  the  interstitial  oxygen  present 
in  these  CZ  samples;  (c)  Si-H  stretching  modes;  and  (d)  O-H  stretching 
modes. 
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Fig.  10.  Plots  of  the  relative  integrated  absorbance  of  each  of  the  modes 
shown  in  Fig.  8,  as  a  function  of  temperature.  Top  panel:  Si-O  modes. 
“Low  frequency  oxide”  is  defined  as  absorbance  between  1140  and  1180 
cm~\  whereas  “high  frequency  oxide”  is  the  remainder  of  the  observed 
band  (>1180  cm~^);  “total  oxide”  is  the  sum  of  the  two.  Middle  panel: 
water  and  hydroxyl  modes.  Bottom  panel:  Si-H  modes.  The  Si-H  data  are 
the  integrated  absorbance  between  2050-2150  cm~^;  the  (Oj  2)-Si-H  and 
{03)-Si-H  points  are  the  integrated  absorbances  between  2175-2225  cm"^ 
and  2225-2300  cm"\  respectively. 


O  atoms/cm^).  In  addition,  the  peak  (LO)  frequency  is 
shifted  to  1235  cm“\  consistent  with  growth  of  a  more 
thermal-like  (stoichiometric  and  less  defective)  oxide  over 
this  temperature  range. 

Significantly,  the  predominant  molecular  signature  over 
the  400-800  ""C  temperature  range  is  due  to  the  interfacial 
hydroxyl  species  that  terminate  the  oxide  layers  [Fig.  9(d)]. 
Specifically,  the  feature  associated  with  these  hydroxyl 
groups  first  sharpens  and  shifts  to  3740  cm“^  (consistent 
with  a  loss  of  H  bonding)  on  annealing  to  600  °C,  then  sub¬ 
sequently  broadens  as  the  reformation  of  interface  water  oc¬ 
curs  at  800  °C,  before  completely  disappearing  after  a 
900  °C  anneal.  Correspondingly,  a  high  frequency  oxide  ab¬ 
sorbance  (1250-1260  cm“^)  appears  over  this  temperature 
range,  adding  an  extra  20%-30%  to  the  original  absorbance 
value  [Fig.  8(b)].  This  frequency  is  well  above  the  highest 
LO  frequency  of  the  (thermal-like)  oxide  layer  (1235  cm“^), 
suggesting  that  it  originates  from  a  distorted  Si-O-Si  spe¬ 
cies  associated  with  closure  of  the  formal  interface  between 
the  two  surfaces."^^  Above  800  °C,  the  overall  oxide  absor¬ 
bance  remains  constant,  with  a  loss  of  the  lower  frequency 
component  of  the  oxide  band  (1000-1150  cm“^)  and  a 


sharpening  of  the  LO  phonon  (1245  cm"^),  indicative  of  the 
formation  of  a  good  quality  thermal  oxide  at  these  elevated 
temperatures. 

The  evolution  of  the  Si-H  spectra  provides  additional 
mechanistic  information  (Fig.  9(c)].  The  most  relevant  fin¬ 
gerprint  is  the  mode  at  2250  cm~\  associated  with  hydrogen 
inside  the  oxide  matrix.  The  integrated  absorbance  of  this 
mode  initially  decreases  upon  annealing  to  400  °C  (the  range 
over  which  the  molecular  water  signal  is  significantly  attenu¬ 
ated),  followed  by  a  substantial  further  increase  and  blueshift 
at  700  °C.  This  increase  suggests  that  a  new  source  of  hy¬ 
drogen  is  available  in  the  400-700  °C  range,  with  the  most 
obvious  candidate  being  the  interface-OH  groups,  the  ther¬ 
mal  evolution  of  which  was  described  above.  Finally,  there  is 
another  pronounced  shift  after  a  900  °C  anneal  so  that  the 
absorbance  maximum  is  now  observed  at  2285  cm“\  sug¬ 
gestive  of  a  change  in  the  nature  of  the  oxide,  before  com¬ 
plete  loss  of  this  feature  occurs  on  annealing  to  1000  ®C.  In 
contrast,  the  integrated  absorbance  of  the  2100  cm“^  feature, 
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associated  with  hydrogen  on  internal  Si  surfaces  at  the 
Si/Si02  interface,  is  essentially  constant  up  to  700  °C  and 
decreases  monotonically  above  this  temperature.  The  fate  of 
the  hydrogen  liberated  between  800-1000  is  not  known: 
one  possibility  is  that  it  simply  migrates  into  the  bulk  (sili¬ 
con)  and  diffuses  away.  However,  a  second  intriguing  possi¬ 
bility  is  that  recombinant  dihydrogen  formation  occurs,  lead¬ 
ing  to  bubble  or  intrinsic  void  formation  in  the  interface 
region;  indeed,  we  see  preliminary  spectroscopic  evidence 
for  such  a  reaction,  although  more  work  is  required  to  sub¬ 
stantiate  these  observations. 


2.  Model  for  oxidation 

The  primary  (novel)  observation  in  this  work  is  the  for¬ 
mation  of  a  thicker  oxide  layer  (and  closure  of  the  initial 
interface  between  the  two  thin  oxide  layers)  upon  annealing 
to  1100  °C,  together  with  the  disappearance  of  water,  hy¬ 
droxyl,  and  hydrogen.  In  addition,  considerable  insight  into 
the  pathways  leading  to  oxidation  can  also  be  inferred  from 
our  data.  First,  there  is  a  direct  correlation  between  the  dis¬ 
sociation  of  water  initially  trapped  between  the  two  joined 
hydrophilic  surfaces  and  the  increased  “thermal-like”  oxide 
below  400  °C.  Indeed,  the  observed  loss  of  -^3-4  monolay¬ 
ers  of  water  (—3-4X10^^  oxygen/cm^)  over  this  temperature 
range  is  sufficient  to  form  an  additional  7-8  A  of  Si02,  in 
good  agreement  with  experiment.  Second,  the  thermal  evo¬ 
lution  of  the  “hydrogen-in-oxide”  signature  provides  addi¬ 
tional  mechanistic  information:  the  amount  of  hydrogen  ini¬ 
tially  contained  in  the  wet-chemical  oxide  decreases  on 
heating  to  400  °C,  as  the  concentration  of  interface  water 
decreases,  but  dramatically  increases  on  annealing  from  400 
to  700  °C,  as  the  disappearance  of  interface  hydroxyl  groups 
is  observed.  The  former  observation  suggests  that  there  is 
interaction  between  the  oxidizing  species  and  hydrogen  in 
the  oxide  layer,  i.e.,  there  is  migration  of  oxygen-containing 
species  through  the  oxide  layer.  In  contrast,  the  latter  obser¬ 
vation  suggests  that  hydroxyl  decomposition  is  the  source  of 
the  additional  hydrogen,  over  this  temperature  range.  Indeed, 
the  further  oxidation  observed  between  400  and  800  is 
quantitatively  consistent  with  the  loss  of  the  estimated 
1-2X 10^^  hydroxyl  groups/cm^.  The  high  frequency  compo¬ 
nent  (1260  cm“^)  of  the  oxide  band  that  becomes  apparent 
over  this  temperature  range,  is  also  indicative  of  the  forma¬ 
tion  of  strained,  Si-O-Si  bridging  bonds.  The  suggested 
model  is  shown  schematically  in  Fig.  1 1  and  summarized  as 
follows. 

(1)  Below  400  °C,  the  water  molecules  trapped  at  the  in¬ 
terface  react  and/or  diffuse  through  the  thin  wet-chemical 
oxide  to  form  additional  oxide  at  the  Si/Si02  interface.  Using 
the  tabulated  values  of  E^  =  0.79  eV  and  Dq  =  10“^  cm^  for 
diffusion  of  water  through  Si02,'^^  we  find  that  the  transport 
of  water  through  the  thin  oxide  can  take  place  at  tempera¬ 
tures  as  low  as  200  ""C.  However,  the  oxidation  rate  is  pre¬ 
dicted  to  be  extremely  slow  at  these  temperatures,  by  ex¬ 
trapolation  of  data  reported  for  conventional  thermal 
oxidation  processes."^^  The  precise  origin  of  this  anomaly  is 
not  yet  clear;  it  may  be  that  the  use  of  chemical  oxides  fa- 


T  =  20  oc 


Fig.  1 1 .  Schematic  of  proposed  mechanism  for  hydrophilic  wafer  bonding. 


cilitates  the  oxidation  process,  by  providing  a  reactive 
(highly  strained  or  defective)  Si/SiO^  interface.  Alternatively, 
the  constrained  environment  may  promote  the  formation  of 
an  active  oxidizing  species  that  is  unique  to  these  systems. 

(2)  Above  400  °C,  additional  Si-O-Si  bond  formation 
takes  place,  due  to  the  decomposition  of  interface  hydroxyl 
groups.  The  proposed  mechanism  is  the  elimination  of  mo¬ 
lecular  water  by  reaction  of  surface  silanol  groups: 


-Si-0HH--Si-0H^-Si-0-Si-+H20, 


where  some  of  the  -Si-O-Si-linkages  so-formed  provide 
the  chemical  bridge  between  the  two  oxide  surfaces,  thereby 
closing  the  interface.  The  observation  of  molecular  water 
reformation  between  700-800  °C  is  clearly  consistent  with 
this  reaction  scheme.  In  addition,  the  appearance  of  a  very 
high  frequency  band  (1250-1270  cm“^)  between  400- 
800  °C,  provides  strong  evidence  for  the  formation  of  such 
bridging  -Si-O-Si-,  since  this  frequency  range  is  most 
consistent  with  oxide  growth  remote  from  the  Si/SiO  inter¬ 
face  (i.e.,  at  the  SiO/SiO  interface). Interestingly,  this  high 
frequency  component  of  the  oxide  band  displays  the  most 
inhomogeneity  with  lateral  position  along  the  interface,  indi¬ 
cating  that  substantial  variations  exist  in  the  formal  closing 
of  the  interface.  This  inhomogeneity  may  be  due,  in  part,  to 
the  intrinsic  interface  roughness  or  (possibly)  the  formation 
of  voids  or  (H2)  bubbles.  Studies  are  currently  being  under¬ 
taken  using  thin  thermal  oxide-terminated  substrates,  to  fur¬ 
ther  investigate  these  possibilities. 
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IV.  CONCLUSIONS 

In  this  article,  we  have  presented  an  overview  of  the  field 
of  silicon  wafer  bonding.  While  the  method  is  relatively 
simple,  fundamental  processes  limit  the  quality  of  the  bond¬ 
ing  and  ultimately  its  practical  use.  Experimentally,  it  is  dif¬ 
ficult  to  probe  the  interface  spectroscopically  since  most  of 
the  traditional  surface  spectroscopies  cannot  penetrate 
through  the  silicon.  Conversely,  we  have  shown  that  not  only 
can  IR  absorption  spectroscopy  be  used  effectively,  but  is  in 
fact  a  very  sensitive  way  to  characterize  the  chemical  and 
physical  state  of  the  interface.  The  results  obtained  so  far 
provide  the  first  glimpse  at  the  relevant  bonding  mechanisms 
and  raise  additional  important  questions. 

For  hydrophobic  wafers,  the  most  exciting  results  are  ob¬ 
tained  on  the  flat  H/Si(lll)-(lXl)  surfaces,  where  there  is 
convincing  evidence  for  strong  physical  interaction  between 
the  two  contamination-free  internal  surfaces.  The  experimen¬ 
tal  challenge  is  to  increase  the  level  of  interface  perfection  to 
the  point  where  registry  issues  can  be  addressed  (azimuthal 
rotation  of  one  wafer  with  respect  to  the  other).  Furthermore, 
a  detailed  understanding  of  the  thermal  evolution  of  the  hy¬ 
drogen  trapped  at  the  interface,  both  as  Si-H  and  possibly 
H2,  as  well  as  the  nature  of  the  bonding  between  the  two 
silicon  surfaces  upon  annealing,  is  necessary  to  assess  the 
validity  of  this  method  for  low  temperature  epitaxy  replace¬ 
ment.  The  actual  bonding  mechanism  for  atomically  rough 
hydrophobic  surfaces  is  difficult  to  establish,  since  it  appears 
to  involve  only  a  small  fraction  of  the  surface,  and  therefore 
requires  increased  sensitivity  to  the  atomic  species  at  the 
interface.  Such  studies  may  be  possible  using  the  signifi¬ 
cantly  higher  intensity  of  synchrotron  radiation  sources 
which  are  optimal  for  the  300  to  800  cm“^  region,"^"^  where 
the  majority  of  Si-X  (X=0,  C,  N,  F)  stretching  vibrations 
are  known  to  occur."^^ 

For  hydrophilic  wafers,  the  presence  and  evolution  of  mo¬ 
lecular  species  such  as  H2O,  OH,  CH^^.  and  of  dissolved  hy¬ 
drogen,  have  been  well  characterized  using  MIT.  Based  on 
our  observations,  detailed  schematic  models  can  be  pro¬ 
posed.  However,  the  challenge  is  clearly  to  probe  and  inter¬ 
pret  the  IR  absorption  associated  with  silicon  oxide  species. 
While  submonolayer  oxygen  contamination  experiments  to¬ 
gether  with  ab  initio  cluster  calculations  will  help  one  to 
understand  the  local  oxygen  structures  that  are  initially 
formed,  the  characterization  of  the  oxide  films  will  demand  a 
more  complex  analysis.  The  complexity  arises  from  the 
strong  Coulombic  coupling  within  the  Si02  network,  image 
charge  effects  for  thin  films,  stoichiometric  variations,  differ¬ 
ent  interface  structures,  and  strain  due  to  the  mismatch  be¬ 
tween  the  silicon  lattice  and  the  Si02  film.  However,  meeting 
this  challenge  is  critical  since  IR  absorption  measurements 
may  be  the  only  nondestructive  way  to  monitor  oxide  forma¬ 
tion  in  a  wide  variety  of  technologically  relevant  processes. 

Finally,  the  formation  of  intrinsic  voids  at  intermediate 
annealing  temperatures  has  not  been  addressed  in  this  article, 
although  it  remains  a  critical  issue  in  silicon  wafer  bonding 
due  to  the  potential  effect  on  interface  homogeneity.  Indeed, 
while  a  number  of  studies  of  the  size  distribution  and  number 


density  of  interfacial  voids  have  been  reported  in  the 
literature,^^  a  comprehensive  understanding  of  the  effect  of 
surface  cleaning  and  rinsing  (i.e.,  surface  passivation,  resi¬ 
dues,  physisorbed  species)  on  the  concentration  and  nature  of 
the  resultant  voids  has  not  yet  emerged.  Therefore,  there  is 
clearly  a  need  to  devise  ways  to  characterize  both  the  inter¬ 
face  homogeneity  and  the  physical  and  chemical  state  of 
such  voids  as  a  function  of  annealing  temperature. 
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Strain,  dislocations,  and  electrically  active  defects  at  and  near  the  interface  of  Si/Si02  and  Si/GaP 
heterostructures  are  analyzed  by  optical  second-harmonic  spectroscopy.  For  plasma  oxides 
deposited  on  Si(OOl)  and  Si(lll),  time-dependent  second-harmonic  experiments  reveal  that 
near-interface  oxide  defects  trap  charge  by  the  tunneling  of  photoexcited  electrons  from  the  Si 
conduction  band.  The  space-charge  field-induced  second-harmonic  transients  are  resonantly 
enhanced  by  two-photon  transitions  in  silicon.  In  GaP  epilayers  grown  on  Si(OOl)  the  bulk 
dipole-allowed  electro- optical  effect  is  suppressed  by  the  formation  of  antiphase  domains.  In 
contrast,  in  GaP  films  grown  on  Si(lll)  and  vicinal  Si (001)  the  density  of  antiphase  domains  is 
considerably  reduced  yielding  an  enhancement  of  the  second-order  nonlinear  optical  response  by 
two  orders  of  magnitude.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Silicon-based  hetero structures  are  of  outstanding  impor¬ 
tance  for  micro-  and  optoelectronic  device  technology  and  of 
considerable  scientific  interest.  Device  limiting  structural  im¬ 
perfections  and  process-induced  strain  at  the  Si/Si02  inter¬ 
face  are  a  main  obstacle  on  the  way  towards  smaller  and 
faster  device  structures,  which  require  new  approaches  to 
oxide  growth  technology  such  as  low  temperature  plasma 
deposition.  Additionally,  due  to  the  exceptional  perfection, 
purity,  and  low  cost  of  silicon,  great  interest  exists  in  devel¬ 
oping  Si-based  optoelectronic  devices  which  allow  the  inte¬ 
gration  of  optical  components  on  conventional  electronical 
integrated  circuits.  The  main  problem  encountered  in  het- 
eroepitaxial  growth  of  III-V  compound  semiconductors  on 
silicon  is  the  high  density  of  dislocations  in  the  epitaxial 
layer  resulting  from  the  formation  of  antiphase  domains 
(APD),  which  can  be  overcome  by  careful  choice  and  special 
treatment  of  the  substrate  material.  The  development  of 
new  improved  hetero  structure  devices  requires  powerful 
non-invasive  analytical  techniques  for  the  characterization  of 
the  buried  solid/solid  interfaces,  which  are  difficult  to  probe 
with  traditional  optical  and  electron-based  surface  diagnos¬ 
tics. 

In  recent  years,  optical  second-harmonic  generation 
(SHG)  has  proven  to  be  an  effective  and  sensitive  tool  for 
studies  of  a  large  variety  of  interface  properties. An  elec¬ 
tric  dipole  second-harmonic  (SH)  response  is  forbidden  in 
centrosymmetric  media  such  as  Si  or  Si02,  making  SHG 
intrinsically  sensitive  to  the  interfacial  region  where  the  in¬ 
version  symmetry  is  broken.  Several  studies  on  Si/Si02  in¬ 
terfaces  have  employed  the  tensorial  characteristic  of  SHG  to 
examine  the  chemical  bonding  arrangement  at  atomic-scale 
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step^’^  and  kink  sites.^  These  results  could  be  correlated  with 
electrical  measurements  of  the  midgap  density  of  interface 
traps  on  the  same  samples.  The  spectral  resolution  of  optical 
techniques  has  been  used  to  identify  a  strain  and  disorder- 
induced  redshift  and  broadening  of  the  Si  bulk  direct  band 
gap  £*1  and  E'^  transitions  at  the  Si/Si02  interface.^" Most 
recently,  electric-field  induced  SHG  has  been  used  to  study 
electroinduced  effects  at  Si  based  MOS  structures. SH- 
transients  on  a  time  scale  of  seconds  have  been  observed  on 
native  oxide  interfaces  by  several  groups. However,  the 
interpretation  of  this  effect  attributed  to  “charge  transfer  ef¬ 
fects”  by  Mihaychuk  et  al.  or  explained  entirely  by  a 
thermal -induced  effect  by  Dadap  et  al  remains  controver¬ 
sial. 

In  this  article  we  report  on  dispersion  and  time-resolved 
SHG  measurements  for  the  characterization  of  electrically 
active  defects  at  Si/Si02  interfaces  and  antiphase  domains  in 
GaP-epilayers  grown  on  Si(lll)  and  Si(OOl).  For  Si/Si02 
interfaces,  a  space-charge  field-induced  SH  transient  reveals 
that  a  high  density  of  near  interface  oxide  traps  occurs  on 
plasma  deposited  oxides,  which  can  be  reduced  significantly 
by  a  rapid  thermal  annealing  of  the  oxide.  A  tunneling  model 
is  introduced  to  explain  the  observed  SHG  time  dependence. 
Furthermore,  SHG  has  been  used  to  study  the  influence  of 
APDs  formed  during  the  growth  of  Si/GaP  heterostructures 
on  the  electrooptical  response  of  the  GaP-epilayer.  The  non¬ 
linear  optical  response  of  the  GaP-epilayers  exhibits  a  high 
density  of  APDs  on  a  flat  Si(OOl)  substrate  due  to  its  double 
domain  structure,  whereas  on  Si(lll)  and  vicinal  Si(OOl)  the 
SH  response  is  enhanced  by  about  two  orders  of  magnitude. 

The  article  is  organized  as  follows:  Section  II  introduces 
the  experimental  technique  and  the  data  analysis  procedure. 
In  Sec.  Ill  the  SH  transients  and  spectra  are  presented  for  (a) 
the  Si/Si02  interfaces  and  (b)  the  Si/GaP  epilayers.  The  con¬ 
clusions  are  given  in  Sec.  IV. 
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detection 


Fig.  1.  Schematic  view  of  the  experimental  setup  used  for  the  SHG  mea¬ 
surements. 


II.  EXPERIMENTAL  CONDITIONS  AND  SAMPLE 
PREPARATION 

The  optical  measurements  were  performed  using  710- 
815  nm  radiation  of  a  passively  mode-locked  titanium: sap¬ 
phire  laser  generating  100  fs  pulses  at  a  repetition  rate  of 
76.5  MHz.  The  output  was  attenuated  to  30  mW  average 
power  and  focused  on  the  sample  to  a  15-/xm-diam  spot  at  a 
45°  angle  of  incidence.  The  experimental  setup  is  shown  in 
Fig.  1.  The  reflected  light  is  spectrally  filtered  by  a  prism  and 
glass  filters  and  the  SH  signal  is  detected  using  a  single  pho¬ 
ton  counting  device.  All  measurements  were  performed  in 
air. 

The  electric  field  at  the  SH  frequency,  E(2oj),  is  gener¬ 
ated  by  a  nonlinear  polarization  P{2(jd)  determined  by  the 
nonlinear  susceptibility  tensor  and  can  be  expanded  in 
terms  of  the  electric  field: 

E(2(o)^P(2o)) 

(o)  +  o))Ei  (o) 

XEio))E{co)  +  ---.  (1) 

The  first  term  in  this  expansion,  the  dipole- allowed  bulk  non¬ 
linearity  x^d,B  ’  is  allowed  in  non-centrosymmetric  me¬ 
dia  such  as  III-V  compound  semiconductors.  In  silicon  or 
Si02  this  contribution  is  symmetry-forbidden,  allowing  the 
next  terms  to  dominate  the  SH  response:  the  dipolar  surface 
term  x^d} »  quadrupolar  contribution  x^^,b  ’ 

dipolar  bulk  term  Xd^,B  •  The  surface  term  x^d}  appears  be¬ 
cause  of  the  broken  inversion  symmetry  at  surfaces  and  in¬ 
terfaces,  and  is  comparable  with  the  bulk  allowed 
X^q^B^(o))VE(a)).  The  last  term  in  Eq.  (1)  gives  rise  to  an 
electric  field-induced  SH  response  induced  by  a  (quasi)  static 
electric  field  ^  via  the  bulk  dipole  Xd^^l  which  is  also  present 
in  centrosymmetric  media. The  different  x^^^  tensor  ele¬ 
ments  reflect  the  sample’s  surface  and  bulk  electronic  struc¬ 
ture  and  can  be  reduced  to  a  set  of  phenomenological  con¬ 
stants  describing  the  SH  response  as  a  function  of  the  sample 
orientation. 


For  SH  dispersion  measurements,  we  detected  the 
p -polarized  SH  response  produced  by  a  p -polarized 
fundamental  field  as  a  function  of  the  sample  azimuth  For 
a  surface  of  and  higher  symmetry,  can  be  written 
phenomenologically: 


2  a{^’^cos(m^) 

m  =  0 


(2) 


where  are  the  complex  Fourier  coefficients  of  the  SH 
field  and  (f>  is  defined  with  respect  to  a  crystal  mirror  plane. 
Since  the  same  polarization  combination  is  used  throughout 
this  work,  the  index  p,p  is  omitted  in  the  following.  Using 
Eq.  (2)  to  fit  the  azimuthal  dependence  of  the  SH  intensity 
measured  at  different  wavelengths,  the  dispersion  of  is 
deduced.  These  coefficients  can  be  attributed  to  certain  struc¬ 
tural  elements  of  the  interface,  e.g.,  is  partially  caused  by 
the  bonding  arrangement  at  atomic  steps  on  the  substrate 
exhibiting  Ci^j  symmetry. 

Optically  flat  Si(OOl)  and  Si(lll)  standard_wafers  cut 
with  misorientations  up  to  6°  in  the  (110)  or  (112)  direction, 
respectively,  were  used  as  substrates  and  submitted  to  RCA 
cleaning  before  growing  the  heterostructures.  Si/Si02  inter¬ 
faces  were  formed  in  two  different  ways,  leading  to  oxide 
layers  in  the  range  of  ^=^30  nm.  For  deposited  oxides,  the 
wafers  were  subjected  first  to  a  preoxidation  step  at  300  °C 
in  an  oxygen  plasma  followed  by  a  remote  plasma  enhanced 
chemical  vapor  deposition  (RPECVD)  step.^^  Additionally,  a 
second  set  of  samples  was  thermally  oxidized  in  dry  oxygen 
at  850  °C.  The  samples  were  partially  subjected  to  a  rapid 
thermal  annealing  (RTA)  step  (100  s  at  1050  °C  in  argon) 
after  oxidation.  The  GaP  films  are  grown  by  pulsed  chemical 
beam  epitaxy  (PCBE)  to  a  thickness  of  6-9  nm  monitored  in 
situ  by  -polarized  reflectance  (PRS)  and  reflectance  differ¬ 
ence  spectroscopy  (RDS).^^ 


III.  RESULTS  AND  DISCUSSION 

A.  Nonlinear  electroreflectance  from  Si/Si02 
interfaces 

Electric  field-induced  SHG  allows  the  direct  observation 
of  charge  trapping  at  near-interface  defects,  which  alters  the 
reflected  SH  signal  due  to  a  change  of  the  static  electric  field 
in  the  space-charge  region  at  the  interface.  For  this  experi¬ 
ment,  the  samples  were  rotated  to  a  position  where  only  is 
detected  [^=90°  for  Si(lll),  ^  =  45°  for  Si(OOl)].  In  this 
case,  ;^^^5^((y^~0)£'(cu)£’(a>)  contributes  to  and  gq 
can  be  split  up  into  a  complex  interface  term  and  the 
complex  electric  field-induced  term  Thus,  using  Eqs. 
(1)  and  (2),  the  SH  intensity  can  be  written  as 

+  (3) 

Time-dependent  studies  of  the  SH  response  as  shown  in 
Fig.  2  were  performed  by  moving  the  sample  laterally,  then 
stopping  the  motor  at  r  =  0,  and  recording  the  SH  signal  as  a 
function  of  time.  Specific  for  the  SH  transient  is  the  fast 
decay  at  the  beginning,  which  slows  down  significantly  at 
longer  times.  The  time-dependent  effect  is  observed  on 
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Fig.  2.  Temporal  decay  of  the  SH  intensity  of  a  plasma  oxide  grown  on 
Si(OOl). 


plasma-deposited  oxides,  but  after  annealing  of  the  interface, 
it  is  reduced  by  at  least  two  orders  of  magnitude.  A  similar 
situation  prevails  for  thermal  oxides,  where  there  is  no  de¬ 
tectable  time  dependence  of  the  SHG  response.  Hence  we 
can  exclude  laser-induced  surface  heating  as  a  reason  for  the 
observed  SH-transient  as  proposed  earlier.  The  temporal 
signal  change  in  the  unannealed  plasma  oxides  cannot  be 
described  by  a  simple  exponential  decay,  but  can  be  well 
fitted  by  a  tunneling  model  (solid  line  in  Fig.  2),  in  which 
photoexcited  electrons  tunnel  directly  from  the  Si  conduction 
band-edge.  The  tunneling  process  will  be  described  later. 

To  identify  the  origin  of  this  effect  we  performed  time- 
dependent  measurements  at  different  input  wavelengths.  Fig¬ 
ure  3(a)  shows  the  spectral  dependence  of 
for  a  Si(lll)-plasma-oxide  inter¬ 
face.  For  comparison,  and  to  avoid  the  influence  of  a  pos¬ 
sible  wavelength  dependent  oxide  trap-induced  electric  field, 
the  same  spectral  regime  was  measured  on  a  MOS  structure, 
to  which  an  externally  controlled  electric  field  has  been  ap¬ 
plied.  By  using  a  thermal  oxide  the  MOS  sample  shows  no 
time-dependent  effect.  The  metallization  was  made  semi¬ 
transparent  to  allow  for  optical  measurements  at  the 
Si/Si02  interface.  We  used  a  metal  contact  made  of  3  nm  Cr 
and  6  nm  Au.  The  spectral  dependence  of 
^^(2a))^^)_^(2w)(ov)  for  U=12  V  is  shown  in  Fig.  3(a). 
Both  spectra  exhibit  a  strong  resonance  at  the  direct  band 
gap  energy  of  3.4  eV,  revealing  that  bulk  electronic  transi¬ 
tions  resonantly  enhance  x^d^B  thus  showing  that  an  elec¬ 
tric  field-induced  effect  caused  by  trapped  oxide  charges  is 
responsible  for  the  SH  time-dependence  observed  on  plasma- 
deposited  oxides.  The  spectra  are  not  corrected  for  the  dis¬ 
persion  of  the  linear  optical  parameters,  which  cause  only  a 
small  reduction  (reabsorption  of  the  SH  radiation)  and  red- 
shift  (<10  meV)  of  the  resonance  features,  as  calculated 
from  the  dispersion  data  given  in  Ref.  22.  The  trap-induced 
alteration  in  the  interfacial  space  charge  field  Ag"is  given  by 


3.0  3.1  3.2  3.3  3.4  3.5 

SH  photon  energy 

Fig.  3.  Spectral  dependence  of  (a)  of  ^  plasma  oxide  grown  on 

Si(lll)  and  of  a  Si(lll)  MOS  structure  biased  at  U=-\2  V; 

(b)  of  a  plasma  oxide  grown  on  Si(lll)  and  after  RTA;  and  (c) 

of  a  thermal  oxide  and  an  annealed  plasma  oxide,  both  grown  on  Si(OOl). 


where  Q^i  and  are  the  space-charge  density  per  unit  area 
and  the  static  dielectric  constant  of  Si,  q  is  the  carrier  (elec¬ 
tron  or  hole)  charge,  d  is  the  thickness  of  the  interfacial 
oxide  trap  region,  and  n^t  is  the  volume  density  of  trapped 
charges. 

To  determine  the  polarity  of  A  and  thus  the  signum  of 
the  trapped  charge,  we  measured  the  rotational  SH  anisot¬ 
ropy  from  the  MOS  structure  described  above  for  a  distinct 
applied  external  voltage  U.  Figure  4  shows  the  rotational 
anisotropy  of  both,  the  SH  signal  from  the  thermally  grown 
Si(lll)/Si02  interface  of  the  MOS  structure  (•),  i.e.,  with¬ 
out  the  metal  contact,  given  by 


100 


-100 


esiAg’=-Aesi=^  f  n^,dz, 
Jo 


Fig.  4.  Rotational  anisotropy  of  (•)  the  SH  intensity  of  a  thermally  grown 
oxide/silicon  interface  (▼)  for  the  difference  =  — 

X(0)  for  U  =  3.0  V  measured  on  a  MOS  structure  on  Si(lll). 
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/(2“)(0)=|aO«O|2^_|^3|2^,os2(3^) 

+  2|ao"'^a3|cos[A(/f(a[,'"'^  ,aj)]cos{3  ((>) , 

(5) 

and  second  the  difference  A^^‘"^(t/=4V)  measured  from  the 
MOS  structure  (T)  showing  a  rotational  dependence  of  the 
form 

A(2")(  </,)  =  2^a(-^'>a(''’')|cos[  A 

+  2(^'|aQ^‘^a3|cos[A^(ao'^'\a3)]cos(3(^) 

+  (6) 

where  is.ilj{a,b)  denotes  the  phase  difference  between  the 
complex  numbers  a  and  b.  Fourier  analysis  of  both  curves 
yields  the  relative  phases  between  and  a^,  and  and 
a3,  respectively.  Figure  2  shows  that  and  are  ei¬ 
ther  in  phase  or  anti-phase  dependent  on  the  polarity  of 
A(S^.  The  rotational  anisotropies  shown  in  Fig.  4  clearly  re¬ 
veal  that  both  isotropic  contributions  are  in  phase,  leading  to 
the  conclusion  that  the  decrease  of  the  SH  signal  is  caused  by 
a  negative  A^,  i.e.,  caused  by  electron  capture  at  near¬ 
interface  oxide  traps. 

The  next  question  is  concerned  with  the  charge  state  of 
the  observed  oxide  traps.  To  clarify  whether  the  defects  are 
initially  positively  charged  or  neutral,  SH  spectra  have  been 
taken  from  plasma  oxides  annealed  or  as  grown  on  Si(lll). 
Figure  3(b)  shows  the  SH  signal  from  both  samples  after 
illuminating  the  samples  for  at  least  10  minutes,  thereby 
saturating  the  electric  field-induced  SH  contribution  (see  Fig. 
2).  Compared  to  the  annealed  plasma  oxide,  the  as-grown 
sample  reveals  a  lower  SH  intensity  in  the  regime  of  the 
resonance  at  3.4  eV,  while  the  annealed  sample  which 
exhibits  no  SH  transient  shows  the  resonance  feature  of  the 
interfacial  x^d^,s  This  indicates  that  the  near-interface  ox¬ 
ide  traps  are  initially  neutral. 

Figure  3(c)  shows  a  comparison  between  the  dispersion  of 
a  thermal  and  a  plasma  oxide  both  grown  on  Si(OOl).  The 
plasma  deposited  oxide  was  subjected  to  RTA,  so  that  both 
samples  exhibit  no  electric  field  induced  effect  by  charging 
of  near-interface  oxide  traps.  However,  the  peak  in  the  SH 
response  is  significantly  shifted  for  the  thermally  oxidized 
sample.  The  observed  resonance  peaks  at  3.35  eV  for  the 
deposited  oxide,  while  the  thermal  oxide  peaks  at  3.3  eV.  In 
previous  SH  studies  this  redshift  from  the  bulk  value  (3.37 
eV)  has  been  attributed  to  electronic  transitions  in  a  few 
monolayers  of  strained  Si-Si  bonds  in  the  interfacial  region. 
The  larger  redshift  in  the  SH  response  from  the  thermal  ox¬ 
ide  indicates  a  greater  interfacial  strain  induced  by  the  higher 
thermal  budget  and  thus  a  different  interface  structure.  This 
structure  remains  different  even  after  annealing  at  1050  °C, 
which  has  also  been  observed  by  infrared  Si-0  stretching 
frequency  measurements.^^ 

As  mentioned  before,  the  time  dependence  of  the  SH  sig¬ 
nal  cannot  be  explained  by  a  simple  exponential  decay.  Fur¬ 
thermore,  excitation  of  electrons  by  reabsorption  of  bulk 
generated  third-harmonic  light  could  be  ruled  out  as  a  reason 
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for  the  observed  SH-transient,  since  the  decrease  in  signal  is 
visible  after  illumination  with  linearly  and  circularly  polar¬ 
ized  light,  the  latter  showing  a  third-harmonic  efficiency 
which  is  three  orders  of  magnitude  smaller.  Since  the  photon 
energy  of  our  laser  radiation  (—1.5  eV)  exceeds  the  silicon 
indirect  band  gap  energy  of  1.11  eV,  one  possible  mechanism 
is  tunneling  of  electrons  from  the  conduction  band  to  the  trap 
states.^"^  In  this  model,  the  tunneling  time  r  depends  expo¬ 
nentially  on  the  tunneling  distance  z:  r{z)  =  txp(ajz),  where 
aj  denotes  the  oxide  specific  tunneling  constant.  Thus  the 
trapping  of  photoexcited  electrons  in  the  oxide  can  be  de¬ 
scribed  by  the  following  rate  equation: 


dt 


-f 


'Z{z) 


(7) 


where  t^(z)=  r^i  cxp(a'fz)  is  the  detrapping  time,  and 
with  and  the  volume  densities  of  oxide 
traps  and  photoexcited  electrons  in  the  interface  region.  This 
equation  and  the  initial  condition  «o;(?o)“0  for  Iho  density 
of  charged  traps  lead  to  the  following  expression  for  the  time 
and  z -dependent  trapped-electron  density: 

«o,(f.2)  =  A^raax(l-exp{-[TXz)“‘  +  c,r(z)^']f}),  (8) 


with  ~  The  change  in  the  elec¬ 

tric  field  at  the  interface  A  8'  can  be  calculated  from  Eq.  (4) 
by  integrating  for  which  is  reasonable  as  typical  tun¬ 

neling  distances  are  far  below  the  oxide  thickness  and  thus 
the  density  strongly  decays  away  from  the  interface.  The 
integration  leads  to 


A  II  (Ml  1- 

A^— -  y+ln  — M"  El 

\A/ 


(9) 


where  y  is  Euler’s  constant,  E|  is  the  first  order  exponential 
integral,  and  ^  ^  Inserting  LW in  Eq. 

(3)  gives  as  a  function  of  time  with  the  parameters 

T^,  ^rid  where  is  ab¬ 

sorbed  in  M  ^olid  line  in  Fig.  2  shows  a  best  fit 
using  this  function. 

The  detrapping  time  has  been  measured  by  performing 
several  charging  and  discharging  cycles  yielding  a  10^  s. 
For  this  experiment  the  sample  was  illuminated  for  5  minutes 
and  then  kept  in  the  dark  for  times  ranging  from  10  s  to  2 
days.  To  investigate  the  intensity  dependence  of  ,  we  re¬ 
peated  the  measurement  shown  in  Fig.  2  at  different  laser 
intensities  ranging  from  5  to  20  kW/cm“^.  Analysis  of  the 
data  showed  that  is  nearly  independent  of  the  laser 
fluence,  indicating  that  a  saturation  of  charge  trapping  has 
been  reached.  Furthermore  the  values  of  t~  ^  shown  in  Fig.  5 
depend  linearly  on  the  laser  intensity,  indicating  that  the  ob¬ 
served  carrier  trapping  is  rather  dominated  by  tunneling  of 
photoexcited  electrons  from  the  Si  conduction  band  than  by 
multiphoton  electron  transitions  into  the  Si02  conduction 
band.  From  Fig.  5  a  trapping  time  t^10“^  s  is  deduced, 
which  is  four  orders  of  magnitude  shorter  than  ,  indicating 
that  the  band-to-trap  tunneling  process  is  followed  by  atomic 
relaxation  at  the  defect. 
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Fig.  5.  Charging  rate  ^  as  a  function  of  The  solid  line  is  the  best  fit 
to  the  data. 


B.  Si/GaP 

The  integration  of  silicon  and  compound  semiconductor 
devices  is  of  tremendous  interest  in  the  context  of  future 
advanced  microelectronic  circuits,  e.g.,  vertically  integrated 
circuits,  optically  interconnected  memory,  and  integrated 
sensor  circuits.  The  main  problem  in  growing  III~V  com¬ 
pounds  on  Si  substrates  is  the  high  density  of  dislocations  in 
the  epitaxial  layer.  This  has  stimulated  attempts  to  grow  first 
a  buffer  layer  of  a  lattice-matched  compound,  e.g.,  GaP,  that 
seals  the  silicon  surface,  which  is  followed  by  the  deposition 
of  an  appropriately  graded  layer  of  the  desired  lattice- 
mismatched  compound.^^  However,  despite  many  investiga¬ 
tions  mechanisms  for  the  formation  and  annihilation  of  an¬ 
tiphase  domains  (APD),  which  are  generated  at  single  atomic 
steps  on  Si,  have  not  been  clarified  yet.^’^^  This  is  due  to  the 
lack  of  experimentation  and  the  difficulty  of  the  evaluation 
of  APD. 

SHG  rotational  analysis  is  a  promising  technique  to  reveal 
the  symmetry  characteristics  and  to  investigate  the  possibil¬ 
ity  of  APD  formation  in  III-V  compound  epilayers  on 
silicon.^^  In  contrast  to  Si,  these  compound  semiconductors 
are  non-centrosymmetric,  and  assuming  a  defect  free  epil- 
ayer,  should  possess  an  electrooptical  coefficient  or  nonlinear 
susceptibility,  which  is  orders  of  magnitude  larger  than 
in  Si.  For  a  (001)  orientation  of  a  cubic  epilayer,  e.g.,  GaX 
(X=As,  P),  this  would  lead  to  a  rotational  SH  anisotropy 
containing  a  dominating  a 2  contribution,  while  for  a  (111) 
surface  Uq  and  ^3  are  dominating  the  SH  signal.^^ 

The  dominant  SH  Fourier  coefficients  of  the  SHG  rota¬ 
tional  anisotropy  of  Si(lll)/GaP  and  Si(001)/GaP  epilayers 
obtained  by  fitting  Eq.  (2)  to  the  data  are  summarized  in  Fig, 
6  as  a  function  of  SH  photon  energy.  For  the  Si(OOl)  sub¬ 
strate,  the  a  2  coefficient  is  below  the  detection  limit  of  our 
apparatus.  The  SH  response  is  dominated  by  the  isotropic 
component  aQ  shown  in  Fig.  6(a),  and  is  comparable  in  mag¬ 
nitude  to  the  gq  component  of  the  Si/Si02  interface.  How¬ 
ever,  the  observed  resonance  at  3.3  eV  is  missing  here,  which 
can  be  explained  by  a  broadening  and  damping  of  the 
resonance  due  to  inhomogeneous  strain  present  at  the  inter- 


grown  on  Si(OOl)  and  (b)  on  vicinal  Si(OOi),  cut  6°  towards  (111);  (c)  the 
isotropic  and  threefold  component  of  a  GaP-film  grown  on  Si(lll)  and  the 
twofold  component  of  a  film  grown  on  vicinal  Si(OOl). 

face.  The  vanishing  of  a 2  can  be  explained  by  the  crystalline 
structure  of  the  Si(OOl)  substrate.  Such  wafers  are  typically 
oriented  to  <0.5°,  leading  to  ^1000  monoatomic  steps 
across  our  laser  spot  size.  The  local  €2^  symmetry  of  the 
terraces  rotates  by  90°  when  a  monoatomic  step  is  encoun¬ 
tered,  leading  to  a  macroscopic  €4^^  symmetry  of  the  sample. 
The  growth  of  a  III-V  epilayer  on  flat  Si(OOl)  may  lead  to 
the  formation  of  APDs  in  the  epilayer^’^^  with  domain 
boundaries  located  at  the  monoatomic  substrate  steps.  The 
averaging  over  many  APDs  cancels  the  net  bulk  dipole- 
allowed  ii^  the  GaP  film  leading  to  fatal  consequences 
for  the  application  of  III-V  compound  semiconductors 
grown  on  Si(OOl)  as  electrooptic  materials. 

For  an  effective  use  of  silicon-based  heterostructures  in 
optoelectronic  devices,  this  problem  has  to  be  overcome. 
One  possibility  is  to  use  vicinal  Si(OOl),  which  possesses  a 
dominant  part  of  diatomic  steps,  assuring  that  contributions 
of  different  domains  remain  in  phase.  This  has  two  effects  on 
the  measured  SH  spectra:  as  shown  in  Fig.  6(b)  the  isotropic 
component  gq  exhibits  the  redshifted  3.3  eV  resonance, 
which  is  still  significantly  broader  than  those  from  Si/Si02 
interfaces  (see  Fig.  3),  indicating  that  the  Si/GaP  interface  is 
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more  ordered  on  vicinal  than  on  flat  Si(OOl)  substrates.  Ad¬ 
ditionally,  as  shown  in  Fig.  6(c)  a  significant  twofold  contri¬ 
bution  ^2  is  obtained  from  the  vicinal  Si(001)/GaP  hetero¬ 
structure,  demonstrating  that  the  density  of  APDs  is 
considerably  reduced. 

Also  shown  in  Fig.  6(c)  are  the  data  measured  from  GaP- 
films  grown  on  Si(lll)  substrates.  The  results  reveal  a  non¬ 
resonant  dispersion  curve  with  a  signal  lying  two  orders  of 
magnitude  above  the  Si/Si02  interface  response.  Note  that 
the  relative  magnitude  of  the  nonlinear  optical  response  is 
still  almost  two  orders  of  magnitude  higher  than  for  vicinal 
Si(OOl)  substrates,  indicating  that  a  single-domain  GaP  film 
is  not  yet  achieved  on  vicinal  Si(OOl).  Although  it  has  been 
shown  that  an  annihilation  of  APDs  is  expected  for  thick 
films,  ^  our  film  thicknesses  are  well  below  the  required 
value.  Surface  steps  on  Si(lll)  are  naturally  occurring  as 
diatomic  steps  solving  the  problem  of  APDs  in  the  GaP  film 
and  thus  allowing  a  large  electrooptic  effect  in  GaP  grown  on 
Si(lll). 

IV.  CONCLUSIONS 

In  this  article  we  demonstrated  that  SHG  is  an  effective 
tool  to  study  strain,  disorder,  and  defect  densities  at  the  in¬ 
terface  of  silicon  heterostructures.  For  two  different  systems 
SHG  was  used  to  evaluate  aspects  of  interface  formation  and 
transformation  induced  by  the  growth  process  and  post¬ 
growth  treatment.  A  detailed  analysis  of  our  SH  spectra  re¬ 
vealed  structural  differences  between  plasma  and  thermal  ox¬ 
ides,  which  strongly  depend  on  the  thermal  budget  imposed 
by  the  oxidation  process.  On  Si/Si02  interfaces  we  have  di¬ 
rectly  observed  the  trapping  of  photoexcited  carriers  by  near¬ 
interface  defect  states  in  the  plasma  oxide.  A  strong  electric 
field-induced  SH  contribution  is  caused  by  electron  capture 
in  the  oxide  layer,  which  is  resonantly  enhanced  at  a  SH 
photon  energy  close  to  the  Ei  transitions  in  silicon.  This 
effect  provides  a  metallization-free  method  for  characterizing 
near-interface  trap  states.  For  Si/GaP  heterostructures  we  as¬ 
sume  that  the  formation  of  anti-phase  domains  on  Si(OOl) 
substrates  significantly  reduces  the  nonlinear  optical  re¬ 
sponse  of  the  GaP  films.  The  formation  of  interface  defects 
and  anti  phase  domains  might  be  reduced  by  using  vicinal 
Si(OOl)  or  Si(lll)  substrates. 
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Imaging  of  topography,  locally  induced  photoluminescence  and  Fermi-level  pinning  in  adjacent 
ordered  and  disordered  domains  on  a  cleaved  GalnP  sample  is  performed  using  a  near-field  scanning 
optical  microscope  and  scanning  capacitance  microscope  at  room  temperature  in  air.  Highly 
localized  photoluminescence  spectra  obtained  by  the  near-field  scanning  optical  microscope  on 
these  domains  show  spectral  peaks  at  680  nm  (ordered)  and  648  nm  (disordered)  GalnR  The 
near-field  scanning  optical  microscope  and  scanning  capacitance  microscope  data  confirm 
previously  published  data,  indicating  that  the  electronic  surface  structure  of  ordered  GalnP  is 
significantly  different  from  that  of  disordered  GalnP  Both  approaches  indicate  that  the  Fermi-level 
at  the  surface  of  ordered  GalnP  is  pinned,  while  the  Fermi-level  at  the  surface  of  disordered  GalnP 
is  not  pinned.  The  size,  structure,  and  position  of  the  ordered  and  disordered  domains  observed  by 
the  near-field  scanning  optical  microscope  and  scanning  capacitance  microscope  agree  with  those 
obtained  by  cathodoluminescence  and  Kelvin  probe  force  microscopy.  ©  1996  American  Vacuum 
Society. 


In  the  past  few  years,  atomic  ordering  has  been  observed 
for  a  wide  variety  of  III-V  semiconductor  alloys,^  The  or¬ 
dering  in  GalnP  affects  the  optical  and  electrical  properties 
of  the  material  and  is  therefore  interesting  and  important  for 
optoelectronic  devices.  It  is  found  that  under  certain  growth 
conditions,  Ga^^Ini-^^P  lattice-matched  to  GaAs  spontane¬ 
ously  orders  on  a  group  III  sublattice  with  gallium  and  in¬ 
dium  atoms  preferentially  occupying  alternate  {111}  planes.^ 
The  ordering  is  identified  by  the  appearance  of  superspots  in 
transmission  electron  diffraction  (TED)  patterns.  Recently, 
atomic  force  microscopy^’"^  (AFM)  and  Kelvin  probe  force 
microscopy^  (KPFM)  have  been  used  to  study  the  surface 
morphology  and  surface  electronic  properties  of  GalnP.  The 
ordering  of  GalnP  reduces  the  energy  band  gap,  as  observed 
by  cathodoluminescence^  (CL)  and  photoluminescence^ 
(PL).  The  measurement  of  band  gap  by  PL  is  macroscopic, 
while  the  measurement  of  CL  provides  approximately  1 
/xm  spatial  resolution.  Recently,  partially  ordered  GalnP  is 
studied  by  a  low  temperature  NSOM  with  a  spatial  resolution 
of  270  nm.^  In  this  work,  we  report  the  results  of  direct 
imaging  of  PL  from  adjacent  ordered  and  disordered  do¬ 
mains  in  GalnP  by  NSOM  (Ref.  9)  at  room  temperature  and 
correlate  these  measurements  with  scanning  capacitance 
microscopy^^’^^  (SCM)  with  a  resolution  of  150  nm. 

Figure  1  shows  a  schematic  of  the  NSOM  used  in  this 
work.  The  NSOM  consists  of  a  shear  force  imaging  (topog¬ 
raphy)  system^^  and  a  near-field  optical  imaging  system.^ 
The  shear  force  imaging  system  is  used  to  control  the  dis¬ 
tance  between  the  tip  and  sample.  The  tip  is  vibrated  at  its 
resonance  frequency  (15-30  kHz)  with  an  amplitude  of 
about  10-20  nm.  As  the  tip  approaches  the  sample  surface. 
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the  oscillation  of  the  tip  is  damped  because  of  the  tip-sample 
interaction  via  shear  force.  Keeping  a  constant  force  results 
in  a  constant  tip-sample  distance.  At  the  heart  of  the  near¬ 
field  imaging  system  is  the  tip,  which  is  a  tapered  single 
mode  fiber  made  opaque  everywhere  except  at  its  very  end 
by  deposition  of  a  layer  of  aluminum  (100  nm).^  The  output 
of  an  Ar"^  laser  (488  nm)  is  launched  into  the  fiber.  This  light 
exits  through  the  150  nm  fiber  aperture  exciting  a  small  re¬ 
gion  of  the  sample  and  inducing  PL  from  this  region.  The  PL 
signal  is  collected  by  a  0.25  NA  objective  lens  from  the  side. 
The  pump  light  (488  nm)  is  filtered  out  by  a  notch  filter.  The 
PL  signal  is  then  sent  to  a  monochromator  and  detected  by  an 
avalanche  photodiode  (APD)  with  a  dark  count  of  8  counts 
per  second. 

Figure  2  shows  a  combination  AFM  and  SCM.^®’^^  A 
metal-coated  silicon  tip  is  brought  to  the  surface  of  the 
sample.  A  contact  mode  AFM  is  used  to  position  and  scan 
the  tip  over  the  surface  of  the  sample.  A  high  sensitivity 
capacitance  sensor  (operating  at  915  MHz)  is  connected  to 
the  tip  to  measure  the  capacitance  between  the  tip  and  the 
sample.  The  tip  is  grounded  by  an  inductor  (at  low  frequen¬ 
cies)  and  a  10-100  kHz  ac  bias  voltage  is  applied  to  the 
sample.  The  tip-sample  capacitance  change  associated  with 
the  ac  bias  voltage  is  measured  by  the  capacitance  sensor. 
When  the  tip  is  scanned  across  the  sample,  a  topography  and 
a  capacitance  change  image  are  obtained  simultaneously. 

NSOM  and  SCM  measurements  have  been  performed  on 
a  GalnP  sample  (AO-98).  The  ordering  in  this  sample  is 
controlled  by  substrate  misorientation.  The  GalnP  layer  is 
grown  by  organometallic  vapor  phase  epitaxy  (OMVPE)  on 
a  grooved  GaAs  (001)  substrate  misoriented  by  9°  toward 
[110].  The  period  of  the  groove  is  40  /xm,  and  the  depth  is  4 
/xm.  The  grooves  run  along  the  [110]  direction.  The  growth 
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Fig.  1.  Schematic  diagram  of  the  near-field  scanning  optical  microscope. 


temperature  is  670®C  and  growth  rate  is  2  yam/hr.  The  thick¬ 
ness  of  the  GalnP  epilayer  is  approximately  5  fim.  The 
sample  is  cleaved  perpendicular  to  the  surface  and  grooves. 
CL  measurements  indicate  the  existence  of  a  highly  ordered 
domain  of  several  micrometer  length  and  approximately  2 
}im  thickness  at  the  bottom  of  the  grooves  close  to  the  GaAs 
interface,  and  a  highly  disordered  GalnP  region  (3  yi6mX2 
yum)  above  the  ordered  region.  In  Fig.  3,  a  sketch  of  the 
sample  cross  section  is  shown  together  with  the  ordered  and 
disordered  regions  which  have  been  identified  by  CL.^^ 
NSOM  PL  spectra  are  obtained  by  locating  the  fiber  probe 
on  the  cross  sectional  surface  of  sample  AO-98.  A  pump 
beam  (488  nm)  is  sent  through  the  fiber  providing  approxi¬ 
mately  1  nano  watt  at  the  surface.  The  collected  PL  intensity 
is  then  measured  as  a  function  of  wavelength  with  the  spec¬ 
trometer.  The  measured  PL  spectra  are  shown  in  Fig.  4.  The 
peak  wavelengths  are  648  nm  (for  the  disordered  region)  and 
680  nm  (for  the  ordered  region)  respectively.  Both  peaks  are 
well  separated.  By  fixing  the  monochromator  bandwidth  (10 
nm)  at  648  nm  (to  select  PL  from  the  disordered  regions)  or 
680  nm  (to  select  PL  from  ordered  regions),  a  map  of  the 
disordered  and  ordered  PL  intensity  is  obtained.  The  results 


Fig.  3.  Schematic  drawing  of  the  groove  structure  of  sample  AO-98  (in 
cross  section). 


are  shown  in  Fig.  5(a)  and  5(b),  respectively.  The  scan  area  is 
9.5  jjLVCi  by  12.0  jxm  and  the  full  width  at  half  maximum 
(FWHM)  of  the  smallest  feature  in  the  NSOM  images  indi¬ 
cates  a  spatial  resolution  of  150  nm.  The  corresponding  to¬ 
pographic  images  are  also  shown  in  Fig.  5(c)  and  5(d),  re¬ 
spectively.  These  topographic  images  show  the  same  area 
except  for  a  slight  shift  of  0.4  ^tm.  The  maximum  height 
variation  in  the  topographic  image  is  less  than  100  nm.  The 
black  regions  in  Fig.  5(a)  and  5(b)  indicate  that  no  photolu¬ 
minescence  is  detected  from  a  large  part  of  the  GalnP  epil¬ 
ayer  or  from  the  GaAs  substrate.  The  small  white  region  in 
Fig.  5(a)  shows  the  photoluminescence  signal  from  the  dis¬ 
ordered  region,  which  corresponds  to  the  highly  disordered 
domain  shown  in  Fig.  3.  It  is  close  to  the  sample  surface  (the 
growth  surface).  The  size  of  the  domain  is  approximately  2 


Wavelength  (Angstrom) 


Fig.  4.  Near-field  photoluminescence  spectra  of  ordered  and  disordered 
Fig.  2,  Block  diagram  of  the  scanning  capacitance  microscope.  GalnP. 
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Fig.  5.  (a)  and  (c)  Near-field  photoluminescence  intensity  map  (\=648  nm,  10  nm  bandwidth)  of  cross  sectioned  GaInP/GaAs  sample  (AO-98)  and 
simultaneously  obtained  topography  (9.5  yLtm  X12.0  /icm).  (b)  and  (d)  Near-held  photoluminescence  intensity  map  (X=690  nm,  10  nm  bandwidth)  of  AO-98 
and  simultaneously  obtained  topography  (9.5  jmm  X12.0  /xm).  All  four  images  have  been  taken  in  the  same  sample  area  with  the  same  tip. 


/mm  wide  and  3  /mm  long.  The  small  white  region  in  Fig.  5(b) 
represents  the  highly  ordered  domain,  which  is  between  the 
highly  disordered  domain  and  the  GaAs  substrate.  The  size 
of  this  domain  is  about  2  /mm  wide  and  a  few  micrometers 
long.  The  positions  of  these  two  regions  are  near  the  lateral 
center  of  the  groove,  which  can  be  seen  by  comparison  to  the 
topographical  images  [Fig.  5(c)  and  5(d)].  There  also  appears 
to  be  a  halo  of  ordered  material  around  the  disordered  region 
as  seen  in  Fig.  5(b).  Seven  different  places  of  sample  (AO- 
98)  have  been  measured  by  the  NSOM,  and  it  is  found  that 
the  size  and  shape  of  these  domains  varies  slightly  from 
groove  to  groove.  The  NSOM  results  are  in  very  good  agree¬ 
ment  with  that  of  CL  (Ref.  13)  and  KPFM  (Ref.  5).  A  large 
part  of  the  GalnP  epilayer  shown  in  Fig.  5(a)  and  5(b)  emits 
no  photoluminescence.  This  region  (outside  the  ordered  and 
disordered  domains  identified  in  Fig.  3)  is  not  well  charac¬ 
terized  by  other  methods  and  requires  further  study. 

Figure  6  shows  two  NSOM  photoluminescence  line  cuts, 
approximately  perpendicular  to  the  GaInP/GaAs  interface, 
through  the  center  of  both  disordered  and  ordered  domains 
shown  Fig,  5(a)  and  Fig.  5(b).  The  PL  intensity  in  the  or¬ 
dered  domain  is  smaller  than  that  of  the  disordered  domain, 
as  can  also  be  seen  in  Fig.  4.  Our  previous  results,  using  the 


KPFM,^  have  shown  that  the  surface  of  ordered  GalnP  is 
electrically  pinned,  indicating  a  large  surface  state  density. 
This  is  consistent  with  the  fact  that  the  PL  intensity  is  re¬ 
duced  in  the  highly  ordered  region,  since  it  is  expected  that  a 
large  surface  state  density  should  decrease  PL  yield.  Note 
that  the  PL  intensity  drops  relatively  sharply  at  the  GaAs/ 
GalnP  interface  in  Fig.  6,  because  the  carriers  quickly  diffuse 
into  the  GaAs  substrate. 

The  SCM  is  also  used  to  image  the  sample  (AO-98)  in 
cross  section.  A  capacitance  change  image  and  a  topographic 


Fig.  6.  Line  cuts,  approximately  perpendicular  to  GaInP/GaAs  interface, 
through  the  center  of  both  ordered  and  disordered  regions  shown  in  Fig.  5(a) 
and  5(b). 
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Fig.  7.  (a)  and  (b)  simultaneously  obtained  capacitance  change  image  and 
topographic  image  (12  /mm  X12  /im)  of  cross  sectioned  AO-98  sample.  This 
image  is  not  of  the  same  groove  as  that  of  Fig.  5. 

image  obtained  simultaneously  by  the  SCM  are  shown  in 
Fig.  7(a)  and  7(b).  The  image  size  is  12  /nmXll  jum.  The 
applied  ac  bias  voltage  between  the  tip  and  a  sample  is  2.5 
Volts  peak  at  89.9  kHz.  The  capacitance  change  is  measured 
by  a  lock-in  amplifier  at  that  frequency.  In  Fig.  7(a),  the 
black  color  represents  a  negligibly  small  capacitance  change 
signal.  Because  the  GaAs  surface  is  electrically  pinned,  there 
is  a  negligible  amount  of  capacitance  change  with  voltage. 
Just  to  the  left  of  the  GaAs  substrate  and  at  the  bottom  of  the 
GaAs  groove,  there  is  an  approximately  2  /nm  wide  and  sev¬ 
eral  yum  long  region  (black  color)  which  corresponds  to  the 
highly  ordered  GaInP.  The  capacitance  change  signal  of  the 
ordered  domain  is  the  same  as  that  of  GaAs,  indicating  that 
the  ordered  domain  is  also  electrically  pinned.  To  the  left  of 
the  highly  ordered  region,  there  is  a  region  approximately  a  2 
yum  wide  and  3  /am  long  (gray  color)  which  corresponds  to 


the  highly  disordered  region  identified  by  NSOM.  This  re¬ 
gion  has  the  largest  capacitance  change.  Compared  with  the 
PL  images  by  NSOM,  the  size,  shape,  and  position  of  these 
two  domains  are  very  similar.  We  believe  that  the  SCM  and 
NSOM  data  strongly  confirm  our  previous  observations  that 
surface  Fermi-level  pinning  is  directly  related  to  ordering  in 
GaInP. 

Since  the  atomic  structure  of  the  ordered  and  disordered 
GaInP  is  different  in  the  bulk,  it  is  not  unreasonable  to  as¬ 
sume  that  ordering  may  also  modify  the  electronic  structure 
at  the  surface  [in  this  case  the  cleaved  (110)  plane].  Our 
observation  that  the  Fermi-level  of  ordered  GaInP  domains  is 
pinned  at  the  surface,  while  highly  disordered  GaInP  do¬ 
mains  are  unpinned,  supports  the  idea  that  ordering  causes  an 
increase  in  the  density  of  surface  states.  We  are  not  aware  of 
the  existence  of  any  detailed  model  of  the  electronic  struc¬ 
ture  of  GaInP  (ordered  or  disordered)  at  the  surface. 

The  SCM  image  also  shows  several  other  clear  variations 
in  the  GaInP  layer  which  are  not  seen  by  NSOM  and  KPFM. 
We  believe  that  the  contrast  seen  by  SCM  may  be  directly 
related  to  the  degree  of  order  in  the  GaInP  epilayer.  Further 
study  is  required  to  substantiate  this  claim. 

In  conclusion,  NSOM  and  SCM  have  been  used  to  study 
GaInP  in  which  ordering  was  controlled  by  substrate  misori- 
entation.  The  measurements  have  been  made  in  cross  section 
and  at  room  temperature.  The  results  clearly  show  that 
NSOM  and  SCM  are  capable  of  distinguishing  between  or¬ 
dered  and  disordered  GaInP.  A  previous  result  (KFPM)  and 
these  results  (NSOM  and  SCM)  all  indicate  that  the  Fermi- 
level  in  ordered  GaInP  is  pinned  at  the  surface  in  contrast  to 
that  of  disordered  GaInP. 
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PREFACE 


The  First  AVS  Topical  Conference  on  Magnetic  Surfaces,  Interfaces,  and  Nanostructures  was 
held  in  conjunction  with  the  42nd  AVS  National  Symposium  in  Minneapolis.  It  provided  an 
excellent  forum  to  address  fundamental  relationships  between  structure  and  magnetic  properties. 
There  were  15  invited  and  40  contributed  talks  that  spanned  three  and  one  half  days  of  sessions. 
The  highlights  included  presentations  on  (i)  the  fabrication  of  novel,  low-dimensional  magnetic 
heterostructures;  (ii)  state-of-the-art  scanning  probe,  microscopic  and  holographic  characterization 
of  magnetic  domains  and  electronic  structure;  and  (iii)  synchrotron  radiation-based  studies  utiliz¬ 
ing  undulators  and  elliptically  polarized  light,  including  the  first  experiments  to  emerge  from  new 
beamlines  at  the  Advanced  Light  Source.  Electronic  structure  theory  and  computational  results  for 
systems  with  rough  as  well  as  ideal  interfaces  were  compared  to  experiment.  A  variety  of  magnetic 
phase  diagrams  and  transitions  were  discussed  in  terms  of  dynamic  and  equilibrium  concepts.  In 
addition,  overviews  were  presented  of  the  opportunities  provided  by  new  magnetoresistive  mate¬ 
rials  (“giant”  and  “collosal”),  and  other  diverse  areas  ranging  from  spin-polarized  device  struc¬ 
tures  to  quantum  well  behavior  in  metals. 

Attendance  at  the  MCIN  sessions  was  excellent,  with  standing  room  only  for  several  of  the 
invited  talks.  A  straw  vote  at  one  of  the  sessions  revealed  that  a  sizable  fraction  of  the  audience 
(between  33%  and  50%  of  the  116  session  attendees)  was  attending  the  AVS  meeting  either  for  the 
first  time  or  specifically  for  this  topical  conference,  revealing  the  strong  interest  in  this  area  and 
the  need  for  a  continuing  presence  in  future  AVS  symposia  to  serve  both  current  and  new  AVS 
members.  The  organizers  (Berry  Jonker — Naval  Research  Laboratory;  Sam  Bader — Argonne  Na¬ 
tional  Laboratory;  and  Brian  Tonner — University  of  Wisconsin,  Milwaukee)  received  many  posi¬ 
tive  comments  on  the  program  itself  and  encouragement  to  include  a  focus  on  magnetism  in  future 
AVS  programs. 

As  a  result,  this  topical  conference  will  be  held  again  in  1996  at  the  43rd  AVS  National 
Symposium  in  Philadelphia,  with  joint  sessions  planned  with  the  Nanoscale  Science  and  Technol¬ 
ogy,  Thin  Film,  and  Surface  Science  Divisions.  The  1996  MS  IN  chairs  will  be  Berry  Jonker, 
Lennie  Klebanoff  (Lehigh  University),  Brian  Tonner,  and  Dieter  Weller  (IBM  Almaden).  Finally, 
special  thanks  are  due  to  the  Office  of  Naval  Research  for  the  generous  financial  support  provided 
for  this  topical  conference  in  1995  and  1996! 


Berry  Jonker 
Co-Chair,  Program  Committee 
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X-ray  magnetic  microscopy  and  spectroscopy  using  a  third  generation 
synchrotron  radiation  source 

A.  T.  Young  H.  A.  Padmore,  and  N.  V.  Smith 

Advanced  Light  Source,  Lawrence  Berkeley  National  Laboratory,  Berkeley,  California  94720 
(Received  2  October  1995;  accepted  20  May  1996) 

Applications  of  x-ray  magnetic  circular  dichroism  (XMCD)  to  the  study  of  magnetic  materials  are 
described.  XMCD  spectra  can  be  used  to  quantitatively  determine  magnetic  properties  on  an 
element-specific  basis.  These  spectra  are  also  sensitive  to  the  chemical  state  and  environment  of  the 
element  being  probed.  The  dichroism  effect  can  also  be  used  to  produce  images  of  microscopic 
magnetic  structures  and  domains.  Third  generation  synchrotron  light  sources  are  well  suited  to  these 
experiments.  Current  and  planned  facilities  at  the  Advanced  Light  Source,  the  first  of  the  new  light 
sources  in  the  U.S.,  are  described,  focusing  on  a  new  facility  with  specialized  undulators  which  will 
directly  produce  high  flux,  high  brightness  beams  of  circularly  polarized  x-rays.  With  new 
beamlines  which  have  been  optimized  for  either  spectroscopy  or  microscopy,  this  facility  will 
provide  the  capability  to  provide  detailed  information  about  magnetic  materials.  ©  1996  American 
Vacuum  Society. 


I.  INTRODUCTION 

The  application  of  circularly  polarized  x  rays  to  studies  in 
chemistry  and  physics  has  recently  become  a  field  of  intense 
interest.  Studies  of  systems  as  diverse  as  biological 
molecules^  and  monolayer/thin  film  structures^  have  been 
performed  using  circularly  polarized  x  rays.  In  particular, 
studies  of  magnetic  materials  have  been  of  special  interest. 
Starting  with  the  theoretical  study  by  Erskine  and  Stem^  and 
on  to  the  first  experimental  studies  by  Schutz  et  al.^  and 
Chen  and  co-workers,^  x-ray  magnetic  circular  dichroism 
(XMCD)  has  been  used  to  study  a  variety  of  magnetic  phe¬ 
nomena  such  as  the  measurement  of  magnetic  moments  and 
the  determination  of  magnetic  coupling.  XMCD  has  also 
been  used  to  produce  images.  Magnetic  microscopy  has  been 
used  to  create  element  specific  images  of  the  magnetic  do¬ 
mains  and  structures  of  interest  to  the  semiconductor/ 
magnetic  recording  industry.^ 

The  Advanced  Light  Source  (ALS)  at  Lawrence  Berkeley 
National  Laboratory  is  ideally  suited  for  both  spectroscopy 
and  microscopy  using  circularly  polarized  x  rays.  In  addition 
to  existing  beamlines  which  have  capabilities  for  circularly 
polarized  x  rays,  the  ALS  is  designing  and  building  a 
complement  of  new  insertion  devices  and  beamlines  to  fa¬ 
cilitate  this  research.  Using  insertion  devices  which  directly 
produce  circularly  polarized  light  makes  available  high  flux, 
high  brightness  beams  of  x  rays  at  application- specific  beam¬ 
lines. 

This  article  first  reviews  a  few  general  properties  of 
XMCD  spectra.  This  review  will  be  brief  and  will  treat  the 
physics  only  qualitatively.  More  detailed  reviews  of  the 
theory  of  XMCD  are  available.^’^  Section  III  then  describes 
some  of  the  applications  of  XMCD  to  magnetic  materials 
research.  Again,  only  a  few  examples  of  interest  to  magnetic 
measurements  are  presented.  The  present  capabilities  of  the 
ALS  with  circularly  polarized  x  rays  are  described.  New  fa- 


^^Electronic  mail:  atyoung@lbl.gov 


cilities  being  built  at  the  ALS  are  discussed;  in  particular,  a 
new  facility  with  circularly  polarized  undulators  and  opti¬ 
mized  beamlines  is  described.  Finally,  the  end  stations  will 
be  described.  For  more  detailed  information  about  synchro¬ 
tron  radiation  sources  and  optics,  the  reader  is  directed  to 
several  recent  reviews.^" 

II.  GENERAL  PROPERTIES  OF  XMCD 

XMCD  uses  the  property  that  some  materials,  when  mag¬ 
netized,  exhibit  a  different  absorption  coefficient  for  left  cir¬ 
cularly  polarized  x  rays  than  for  right  circularly  polarized  x 
rays.  The  effect  is  dependent  on  the  direction  of  both  the 
magnetization  vector  and  the  polarization  vector  of  the  x 
rays,  which  for  circular  polarization  is  either  parallel  or  an¬ 
tiparallel  to  the  direction  of  photon  propagation.  Two  spectra 
are  possible:  one  where  the  magnetization  and  the  photon 
polarization  vectors  are  parallel,  and  the  other  with  the  mag¬ 
netization  and  the  polarization  antiparallel.  Switching  be¬ 
tween  these  two  spectra  can  be  done  by  changing  either  the 
magnetization  direction  or  the  photon  polarization. 

Excitation  by  x  rays  produces  atoms  in  which  one  of  the 
inner  shell  (core  level)  electrons  are  promoted  to  an  empty 
valence  shell  energy  level.  X-ray  absorption  probes  the 
population  and  state  density  of  both  the  lower  state  and  the 
upper  state.  For  3  J  transition  metal  elements,  the  upper  lev¬ 
els  are  the  3d  energy  levels,  which  are  the  states  responsible 
for  magnetic  properties.  The  strongest  absorption  to  these 
levels  are  the  L23  lines  (edges),  which  have  the  2p  levels  as 
their  lower  state.  The  transition  energies  lie  between  574  eV 
for  vanadium  to  952  eV  for  copper.  Rare  earth  elements  have 
4/  levels  as  the  upper  states.  The  strongest  lines  for  these 
levels  are  the  M45  lines,  which  have  the  3d  levels  as  their 
lower  states.  These  transition  energies  880  to  1600  eV. 

Figure  1  shows  typical  absorption  spectra  for  the  L2,3 
lines  of  nickel.  In  the  upper  panel,  the  solid  and  dashed  lines 
are  the  spectra  obtained  with  the  magnetization  vector  and 
the  photon  helicity  parallel  and  antiparallel.  The  lower  panel 
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Fig.  1.  X-ray  absorption  and  XMCD  spectra  of  nickel  at  the  L2  3  core  level 
edges.  Upper  panel:  absorption  spectra  with  photon  helicity  parallel  (solid) 
and  antiparallel  (dashed)  Lower  panel:  XMCD  spectrum. 

shows  the  resulting  XMCD  spectra,  i.e.,  the  difference  be¬ 
tween  the  two  absorption  spectra.  The  intensity  ratios  of  both 
the  absorption  spectra  and  the  XMCD  spectra  are  informa¬ 
tive.  Sum  rules  which  relate  the  integral  of  the  XMCD  spec¬ 
tra  to  the  orbital  and  spin  magnetic  moments  have  been 
developed.^^’^^ 

To  use  XMCD  to  produce  images,  the  energy  of  the  ex¬ 
citing  x-ray  beam  is  held  fixed  at  the  center  of  the  XMCD 
peak.  With  a  given  photon  helicity,  the  intensity  of  the  re¬ 
sponse  of  the  material  will  depend  upon  the  magnetization  of 
the  material.  If  the  response  from  the  sample  can  be  spatially 
resolved,  a  map  of  the  magnetization  of  the  material  can  be 
produced. 

There  are  two  primary  schemes  to  image  the  surface.  The 
first  technique  uses  the  x-ray  beam  to  illuminate  a  small  but 
finite  area.  An  electron  microscope  is  then  used  to  create  an 
image  from  the  photoelectrons  that  have  been  liberated.  This 
technique  has  the  advantage  of  acquiring  the  entire  image  at 
once.  Ultimate  resolution  is  determined  by  the  resolution  of 
the  electron  microscope.  The  second  technique  focuses  the 
x-ray  beam  to  as  small  a  focal  spot  as  possible  and  recording 
the  signal.  Spatial  resolution  is  set  by  the  size  of  the  foeal 


spot.  The  spot  is  then  scanned  over  the  entire  sample,  either 
by  translating  the  focusing  element  or  by  moving  the  sample. 
This  technique  has  the  advantage  that  a  variety  of  detection 
techniques  can  be  used,  including  transmission,  fluorescence, 
and  photoelectron  detection. 


III.  APPLICATIONS  OF  XMCD 

There  have  been  a  large  number  of  experiments  using 
XMCD.  These  can  be  divided  into  two  major  categories: 
spectroscopy  and  microscopy.  A  variety  of  systems  has  been 
studied  with  spectroscopy.  Magnetic  multilayer  thin  films 
have  been  a  frequent  topic  of  interest.  Monolayer  (ML)  films 
of  iron  on  single  crystal  copper  have  been  studied,  with 
large  dichroism  effects  detected.  Nickel  ultrathin  films  grown 
on  Cu(lOO)  have  been  studied, and  it  was  determined  that 
the  easy  axis  of  magnetization  varied  with  the  thickness  of 
the  layer.  It  was  determined  that  the  axis  changed  from  par¬ 
allel  to  the  surface  for  5-9  ML  to  perpendicular  for  10-75 
ML.  In  another  experiment,  the  exchange  coupling  between 
two  layers  has  been  studied  as  a  function  of  the  width  of  an 
intervening  layer.  In  this  experiment,  the  coupling  between 
a  cobalt  layer  and  a  nickel-iron  layer  with  an  intervening 
wedge  of  ruthenium  was  determined  using  XMCD.  It  was 
found  that  the  direction  of  magnetization  oscillates  with  the 
thickness  of  the  ruthenium. 

Quantitative  measurements  of  magnetic  properties  have 
also  been  made.  For  example,  element  specific  magnetization 
curves  have  been  obtained.  Iron/copper/cobalt  trilayers 
were  measured  using  XMCD.  Figure  2  shows  the  hysteresis 
curves  obtained  with  XMCD  and  a  conventional  method.  As 
can  be  seen,  the  iron  and  cobalt  hysteresis  curves  are  very 
different,  with  different  saturation  and  coercive  fields.  This  is 
in  contrast  to  measurements  made  on  bulk  alloys,  in  which 
the  hysteresis  curves  for  the  two  elements  are  identical.  The 
sum  of  the  two  curves  yields  the  curve  obtained  convention¬ 
ally.  This  experiment  also  illustrates  the  fact  that  XMCD 
measurements  are  element  specific.  Another  experiment  il¬ 
lustrates  that  the  magnetic  moment  can  be  decomposed  into 
its  orbital  and  spin  contributions.  Using  the  sum  rules  men¬ 
tioned  above,  Chen  and  co-workers determined  that  the 
orbital  moment  for  a  cobalt  thin  film  is  0.14  fig,  and  the  spin 
moment  is  1.52  fig . 

Much  less  work  has  been  done  on  magnetic  microscopy. 
Stohr  et  al  have  imaged  the  magnetic  bit  structure  of  a  mag¬ 
netic  recording  disc.^  Figure  3  shows  a  disc  which  has  do¬ 
mains  with  alternating  magnetization  direction.  The  substrate 
is  a  cobalt/platinum/chromium  alloy,  and  the  XMCD  spec¬ 
trum  was  taken  at  the  cobalt  absorption  edges.  The  light  and 
dark  areas  correspond  to  regions  where  the  magnetization 
direction  is  either  parallel  or  antiparallel  to  the  polarization 
direction.  The  narrowest  domains  are  0.5  fim  wide.  Another 
example  of  imaging  is  the  work  by  Tonner  et  al  who  mea¬ 
sured  iron/terbium/cobalt  thin  films. In  this  experiment,  the 
images  were  obtained  at  the  iron  and  terbium  edges. 
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Fig.  2.  Element  specific  magnetometry  using  XMCD.  The  top  panel  shows 
the  Fe  and  Co  L3  line  intensities  as  a  function  of  applied  magnetic  field  for 
a  Fe/Cu/Co  trilayer.  The  bottom  panel  shows  the  comparison  between  an 
overall  hysteresis  curve  obtained  using  conventional  methods  (solid)  and  a 
linear  combination  fit  of  the  two  XMCD  hysteresis  curves  (dotted). 


IV.  CIRCULAR  POLARIZATION  CAPABILITIES  AT 
THE  ALS 

X-ray  science  has  enjoyed  a  renaissance  in  recent  years 
with  the  introduction  of  synchrotron  light  sources.  These 
light  sources  use  the  fact  that  charged  particles  emit  light 
when  bent  by  a  magnetic  field.  The  spectrum  of  light  that  is 
emitted  is  a  function  of  the  mass  and  energy  of  the  particle 
and  the  strength  of  the  magnetic  field.  To  produce  high  en¬ 
ergy  photons  such  as  x  rays,  electrons  accelerated  to  the  GeV 
energy  range  are  used  in  combination  with  magnetic  fields  of 
about  1-5  T.  This  produces  light  from  the  far  infrared  to  x 
rays  beyond  10  keV  (a  wavelength  of  1.2  A).  In  addition,  by 
using  these  high  energy  electrons,  which  are  relativistic  at 
this  energy,  the  emitted  photons  are  directed  in  a  very  nar¬ 
row,  well  directed  beam,  propagating  in  the  same  direction  as 
the  electron.  Pictorially,  this  is  analogous  to  the  searchlight 
(the  x-ray  beam)  sweeping  around  an  arc. 


Fig.  3.  XMCD  images  of  the  magnetic  domains  on  a  magnetic  hard  disc. 
The  light  and  dark  regions  correspond  to  the  domains  having  a  magnetiza¬ 
tion  direction  either  parallel  or  antiparallel  to  the  x-ray  polarization. 


Third  generation  light  sources,  of  which  the  ALS  is  the 
first  in  the  world  designed  for  soft  x  rays,  have  optimized  the 
design  of  the  storage  ring  to  produce  high  quality  photon 
beams.  Third  generation  sources  use  an  electron  beam  which 
has  a  small  cross  sectional  area  and  small  divergence,  giving 
low  emittance.  This  makes  it  possible  to  focus  the  emitted 
photon  beam  to  small  sizes.  In  addition,  the  design  of  the 
electron  storage  ring  is  in  the  shape  of  polygon  with  many 
sides  or  straight  sections,  rather  than  a  circle  or  oval.  In  the 
case  of  the  ALS,  there  are  twelve  straight  sections.  At  the 
vertices  of  the  polygon,  bend  magnets  are  placed  which  pro¬ 
duce  synchrotron  radiation  as  described  before.  In  the 
straight  sections,  special  periodic  arrays  of  magnets  can  be 
placed.  These  periodic  arrays,  or  insertion  devices,  produce  a 
much  higher  photon  flux  than  a  single  bend  magnet. 

There  are  two  types  of  insertion  devices,  wigglers,  and 
undulators.  Wigglers  use  magnets  with  a  high  magnetic  field 
and  produce  a  high  flux  photon  beam  with  a  broad  energy 
spectrum.  Undulators  use  periodic  magnet  arrays  of  lower 
field;  however,  because  of  constructive  interference  between 
the  photons  in  the  undulator,  the  beam  is  much  narrower  than 
the  bend  magnet  beams,  and  is  quasimonochromatic.  Undu¬ 
lators  thus  produce,  within  a  narrow  bandwidth,  more  flux 
and  have  higher  spectral  brightness  (the  number  of  photons 
per  cross  sectional  area,  angular  divergence,  and  photon  en¬ 
ergy  spread  per  second)  than  bend  magnets.  The  spectral 
brightness  of  typical  ALS  undulators  is  four  to  five  orders  of 
magnitude  higher  than  that  from  bend  magnets  sources. 

Synchrotron  radiation  is  emitted  with  specific  polarization 
properties.  From  bend  magnets,  the  light  emitted  in  the  plane 
of  the  electron  orbit  is  polarized  horizontally.  As  one  moves 
out  of  the  orbit  plane,  though,  the  polarization  of  the  beam 
becomes  elliptical,  becoming  more  and  more  circular  the  fur- 
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ther  out-of-plane  one  goes.  The  flux  of  photons  drops  rap¬ 
idly,  however.  The  variation  of  flux  and  polarization  with 
angle  is  a  function  of  the  photon  energy.  Using  apertures,  the 
degree  of  circular  polarization  can  be  selected. 

Undulators  and  wigglers  also  produce  horizontally  polar¬ 
ized  light.  However,  conventional  wigglers  and  undulators 
do  not  produce  circularly  polarized  radiation.  To  produce  cir¬ 
cularly  polarized  radiation,  a  variety  of  special  insertion  de¬ 
vices  have  been  proposed,  although  not  all  designs  allow  for 
complete  control  of  the  energy  or  helicity  of  the  radiation. 
The  ALS  is  designing  and  building  a  set  of  these  special 
undulators  which  allows  for  user-controlled  energy  and  po¬ 
larization. 

A.  Presently  operating  facilities 

Beamline  9.3.2  is  a  bend  magnet  beamline  which  has 
been  optimized  for  high  resolution  spectroscopy.^^  With  an 
energy  range  from  30-1500  eV,  it  enables  research  with  the 
most  interesting  3d  transition  metals  and  4/  rare  earths,  in¬ 
cluding  nickel,  iron,  cobalt,  and  neodymium.  Equipped  with 
a  monochromator  which  has  a  resolving  power  greater  than 
7000,  it  is  capable  of  determining  shifts  in  the  absorption 
spectra  which  occur  due  to  differences  in  the  chemical  envi¬ 
ronment  of  the  element  probed. 

To  use  the  circularly  polarized  x  rays,  an  aperture  is  used 
to  select  a  portion  of  the  out  of  plane  radiation.  At  a  modest 
reduction  in  photon  flux,  the  beam  can  have  a  high  degree  of 
circular  polarization.  Typically,  a  degree  of  circular  polariza¬ 
tion  of  about  0.8  is  used,  corresponding  to  collecting  radia¬ 
tion  from  about  0.2  to  0.5  mrad.  Measurements  of  the  polar¬ 
ization  using  a  reflection  polarimeter  have  verified  the 
performance  of  this  system. 

Multiple  end  stations  can  be  accommodated  at  this  beam¬ 
line.  These  include  the  advanced  photoelectron  spectrometer 
diffractometer  (APES)  which  is  capable  of  a  variety  of  sur¬ 
face  science  experiments,  such  as  high  resolution  photoelec¬ 
tron  diffraction;  the  angle-resolved  photoemission  spectrom¬ 
eter  (ARPES)  with  a  rotatable  electrostatic  electron-energy 
analyzer;  and  the  applied  materials  chamber  (AMC).  Using 
the  AMC,  XMCD  measurements  can  rapidly  be  made  on  a 
variety  of  samples. 

In  addition  to  this  facility,  beamline  6.3.2  at  the  ALS  also 
has  a  limited  ability  to  produce  circularly  polarized  x  rays.  It, 
too,  is  a  bend  magnet  beamline  and  can  collect  a  portion  of 
the  out-of-plane  radiation. 

B.  Facilities  under  construction 

1.  Beamline  7.3,1  bend  magnet  endstation  optimized 
for  microscopy 

Beamline  7.3.1  is  also  a  bend  magnet  beamline.  It  is  op¬ 
timized  for  full  field  photoelectron  microscopy 
applications.^^  Because  the  electron  beam  emittance  from 
third  generation  synchrotrons  is  so  low,  with  careful  optical 
design,  bend  magnet  radiation  can  be  focused  to  produce 
very  small  illuminated  areas,  typically  50  /jum  diam.  Micros¬ 
copy  experiments  in  general  require  a  high  photon  flux.  Be¬ 
cause  of  this,  the  optical  design  of  the  beamline  has  been 


optimized  to  maximize  the  photon  flux  through  the  beamline 
to  the  endstation.  The  microscopy  applications  generally  do 
not  require  high  spectral  resolution.  This  leads  to  a  simplified 
monochromator  design  which  improves  the  photon  through¬ 
put.  Spectral  resolution  of  at  least  1000  will  be  achieved 
throughout  the  energy  range  of  the  system.  This  range,  275- 
1500  eV,  covers  the  carbon  Is  absorption  edge  at  the  low 
end,  to  the  M  edges  of  the  rare  earths  at  the  high  end.  The 
performance  is  optimized  at  approximately  800  eV,  the  en¬ 
ergy  of  the  L  edges  of  iron,  cobalt,  and  nickel. 

Selection  of  the  helicity  of  the  circular  polarization  will 
be  performed  using  a  chopper/aperture  to  collect  out  of  plane 
radiation.  To  improve  the  XMCD  performance,  the  chopper 
can  be  used  to  switch  the  helicity  at  rates  up  to  0.5  Hz.  This 
capability  will  allow  for  reductions  in  noise,  as  well  as  a 
modest  capability  to  measure  time  dependent  processes. 

The  endstation  for  beamline  7.3.1  will  be  a  photoemission 
electron  microscope  (PEEM).  Details  of  this  type  of  micro¬ 
scope  have  been  described  elsewhere.  With  this  type  of 
microscope,  the  photoelectrons  ejected  from  the  sample  by 
the  X  rays  are  used  by  an  electron  microscope  to  image  the 
surface.  Sub-micron  spatial  resolution  has  been  demonstrated 
with  this  instrument,  and  with  planned  improvements,  reso¬ 
lution  of  <100  nm  structures  should  be  achievable.  This  ex¬ 
perimental  facility  is  currently  under  construction  and  will  be 
ready  for  use  by  June  of  1996. 


2.  Beam  line  4.0  unduiator  facility  for  spectroscopy 
and  microscopy 

Beamline  4.0  is  an  undulator-based  facility  designed  spe¬ 
cifically  for  circular  polarization  applications.  The  insertion 
devices  will  be  located  in  a  single  insertion  device  straight 
section,  where  two  unduiator  stations  will  be  placed  end-to- 
end.  Small  bending  magnets  will  produce  a  chicane  in  the 
straight  section,  directing  the  electron  beam  first  through  one 
station,  then  through  the  other,  producing  a  1.65  mrad  angle 
between  the  two  optical  axes.  Undulators  can  be  positioned 
and  operated  at  both  stations  simultaneously.  In  addition,  at 
each  station,  a  translation  mechanism  will  allow  either  of 
two  undulators  to  be  placed  in  the  beam.  With  this  arrange¬ 
ment,  a  total  of  four  different  undulators  will  be  available  for 
use  at  the  two  beamlines,  allowing  for  great  flexibility  in  the 
energy  range  or  helicity  switching.  The  output  from  either 
station  can  be  directed  to  either  of  the  two  branchlines,  or  the 
output  from  both  stations  can  be  directed  to  the  same 
branchline. 

To  directly  produce  circularly  polarized  radiation,  an  un¬ 
duiator  design  similar  to  that  of  Sasaki^^  and  Carr^^  has  been 
adopted.  The  magnetic  design  of  this  insertion  device  has 
been  modified  to  produce  higher  magnetic  fields  on  the  axis 
of  the  unduiator,  leading  to  an  expanded  photon  energy 
range.  The  device  is  capable  of  producing  polarized  light  of 
any  ellipticity:  horizontal,  vertical,  circular,  or  elliptical.  Us¬ 
ers  will  be  able  to  select  the  mode  used.  In  addition  they  will 
be  able  to  change  the  helicity  of  the  circularly  or  elliptically 
polarized  light  every  few  seconds. 
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Table  I.  Energy  range  of  a  5  cm  period  elliptically  polarizing  undulator. 


Mode 

Peak  Bx 
(T) 

Peak  By 
(T) 

Min.  energy 
(eV) 

Max.  energy 
(eV) 

Elliptical 

0.228 

0.502 

100 

1500 

Helical 

0.424 

0.424 

140 

680 

Horizontal 

0 

0.793 

85 

1500 

Vertical 

0.502 

0 

180 

1500 

The  figure  of  merit  considered  for  the  evaluation  of  per¬ 
formance  is  defined  as 

M=pI*F, 

where  Pc  is  the  degree  of  circular  polarization,  and  F  is  the 
flux.  The  merit  function  brightness  is  also  considered  (where 
brightness  is  substituted  for  flux  in  the  above  equation).  Flux, 
brightness,  and  degree  of  circular  polarization  have  been  cal¬ 
culated  using  the  formalizm  described  by  Kim^^  for  the  pla¬ 
nar  and  helical  cases  and  generalized  by  Marks^^  for  the 
elliptical  case. 

The  first  undulator  to  be  installed  in  the  ALS  circular 
polarization  facility  will  be  a  5  cm  period,  1.95  m  long  de¬ 
vice  with  37  full  strength  periods.  The  energy  ranges  of  the 
various  modes  of  polarization  are  summarized  in  Table  1. 
These  correspond  to  an  electron  energy  of  1 .9  GeV.  Figure  4 
shows  the  calculated  performance  for  the  merit  function  flux 
and  brightness  for  this  device  for  pure  helical  mode  and  for 
the  first,  third,  and  fifth  harmonics  when  the  undulator  is 
operated  in  an  elliptical  mode.  As  can  be  seen,  operation  in 
pure  helical  mode  gives  slightly  higher  merit  function  per¬ 
formance,  but  with  a  very  restricted  energy  range,  as  pure 
helical  motion  produces  no  higher  harmonics.  In  elliptical 
mode,  this  undulator  produces  usable  output  from  100  to 
1800  eV.  This  energy  range  covers  the  important  core  levels 
for  magnetic  materials — the  L2,3  of  3d  transition  met¬ 

als  and  the  M4  5  edges  of  rare  earths.  Performance  has  been 
optimized  for  coverage  of  the  iron,  nickel,  and  cobalt  absorp¬ 
tion  edges,  near  800  eV,  These  calculations  do  not  include 


Fig.  4.  Calculated  performance  curves  of  a  5.0  cm  period  elliptical  polarized 
undulator.  The  flux  (brightness)  merit  function  is  plotted,  which  is  the  flux 
(brightness)  x  degree-of-polarization  (Ref.  2),  Electron  beam  energy  is  1.9 
GeV. 


electron  beam  energy  spread,  which  will  decrease  the  perfor¬ 
mance  of  the  higher  harmonics.  For  this  reason,  in  determin¬ 
ing  the  high  energy  performance,  only  the  first  and  third 
harmonics  have  been  considered.  To  reach  even  lower  ener¬ 
gies,  a  7.5  cm  period  device  has  also  been  designed.  With 
this  device,  energies  as  low  as  20  eV  will  be  available. 

Emerging  from  the  undulators,  the  two  photon  beams  are 
separated  by  1.65  mrad.  After  passing  through  the  shield 
wall,  the  beams  will  enter  a  mirror  switchyard.  Using  this  set 
of  mirrors,  the  output  from  either  (or  both)  undulator  sta- 
tion(s)  can  be  directed  to  either  of  the  branchlines.  In  the 
standard  mode,  each  undulator  beam  would  be  directed  to 
one  of  the  beamlines  by  a  translating  mirror.  To  send  both 
beams  to  the  microscopy  branchline,  the  mirror  for  the  spec¬ 
troscopy  line  would  be  retracted  and  the  beam  allowed  to 
propagate  to  a  second  mirror  which  would  then  direct  the 
beam  down  the  microscopy  line.  A  similar  procedure  is  used 
to  direct  both  beams  to  the  spectroscopy  line.  Sending  the 
output  from  both  undulator  stations  down  the  same  beamline 
will  allow  for  experiments  which  require  rapid  changes  in 
polarization.  For  these  experiments,  undulators  in  the  two 
stations  will  be  set  to  the  same  photon  energy  and  equal  but 
opposite  helicity.  Although  both  beams  propagate  down  the 
same  beamline,  in  some  places  they  are  physically  separated. 
At  one  of  these  points,  a  mechanical  chopper  is  placed  which 
will  alternately  block  one  polarization  or  the  other. 

The  conceptual  design  of  the  circular  polarization  beam¬ 
lines  is  shown  in  Fig.  5.  Two  main  beamlines  will  be  built  for 
this  facility.  The  optical  design  of  each  branchline  is  tailored 
to  suit  one  of  the  two  major  research  areas,  microscopy  or 
high  resolution  spectroscopy.  For  spectroscopy  studies,  a 
wide  energy  range  will  be  covered  from  20  to  1800  eV,  with 
a  resolution  of  up  to  10  000  at  100  eV.  For  the  microscopy 
beamline,  a  higher-throughput,  lower  resolution  monochro¬ 
mator  has  been  designed.  It  is  entrance-slitless,  with  an  en¬ 
ergy  range  of  100-1800  eV  and  a  resolution  of  about  1500. 

The  microscopy  beamline  will  have  provisions  for  two 
end  stations,  each  with  its  own  monochromator  exit  aperture. 
One  end  station  will  be  a  PEEM,  a  similar  but  improved 
version  of  the  instrument  used  on  beamline  7.3.1.  The  cur¬ 
rent  generation  of  PEEMs  suffer  from  resolution  degradation 
caused  by  chromatic  aberration  of  the  lens,  caused  by  the 
spread  in  the  kinetic  energy  of  the  photoelectrons.  The  new 
PEEM  will  focus  all  electrons  to  the  image  plane,  indepen¬ 
dent  of  their  kinetic  energy.  Ultimate  resolution  of  an  ad¬ 
vanced  PEEM  should  be  better  than  20  nm. 

The  other  microscope  end  station  will  be  equipped  for 
scanning  Fresnel  zone  plate  microscopy.  Zone  plates  are  fo¬ 
cusing  elements  which  rely  on  diffraction  rather  than  refrac¬ 
tion,  like  lenses,  or  reflection,  like  mirrors.  They  consist  of  a 
series  of  concentric  rings  with  a  decreasing  spacing  as  the 
radius  increases.  The  radii  of  the  rings  is  chosen  so  that  the 
light  passing  through  all  the  rings  is  diffracted  to  a  single 
point.  Zone  plates  are  excellent  optical  elements  for  x-ray 
applications  as,  because  of  absorption,  there  are  no  materials 
which  can  be  used  as  lenses.  Mirrors  for  x-ray  applications 
also  suffer  from  absorption  as  well  as  the  requirement  that 
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Fig.  5.  Conceptual  design  of  the  ASL  beamline  4.0,  a  new  facility  with  elliptically  polarizing  undulators  and  beamlines  optimized  for  spectroscopy  and 
microscopy. 


the  mirrors  need  to  be  used  at  grazing  incidence.  A  typical 
zone  plate  in  use  at  the  ALS  has  100  zones  (rings),  a  diam¬ 
eter  of  100  yarn  and  an  outer  zone  width  of  30  nm. 

The  resolution  of  a  zone  plate  microscope  is  derived  from 
the  size  of  the  focused  spot.  This,  in  turn,  is  approximately 
equal  to  the  width  of  the  outer  zone.  This  is  currently  limited 
by  microfabrication  technology  to  about  30  nm. 

Detection  in  a  zone  plate  microscope  can  be  via  a  variety 
of  techniques,  depending  upon  the  sample.  Using  x-ray  de¬ 
tectors,  transmission  through  the  sample  can  be  measured,  as 
can  fluorescence  excited  by  the  synchrotron  beam.  Alterna¬ 
tively,  photoelectrons  can  be  collected.  Images  are  collected 
by  scanning  the  x-ray  spot  across  the  sample,  either  by  trans¬ 
lating  the  zone  plate  or  by  moving  the  sample. 

The  spectroscopy  beamline  will  also  have  provisions  for 
two  end  stations.  For  this  beamline,  a  rotating  experimental 
platform  will  allow  two  end  stations  to  be  attached  and 
aligned  to  the  beamline  at  a  time.  Many  of  the  types  of 
experiments  to  be  run  on  this  beamline  are  similar  to  the 
experiments  performed  on  beamline  9.3.2,  thus,  many  of  the 
endstations  will  be  similar.  However,  the  much  greater  pho¬ 
ton  flux  available  from  the  undulators  will  enable  experi¬ 
ments  which  are  not  currently  possible.  For  example,  experi¬ 
ments  with  samples  which  are  very  dilute  in  the  species  of 
interest  or  which  have  a  finite  lifetime  will  benefit  from  the 
increased  photon  flux.  Typical  end  stations  which  will  be 
used  on  this  beamline  will  include  UHV  angle-resolved  pho¬ 


toelectron  spectroscopy  chambers  and  general  purpose  cham¬ 
bers  equipped  for  surface  science  and  materials  research.  For 
XMCD  measurements,  chambers  with  magnets  which  can 
alternate  their  polarity  will  be  used.  Systems  with  permanent 
magnets  or  electromagnets  can  be  used.  In  particular,  super¬ 
conducting  magnet  chambers  for  measurements  at  fields 
above  1  T  will  be  available. 

V.  SUMMARY 

XMCD  has  shown,  in  a  short  time,  to  be  a  valuable  tech¬ 
nique  in  characterizing  magnetic  materials.  Third  generation 
synchrotron  light  sources  like  the  ALS  can  provide  high  flux, 
high  brightness  beams  of  x  rays  for  this  research.  At  the 
ALS,  in  addition  to  presently  available  bend  magnet  beam¬ 
lines,  two  new  facilities  are  being  constructed.  A  new  mi¬ 
croscopy  beamline  is  being  built,  and  a  facility  with  circu¬ 
larly  polarizing  undulators  and  beamlines  optimized  for  high 
resolution  spectroscopy  and  microscopy  are  being  con¬ 
structed.  These  new  facilities  will  provide  a  new  resource  for 
probing  magnetic  materials. 
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Ferromagnetic  domains  of  a  single  crystal  of  neodymium-iron-boron,  Nd2Fei4B  (one  of  the 
strongest  permanent  magnetic  materials  known)  are  imaged  by  focusing  a  beam  of  photoelectrons 
with  electrostatic  optics  in  a  photoelectron  emission  microscope.  Photoelectrons  emitted  from  the 
surface  are  deflected  laterally  into  two  opposite  directions  by  stray  magnetic  fields  that  exist  above 
the  domains.  The  photoelectron  beam  is  partially  split  into  two.  Magnetic  contrast  is  produced  by 
blocking  part  of  the  beam  and  imaging  with  an  edge  of  the  beam.  The  magnetic  contrast  mechanism 
appears  to  be  similar  to  the  type  I  magnetic  contrast  mechanism  known  from  scanning  electron 
microscopy,  in  which  stray  magnetic  fields  above  the  ferromagnetic  domains  deflect  secondary 
electrons  either  towards  or  away  from  the  electron  detector.  Upon  heating  the  sample  above  the 
Curie  temperature,  the  ferromagnetic  domains  gradually  disappear,  as  expected  for  a  second  order 
phase  transition.  They  reappear  upon  cooling.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION:  PHOTOELECTRON  EMISSION 
MICROSCOPY 

Photoelectrons  ejected  from  a  solid  sample  into  vacuum 
become  free  electrons.  As  free  electrons,  they  are  deflected 
by  electric  and  magnetic  fields,  and  a  beam  of  photoelectrons 
can  be  focused  by  electrostatic  and  electromagnetic  lenses  to 
produce  an  electron  optical  image  of  a  sample.  Briiche^  pub¬ 
lished  photoelectron  images  in  1933 — just  one  year  after  the 
first  transmission  electron  microscopy  images  of  Knoll  and 
Ruska^  and  the  first  thermionic  electron  emission  images  of 
Briiche  and  Johannson.^  Already  in  the  1930s,  studies  on  the 
absorption  of  gases  on  Pt  (Refs.  4  and  5)  and  Ni  (Ref.  6)  and 
of  K  and  Ba  on  various  metals,  suggested  that  a  single  mono- 
layer  of  adsorbate  could  radically  alter  the  photoemissive 
properties  of  a  surface,  and  that  a  single  adsorbed  monolayer 
could  be  imaged  by  photoelectron  microscopy.  However,  be¬ 
cause  of  poor  vacuum,  it  is  assumed  that  surfaces  were 
quickly  contaminated  by  residual  gases. 

A  breakthrough  occurred  with  the  introduction  of  bakable, 
ultrahigh  vacuum  instruments.^"^®  These  allowed  controlled 
studies  of  monolayer  films  and  the  combination  of  photo¬ 
electron  microscopy  with  other  standard  UHV  techniques  of 
surface  science.  If  a  substrate  is  chosen  with  a  work  function 
greater  than  the  maximum  photon  energy  of  the  light  source, 
no  photoelectrons  are  emitted  from  the  substrate.  If,  onto 
such  a  substrate,  a  substance  is  deposited  that  lowers  the 
work  function  below  the  maximum  energy  of  the  lamp,  pho¬ 
toelectrons  are  emitted  from  areas  covered  with  adsorbate.  If 
the  surface  energy  of  the  adsorbate  is  less  than  that  of  the 
substrate,  the  adsorbate  can  wet  the  surface  and  spread  as  a 
monolayer,  and  single  monolayers  can  be  imaged  by  photo¬ 
electron  microscopy.  For  example,  the  epitaxial  growth  of  a 
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single  monolayer  of  copper  evaporated  onto  a  (110)  face  of  a 
single  crystal  of  molybdenum  was  followed  from  the  initial 
stages  of  nucleation  at  atomic  steps  to  the  completion  of  the 
first  monolayer. Three-dimensional  epitaxial  island 
growth  has  been  seen  as  well.^"^  An  important  application  of 
photoelectron  microscopy  in  surface  science  has  been  the 
observation  of  chemical  reaction-diffusion  fronts  in  mono- 
layer  films  on  single  crystal  surfaces  during  catalytic 
reactions. Perhaps  the  most  spectacular  application  has 
been  the  observation  of  chemical  kinetic  oscillations,  chemi¬ 
cal  waves,  and  chemical  chaos  in  monolayer  films.  Nu¬ 
merous  reviews  on  photoelectron  emission  microscopy  have 
been  published.^’ 

In  1957  Spivak,  Dombrovskaia,  and  Sedov^^  published 
photoelectron  images  of  ferromagnetic  domains  of  the 
(0001)  basal  plane  of  cobalt.  The  patterns  of  these  domains 
are  quite  distinct  and  unmistakable  and  were  directly  com¬ 
pared  to  images  produced  by  the  magnetic  colloid  decoration 
technique  of  Francis  Bitter. Schur  and  Pfefferkorn^^  and 
Pfefferkorn  et  al?^  used  photoelectron  microscopy  to  image 
ferromagnetic  domains  on  prismatic  planes  of  cobalt.  They 
concluded  that  contrast  was  produced  when  part  of  the  pho¬ 
toelectron  beam  was  blocked  by  an  edge,  analogous  to 
Topler’s  schlieren  technique,  in  which  part  of  a  beam  of  light 
is  blocked  by  a  knife  edge  to  produce  contrast.  Magnetized 
regions  of  a  magnetic  recording  disk  have  also  been  imaged 

using  circularly  polarized  x  rays  from  a  synchrotron 

37-39 

source.^' 

In  this  article  we  report  the  observation  of  ferromagnetic 
domains  of  a  single  crystal  of  neodyraium-iron-boron 
(Nd2Fei4B)  using  photoelectron  microscopy.  Neodymium- 
iron-boron  is  one  of  the  strongest  permanent  magnets."^®""^^ 
Moreover,  despite  the  use  of  the  rare-earth  element  neody¬ 
mium,  it  is  cost  effective  and  produces  a  higher  magnetic 
field  per  unit  cost  compared  to  other  materials.  It  has  revo- 
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lutionized  the  permanent  magnet  industry/^  Neodymium- 
iron-boron  forms  tetragonal  crystals  with  68  atoms  in  the 
unit  cell."^^  It  is  a  uniaxial  magnetic  material.  The  direction  of 
easy  magnetization  is  along  the  c  axis.^^  The  Curie  tempera¬ 
ture  is  reported  as  585  Magnets  used  in  practice  have 
small  crystal  grain  sizes  that  are  near  the  single  domain  size 
for  this  material.^^’'^^  It  is  the  small  single  domain  size  crys¬ 
tallites  that  give  the  material  much  of  its  high  coercivity."^® 

II.  EXPERIMENT 

A  single  crystal  boule,  10  mm  in  diameter,  was  grown  at 
the  Institute  of  Crystal  Materials  at  Shandong  University, 
RR.  China,  using  the  Czochralski  method  from  the  melt  in  a 
Hukin  cold  crucible."^^  X-ray  diffraction  studies"^^  verified 
that  the  crystal  had  the  well-known  tetragonal  structure  of 
the  magnetic  Nd2Fei4B  phase."^"^  A  Laue  pattern  implied  that 
much  of  the  sample  was  single  crystal,"^^  although  micros¬ 
copy  has  revealed  subgrain  boundaries  and  inclusions.  En¬ 
ergy  dispersive  x-ray  fluorescence  analysis  performed  in  a 
scanning  electron  microscope  (SEM)  showed  Si  as  a  minor 
impurity."^^  The  boule  was  cut  and  observations  of  ferromag¬ 
netic  domains  were  made  on  {100}  (a  nonbasal)  plane  paral¬ 
lel  to  the  c  axis  and  perpendicular  to  an  a  axis  of  the  tetrag¬ 
onal  crystal.  The  accuracy  of  the  orientation  was  assumed  to 
be  between  0.3  and  0.5°  and  the  presence  of  some  miscut 
was  inferred  from  the  fact  that  SEM  observations  of  this 
crystal  revealed  type  I  magnetic  contrast  that  would  require 
slight  misorientation  and  stray  magnetic  fields  above  do¬ 
mains  on  a  nonbasal  plane.  To  minimize  oxidation,  the  cut 
crystal  was  shipped  to  the  United  States  under  a  protective 
layer  of  grease.  The  sample  was  degreased  in  boiling  toluene 
and  in  acetone.  A  final  rinse  in  reagent  grade  absolute  alcohol 
was  made  before  the  sample  was  placed  in  the  vacuum 
chamber  of  the  photoelectron  microscope,  which  was  baked 
overnight  to  a  pressure  of  2X10“^  Torr  (3X10“^  Pa).  No 
further  attempts  were  made  to  clean  the  sample  or  to  keep 
the  surface  free  from  residual  gases.  It  was  assumed  that  the 
surface  contained  carbon  and  other  impurities. 

Electrostatic  electron  optics,  which  were  purchased  from 
Staib  GmbH,  and  assembled  by  Senftinger,  were  used  in  the 
photoelectron  microscope.  Use  of  electrostatic  lenses,  rather 
than  electromagnetic  lenses  is  a  great  advantage  in  the  study 
of  magnetic  materials,  because  electrostatic  lenses  produce 
no  magnetic  field  that  might  alter  the  magnetization  of  the 
sample.  The  Staib  design  was  developed  by  Engel  and 
Kordesch^^  at  the  Fritz  Haber  Institute  in  the  groups  of  Brad¬ 
shaw  and  Zeitler.  It  is  the  same  lens  design  used  by  the 
groups  of  Ertl  and  Block  to  study  reaction-diffusion 
phenomenon.  The  system  uses  standard  UHV  compo¬ 
nents,  and  a  Galileo-type  channel  plate.  Detailed  drawings  of 
the  electrostatic  lenses  are  given  in  the  work  by  von 
Oertzen."^^ 

Because  magnetic  fields  from  Nd2Fei4B  magnetized  to 
near  saturation  deflect  the  emitted  electrons  to  too  great  an 
extent  to  form  useful  images,  the  sample  was  demagnetized 
by  heating  above  the  Curie  temperature,  585  K.  Radiation 
from  a  tungsten  filament  located  below  the  sample  was  used 


as  the  heater.  The  sample  was  cooled  in  the  earth’s  magnetic 
field  to  room  temperature.  As  with  all  ferromagnetic  materi¬ 
als,  the  ferromagnetic  domains  spontaneously  reform  upon 
cooling  below  the  Curie  temperature  and  partially  cancel  the 
external  field.  Ultraviolet  light  was  used  from  a  common  Hg 
lamp  (OSRAM  HBO  100  W/2),  which  produces  a  maximum 
useful  photon  energy  of  ~5  eV  (253  nm).  It  was  partially 
focused  with  a  parabolic  mirror  (in  a  PTI  model  A 10 10 
lampholder)  to  a  ^^3  mm  diam  spot  size.  The  light  struck  the 
surface  75°  from  normal.  No  attempt  was  made  to  polarize 
the  light.  Careful  adjustment  of  the  sample  tilt  relative  to  the 
electron  optical  axis  was  critical  for  the  observation  of  the 
domains.  To  produce  contrast  it  was  necessary  to  tilt  the 
sample  slightly  off  axis.  Deflection  of  the  photoelectron 
beam  with  a  hand-held  magnet  outside  of  the  vacuum  cham¬ 
ber  showed  that  contrast  did  not  occur  if  the  center  of  the 
beam  was  used  and  that  contrast  was  achieved  only  by  using 
an  outer  edge  of  the  beam.  Observations  by  scanning  elec¬ 
tron  microscopy  showed  that  the  features  were  not  due  to  a 
topographical  effect.  Ferromagnetic  domains  were  observed 
in  the  SEM  by  type  I  magnetic  contrast  and  proved  that  the 
features  seen  by  a  photoelectron  emission  microscope 
(PEEM)  were  ferromagnetic  domains.  Furthermore,  heating 
of  the  sample  above  the  Curie  temperature  in  the  SEM  led  to 
the  disappearance  of  the  domains  that  reappeared  upon  cool¬ 
ing  (unpublished).  Images  in  the  PEEM  were  photographed 
from  the  circular  channel  plate  screen  with  a  Canon  Eos  Elan 
35  mm  camera  with  exposure  times  of  under  20  s  using  ASA 
100  film. 

III.  RESULTS 

Figure  1  shows  photoelectron  images  of  four  regions  of 
the  sample.  Although  nominally  a  single  crystal,  the  crystal 
also  contained  subgrain  boundaries  and  inclusions.  On  re¬ 
gions  of  the  crystal  that  were  far  from  defects,  alternating 
dark  and  bright  parallel  domains  were  seen  as  in  Fig.  1(a). 
Most  of  the  crystal  contained  these  approximately  parallel 
domains.  Near  subgrain  boundaries  and  defects  more  com¬ 
plex  domain  structures  were  seen  as  in  Figs.  1(b)- 1(d).  Im¬ 
ages  of  domains  were  diffuse.  As  the  sample  was  heated  in 
the  PEEM  to  above  the  Curie  temperature  the  domains 
gradually  disappeared.  They  gradually  reappeared  upon  cool¬ 
ing.  The  heating  and  cooling  cycle  could  be  repeated  indefi¬ 
nitely.  By  deflecting  the  photoelectron  beam  with  a  hand¬ 
held  magnet,  it  was  clear  that  no  domains  were  seen  if  the 
center  of  the  beam  was  used  and  that  contrast  was  achieved 
only  if  one  of  the  opposite  edges  of  the  beam  was  used.  This 
is  represented  schematically  in  Fig.  2. 

IV.  DISCUSSION 

Ferromagnetic  domains  of  Nd2Fei4B  of  various  orienta¬ 
tions  have  been  imaged  by  other  types  of  microscopy,  espe¬ 
cially  magneto-optical  Kerr  microscopy.^^"^^  Comparison  of 
the  images  of  Fig.  1  to  the  images  taken  by  the  Kerr  micros¬ 
copy  technique,  especially  those  of  Luo  and  Zhang^"^  show 
that  the  dark  and  bright  stripes  are  quite  similar  to  known 
ferromagnetic  domain  structures  of  Nd2Fei4B.  They  are  also 
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(c)  (d) 


Fig.  1.  Photoelectron  images  of  ferromagnetic  domains  of  Nd2Fe]4B  (Hg  lamp  light  source).  Magnetic  contrast  is  caused  by  lateral  deflection  of  photoelectrons 
out  of  the  aperture  by  stray  magnetic  fields  that  exist  above  the  surface,  (a)  Parallel  domain  structure  seen  on  much  of  the  crystal  (diameter  500  yu-m).  (b)“(d) 
Complex  domain  structures  typical  near  subgrain  boundaries,  defects,  and  inclusions  [diameters  (b)  and  (c)  135  ^tm;  (d)  140  /^m]. 


similar  to  ferromagnetic  domain  structures  seen  near  grain 
boundaries  on  prismatic  planes  of  cobalt^^  and  other  ferro¬ 
magnetic  materials.^^"^®  The  gradual  disappearance  of  the 
domains  (unpublished)  as  the  sample  is  heated  above  the 
Curie  temperature  and  their  reappearance  upon  cooling  is 


Fig.  2.  Schematic  representing  the  partial  splitting  of  the  emitted  photoelec¬ 
tron  beam.  Maximum  magnetic  contrast  is  produced  by  using  an  edge  of  the 
beam.  Little  or  no  magnetic  contrast  is  noted  if  the  center  of  the  beam  is 
used  to  form  the  image. 


similar  to  that  seen  by  Luo  and  Zhang^^  using  Kerr  micros¬ 
copy  and  is  as  expected  for  a  second  order  phase  transition  of 
ferromagnetic  materials. 

Ferromagnetic  domains  have  long  been  imaged  by  con¬ 
ventional  scanning  electron  microscopy.^^”^^  Two  contrast 
mechanisms  are  recognized.^^  In  type  I  contrast,  secondary 
electrons  are  deflected  by  stray  magnetic  fields  above  the 
sample.^^  Stray  magnetic  fields  can  be  present  if  the  sample 
is  miscut.  This  is  shown  very  schematically  in  Figs.  3(a)  and 
3(b).  In  the  type  II  mechanism, backscattered  electrons  are 
deflected  by  the  magnetic  field  within  the  specimen  as  shown 
schematically  in  Fig.  3(c).  In  this  case  no  stray  field  is  nec¬ 
essary.  These  mechanisms  are  well  established  and  are  dis¬ 
cussed  in  textbooks  on  SEM.^^  We  have  examined  the 
Nd2Fei4B  sample  in  the  SEM  and  have  found  that  the  do¬ 
mains  are  imaged  by  the  type  I  mechanism.^^  The  direction 
of  deflection  of  the  secondary  electrons  toward  the  detector 
is  as  in  Fig.  4.  The  images  of  domains  are  diffuse  in  both 
SEM  and  PEEM,  which  is  consistent  with  deflection  of  both 
secondary  electrons  and  photoelectrons  by  a  stray  magnetic 
field.^^  Finally,  we  have  heated  the  sample  in  the  SEM  above 
the  Curie  temperature  and  observed  the  disappearance  of  the 
domains.  This  leaves  little  doubt  that  the  structures  shown  in 
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Fig.  3.  Schematic  representing  type  I  and  type  II  magnetic  contrast  known 
from  scanning  electron  microscopy.  The  same  principles  can  be  applied  to 
explain  deflection  of  photoelectrons,  (a)  Schematic  of  assumed  stray  fields. 
If  a  crystal  is  not  cut  exactly  on  axis,  the  magnetization  vector  is  not  in  the 
plane  of  the  surface  and  stray  fields  will  exist  above  the  sample.  The  straight 
arrows  point  along  the  directions  of  easy  magnetization,  which  is  the  c  axis 
of  this  tetragonal  crystal,  (b)  Projected  side  view  of  the  assumed  stray  fields. 
Maximum  lateral  deflection  of  photoelectrons  occurs  above  domain  bound¬ 
aries.  Opposite  directions  of  stray  fields  deflect  photoelectrons  in  two  oppo¬ 
site  directions,  causing  partial  splitting  of  the  photoelectron  beam.  Minimum 
lateral  deflection  will  occur  for  photoelectrons  ejected  from  the  center  of 
domains.  Variation  of  the  lateral  Lorentz  force  above  the  domains  is  respon¬ 
sible  for  the  diffuse  image  of  the  domains  in  Fig.  I .  (c)  In  type  II  contrast 
magnetic  fields  within  the  sample  cause  lateral  deflection  of  backscattered 
electrons  in  the  SEM.  No  stray  field  is  necessary. 


Fig.  4.  Stray  fields  giving  rise  to  diffuse  domain  structure  and  schematic  of 
an  image.  In  type  I  contrast,  lateral  deflection  of  emitted  electrons  is  attrib¬ 
uted  to  stray  magnetic  fields  above  the  sample,  which  explains  the  diffuse 
images  of  the  domains  and  explains  the  relative  position  of  the  secondary 
electron  detector.  This  mechanism  occurs  for  uniaxial  magnetic  materials 
which  have  stray  fields  above  the  sample  that  close  the  magnetic  flux.  Note 
that  the  position  of  the  detector  in  the  SEM  requires  a  stray  field  to  explain 
the  direction  of  deflection. 


Fig,  1  are  ferromagnetic  domains  and  that  they  can  be  im¬ 
aged  by  photoelectron  microscopy. 

The  magnetic  contrast  mechanism  in  photoelectron  mi¬ 
croscopy  appears  to  be  straight  forward.  Upon  ejection  from 
the  sample,  the  photoelectrons  become  free  electrons.  The 
photoelectrons  are  deflected  laterally  by  the  Lorentz  force  as 
the  electrons  travel  through  the  stray  magnetic  field  above 
the  sample.  Opposite  directions  of  the  stray  field  above  the 
opposite  domain  types  deflects  the  photoelectrons  in  two  op¬ 
posite  directions,  partially  splitting  the  photoelectron  beam 
as  shown  in  Fig.  2.  Imaging  with  an  edge  of  the  beam  reveals 
the  domains.  Because  the  photoelectrons  must  encounter  the 
same  magnetic  fields  as  the  secondary  electrons,  the  deflec¬ 
tion  must  be  similar  and  the  well-established  mechanisms 
applied  to  the  SEM  (Ref.  66)  can  be  applied  to  the  photo¬ 
electron  images.  If  a  stray  field  configuration  due  to  miscut 
of  the  crystal  is  assumed  as  shown  in  Fig.  3(a)  (as  adapted 
from  computer  simulations  of  Harasko,  Pfiitzner,  and 
Futschik^^),  maximum  lateral  deflection  of  the  photoelec¬ 
trons  will  occur  above  the  domain  boundaries.  The  domain 
boundaries  must  then  be  at  positions  of  maximum  and  mini¬ 
mum  photoelectron  intensity  in  Fig.  1.  Images  of  domains 
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will  not  be  sharp  but  will  be  diffuse  because  the  force  of 
lateral  deflection  will  vary  with  position  as  the  photoelec¬ 
trons  travel  through  different  regions  of  the  stray  field. 

Because  stray  magnetic  fields  are  an  essential  component 
in  the  function  of  magnetic  recording  media, photoelec¬ 
trons  can  also  be  deflected  by  the  magnetized  regions  in 
magnetic  video  tapes,  recording  tapes,  computer  disks,  etc. 
We  have  also  imaged  the  magnetized  regions  in  magnetic 
recording  media  by  photoelectron  microscopy.  The  electro¬ 
static  lens  system  appears  well  suited  for  such  applications, 
since  a  strong  magnetic  field  could  erase  or  distort  the  mag¬ 
netized  regions  of  a  recording  medium. 
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The  primary  objective  of  this  study  is  to  determine  the  effect  of  silicon  wafer  processing  parameters 
on  giant  magnetoresistance  (GMR).  Thermally  oxidized  silicon  wafers  serve  as  substrates  for 
alternating  layers  of  NiFeCo  and  Cu  (2. 0-2. 5  nm).  Si02  films  prepared  in  the  laboratory  are  loaded 
into  the  sputtering  chamber  along  with  commercially  oxidized  wafers  for  comparison  purposes. 
Four  process  parameters  are  systematically  varied — oxidation  method,  postoxidation  anneal  (POA), 
oxide  thickness,  and  oxidation  temperature.  Preliminary  results  indicate  the  two  most  influential 
processing  parameters  are  the  method  of  oxidation  (dry  oxygen  or  wet  steam)  and  a  POA.  In 
contrast,  GMR  values  do  not  seem  to  vary  greatly  for  various  oxidation  or  POA  temperatures  in  the 
range  800-1000  °C.  We  present  GMR  results  of  NiFeCo/Cu  multilayers  on  oxidized  silicon 
substrates  formed  by  various  processing  methods.  GMR  is  observed  for  all  films,  with  the 
magnitude  of  the  effect  increasing  for  substrates  oxidized  by  “wet”  methods  and  oxidized  substrates 
that  include  a  POA  (1000  °C,  nitrogen  atmosphere,  20  min)  previous  to  sputter  deposition  of  the 
multilayers.  Our  data  suggest  that  the  GMR  of  the  multilayer  stack  is  sensitive  to  changes  at  the 
Si-Si02  interface.  ©  1996  American  Vacuum  Society. 


I  INTRODUCTION 

A  study  of  the  recent  literature  clearly  indicates  greatly 
increased  effort  in  the  development  of  sensors  employing 
ferromagnetic  thin  films  with  a  magnetoresistance  (MR)  ef¬ 
fect,  for  use  as  magnetic  field  sensors.  “Giant”  magnetore¬ 
sistance  (GMR)  has  been  studied  extensively  since  the  first 
observations  of  large  resistance  changes  induced  by  mag¬ 
netic  fields  in  Fe/Cr  superlattices  with  antiferromagnetic  cou¬ 
pling  between  adjacent  Fe  layers. In  Fe/Cr  multilayers,  it  is 
generally  accepted  that  spin-dependent  scattering  causes  the 
GMR  effect  and  is  related  to  interface  roughness.  Many 
other  systems  have  been  found  to  behave  similarly  to  Fe/Cr, 
such  as  Fe/Cu,  Fe/Mo,  Co/Ru,  Co/Cr,  and  Co/Cu.  All  of 
these  systems  exhibit  oscillatory  behavior  between  antiferro¬ 
magnetic  and  ferromagnetic  coupling."^"^  GMR  observed  in 
sputtered  Co/Cu  multilayers  suggests  strong  AFM 
coupling.^’^^ 

The  multilayer  stacks  that  exhibit  GMR  are  typically 
composed  of  alternate  thin  layers  of  magnetic  material  (e.g., 
Co,  Ni,  Fe,  NiFe)  and  nonmagnetic,  conducting  materials 
(e.g.,  Cu,  Ag,  Au).  One  of  the  first  research  groups  to  observe 
GMR  in  NiFeCo/Cu  multilayers  observed  room  temperature 
MR  values  of  35%  and  13%  for  30  bilayer  films  with  Cu 
thicknesses  of  approximately  0.8  and  2.2  nm,  respectively.^^ 
The  addition  of  Co  to  the  NiFe  alloy  was  expected  to  in¬ 
crease  the  MR  magnitude  over  that  of  NiFe/Cu  multilayers 
originally  investigated  by  Parkin. The  room  temperature 
MR  values  reported  by  Parian  for  NiFe/Cu  14  bilayer  films 
were  16%  with  a  Cu  thickness  of  0.8  nm  and  under  2%  for 
2.0  nm  Cu  spacers,  with  basically  no  coupling  observed  at 
the  latter  thickness.  Parkin’s  later  work  in  NiFe/Cu  multilay¬ 
ers  with  a  thin  Co  layer  at  each  interface  showed  a  dramatic 
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increase  in  MR  to  nearly  20%  at  room  temperature  with  2.0 
nm  Cu  spacers.^^  In  our  laboratory,  we  have  observed  MR 
values  of  10%  in  NiFeCo/Cu  multilayers  with  a  sensitivity  of 
approximately  0.3%  MR/Oe  and  near  zero  hysteretic  profiles 
with  a  Cu  spacer  layer  thickness  of  2.3  nm.^"^ 

In  our  laboratory,  we  first  observed  a  small  increase  in  the 
MR  ratio  on  a  set  of  oxidized  silicon  substrates  (5000  A) 
prepared  by  “wet”  oxidation  methods;  one  sample  included 
a  postoxidation  anneal.  The  multilayer  materials  consisted  of 
four  bilayers  of  sputter  deposited  Co/Cu  (2.0  nm/2.3  nm).  A 
commercially  oxidized  silicon  wafer  was  included  in  the 
sputtering  chamber  for  reference  and  resulted  in  a  MR  value 
of  6.1%.  A  MR  value  of  5.35%  was  obtained  for  the  labora¬ 
tory  sample,  which  included  a  POA  compared  to  a  value  of 
4.5%  for  the  sample  with  no  extra  annealing  step.  A  POA 
step  resulted  in  an  increase  in  the  MR  ratio  of  approximately 
20%  as  compared  to  the  sample  with  no  anneal. 

This  study  was  motivated  by  an  interest  in  validating 
these  experimental  results  and  to  further  investigate  the  ef¬ 
fect  of  substrate  processing  on  GMR.  NiFeCo/Cu  were  cho¬ 
sen  as  the  materials  for  the  multilayers.  The  ternary  alloy 
NiFeCo  exhibits  low  magnetostriction^^  and  low  magneto¬ 
crystalline  anisotropy.  In  candidate  materials  for  magnetic 
field  detection  elements,  both  low  magnetostriction  and  mag¬ 
netocrystalline  anisotropy  are  desired  to  reduce  magnetic 
switching  noise. 

II.  EXPERIMENTAL  METHODS 

Single  crystal  (111)  oriented  0.005  Hem  ;?-type  silicon 
wafers  are  cleaned  by  a  standard  RCA  cleaning  procedure 
foregoing  the  oxidation  process.  Silicon  substrates  are  ther¬ 
mally  oxidized  in  a  quartz  tube  using  two  oxidation  methods. 
High  purity  oxygen  provides  the  ambient  for  “dry”  oxida¬ 
tion  and  steam  produced  from  boiling  deionized  (DI)  water 
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Sample  Preparation 


O  r^O  O  cro^rio  o  o  o  QQ  rP  O  n  O  O 


Silicon  wafer 


Legend 

X  NiFeCo  layer 

Z  Cu  layer 


O  oo 

o°On  SiOJayer 


Fig.  1 .  Schematic  illustrating  sample  preparation. 


provides  oxygen  for  “wet”  oxidation.  An  important  process¬ 
ing  technique  routinely  used  to  improve  the  quality  of  the 
interface  between  silicon  and  thermally  grown  silicon  diox¬ 
ide  is  the  postoxidation  anneal  (POA).  The  effect  of  a  high 
temperature  POA  on  the  characteristics  of  the  Si-Si02  inter¬ 
face  has  been  widely  studied.^^"^^  The  interest  is  primarily 
due  to  device  scaling  for  integrated  circuits  utilizing  metal- 
oxide-semiconductor  (MOS)  technology  which  involves  re¬ 
ducing  the  thickness  of  gate  oxides  to  approximately  100  A. 
POAs  have  an  important  effect  on  device  characteristics  and 
have  been  found  to  improve  transistor  properties.^^  A  high 
temperature  POA  can  also  increase  Si-0  bond  strength  at  the 
interface  due  to  densification,  which  leads  to  lower  interface 
state  density  (D^^)  generation.^^  A  POA  at  various  tempera¬ 
tures  in  a  nitrogen  ambient  is  performed  to  purposely  modify 
the  Si/Si02  interface  properties.  Unless  otherwise  noted,  a 
POA  is  done  at  1000  °C  for  20  min  in  a  nitrogen  atmosphere. 

NiFeCo  multilayers  are  sputter  deposited  by  dc  magne¬ 
tron  sputtering  on  oxidized  silicon  substrates  in  a  Vac-Tec 
Model  250  horizontal  batch  sputtering  system  using  ultra- 
high  purity  (99.999%)  argon.  The  magnetic  field  from  the 
permanent  magnet  results  in  higher  sputtering  rates  for  the 
same  power  compared  to  other  dc  sputtering  systems  due  to 
higher  current  densities  near  the  target  surface.^^  This  en¬ 
hanced  current  density  with  magnetron  sputtering  is  possible 
due  to  the  higher  concentration  of  electrons  kept  near  the 
target  surface,  providing  more  opportunity  for  argon  ioniza¬ 
tion.  Additionally,  a  lower  sputtering  pressure  is  made  pos¬ 
sible  by  magnetron  sputtering.  The  predeposition  base  pres¬ 
sure  of  the  system  is  in  the  range  of  2X10“^  Ton*.  Samples 
are  deposited  under  an  argon  pressure  of  ~2.3  mTon.  The 
films  are  all  deposited  at  ambient  temperature  with  a  perma¬ 
nent  magnet  behind  each  substrate  supplying  an  aligning 
field  of  80  Oe  parallel  to  the  substrate  surface  during  depo¬ 
sition.  A  schematic  illustrating  sample  preparation  is  shown 
in  Fig.  1. 

Commercially  oxidized  silicon  wafers  (Si02  thickness 
=  10  000  A)  are  loaded  into  the  sputtering  chamber  along 
with  wafers  thermally  oxidized  in  our  laboratory.  A 
Ni^^Fei^Cojg  alloy  target  is  used  in  the  sputtering  chamber. 
Electron  microprobe  results  on  granular  Nig^Fei^Coig-Ag 
sputtered  films  reveal  a  ferromagnetic  portion  composition  of 
approximately  Ni76FeioCoi4.^'^  The  NiFeCo/Cu  films  consist 
of  six  sputtered  bilayers.  The  Cu  layers  are  2.3-2.5  nm  thick 


Steam  and  dry  oxidation  at  1 0OO^C,  1 0OOA 
POA  ZOmin.  in  N 

2 

(MR%  of  commercial  Si02  is  3.8%) 


Applied  Field  (Oe) 

Fig.  2.  Magnetoresistance  ratio  {%)  vs  applied  field  (Oe)  for  wet  and  dry 
oxidation  methods. 

and  the  alloy  layers  are  approximately  2.0-2.5  nm  thick. 
Sputtering  rates  are  calculated  from  thickness  measurements 
of  reference  films  made  with  a  Dektak  Ila  surface  profilome- 
ter.  Film  thicknesses  are  controlled  by  controlling  sputtering 
time.  In  this  study  the  NiFeCo  and  the  Cu  spacer  layer  thick¬ 
ness  is  held  constant  as  well  as  the  number  of  bilayers  for 
comparison  purposes. 

An  important  parameter  for  evaluating  GMR  materials  is 
the  normalized  change  in  resistance,  AR/R,  the  MR  ratio. 
MR  measurements  are  made  with  a  linear  four  point  probe 
technique  with  both  current  and  magnetic  field  in  the  film 
plane  but  perpendicular  to  each  other.  All  measurements  are 
made  at  room  temperature  on  as-deposited  samples  unless 
otherwise  noted.  The  MR  percentage  is  calculated  using  the 
equation 

MR%  =  [(Pmax  -  Pmin)/Pinin]  X  100, 

where  is  the  film  resistivity  at  (or  near)  zero  field,  and 
Pmin  is  the  resistivity  at  the  maximum  applied  field. 

The  variability  observed  in  MR  values  for  multilayers 
prepared  at  different  times  makes  direct  comparison  of  re¬ 
sults  difficult.  This  is  remedied  somewhat  by  comparing  only 
MR  magnitudes  for  substrates  prepared  in  the  same 
multilayer  sputter  deposition.  Commercially  oxidized  wafers 
are  also  included  in  each  sputter  deposition  as  reference 
samples. 
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Steam  and  dry  oxidation  at  1 000°C,  1 OOOA 
POA  20min.  in 

(MR%  of  commercial  SiO^  is  5.1 6%) 


Applied  Field  (Oe) 

Fig.  3.  Magnetoresistance  ratio  (%)  vs  applied  field  (Oe)  for  substrates 
prepared  by  wet  and  dry  oxidation  methods. 


III.  EXPERIMENTAL  RESULTS 

NiFeCo  and  Cu  are  the  chosen  multilayer  materials  and 
samples  are  prepared  as  described  in  Sec.  11.  Four  process 
parameters  are  systematically  varied.  They  are  oxidation 
method,  postoxidation  annealing,  oxide  thickness,  and  oxida¬ 
tion  and  annealing  temperature.  The  two  most  influential 
processing  parameters  are  the  method  of  oxidation  and  the 
performing  of  a  POA. 

A.  Oxidation  method 

Multilayers  sputtered  on  substrates  prepared  by  wet  meth¬ 
ods  consistently  produce  higher  MR  values.  As  stated  before, 
due  to  the  variability  seen  in  samples  prepared  in  different 
sputter  depositions  and  the  strong  dependence  of  film  thick¬ 
ness  in  the  multilayer  stack  on  MR  values,  experimental  re¬ 
sults  are  compared  to  results  obtained  from  samples  prepared 
in  the  same  run.  In  one  experiment,  a  Si02  film  oxidized  by 
dry  oxygen  (1000  °C,  1000  A,  POA),  gives  a  MR  value  of 
3.22%  and  one  oxidized  by  wet  methods  gives  a  value  of 
5.7%.  For  reference,  a  commercially  oxidized  silicon  wafer 
yielded  an  MR  value  of  3.8%  for  that  particular  sputter  depo¬ 
sition.  Typical  MR  profiles  showing  the  dependence  on  oxi¬ 
dation  method  are  shown  in  Figs.  2  and  3. 

This  effect  is  believed  to  be  related  to  reduced  interface 
roughness  effects.  Atomic  force  microscopy  (AFM)  studies 
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Dry  oxidation  at  1 000°C,  1 OOOA 
(MR%  of  commercial  SiO^  Is  4,9%) 


Applied  Field  (Oe) 

Fig.  4.  Magnetoresistance  ratio  (%)  vs  applied  field  (Oe)  for  substrates 
prepared  by  dry  oxidation  methods  and  POA. 


are  in  progress,  but  preliminary  analysis  indicates  very 
smooth  surfaces  and  no  distinct,  distinguishable  features  be¬ 
tween  substrates  prepared  by  wet  or  dry  oxidation  methods 
or  substrates  that  include  an  additional  annealing  step.  Inde¬ 
pendent  AFM  and  ellipsometry  studies  reveal  that  interface 
roughness  decreases  with  the  extent  of  oxidation,  for  samples 
prepared  with  dry  oxygen,  and  is  relatively  independent  of 


Magnetoresistance  ratio  (%)  vs  Oxide  thickness  (A) 


Fig.  5.  Magnetoresistance  ratio  (%)  vs  silicon  oxide  thickness  (A)  for  sub¬ 
strates  prepared  by  wet  oxidation  methods. 
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Table  I.  Summary  of  MR  values  for  various  silicon  wafer  processing  conditions. 


Silicon  processing  variation 

Processing  variation 

MR  value  (%) 

Oxidation  method 

(1000  °C,  1000  A),  no  POA 

Dry 

8.2 

Commercial  MR  value  8.7% 

Wet 

10.0 

(1000  °C,  1000  A)  POA 

Dry 

4.68  4.34 

Commercial  MR  value  5.16% 

Wet 

6.1  6.14 

(1000  "C,  1000  A)  POA 

Dry 

3.22 

Commercial  MR  value  3.8% 

Wet 

5.7 

POA 

Wet  (1000  °C,  5000  A) 

POA 

12.3 

Commercial  MR  value  11.3% 

No  POA 

11.6 

Dry  (1000  °C,  1000  A) 

POA 

(a)  9.1  (b)6.52 

(a)  Commercial  MR  value  8.7% 

(b)  Commercial  MR  value  4.9% 

No  POA 

(a)  8.2  (b)  5.69 

Oxide  thickness 

Wet  (1000  °C),  no  POA 
(a)  Commercial  MR  value  10.2% 

100  A 

(a)  10.0  (b)  5.85 

(b)  Commercial  MR  value  5.16% 

500  A 

(a)  9.2  (b)5.39 

1  000  A 

(a)  10.3  (b)6.38 

5  000  A 

(a)  9.0  (b)6.18 

10  000  A 

(a)  7.5  (b)6.5 

Oxidation  temperature 

Wet  (1000  A)  no  POA 

800  °C 

9.8 

Commercial  MR  value  8.7% 

900  X 

9.6 

1  000  °c 

10.0 

POA  temperature 

Wet  (1000  °C,  1000  A) 

600  °C 

4.0 

Commercial  MR  value  5.42% 

800  °C 

3.15 

1  000  °c 

4.82 

Dry  (1000  °C,  1000  A) 

600  °C 

5.0 

Commercial  MR  5.42% 

800  °C 

4.42 

1  000  °c 

4.65 

crystal  orientation  of  the  substrate.^^  In  that  work,  attempts 
to  measure  changes  in  interface  roughness  were  futile  be¬ 
cause  the  level  of  roughness  was  below  the  noise  limits  of 
the  AFM.  They  examined  purposely  roughened  silicon  sub¬ 
strates  to  allow  AFM  studies  to  reveal  the  decrease  in  rough¬ 
ness  with  increasing  oxidation.^^’^"^ 


B.  Postoxidation  anneal 

The  presence  of  a  POA  also  consistently  increases  MR 
values,  POA  is  well  known  for  reduction  of  the  density  of 
interface  states  at  the  Si-Si02  interface.  For  one  set  of 
samples,  a  MR  value  of  5.69%  was  obtained  for  a  sample 
oxidized  by  dry  oxygen;  a  similar  sample  undergoing  a  POA 
yielded  a  value  of  6.52%.  Similar  results  are  obtained  for 
substrates  prepared  by  wet  oxidation  methods  and  annealed 
for  20  min  in  nitrogen  at  1000  °C.  For  reference,  a  commer¬ 
cially  oxidized  wafer  with  no  anneal  gave  a  MR  value  of 
4.9%.  A  typical  MR  profile  for  a  sample  oxidized  by  dry 
oxygen  is  shown  in  Fig.  4.  The  changes  in  MR  ratios  for  the 
substrates  that  include  postoxidation  annealing  is  consis¬ 
tently  observed. 


C,  Oxide  thickness 

Another  interesting  behavior  is  observed  for  substrates 
with  various  Si02  thicknesses  prior  to  sputter  deposition  of 
the  multilayers.  Oxide  thickness  is  varied  from  100  to  10  000 
A  in  an  attempt  to  understand  the  effect  of  an  ultrathin  oxide 
film.  Thickness  of  the  underlying  oxide  buffer  will  be  very 
important  in  ultrahigh  density  magnetic  recording  applica¬ 
tions  where  intershield  distances  are  extremely  small  and  in¬ 
sulating  layers  extremely  thin.  Samples  are  prepared  by  wet 
oxidation  methods  at  1000  °C  and  no  POA  is  performed. 
Oxide  thickness  is  varied  by  controlling  oxidation  time.  Two 
sets  of  data  for  samples  from  two  different  multilayer  sputter 
depositions  reveal  only  slight  changes  in  the  magnitude  of 
the  MR  ratio  as  a  function  of  oxide  thickness.  The  MR  ratio 
dependence  on  oxide  thickness  is  shown  in  Fig.  5.  Compari¬ 
son  of  the  data  shows  that  a  distinct  kink  in  the  curve  is 
observed  at  thickness  in  the  range  100-1000  A.  This  phe¬ 
nomenon  is  currently  under  detailed  investigation  but  is  quite 
reproducible  as  evidenced  by  Fig.  5.  Both  sets  of  substrates 
are  prepared  using  the  same  methods  but  differ  in  their  rela¬ 
tive  magnitude.  The  reason  for  the  differences  in  MR  values 
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between  the  two  sets  is  merely  due  to  the  variations  observed 
in  separate  depositions  and  arise  from  the  variation  in  cali¬ 
bration  of  the  sputtering  target. 

D.  Oxidation  and  anneal  temperature 

MR  values  do  not  vary  greatly  for  various  oxidation  tem¬ 
peratures  in  the  range  800-1000  °C.  For  this  particular  ex¬ 
periment,  samples  are  prepared  by  wet  methods  and  thick¬ 
ness  is  held  constant  at  1000  A.  MR  ratios  of  9.8%,  9.6%, 
and  10%  are  obtained  for  oxidation  temperatures  of  800, 
900,  and  1000  °C,  respectively,  while  8.7%  is  obtained  for 
the  commercially  oxidized  wafer  included  in  that  deposition. 
Similar  results  are  seen  for  samples  with  POA  temperatures 
in  the  range  600-1000  ""C.  Table  I  summarizes  the  experi¬ 
mental  results  for  each  silicon  wafer  processing  variation  in 
this  study. 

IV.  DISCUSSION 

MR  values  are  strongly  dependent  on  the  thickness  of  the 
multilayer  films.  The  experimental  results  shown  in  this  ar¬ 
ticle  come  from  measurements  at  room  temperature  on  as- 
deposited  multilayers.  For  one  large  sample  set,  MR  values 
of  approximately  1%  were  observed  for  a  variety  of  samples. 
When  the  same  samples  were  measured  again  after  1  month, 
a  slight  increase  is  observed  for  most  samples.  Some,  how¬ 
ever,  had  doubled  in  value,  and  a  few  isolated  samples  gave 
MR  values  of  4% -5%.  This  room  temperature  magnetic  af¬ 
ter  effect  has  been  seen  before  in  our  laboratory^^  and  is 
consistent  with  gradual  change  from  ferromagnetic  to  anti¬ 
ferromagnetic  interlayer  exchange  coupling.  After  annealing 
the  multilayers  at  150  °C,  a  dramatic  increase  is  observed 
and  MR  values  range  from  approximately  3%  to  6%  overall 
with  some  samples  as  high  as  7%.  Further  annealing  at 
250  °C  brings  about  another  slight  increase.  Because  there  is 
an  optimum  thickness  for  efficient  exchange  coupling  be¬ 
tween  the  multilayers,  the  annealing  step  allows  one  to  ob¬ 
tain  increased  MR  values  for  multilayers  that  are  slightly 
thicker  than  optimal.^^ 

V.  SUMMARY 

The  sensitivity  of  GMR  to  processing  conditions  of  a  sili¬ 
con  dioxide  substrate  previous  to  sputter  deposition  of 
multilayer  thin  films  is  studied  in  the  present  work  by  moni¬ 
toring  comparative  magnitudes  of  resulting  magnetoresis¬ 
tance  values.  These  values  are  compared  to  values  obtained 
from  multilayers  sputter  deposited  on  commercially  oxidized 
silicon  wafers.  The  magnitude  of  MR  ratios  increase  for  oxi¬ 
dized  substrates  that  include  a  POA  (1000  °C,  nitrogen  atmo¬ 
sphere,  20  min)  and  substrates  oxidized  by  wet  methods.  MR 
values  appear  to  be  relatively  independent  of  the  thickness  of 
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the  oxide  layer,  and  even  films  as  thin  as  100  A  produce 
significant  MR  values.  Our  data  suggests  that  GMR  of  the 
multilayer  stack  is  sensitive  to  changes  at  the  Si-Si02  inter¬ 
face.  Future  plans  include  a  more  detailed  study  of  the  varia¬ 
tion  in  thickness  of  the  underlying  oxide  film  and  investiga¬ 
tion  of  multilayers  deposited  on  alumina  substrates.  This 
work  can  be  useful  in  finding  optimum  processing  conditions 
and  determining  the  role  of  the  substrate  on  GMR. 
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The  local  spin  density  approximation  (LSDA)  is  used  to  investigate  the  interrelationships  between 
electronic  properties,  magnetism,  structure,  and  transport  in  La^  _;fCaj^Mn03 .  The  LSDA  is  shown 
to  provide  a  realistic  description  of  the  endpoint  compounds,  including  magnetic  and  structural 
symmetry.  Mn-0  hybridization  is  strong  and  spin  dependent.  Supercell  calculations  for  x=^  and 
x  =  I  yield  ferromagnetic  configurations.  Strong  local  La-Ca  cation  effects  are  found,  which  lead  to 
localization  of  the  low  density  of  minority  spin  carriers  in  this  regime.  Implications  of  this 
effectively  half-metallic  electronic  structure  are  discussed. 


I.  INTRODUCTION 

The  discovery  of  colossal  magneto-resistance  (CMR)  in 
films  of  Lai_;^Ca^Mn03  and  related  Mn  containing 
perovskites^”"^  raises  the  question  of  what  is  special  about 
these  materials  to  permit  extremely  large  resistivity  reduc¬ 
tions  in  laboratory  fields.  The  endpoints  of  this  alloy  system 
are  antiferromagnetic  (AF)  insulators,  with  different  struc¬ 
tural  and  magnetic  symmetries.  CaMn03  is  cubic  perovskite 
with  a  lattice  parameter  of  3.73  A  and  G-type  (rocksalt)  AF 
ordering,^  while  LaMn03  occurs  in  a  distorted  perovskite 
structure  (GdFe03  type,  Pnma,  No.  62),  with  A -type  AF 
order.^’^  In  the  concentration  range,  0.2  <  x  <  0,5,  the  alloy 
occurs  in  the  cubic  perovskite  structure,  and  is  metallic  and 
ferromagnetic  (F)  at  low  temperatures.  It  is  in  this  regime 
(x«=^  j)  that  the  CMR  effect  is  observed.^’^ 

Goodenough^  addressed  these  materials  early  on  with  a 
discussion  of  Mn  d-0  p  hybridization  leading  to  “semico- 
valent  exchange.”  His  theory  led  to  connections  between 
magnetic  order,  crystal  structure,  and  electronic  transport. 
The  endpoints  formally  contain  Mn^"^  {d"^)  and  Mn"^"^  {d^) 
ions.  Based  on  the  neutron  data,^  these  are  in  high  spin  con¬ 
figurations,  though  possibly  reduced  from  Hund’s  rule  values 
due  to  hybridization.  In  the  perovskite  crystal  field,  the  t2g 
orbitals  are  lower  in  energy  than  the  ,  so  that  for  CaMn03 
the  t2g  majority  orbitals  hold  the  three  electrons,  and  the 
minority  spin  orbitals  on  the  Mn  are  unoccupied.  In  LaMn03 
there  is  one  additional  Mn  d  electron,  leading  to  partial  oc¬ 
cupation  of  the  two  majority  spin  Cg  states.  This  is  presumed 
to  result  in  a  Jahn-Teller  instability,  accounting  for  the 
Pnma  structural  distortion. 

It  should  be  noted  that  there  are  additional  driving  forces 
that  may  be  operative.  Rotational  instabilities  of  oxygen  oc- 
tahedra  due  to  ion  size  mismatch  resulting  in  this  symmetry 
are  common  in  perovskites  and  the  La  and  Mn  ionic  radii  are 
consistent  with  this.  However,  such  a  mechanism  does  not 
generally  lead  to  the  substantial  changes  in  transition 
metal-0  distance  occurring  in  LaMn03 . 

Zener^^  proposed  a  double  exchange  process  to  explain 
the  occurrence  of  a  metallic  ferromagnet  in  the  phase  dia- 
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gram.  In  his  picture,  the  O  ions  are  closed  shell  (0^~).  None¬ 
theless,  hopping  must  occur  through  the  O.  This  occurs 
through  a  correlated  electron  motion  in  which  an  electron 
jumps  from  O  to  a  neighboring  Mn  while  an  electron  hops 
from  a  different  Mn  neighbor  onto  the  O.  This  process  re¬ 
quires  that  both  hopping  electrons  have  the  same  spin.  Since 
the  Mn  are  in  the  high  spin  configuration,  the  implication  is 
that  for  conduction,  the  active  Mn  ions  must  have  the  same 
spin.  While  this  does  nicely  account  for  the  relationships 
between  transport  and  magnetic  order  in  the  (La,Ca)Mn03 
alloy,  it  leaves  certain  questions  unanswered,  particularly,  the 
role  of  hybridization  and  how  the  very  low  resistivities  that 
have  been  observed  (~100  /zfi  cm  in  films^^)  are  consistent 
with  a  correlated  hopping  conduction  process. 

Here,  local  spin  density  approximation  (LSDA)  calcula¬ 
tions  are  used  to  elucidate  the  extent  and  role  of  the  hybrid¬ 
ization  and  to  establish  the  main  features  of  the  electronic 
structure  in  the  CMR  regime.  Other  electronic  structure  stud¬ 
ies  are  those  of  Sarma  et  Satpathy  et  and  Butler 
et  al}"^  The  LSDA  description  is  found  to  account  quite  well 
for  established  experimental  facts.  Substantial,  spin  depen¬ 
dent  hybridization  is  found  along  with  effectively  half- 
metallic  character.  This  is  related  to  the  CMR  phenomena. 

II.  METHOD 

The  calculations  were  performed  with  a  generalized  lin¬ 
earized  augmented  plane-wave  method.  No  shape  ap¬ 
proximations  are  made  to  the  potential  or  the  charge  or  spin 
densities  in  this  method,  and  additionally  a  flexible  basis  is 
employed  in  all  regions  of  space.  This  makes  the  method 
suitable  for  open  structures  with  low  site  symmetries,  as  con¬ 
sidered  here.  Well  converged  linear  augmented  plane  wave 
(LAPW)  basis  sets  of  approximately  400  functions  per  for¬ 
mula  unit  were  employed,  with  the  addition  of  local  orbitals 
to  relax  linearization  errors  and  to  treat  the  semicore  states 
(Ca  3sy  3p\  La  5^,  5p\  O  2^)  on  the  same  footing  as  the 
valence  states.  The  La  4/  states  were  also  treated  self- 
consistently  with  the  valence  states,  and  were  found  2-3  eV 
above  the  Fermi  energy  (Ef).  Sphere  radii  of  1.55  and  2.0 
a.u.  were  used  for  O  and  the  cations,  respectively.  All  calcu¬ 
lations  were  performed  self-consistently.  Scalar  relativistic 
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CaMnC)3  G-type  AFM 


Fig.  1.  Band  structure  of  CaMn03  with  G-type  (rocksalt)  AF  ordering.  The 
designation  110  indicates  a  second  X  point  lying  outside  the  first  Brillouin 
zone. 


effects  were  retained  for  the  valence  states,  while  core  states 
were  treated  fully  relativistically  in  an  atomiclike  fashion. 

The  Brillouin  zone  sampling  was  performed  using  the 
special  k  points  method.  A  dense  set  of  42  points  in  the 
irreducible  wedge  was  used  for  the  supercells  in  the  crucial 
x=j  CMR  regime.  Use  of  a  smaller  set  of  20  points  yielded 
the  identical  moments  to  within  1%.  The  calculations  were 
performed  for  a  volume/formula  unit  corresponding  to  that  at 
the  CMR  composition,  x=  j.  However,  because  of  the  near 
Hund’s  rule  spin  moments  and  the  stiffness,  which  we  find  in 
fixed-spin-moment  calculations,  only  weak  and  continuous 
variation  with  volume  is  anticipated. 

III.  CaMnOs 

As  mentioned,  CaMn03  is  an  AF  insulator  occurring  with 
the  cubic  perovskite  structure.  Calculations  were  performed 
for  this  structure  with  various  spin  orderings:  A -type  AF, 
G-type  AF  (observed),  and  F.  The  energies  of  these  configu¬ 
rations  relative  to  a  constrained  nonspin-polarized  calcula¬ 
tion  are  —0.92  (A-type  AF),  —0.98  (G-type  AF),  and  —0.86 
eV  (F).  These  large  energy  gains,  which  are  relatively  inde¬ 
pendent  of  the  ordering,  show  the  robustness  of  the  magnetic 
solutions,  leading  to  the  expectation  that  the  moments,  al¬ 
though  disordered  and  fluctuating,  should  persist  relatively 
undisturbed  above  the  ordering  temperature  T^.  The  Mn  d 
exchange  splitting  is  eV  for  each  ordering. 

The  observed  G-type  AF  magnetic  ordering  has  the  low¬ 
est  energy  (—0.98  eV),  in  agreement  with  the  experimental 
symmetry.  Furthermore,  with  this  ordering  an  insulating 
band  structure  with  a  0.42  eV  gap  is  obtained  (Fig.  1).  This 
direct  gap,  which  occurs  at  the  X  point,  is  between  majority 
and  minority  spin  Mn  t2g  states  and  therefore  is  accessible 
only  through  spin-flip  transitions.  The  measured  optical  gap 
should  be  larger. 

The  Mn  t2g  manifold  is  essentially  completely  polarized, 
in  accord  with  Hund’s  rule.  However,  the  band  structure  and 
projections  of  the  density  of  states  (DOS)  show  substantial 
hybridization  with  0  2/7  states,  primarily  affecting  the  ma¬ 
jority  spin  Mn  states,  which  are  below  Ep  and  overlap  0  2/7 
bands.  One  measure  of  the  hybridization  is  through  the  re¬ 


duction  from  integral  values  of  the  moment.  For  the  G-type 
ordering,  the  moment  in  the  Mn  sphere  is  2.48  .  Bearing 

in  mind  that  5%  of  the  Mn  d  moment  may  lie  outside  the  2.0 
a.u.  sphere,  this  is  in  reasonable  accord  with  the  neutron 
value  of  2.65 

The  result  for  CaMn03  with  an  imposed  F  ordering  is 
remarkable.  The  calculated  magnetization  is  precisely  3  fig 
in  spite  of  considerable  hybridization.  This  occurs  because 
the  band  structure  is  half-metallic  {\M),  with  metallic  ma¬ 
jority  spin  bands  crossing  Ef  but  a  gap  about  Ep  m  the 
minority  spin  bands.  The  lower  lying  majority  Mn  t2g  states 
are  occupied  and,  as  a  result,  overlap  and  strongly  mix  with 
0  2/7  states.  The  minority  Mn  bands  are  3  eV  higher,  and  lie 
above  the  O  bands. 

IV.  LaMn03 

LaMn03  occurs  in  a  distorted  Pnma  crystal  structure. 
However,  before  describing  calculations  for  the  actual  struc¬ 
ture,  we  discuss  the  cubic  perovskite  structure,  which  may  be 
more  directly  related  to  the  CMR  alloys.  For  this  structure, 
neither  the  F-  nor  the  A-type  (observed)  or  G-type  magnetic 
orderings  yielded  an  insulating  band  structure  and  the  F  or¬ 
dering  was  lowest  in  energy.  The  bands  for  the  F  ordering 
are  very  similar  to  those  of  \M  CaMn03  but  due  to  the 
increased  band  filling  the  minority  spin  gap  occurs  below 
Eg,  and  both  spin  channels  are  metallic.  The  F  spin  magne¬ 
tization  is  3.38  fig,  which  is  a  considerable  drop  from  the 
Hund’s  rule  value  (4  fig).  The  moment  in  the  Mn  sphere  is 
only  2.89  fig  reflecting  again  the  importance  of  hybridiza¬ 
tion. 

The  distortion  to  the  Pnma  structure  involves  a  rotation  of 
the  oxygen  octahedra  and  an  additional  strong  Jahn-Teller- 
like  distortion  that  changes  the  Mn-0  bond  lengths,  which 
then  differ  by  as  much  as  0.27  A.  The  energy  gain  due  to  this 
distortion  is  calculated  as  0.23  eV  per  formula  unit.  This 
energy  difference  is  close  to  that  obtained  by  Sarma  et  al}^ 
Including  the  distortion  the  A-type  AF  ordering  (observed)  is 
lowered  below  the  F  state,  and  a  0.12  eV  insulating  gap 
develops  in  the  band  structure.  The  removal  of  minority  spin 
carriers  from  Eg  results  in  an  increase  of  the  moment  to  3.31 
fig  within  each  Mn  sphere.  There  is  an  additional  polariza¬ 
tion  of  0.07  fig  within  those  O  spheres  where  it  is  allowed  by 
symmetry.  This  is,  however,  a  severe  underestimate  of  the  O 
contribution,  since  the  Ip  states  in  O  ions  are  very  extended 
and  are  not  contained  by  a  1.55  a.u.  sphere.  In  any  case,  the 
results  show  that  the  Pnma  distortion  is  necessary  to  describe 
LaMn03 .  However,  when  it  is  included,  the  LSDA  gives  a 
quite  reasonable  description  of  this  material. 

V.  CMR  REGIME 

As  shown  above,  the  LSDA  describes  both  endpoints  of 
the  alloy  system  including  the  magnetic  ordering,  insulating 
character,  and  structural  distortion.  Accordingly,  it  is  reason¬ 
able  to  proceed  to  the  CMR  alloys.  In  this  regime, 
(La,Ca)Mn03  has  the  cubic  perovskite  structure  and  ferro¬ 
magnetic  order.  We  studied  this  range  using  ordered  super¬ 
cells  with  x=\  and  x=\. 
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Fig.  2.  DOS  and  projections  onto  the  Mn  spheres  for  the  supercell 
with  the  actual  F  ordering  (left)  and  with  an  imposed  ferrimagnetic  ordering 
(right).  Majority  (minority)  spin  is  shown  as  positive  (negative).  The  pro¬ 
jections  are  on  a  per  atom  basis.  Epis  denoted  by  the  dashed  vertical  lines. 


For  an  ordered  x=j  supercell,  La2CaMn309  with  periodi¬ 
cally  repeating  [-La-Ca-La-]  layers,  F  order  is  obtained 
with  a  moment  of  3.51  /jl^  per  Mn.  The  artificial  cation  or¬ 
dering  leads  to  two  distinct  Mn  sites,  one  with  eight  La 
neighbors  (MnLa-La)  Ihe  other  with  four  La  and  four  Ca 
neighbors  (MnLa.cJ.  The  moment  on  the  MnLa_La  site  is 
larger  by  0.05  than  that  on  the  Mn^a-Ca  site,  as  might  be 
expected.  This  difference,  however,  is  much  less  than  would 
be  expected  in  a  purely  ionic  picture.  The  DOS  and  projec¬ 
tions  are  shown  in  Fig.  2.  Flipping  the  MnLa-La  moment, 
leads  to  a  metastable  “ferrimagnetic”  configuration  that  is 
0.07  eV  higher  in  energy. 

An  ordered  body-centered-cubic  symmetry  supercell  at 
x=^,  La3CaMn40i2  (each  Mn  has  six  La  and  two  Ca  neigh¬ 
bors)  with  F  ordering  displays  similar  behavior,  with  a  spin 
magnetization  of  3.53  /ulq  per  Mn.  As  for  the  endpoint  com¬ 
pounds  the  Mn  d  exchange  splittings  are  Aex'^3  eV. 

The  DOS  and  projections  show  a  certain  constancy  of 
behavior.  The  differentiation  of  the  Mn  ions  is  a  local  envi¬ 
ronment  effect,  that  will  have  its  counterpart  in  the  disor¬ 
dered  solid.  The  projected  DOS  (PDOS)  shows  hybridization 
with  0  2p  states  in  the  majority  bands  at  and  below  Ep.  The 
PDOS  are  strongly  spin  differentiated  in  the  range  -2-0  eV 
(relative  to  Ep),  and  in  this  region  the  moment  flip  results 
simply  in  a  flip  in  the  PDOS  of  the  affected  atom,  while 
leaving  its  shape  and  the  PDOS  of  the  neighboring  Mn  ions 
practically  unchanged. 

In  the  x  =  ^  supercell,  the  small  number  of  minority  carri¬ 
ers  are  associated  practically  exclusively  with  the  MnLa-La 
site;  this  Mn  is  surrounded  by  trivalent  La,  and  so  it  is  not 
surprising  that  it  is  more  attractive  for  electrons.  The  other 
Mn  ions  remain  with  no  minority  spin  carriers  at  Ep, 
similar  to  F  ordered  CaMn03.  In  the  x=j  supercell,  the 
minority  spin  band  edge  is  again  slightly  below  Ep,  and  is 
intermediate  in  position  between  the  edges  in  the  minority 
spin  PDOS  for  MnLa_La  (^  La  neighbors)  and  MnLa-ca  (4  La 
neighbors).  The  dependence  of  the  minority  edge  on  local 
environment  is  illustrated  in  Fig.  3,  where  the  minority  spin 
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Fig.  3.  Minority  spin  band  structure  along  F  ~X  for  F  ordered 
Lai.^.Ca^Mn03,  j:  =  0,  4,  3,  and  1.  Of  the  five  bands  near  Ep  at  x=  3,  the 
lowest  is  purely  MnLa_La  derived  and  the  upper  pair  is  derived  from  the  two 
Mnta-ca  sites;  the  middle  pair  involves  all  Mn  ions.  Note  the  decrease  in 
minority  gap  with  Ca  content.  This  arises  from  raising  of  the  O  p  bands  as 
the  neighboring  cation  charge  decreases. 


band  structure  along  F -X  is  shown  for  the  various  calcula¬ 
tions. 

VI.  DISCUSSION 

CMR  occurs  in  the  cubic  perovskite  F  ordered  regime. 
The  above  calculations  show  that  in  this  case  the  majority 
bands  have  strongly  metallic,  well  hybridized  Mn  d-0  p 
character.  This  is  entirely  consistent  with  the  low  observed 
resistivities  as  temperature  is  reduced,  since  ordinary  metal¬ 
lic  conduction  is  expected. 

The  minority  spin  band  structures  show  strong  variations 
in  the  position  of  the  band  edge  and  number  of  carriers  with 
jc.  As  mentioned  this  is  a  local  environment  effect.  F  ordered 
CaMn03  is  a  true  ferromagnet.  The  ordered  x-^  and  x 
=  j  supercells  have  low,  site  dependent  carrier  densities.  In 
reality,  the  A  cation  site  is  disordered,  and  this  disorder  will 
couple  to  the  position  of  the  minority  edge  through  the  local 
effects  discussed  above,  and  through  local  strain  effects.  The 
result  will  be  localization  of  minority  carriers,  where  Ep  oc¬ 
curs  below  a  mobility  edge  in  this  spin  channel.  Thus  F 
ordered  (La,Ca)Mn03  is  predicted  to  be  effectively  jM  in 
the  CMR  regime.  For  F  alignments  the  majority  spin  bands 
are  strongly  covalent  and  metallic,  while  the  minority  bands 
are  insulating.  This  implies  the  lack  of  a  true  Stoner  con¬ 
tinuum,  although  there  is  the  possibility  of  new  low  energy 
excitations  involving  a  hole  in  the  majority  spin  bands  and 
an  electron  in  a  localized  minority  spin  state,  or  vice  versa. 

The  local  environment  dependence  of  the  electronic  struc¬ 
ture,  discussed  above,  implies  that  the  effective  behavior 
will  persist  near  flipped  Mn  spins,  such  as  would  occur  as 
temperature  is  raised,  near  abrupt  domain  boundaries,  or  in 
locally  Ca-rich  regions  where  AF  order  is  favored.  This  pro¬ 
vides  a  way  to  account  for  high  resistivity  in  zero  field  near 

.  Due  to  the  jM  conduction,  hopping  of  carriers  between 
F  regions  with  opposite  magnetization  directions  will  be 
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strongly  suppressed.  This  is  similar  to  physics  discussed  for 
heterogeneous  giant  magnetoresistive  (GMR)  materials  like 
Co-Cu,^^’^^  but  would  be  stronger  because  of  the  maximum 
\M  asymmetry.  Above  there  is  no  net  moment.  Whether 
the  moments  become  disordered  on  an  atomic  scale  or  an  a 
larger  scale  corresponding  to  several  F  coupled  spins,  hop¬ 
ping  between  the  these  units  will  be  strongly  suppressed 
when  the  relative  orientation  differs.  We  speculate  that  the 
conduction  may  become  nonmetallic  due  to  a  combination  of 
bandwidth  narrowing  (Mott  localization)  and  disorder 
(Anderson  localization).  If  the  locally  aligned  regions  be¬ 
come  larger,  as  expected  near  ,  it  may  be  feasible  to  rotate 
them  using  applied  fields,  providing  a  mechanism  for  mag¬ 
netoresistance. 

VII.  SUMMARY 

Clearly,  there  is  a  considerable  distance  to  go  in  under¬ 
standing  the  CMR  effect  in  manganites.  Nonetheless,  our 
calculations  do  provide  some  insight  into  the  (La,Ca)Mn03 
system.  The  LSDA  has  been  shown  to  provide  a  realistic 
description  of  the  materials.  Effectively  \M  transport,  with 
strongly  hybridized  conducting  majority  bands,  and  local  en¬ 
vironment  effects  are  found  for  the  CMR  alloys.  This  pro¬ 
vides  a  ready  explanation  for  the  low  resistivity  at  low  T,  and 
suggests  possible  mechanisms  for  achieving  high  resistivity 
and  field  dependence  near  and  above  . 
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Recent  spin-resolved  x-ray  photoelectron  spectroscopy  (SRXPS)  studies  of  ferromagnetic  metals  are 
reviewed.  SRXPS  studies  of  metallic  Fe,  Co,  Co^^Fe4NiiBi4Sii5,  and  Ni  demonstrate  that  core-level 
photoemission,  and  the  itinerant  electron  response  to  core-hole  creation,  are  highly  spin-dependent. 

The  exchange  splitting  of  the  Fe  2;?  3/2  level  is  found  to  be  0.48  ±0.05  eV.  Lifetime  broadening 
results  for  the  Fe  2/? 3/2  A^T  (majority  spin)  and  Nl  (minority  spin)  components  indicate  conservation 
of  spin  in  core-hole  filling  processes  involving  the  valence  band.  SRXPS  study  of  the  Fe  2/73/2 
asymmetry  a  reveals  a  dependence  of  electron-hole  excitation  on  the  spin  of  the  core  hole.  Spin 
analysis  of  the  Fe  35-  XPS  line  shape  shows  it  to  be  a  three-component  spectrum,  rather  than  the 
two-component  line  shape  assumed  previously.  A  photon  energy  dependence  of  one  of  the  Fe  3  s 
components  explains  disagreement  among  previous  Fe  3.s  XPS  results.  Comparisons  of  SRXPS 
from  Co  metal  and  Co66Fe4NiiBj4Si]5  directly  demonstrate  the  effect  of  a  reduced  atomic  magnetic 
moment  on  the  spin  dependence  of  core-level  XPS.  The  behavior  of  lifetime  broadenings  for  the  Aj 
and  Nl  Co  2/73/2  components  show  that  the  reduced  Co  magnetic  moment  found  in  the 
Co66Fe4NiiBj4Sii5  amorphous  glass  is  due  to  the  transfer  of  j-spin  valence  electron  density  to  the 
l-spin  valence  band  upon  glass  formation.  SRXPS  also  allows  investigation  of  spin-dependent 
core-hole  screening  processes  and  satellite  production,  as  demonstrated  in  SRXPS  studies  of 
ferromagnetic  Ni.  Future  directions  of  SRXPS  are  also  explored.  ©  1996  American  Vacuum 
Society. 


I.  INTRODUCTION 

Spin-resolved  x-ray  photoelectron  spectroscopy  (SRXPS) 
constitutes  a  local  convergence  of  two  independent  fields  of 
research:  electron  spin  polarization  studies  and  XPS  investi¬ 
gations  of  core  and  valence  electronic  structure.  Pierce  and 
Celotta^  have  reviewed  electron  spin  polarization  research, 
emphasizing  spin-polarized  electron  scattering  from  surfaces. 
By  the  early  1980s,  ultraviolet  photoelectron  spectroscopy 
(UPS)  had  been  performed  with  “spin  resolution”  via  the 
coupling  of  electron  spin  detectors  to  electron  energy 
analyzers.^’^  Spin-resolved  UPS  studies  probed  the  occupied 
spin-dependent  valence  electronic  structure  of  ferromagnets, 
providing  unprecedented  information  on  the  quantum  me¬ 
chanical  origin  of  magnetism.  However,  the  emphasis  on 
spin-resolved  UPS  tended  to  concentrate  photoemission  in¬ 
strumentation  on  synchrotron  UV  beam  lines  that  did  not 
extend  to  high  enough  energy  to  allow  spin-resolved  core¬ 
level  photoemission  studies. 

When  the  construction  of  the  SRXPS  laboratory  began  at 
Lehigh  University  in  1988,  the  importance  of  conducting 
spin-resolved  photoemission  in  the  XPS  regime  was  twofold. 
First,  the  resolution  of  electron  spin  in  multiplet-split  line 
shapes  would  provide  fundamentally  new  information  about 
the  nature  of  the  intra-atomic  exchange  interaction,  the  role 
of  spin  in  XPS  satellite  production,  and  spin-dependent  ef¬ 
fects  in  core-hole  screening.  For  many  years,  these  funda¬ 
mentally  spin-dependent  phenomena  had  been  intensively 
investigated,'^  albeit  without  the  benefit  of  spin  analysis. 


^^Electronic  mail:  lek@lehigh.edu 


Second,  SRXPS  would  allow  element-specific  magnetic 
measurement — a  capability  sorely  lacking  in  almost  all 
probes  of  magnetism.  Magnetic  films,  alloys,  and  interfaces 
composed  of  two  or  more  magnetic  elements  (heterogeneous 
magnetic  systems)  are  not  amenable  to  spin-resolved  UPS 
measurements  because  of  the  overlap  of  valence  band  fea¬ 
tures  from  different  elements.  In  contrast,  core-level  XPS, 
affords  element  specificity.  A  measurement  of  core-level  spin 
polarization  by  SRXPS  could,  via  the  core-valence  intra- 
atomic  exchange  splitting,  provide  a  measurement  of  the  lo¬ 
cal  magnetic  moment  magnitude,  while  providing  the  mag¬ 
netic  moment  direction  directly  from  the  observed  spin 
polarization  direction. 

In  order  for  SRXPS  to  provide  even  a  semiquantitative 
probe  of  magnetism,  the  fundamental  physical  processes 
governing  the  photoelectron  spin  polarization  at  the  instant 
of  excitation  needed  to  be  understood.  Additionally,  the  ef¬ 
fects  on  electron  spin  of  elastic  and  inelastic  scattering  dur¬ 
ing  photoelectron  transport  out  of  the  solid  required  study. 
This  article  reviews  recent  SRXPS  results  that  focus  on  these 
foundational  issues.  Section  II  describes  the  SRXPS  appara¬ 
tus  and  the  method  of  data  acquisition  and  analysis.  Section 
III  presents  and  discusses  recent  SRXPS  results  from  ferro¬ 
magnetic  Fe,  Co,  Co66Fe4NijBj4Si,5,  and  Ni.  The  future  of 
SRXPS  will  be  examined  in  Sec.  IV. 

11.  SRXPS  APPARATUS 

A  diagram  of  the  SRXPS  apparatus  is  shown  in  Fig.  1. 
The  sample  is  located  in  an  UHV  chamber  constructed  of  mu 
metal.  The  x-ray  source  is  an  unmonochromatized  single- 
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Fig.  1 .  Schematic  diagram  of  the  SRXPS  apparatus. 


anode  Mg  Ka  [hv~  1253.6  eV,  0.7  eV  FWHM  (full  width  at 
half-maximum)  natural  linewidth]  x-ray  tube  designed  to  op¬ 
erate  continuously  at  800  W  power.  For  SRXPS  it  is  typically 
operated  at  510  W.  The  overall  instrumental  energy  resolu¬ 
tion  is  1.6  eV  FWHM.  The  electron  energy  analyzer  is  a 
commercial  V.G.  ESCALAB  Mkll  electron  spectrometer 
(152  mm  mean  radius,  150''  spherical  sector),  and  delivers 
energy  resolved  photoelectrons  to  the  electron  spin  detector. 

The  spin  polarization  P  of  electrons  photoemitted  from  a 
ferromagnetic  metal  can  be  defined  as^ 


AT+A^i’ 


(1) 


where  At  and  At  are  the  number  of  photoelectrons  with 
magnetic  moments  oriented  parallel  and  antiparallel,  respec¬ 
tively,  to  the  sample  magnetization.  Due  to  the  negative  gy- 
romagnetic  ratio  of  the  electron,  the  electron  magnetic  mo¬ 
ment  lies  antiparallel  to  the  electron  spin  angular  momentum. 
Although  the  polarization  P  defined  in  Eq.  (1)  is  really  a 
“magnetic  moment  polarization,”  the  term  “spin  polariza¬ 
tion”  is  conventionally  used  for  the  quantity  P.^  The  polar¬ 
ization  P  refers  to  the  projection  of  the  “spin”  along  the 
sample  magnetization  direction. 

Low-energy  diffuse  scattering  from  gold  is  the  basis  of 
the  SRXPS  instrumentation.  The  spin  detector  relies  on  the 
spin-orbit  interaction  (V^q^L'S)  in  electron  scattering  to 
produce  a  spin-dependent  left-right  asymmetry  in  the  inten¬ 
sity  scattered  off  a  gold  target.^  The  orbital  angular  momen¬ 
tum  L  associated  with  the  scattering  lies  normal  to  the  scat¬ 
tering  plane  at  the  gold  target,  and  is  the  spin  analysis  axis  of 
the  detector.  The  sample  is  magnetized  in  a  direction  parallel 
to  the  spin  analysis  axis. 

The  photoelectron  spin  polarization  P  can  be  related  to 
the  observed  scattering  asymmetry  A  via  the  relation^ 


Ni-Nj,  A 
S(A^  +  A^)  5’ 


(2) 


where  A^  is  the  number  of  counts  registered  in  the  “left” 
microchannel  plate  detector,  and  A/^  is  the  number  of  counts 
registered  in  the  “right”  microchannel  plate  detector  (Fig.  1). 
The  factor  5  is  the  effective  Sherman  function.  The  Sherman 


function  characterizes  the  sensitivity  of  the  detector  to  spin 
polarization  and  must  be  determined  through  calibration.  The 
scattering  energy  for  the  diffuse  scattering  detector  is  150 
eV.^  A  complete  description  of  the  SRXPS  spin  detector  con¬ 
struction  and  performance  can  be  found  in  Ref.  7. 

The  SRXPS  data  are  collected  into  four  channels  A^ ,  A^^, 
A^,  A^.  Here,  A^  represents  the  number  of  electrons  dif¬ 
fusely  scattered  to  the  left  (L)  from  the  Au  target  in  the  spin 
detector  when  the  sample  magnetization  is  positive  (+).  A^ 
is  the  number  of  electrons  scattered  to  the  right  (P)  from  the 
Au  target  when  the  sample  magnetization  has  been  reversed 
to  the  negative  (— )  direction.  The  electron  beam  polarization 
P  can  then  be  expressed  as^ 

SRXPS  measurements  using  both  (+)  and  (-)  magnetiza¬ 
tions  removes  from  the  polarization  data  apparatus  asymme¬ 
try  effects  unrelated  to  the  spin  of  the  electron  beam,  and 
also  averages  over  any  magnetic  dichroism  effects.  The  po¬ 
larization  data  can  be  separated^  into  individual  Aj  and  N[ 
SRXPS  spectra  for  the  majority-spin  (j-spin)  and  minority- 
spin  (|-spin)  photoelectrons  via  the  following  equations: 
AT  =  2  A,,,(  1  +  P) ,  Ai  =  2NU 1  -  P) ,  where 

For  SRXPS  to  reveal  the  role  of  spin  in  photoionization, 
ideally  the  spectral  spin  polarization  should  be  unaffected  by 
photoelectron  transport^  through  the  magnetic  solid.  Alterna¬ 
tively,  a  spin-dependent  transport  process  might  provide  use¬ 
ful  geometric  structure  information.^  In  SRXPS  studies  of 
Fe(Oll),^®  the  spin-resolved  Fe  2/73/2  line  shape  (kinetic 
energy =547  eV)  showed  no  detectable  variation  on  scanning 
the  emission  angle  through  forward  scattering  intensity 
maxima,  corresponding  to  zeroth-order  diffraction  through 
nearest-neighbor  atoms.  In  addition,  for  the  FeAV(Oll)  and 
Fe/Ag(001)  overlayer  systems,  the  nominally  unpolarized  W 
4/  photoelectrons  (kinetic  energy  =1222  eV)  and  Ag  Auger 
electrons  (kinetic  energy =353  eV)  acquired  no  detectable 
spin  polarization  as  they  traversed  30  A  of  ferromagnetic 
Fe.^^  These  unpublished  results  indicate  that  above  350  eV 
kinetic  energy,  T-spin  and  j-spin  electrons  elastically  scatter 
with  essentially  the  same  cross  section.  Therefore,  the  spin 
polarizations  observed  in  SRXPS  spectra  arise  from  the  ini¬ 
tial  optical  excitation,  and  not  from  electron  scattering  during 
photoelectron  transport  out  of  the  solid. 

III.  RESULTS  AND  DISCUSSION 
A.  Fe  metal 

Ferromagnetic  iron  was  the  first  subject  of  SRXPS  study. 
Figure  2  shows  SRXPS  results^  ^  for  the  Fe  2py2  level.  The 
lines  through  the  data  are  simplex  fits  to  each  spin  compo¬ 
nent  using  a  single  Doniach-Sunjic  (DS)  line  shape^^  con¬ 
voluted  with  a  Gaussian  of  1.6  eV  FWHM  (the  instrumental 
energy  resolution).  The  Fe  2/73/2  core-level  binding  energy, 
lifetime  width,  and  solid  state  line  shape  asymmetry  all  show 
a  strong  spin  dependence. 
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Fig.  2.  Separate  A^l  and  A^l  SRXPS  spectra  for  the  Fe  2py2  majority-spin 
(A)  and  minority-spin  {▼)  photoelectrons,  respectively.  The  lines  through 
the  raw  data  are  the  result  of  a  simplex  fit  to  each  spin  component  using  a 
single  Doniach-Sunjic  line  shape  convoluted  with  a  Gaussian  of  1.6  eV. 


The  exchange  splitting  between  the 
components  is  0.48  ±0.05  eV.  Clearly,  intra-atomic  exchange 
is  important  even  for  deep  core-level  photoemission,  despite 
the  fact  that  the  2p~3d  exchange  integral  is  limited  by  the 
small  (0.15  A)  radial  extent  of  the  2p  level.  The  Aj  SRXPS 
core-level  components  generally  have  larger  Lorentzian 
broadenings  than  the  associated  Nl  components.  The  Fe 
2p3/2  At  and  Nl  lifetime  widths,  obtained  from  the  DS  line 
shapes  in  Fig.  2,  are  0.56±0.04  eV  and  0.44±0.04  eV,  re¬ 
spectively.  This  spin  dependence  indicates  that  core-hole  fill¬ 
ing  processes  involving  the  valence  band  conserve  spin,  with 
the  larger  Nl  Fe  2p^/2  width  attributable  to  the  preponder¬ 
ance  of  t -spin  electrons  in  the  Fe  valence  band.  There  have 
been  no  spin-resolved  calculations  of  core-hole  lifetimes. 
Such  calculations  would  be  an  interesting  area  for  future 
theoretical  research. 

The  DS  line  shape  singularity  index  a  is  consistently 
larger  for  the  Nl  SRXPS  components  than  for  the  N I  com¬ 
ponents.  For  the  Fe  2py2  level,  the  a  values  are  0.46 ±0.02 
for  AT  and  0.42±0.02  for  Aj.  The  AT  SXRPS  peaks  are 
therefore  more  asymmetric  toward  higher  binding  energy 
than  the  AT  peaks.  This  result  is  the  first  indication  that  the 
electron-hole  (e-h)  pair  excitation  spectrum  accompanying 
core-hole  creation  is  intrinsically  spin-dependent  and  is 
stronger  for  the  majority-spin  core  hole.  A  speculative 
mechanism  has  been  offered  for  the  spin  dependence,  but 
the  problem  needs  theoretical  attention.  Previous 
calculations^^"^^  of  the  itinerant  e-h  pair  response  to  core¬ 
hole  creation  treat  the  core  hole  as  spinless. 

The  spin  dependence  of  the  Fe  Sy  spectrum  was  first  mea¬ 
sured  by  Hillebrecht,  Jungblut,  and  Kisker^^  using  250-eV 
synchrotron  radiation.  However,  Qiu  and  co-workers  re¬ 
ported  that  the  Fe  35*  photoemission  line  shape  observed  at 
hv=250  eV  differs  significantly  from  that  observed  at  high 
photon  energy.  These  workers  speculated  that  the  use  of  a 
low-photon  energy  enhances  the  surface  sensitivity  of  the 
measurement,  with  the  varying  Fe  3.y  line  shapes  reflecting 
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Fig.  3.  Separate  (A)  and  Nl  (T)  SRXPS  spectra  for  the  Fe  3^  photo¬ 
electrons,  respectively.  Binding  energy  is  plotted  against  photoelectron 
counts  (left  axis).  The  line  through  the  raw  data  is  a  second-order  11- 
point  Savitzky-Golay  (SG)  smooth.  The  line  through  the  raw  data  is  a 
second-order,  15-point  SG  smooth.  The  dotted  line  above  the  SRXPS  data  is 
the  spin-summed  spectrum  [N]  +A^i).  The  spin-summed  binding  energy  is 
plotted  against  intensity  (right  axis). 

differences  between  the  Fe  surface  and  bulk  magnetic  prop¬ 
erties. 

Figure  3  shows  SRXPS  data^^  for  the  Fe  3.^  level.  Figure 
4  directly  compares  high-photon  energy  SRXPS  Fe  3^  re¬ 
sults  with  the  250-eV  photon  energy  spin-resolved  data  of 
Hillebrecht,  Jungblut,  and  Kisker.^^  For  clarity,  only  the 
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Fig.  4.  Thick  lines:  SG  smooth  curves  for  the  Fe  3s  SRXPS  data  of  Fig.  3, 
obtained  at  hv=  1253.6  eV.  The  left  axis  is  applicable  to  the  SRXPS  data. 
Thin  lines:  SG  smooth  curves  for  the  Fe  3.y  250-eV  synchrotron  data  of  Ref. 
16.  The  right  axis  is  applicable  for  the  data  of  Ref.  16.  The  N]  curves  have 
been  rigidly  shifted  vertically  (offset)  to  clarify  the  presentation.  The  right 
axis  was  scaled  so  that  the  two  Nl  peaks  were  of  equal  height.  However,  the 
relative  intensities  within  a  data  set  are  accurate  (i.e.,  NyNl  intensity  ratios 
for  the  SRXPS  data  are  accurately  represented,  as  are  the  NyNl  intensities 
from  Ref.  16). 


3143 


L.  E.  Klebanoff:  XRP  studies  of  ferromagnetic  metals 

smoothed  and  curves  from  Fig.  3  are  shown  in  Fig.  4. 
It  is  clear  from  Fig.  4  that  the  overall  Fe  35  line  shape  does 
vary  with  photon  energy.  As  the  photon  energy  increases, 
component  B  evolves  from  a  weak  shoulder  at  low-photon 
energy  to  a  distinct  and  prominent  N]  component  in  the  Fe 
35  SRXPS  spectrum.  The  agreement  between  Fe  35  XPS 
spectra  collected  at  normal  emission  and  70°  off  normal  in¬ 
dicate  that  the  differences  observed  in  Fig.  4  are  a  true  pho¬ 
ton  energy  dependence,  and  cannot  be  attributed  to  a  varying 
surface  sensitivity. 

The  increase  in  the  intensity  of  component  B  at  larger 
photon  energies  explains  puzzling  line  shape  results  from 
spin-integrated  photoemission  studies.^^’^^  The  SRXPS  data 
demonstrate  that  the  “satellite  to  main  peak”  intensity  ratio, 
as  gleaned  from  spin-integrated  XPS  data,  is  not  a  measure 
of  the  N]IN[  intensity  ratio,  since  j-spin  component  B  lies 
on  the  high  binding  energy  side  of  i-spin  component  A.  The 
increased  prominence  of  component  B  at  high  photon  energy 
causes  the  “satellite  to  main  peak”  intensity  ratio  to  decrease 
with  increasing  photon  energy  as  indicated  in  previous  two- 
peak  fits  to  the  spin-integrated  Fe  35  line  shape. It  is 
probable  that  experimental  variations  in  the  Fe  A£’3^  values 
(from  4.2  to  4.9  eV)  inferred^^’^^’^^  from  two-peak  fits  to  the 
Fe  35  spectrum  are  attributable  in  part  to  the  photon  energy 
dependence  of  component  B,  The  correct  value  of  A£3  y  for 
metallic  Fe  is  4.5  ±0.1  eV, 

Since  component  B  lies  only  0.83  eV  above  component 

A,  we  do  not  believe  peak  B  represents  Si  3d  “shakeup” 

transition  in  which  the  ¥e  3d  electron  count  is  decreased. 
Rather  we  believe  peak  B  corresponds,  qualitatively,  to  the 
excited  atomic  term  of  the  atomic  4s^3d^  valence  con¬ 
figuration.  If  we  couple  (35^  ^S)  with  (4s^3d^,  ^G),  we 
form  a  single  ^G  final-state  configuration.  The  theory  of 
Dembczynksi^^  places  the  {4s^3d^,  ^G)  term  0.75  eV 
higher  in  energy  than  the  (45^3  ^G)  configuration,  close 

to  the  energy  separation  between  the  Fe  35  components  A 
and  B.  One  would  expect^^  this  ^G  final  state  to  be  acces¬ 
sible  only  by  the  photoemission  of  j-spin  35  electrons,  con¬ 
sistent  with  the  t-spin  polarization  of  component  B,  Since 
peak  B  represents  a  higher-energy  term  within  the  4s^3d^ 
valence  configuration,  its  intensity  may  turn  on  more  slowly 
with  photon  energy  than  components  A  and  C  which  derive 
from  multiplet  splitting  from  the  lowest-energy  final-state 
term. 

B.  Co  metal 

Ferromagnetic  cobalt  metal  has  a  magnetic  moment  of 
l.llfjbB  per  atom.^^  The  amorphous  ferromagnetic  metallic 
glass  Co66Fe4NiiBi4Sii5  belongs  to  a  class  of  important  tech¬ 
nical  ferromagnetic  materials  extensively  used  in  transformer 
technology.^^  The  Co  atoms  in  the  magnetic  glass  have  a 
reduced  magnetic  moment  of  due  to  chemical  bonding 
with  the  metalloid  (B,Si)  components.  Thus  a  comparison  of 
Co  core-level  SRXPS  spectra  from  Co  metal  and  the  Co¬ 
based  glass  can  reveal  the  role  of  a  varying  valence  spin 
polarization  on  an  element’s  core-level  SRXPS  spectra. 

Figures  5  and  6  display  SRXPS  spectra^"^  for  the  Co  2/73/2 
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Fig.  5.  Separate  N]  (A)  and  N[  (T)  SRXPS  spectra  for  the  Co  2/73/2  ^®vel 
of  Co  metal.  The  lines  through  the  data  are  the  result  of  a  simplex  fit  to  each 
spin  component  using  a  single  Doniach-Sunjic  line  shape  convoluted  with  a 
Gaussian  of  1.6  eV  FWHM. 


level  for  metallic  Co  and  the  Co  glass,  respectively.  The  solid 
line  represent  single  DS  fits  to  the  N]  and  Ni  components 
over  the  limited  binding  energy  range  781-774  eV.  The  2/73/2 
fit  is  limited  in  energy  because  the  low  intensity  Mg  Ka-^^ 
x-ray  satellites  extending  up  to  10  eV  from  the  Co  2/7 1/2  peak 
at  —793  eV  interfere  slightly  with  the  spectral  intensity  from 
the  2/73/2  above  782-eV  binding  energy.  The  single¬ 
component  DS  fits  are  quite  satisfactory  and  for  the  metal 
(Fig.  5)  yield  the  following  spin-dependent  values  for  the 
spectral  binding  energy  (BEt,i),  the  singularity  index  (aj,  j), 
and  the  FWHM  Lorentzian  broadening  (Ft,!):  N]  compo¬ 
nent:  BET=777.84±0.030  eV,  aT=0.39±0.020,  and  TT 
=0.73 ±0.10  eV.  For  the  component  we  derive  the  spec¬ 
tral  values:  BE|=777.66±0.030  eV,  aj=0.36±0.02,  and  ¥[ 
=0.49±0.10  eV.  The  SRXPS  data  reveal  a  Co  metal  2/73/2 
spin-dependent  (exchange)  splitting  of  0.18  ±0.05  eV.  The 
2/73/2  SRXPS  spectra  for  the  Co-based  glass  are  shown  in 
Fig.  6.  The  DS  fit  results  can  be  summarized  as  follows:  For 
BEt=777.88±0.03  eV,  at=0.35±0.02,  and  rt=0.65 


Binding  Energy  (eV) 


Fig.  6.  Separate  (A)  and  Ni  (T)  SRXPS  spectra  for  the  Co  2py2  level 
of  Co66Fe4NiiBi4Sij5.  The  lines  through  the  data  are  the  result  of  a  simplex 
fit  to  each  spin  component  using  a  single  Doniach-Sunjic  line  shape  con¬ 
voluted  with  a  Gaussian  of  1.6  eV  FWHM. 
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±0.1  eV.  For  BEi=777.82±0.03  eV,  ai-0.31±0.02, 
and  rj=0.64±0.1  eV.  The  reduced  exchange  splitting  for 
the  Co-glass  2/? 3/2  level,  0.060 ±0.05  eV  compared  with  that 
of  the  metal,  0,1 8 ±0.05  eV,  clearly  demonstrates  that  as  the 
valence  electron  spin  polarization  decreases,  the  Co  2/?3/2 
exchange  splitting  decreases. 

As  the  Co  3  J  magnetic  moment  decreases  in  going  from 
the  metal  to  the  glass,  the  spin  dependence  of  the  2py2  core¬ 
hole  lifetime  F  diminishes,  with  the  spin-resolved  values  Fj 
and  F  j  converging  toward  the  average  of  Fj  and  F  j  for  the 
metal.  Since  the  absolute  spin-resolved  core  filling  rates  are 
determined  by  the  j-spin  and  j-spin  populations  in  the  3d 
valence  band,  the  convergence  of  Ff  and  Fj  indicates  that 
there  exists  (per  Co  atom)  fewer  t-spin  3d  electrons  and 
more  j-spin  3d  electrons  in  the  glass  than  found  in  the  metal. 
This  is,  of  course,  consistent  with  a  lower  magnetic  moment 
for  the  Co  atoms  in  the  glass,  but  also  suggests  that  the 
reduction  of  the  magnetic  moment  is  achieved  by  the  transfer 
of  t-spin  3d  valence  electron  density  into  the  j-spin  3d  va¬ 
lence  band,  thereby  producing  fewer  t-spin  electrons  and 
more  j-spin  electrons  per  Co  atom.  This  is  direct  confirma¬ 
tion  that  Co  3d  hybridization  with  the  metalloid  valence  or¬ 
bitals  weakens  the  3d  electron-electron  interaction  in  the 
glass, thereby  causing  spin  pairing  in  the  valence  band.  The 
data  are  inconsistent  with  a  reduced  moment  caused  by  the 
transfer  of  charge  into  the  unoccupied  minority-spin  band 
from  the  metalloid  components,^^  since  this  mechanism 
would  not  vary  the  t-spin  valence  electron  population,  and 
therefore  would  not  affect  the  Ft  value.  In  this  way,  SRXPS 
linewidths  can  provide  valuable  element-specific  information 
on  mechanisms  by  which  valence  electronic  structure  is 
modified  in  alloys. 


C.  Ni  metal 

SRXPS  is  a  powerful  probe  of  the  itinerant-electron  re¬ 
sponse  of  the  N— I  electron  system  to  core-hole  creation. 
This  is  clearly  shown  in  recent  SRXPS  studies  of  ferromag¬ 
netic  Ni.^^  The  core  and  valence  XPS  spectra  of  metallic  Ni 
display  a  prominent  “main”  component  that  is  accompanied 
by  satellites  located  at  higher  binding  energies. The  exist¬ 
ence  of  satellites  suggests  a  variable  final-state  3d  electron 
count,  influenced  in  part  by  core-hole  screening. 
Theoretically,^^  the  strong  electron-electron  interaction  in  Ni 
leads  to  an  increased  probability  for  atomiclike  valence-level 
excitations.  Hufner  and  Wertheim^^  reported  the  absence  of 
exchange  splitting  in  the  main  component  of  Ni  3^  XPS, 
leading  these  workers^^  to  attribute  the  main  3^  core-level 
XPS  component  of  Ni  to  an  atomiclike  final-state  electron 
configuration  of  c3d^^  (c=core  hole).  Metallic  Ni  has  a 
ground-state  valence  electronic  configuration  of 
—  {Asp)^3d^ However,  a  c3d^^  final  state  could  be 
theoretically  generated  from  the  ^{Asp)^3d'^  initial  state 
upon  the  creation  of  a  core  hole.  This  could  occur  by  extra- 
atomic  “J-wave”  core-hole  screening  in  which  a  previously 
unoccupied  3d  level  were  pulled  below  the  Fermi  level 
by  the  core-hole  potential  and  occupied.^^’^^  The  main  com- 
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Fig.  7.  Separate  N]  (A)  and  Ni  (▼)  SRXPS  spectra  for  the  Ni  3^  photo¬ 
electrons,  respectively.  The  lines  through  the  data  are  the  result  of  an  11- 
point  second-order  SG  smooth  through  the  raw  and  Ni  data  points.  The 
arrows  mark  energy  positions  6.0  and  14.0  eV  from  the  main  N]  com¬ 
ponent,  Sample  error  bars  are  shown  for  both  the  N]  and  Ni  spectra. 

ponents  of  core-level  XPS  spectra  of  Ni  metal  have  been 
extensively  assigned^^  to  the  c3d^^  final-state  configuration. 

More  recently,  many-body  3d  configuration  interaction 
(Cl)  calculations^^"^^  employing  the  Anderson  impurity 
model  have  described  the  Ni  ground  state  as  a  mixture  of 
3d^^,  3d^,  and  3d^  valence  configurations  with  relative 
weights  of  ~32%,  —53%,  and  —15%,  respectively.  The  rela¬ 
tive  weights  can  vary  when  a  core-level  electron  is  photo- 
emitted.  The  final-state  configuration  associated  with  the 
“main”  core-level  component  has  not  been  calculated,  but  a 
simple  estimate  suggests  that  —10%  c3d'^  and  —90%  c3d^^ 
could  be  typical  for  this  model.^^ 

SRXPS  data^^’"^®  for  the  Ni  3s  level  are  shown  in  Fig.  7. 
The  main  Ni  3^  |-spin  component  (Aj)  is  displaced  0.38 
±0.06  eV  to  higher  binding  energy  and  has  a  smaller  inten¬ 
sity  than  the  j-spin  3^  component  (Aj).  This  3s  data  dem¬ 
onstrate  that  the  final-state  Ni  valence  configuration  upon 
core-hole  creation  is  not  purely  3d^^,  in  disagreement  with 
previous  assignments, since  the  nonmagnetic  3s^3d^^ 
configuration  cannot  produce  the  Ni  3^  main-component  ex¬ 
change  splitting  evident  in  Fig.  7. 

Although  the  Ni  SRXPS  data  demonstrate  conclusively 
that  the  main-line  component  is  not  purely  c3d^^,  the  main 
Ni  core-level  XPS  components  could  have  a  mixed  c3d^, 
c3d^^  character.  An  empirical  argument  can  be  constructed 
to  qualitatively  evaluate  this  possibility.  As  discussed 
previously,^^  there  is  strong  evidence  that  the  3d  electron 
count  (and  therefore  the  spin  polarization)  in  metallic  Fe  and 
Co  does  not  change  upon  the  creation  of  a  core  hole.  Figure 
8  plots  versus  the  initial-state  spin  polarization  (in 

electrons)  for  Cu,  Ni,  Co,  and  Fe  metals.  The  solid  line  is  a 
linear  least  squares  fit  to  the  Cu,  Co,  and  Fe  data,  excluding 
the  Ni  3^  data.  Figure  8  can  be  used  qualitatively  to  derive 
by  interpolation  the  expected  metallic  Ni  3^  exchange  split¬ 
tings  were  the  Ni  3d  spin  polarization  to  remain  the  same  as 
in  the  —  3  initial  state  (0.56  electrons).  For  the  Ni  3^  level, 
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Fig.  8.  A  plot  of  the  35  multiplet  splitting  AE^s  for  Cu,  Ni,  Co,  and  Fe 
metals  vs  the  initial-state  spin  polarization  in  electrons.  The  line  is  the  result 
of  a  linear  least-squares  fit  to  the  Cu,  Co,  and  Fe  data,  excluding  the  Ni  data. 
The  Fe  and  Co  metal  values  are  taken,  respectively,  from  Ref.  16  and 
from  D.  G.  Van  Campen  et  al,  Phys.  Rev.  B  49,  2040  (1994). 


this  interpolated  splitting  is  1.18  eV.  The  measured  Ni  3^ 
exchange  splitting  (0.38  eV)  is  ^^30%  of  the  expected  value. 
If  this  decrease  in  core-valence  exchange  is  completely  at¬ 
tributed  to  a  reduced  valence  spin  polarization  in  the  photo¬ 
emission  final  state,  then  a  final-state  3d  spin  polarization  of 
0.17  electrons  is  inferred,  corresponding  to  a  30%  c3d^  and 
70%  c3d^^  final-state  mixture.  This  estimate  is  probably  an 
upper  limit  to  the  nonmagnetic  c3d^^  contribution,  since  the 
Ni  3 p  peak  has  a  surprisingly  large  spin-dependent  splitting 
and  the  magnitudes  of  the  SRXPS  spin  polarizations  for  Ni 
are  [when  compared  to  Co  (Ref.  24)  and  Fe  (Ref.  11)]  more 
consistent  with  a  Ni  final-state  spin  characteristic  of  the 
(4spy3d^  initial  state.  Therefore,  we  believe  that  the  true 
final-state  Ni  3J  valence  composition  following  core  elec¬ 
tron  ionization  is  one  of  reduced  (but  not  zero)  magnetiza¬ 
tion,  and  is  most  likely  —50%  c3d^  and  —50%  3d^^.  This 
final-state  composition  estimate  is  consistent  with  an 
itinerant-electron  screening  model  proposed  in  Ref.  40. 

The  spin  polarization  of  the  Ni  main  components  may, 
alternatively,  have  an  explanation  in  localized  models  em¬ 
ploying  Anderson  impurity  theory. In  these  models,  a 
main-component  spin  dependence  could  arise  from  the  ad¬ 
mixture  of  3(7^  and  3d^^  “atomiclike”  valence  configura¬ 
tions  in  the  final  state  associated  with  the  Ni  main  compo¬ 
nent.  The  composition  of  this  admixture  in  the  final  state  for 
the  “main”  Ni  core-level  component  has  not  been  calculated, 
but  a  simple  estimate  suggests  that  it  is  primarily  (—90%) 
3d^^  in  character,  with  a  “small”  (—10%)  3d^  admixture.^^ 
The  itinerant-electron  screening  model"^^  suggests  that  the 
c3d^  configuration  survives  at  the  —50%  level.  Since  the 
two  models  differ  quantitatively,  but  not  qualitatively,  in 
their  predictions  for  the  main  components,  the  main- 
component  SRXPS  data  alone  do  not  indicate  a  decisive 
preference  for  either  model.  The  main-component  and  satel- 
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lite  SRXPS  data  are,  as  a  whole,  more  consistent  with  the 
itinerant-electron  screening  model.^® 

The  “6-eV  satellite”  and  a  secondary  satellite  located 
— 14  eV  from  the  main  3^  peak  (referred  to  hereafter  as  the 
“14-eV  satellite”),  both  display  a  very  large  (nearly  100%) 
majority-spin  polarization.  The  large  j-spin  polarizations  for 
the  Ni  3^  6-  and  14-eV  satellites,  combined  with  the  promi¬ 
nence  of  these  satellites  in  Ni  and  the  relative  weakness  of 
satellites  in  Cu  (a  known  3d^^  system),  indicate  that  a  36? 
shakeup  from  a  36?^^  configuration  cannot  generate  strong 
and  discrete  satellite  structure.  Thus,  we  believe  that  the 
c36?^  final-state  component  in  the  main  line,  which  results 
from  extra-atomic  screening  via  the  3  4  5/7  states  above  Ef 

(Fig.  4  of  Ref.  40),  is  the  starting  point  for  the  6-  and  14-eV 
satellites  in  Ni. 

A  central  issue  that  distinguishes  the  core-level  satellite 
problem  from  the  valence  satellite  problem  is  the  extent  to 
which  the  3d  valence  electron  participating  in  the  “shakeup” 
is  coupled  to  the  remaining  and  highly  localized  core-level 
electrons  after  core  ionization.  The  SRXPS  data  suggest  that 
upon  core-level  ionization,  the  excitation  of  a  Ni  valence  3d 
electron  (initially  in  a  —36?^  configuration)  to  an  unoccupied 
state  above  does  reduce  the  3d  valence  charge  within  the 
radial  extent  of  the  remaining  core  electrons.  From  the 
“point  of  view”  of  the  remaining  electrons  in  the  ionized 
core  level,  the  3d  shakeup  reduces  the  “local”  3d  valence 
charge  from  —36?^  to  —36?^.  This  mechanism  differs  con¬ 
ceptually  from  valence  3d  ionization  (which  literally  pro¬ 
duces  a  3  final  state)  for  which  the  3d  shakeup  is  thought 
not  to  change  the  total  3d  count  in  the  final  state, and  the 
6-eV  valence  satellite  is  interpreted  with  the  36?^  final-state 
valence  configuration. 

In  the  core-level  scenario  above,  the  6-eV  satellite  appear¬ 
ing  in  Ni  3^  SRXPS  can  be  understood  (using  atomic  termi¬ 
nology  only  qualitatively  appropriate  for  the  metallic  prob¬ 
lem)  as  derived  from  the  dominant  term"^^”^^  of  the  Ni 
3d^  final-state  valence  configuration.  For  3  s  photoemission, 
the  6-eV  satellite  would  then  be  associated  with  the  ^G  final- 
state  term  resulting  from  the  coupling  of  35 ^(^5)  with 
36?^(^G).  In  cases  where  LS  coupling  is  valid  (i.e.,  spin- 
orbit  interactions  are  zero  or  negligible),  the  spin  component 
along  the  magnetization  direction  is  a  conserved  quantity. 
Conservation  of  this  spin  component"^ suggests  that 

+  (4) 

(3d^)  {3d^)  {3d^)  {3sf 

Here,  Mf  is  the  initial- state  spin  component  along  the  mag¬ 
netization  direction  (i.e.,  the  eigenvalue  for  5^  in  the  initial 
state),  which  in  the  3d^  atomic  case  would  be  +5  in  units  of 
h.  is  the  spin  component  of  the  final  ionic  state  (36?^) 
excluding  the  shakeup  electron;  Mf  is  the  spin  component 
of  the  3d  electron  (36?^)  involved  in  the  shakeup  (su)  and 
is  the  spin  component  of  the  detected  35  photoelectron 
(pe).  If  we  assume  that  the  3d  electron  participating  in  the 
shakeup  is  a  j-spin  electron  (M^“  =  -  5),  then  the  ^G  term 
(with  =  +  2)  can  only  be  produced  by  j-spin  35  photo¬ 
electrons  (Mj^  =  +5),  i.e.,  -h  5= +  5”  54-5  from  Eq.  (4). 
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These  considerations  explain  the  nearly  100%  t-spin  polar¬ 
ization  observed  for  the  Ni  3^  6-eV  satellite  in  Fig.  7. 

We  believe  the  secondary  satellite  at  14  eV  is  qualitatively 
derived  from  the  term  of  the  3d^  atomic  multiplet.^^  The 
Ni  3^  14-eV  satellite  would  then  be  associated  with  the 
final- state  term  arising  from  the  coupling  of  35 ^(^5)  with 
3d^{^S).  Given  the  assumptions  above,  the  peak  (with 
~  5)  is  also  allowed  only  by  the  photoemission  of 

t-spin  =  -h  5)  Ni  35  electrons,  as  suggested  by  Eq.  (4) 
and  observed  in  Fig.  7.  The  assignment  for  the  14-eV 
satellite  explains  its  large  t-spin  polarization  and  the  higher 
binding  energy  for  the  state  relative  to  the  term.  How¬ 
ever,  its  displacement  of  ~8  eV  beyond  the  6-eV  satellite  is 
rather  large.  Calculations  for  atomic  Ni  by  Combet-Farnoux 
and  Ben  Amar"^^  predict  a  ^S-^G  {3d^)  term  separation  of 
4.7  eV.  Theoretical  results  by  Liebsch^^  for  metallic  Ni  place 
the  peak  only  --2  eV  beyond  the  6-eV  satellite.  The  gas 
phase  measurements  of  Schmidt  et  suggest  that  the 
peak  lies  2.3  eV  to  higher  binding  energy  than  the  state  in 
atomic  Ni.  Thus  the  large  ^S-^G  separation  is  not  ad¬ 
equately  understood. 

Calculations^^  using  Anderson  impurity  theory  have  con¬ 
sidered  the  Ni  35  6-eV  satellite  to  consist  of  and 
terms  of  the  35^ 3^^^  final  state  configuration.  The  essentially 
100%  t-spin  polarization  observed  for  the  Ni  35  6-eV  satel¬ 
lite  is  inconsistent  with  a  3s^3d^  final-state  configuration 
assignment  for  the  satellite,  since  and  terms  would 
lead  to  both  j-spin  and  j-spin  components  in  the  Ni  35  6-eV 
satellite,  which  are  not  observed. 

IV.  FUTURE  OF  SRXPS 

SRXPS  has  already  provided  valuable  information  on  the 
spin-dependent  nature  of  core-level  and  valence  photoemis¬ 
sion,  as  well  as  on  the  magnetism  of  bulk  ferromagnets.  To 
fully  exploit  the  information  content  of  SRXPS,  comparison 
must  be  made  with  advanced  theory  of  the  SRXPS  process. 
The  paper  presented  by  Dr.  Jose  Menchero  at  this  conference 
will  report  remarkable  calculations  of  Ni  SRXPS  that  agree 
almost  quantitatively  with  the  Ni  SRXPS  experimental  re¬ 
sults.  Similar  theoretical  effort  extended  to  other  ferromag¬ 
netic  metals  and  to  magnetic  interfaces  are  needed  and  will 
greatly  increase  the  utility  of  SRXPS  measurements.  Calcu¬ 
lations  of  core-level  exchange  splittings,  lifetimes,  and  asym¬ 
metries  will  be  particularly  helpful. 

To  date,  these  high  photon  energy  SRXPS  measurements 
have  been  conducted  with  a  high  power  Mg  Ka  x-ray  anode. 
This  apparatus  remains  the  only  SRXPS  facility  that  allows 
spin  analysis  of  deep  core-level  electrons  while  maintaining 
a  reasonably  high  photoelectron  kinetic  energy  and  a  decent 
energy  resolution  (1.6  eV  FWHM).  SRXPS  studies  will 
greatly  benefit  from  the  third-generation  synchrotron  sources 
being  commissioned  that  will  allow  high  flux  and  high- 
energy  resolution  SRXPS  in  the  photon  energy  range  1200- 
1800  eV. 

SRXPS  will  be  very  profitably  extended  to  investigations 
of  heterogeneous  magnetic  interfaces  involving  one  mono- 
layer  of  a  magnetic  metal  deposited  on  a  magnetic  substrate. 
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The  element  specificity  of  SRXPS  will  allow  the  monolayer 
properties  to  be  probed  selectively,  and  the  magnitude  and 
direction  of  the  overlayer  magnetic  moment  to  be  quantita¬ 
tively  determined.  Such  SRXPS  studies,  when  combined 
with  determinations  of  interfacial  geometric  structure,  will 
greatly  clarify  questions  of  magnetic  moment  enhancement 
and  creation  for  monolayer  magnetic  systems. 

The  high  kinetic  energies  of  SRXPS  spectra  afford  two 
important  advantages:  (1)  the  spin  polarization  is  unaffected 
by  photoelectron  transport  through  the  solid,  and  (2)  the  in¬ 
elastic  mean  free  path"^^  of  the  photoelectrons  is  10-20  A. 
These  attributes  allow  SRXPS  to  probe  magnetic  layers 
“buried”  beneath  up  to  30-40  A  of  another  material.  The 
effective  depth  sensitivity  of  SRXPS  can  be  varied  by  chang¬ 
ing  the  angle  of  electron  detection  with  respect  to  the  surface 
plane,  providing  SRXPS  with  a  magnetic  depth  profiling  ca¬ 
pability.  These  attributes  should  allow  SRXPS  to  probe  in¬ 
ternal  interfaces  in  multilayer  magnetic  films  of  technologi¬ 
cal  importance. 
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We  evaluate  the  possibility  of  using  dichroic  electron  energy  loss  spectroscopy  (DEELS)  as  an 
alternative  to  x-ray  magnetic  circular  dichroism  (XMCD).  It  is  well  known  that  electron  energy  loss 
spectroscopy  and  x-ray  absorption  spectroscopy  provide  similar  information.  A  simple  semiclassical 
model  suggests  that  reflection  DEELS  might  have  a  magnetic  sensitivity  similar  to  that  of  XMCD. 
This  sensitivity  will  be  reduced,  however,  by  multiple  scattering  of  the  probe  electron  before  and 
after  the  energy  loss  event.  Thus,  it  is  difficult  to  predict  the  magnitude  of  the  DEELS  effect. 
Experiments  were  performed  at  the  L  edges  of  polycrystalline  Fe,  Co,  and  Ni  thin-film  samples 
prepared  in  situ  with  a  uniaxial  magnetic  bias.  Even  in  these  most  favorable  cases,  the  DEELS  effect 
is  limited  to  less  than  one-tenth  of  related  effects  in  XMCD.  ©  1996  American  Vacuum  Society. 


Electron  energy  loss  spectroscopy  (EELS)  is  in  many 
ways  highly  analogous  to  x-ray  absorption  spectroscopy 
(XAS),  as  is  well  known. For  example,  near-edge  reflec¬ 
tion  EELS  spectra  have  been  shown  to  provide  the  same 
chemical  information  (projected  densities  of  states)  that  can 
be  obtained  from  x-ray  absorption  near-edge  spectroscopy. 
Similarly,  the  well-known  structural  analysis  technique  of 
extended  x-ray  absorption  fine  structure  spectroscopy  has  its 
EELS  analog:  extended  energy  loss  fine  structure."^  In  this 
article,  we  explore  yet  another  reflection  EELS  variant.  This 
is  analogous  to  x-ray  magnetic  circular  dichroism  (XMCD), 
which  we  call  dichroic  electron  energy  loss  spectroscopy 
(DEELS). 

To  understand  how  EELS  techniques  are  similar  to  XAS 
techniques,  consider  that  XAS  measures  the  probability  that 
a  photon  will  be  absorbed  by  a  given  material  as  a  function 
of  the  photon  energy.  In  comparison,  EELS  measures  the 
probability  that  a  relatively  high-energy  electron  will  lose  a 
quantum  of  energy  when  it  interacts  with  a  material,  as  a 
function  of  the  loss  energy.  Put  another  way,  photons  in  XAS 
induce  transitions  of  core  level  electrons  to  energies  above 
the  Fermi  level.  In  EELS,  high-energy  electrons  induce  simi¬ 
lar  transitions,  and  in  so  doing,  lose  energy.  But  recall  that 
electron-electron  interactions  are  mediated  by  virtual  pho¬ 
tons  in  the  Coulomb  interaction.  Thus  EELS  may  be  consid¬ 
ered  a  photoabsorption  process,  where  the  photon  is  created 
very  close  to  the  absorbing  atom.  This  is  at  the  heart  of  the 
XAS/EELS  analogy. 

There  are  two  main  characteristics  which  distinguish 
EELS  techniques  from  XAS  techniques:  the  excitation 
source  and  the  selection  rules  for  energy  absorption.  In  XAS, 
the  excitation  source  must  be  an  energy-tunable  photon 
source:  usually  a  synchrotron  radiation  source.  By  compari¬ 
son,  EELS  requires  only  a  simple  laboratory  electron  gun 
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source,  which  is  relatively  inexpensive  and  widely  available. 
The  other  distinguishing  characteristics,  the  selection  rules, 
determine  which  transitions  are  observed.  In  XAS,  dipole 
selection  rules  apply  to  a  very  good  approximation,  and  cur¬ 
rent  techniques  permit  complete  control  of  the  photon  polar¬ 
ization.  In  EELS,  however,  the  selection  rules  are  not  dipole 
limited,  and  the  selection  rules  and  virtual  photon  polariza¬ 
tion  depend  on  the  electron  scattering  geometry.^  This  latter 
aspect  can  make  the  interpretation  of  EELS  more  compli¬ 
cated  than  XAS.  Nevertheless,  by  manipulation  of  the  EELS 
scattering  geometry,  it  is  sometimes  possible  to  simulate  di¬ 
pole  selection  rules  to  a  very  good  approximation,^  and,  as 
we  explain  below,  also  to  control  the  polarization  of  the  vir¬ 
tual  photons. 

In  recent  years,  a  powerful  new  magnetic  and  element- 
specific  XAS  technique  known  as  XMCD  has  been 
developed^  and  has  been  highly  successful  in  the  study  of 
magnetic  materials.  Here,  the  absorption  coefficient  of  circu¬ 
larly  polarized  x  rays  shows  large  changes  depending  on  the 
relative  orientations  of  photon  helicity  and  the  local  mag¬ 
netic  moment  of  a  particular  atomic  species  in  a  material. 
XMCD  thus  provides  information  about  those  local  magnetic 
moments.  In  the  present  article,  we  propose  that  similar  ef¬ 
fects  should  be  observable  in  EELS:  i.e.,  that  for  certain 
EELS  scattering  geometries,  there  is  a  net  circular  polariza¬ 
tion  of  the  virtual  photons  that  are  exchanged  between  elec¬ 
trons.  Thus  the  EELS  scattering  cross  section  should  depend 
on  the  relative  orientation  of  local  atomic  moments  relative 
to  this  circular  polarization. 

A  semiclassical  model  elucidates  this  process.  Consider  a 
classical  point-charge  electron  with  energy  incident  on  an 
Fe  atom  with  impact  parameter  b .  The  electron  interacts  with 
the  screened  Coulomb  potential  of  the  atom  nucleus  and  un¬ 
dergoes  scattering,  as  shown  in  Fig.  1.  We  are  interested  only 
in  electrons  which  undergo  90°  scattering.  We  wish  to  esti- 
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Fig.  1.  Scattering  geometry  for  the  semiclassical  model.  An  electron  is 
incident  upon  an  atom  at  an  angle  of  135°  with  respect  to  the  x  axis  and 
scatters  into  an  angle  of  225°.  The  magnitude  and  direction  of  the  electric 
field  at  the  atom  vary  in  both  magnitude  and  direction  as  a  function  of  time. 


mate  the  virtual  photon  amplitude  caused  by  this  incident 
electron  near  the  atom  core. 

At  this  point,  it  is  already  possible  to  see  how  this  scat¬ 
tering  geometry  gives  rise  to  circular  polarization.  At  the 
time  when  the  electron  makes  its  closest  approach  to  the 
atom  (?=0),  the  electron  is  moving  perpendicular  to  its  ra¬ 
dius  from  the  atom  center,  along  a  path  well  approximated  by 
a  circular  arc.  At  this  moment,  the  electric  field  at  the  atom 
position  is  constant  in  magnitude  but  is  rotating  with  some 
angular  velocity,  o)^.  This  is  very  similar  to  the  situation 
when  an  Fe  atom  is  immersed  in  a  circularly  polarized  pho¬ 
ton  field  with  energy  h  o)^ . 

It  was  shown  by  Slater^  that  the  Fe  2p  electrons  move  in 
a  potential  which  is  reasonably  approximated  as  Z7r,  where 
Z'  — Z“3.15  =  22.85  since  for  the  2p  electrons  the  nuclear 
potential,  Z/r,  is  somewhat  screened  by  the  1^,  25,  and  other 
2p  electrons.  Because  we  are  interested  in  transitions  of  2p 
electrons,  we  assume  in  our  model  that  the  incident  electron 
moves  in  the  same  potential.  In  any  case,  the  exact  form  of 
the  scattering  potential  does  not  qualitatively  change  the  ar¬ 
guments  presented  here. 

For  a  particle  moving  in  a  l/r  potential,  the  scattering 
path  of  the  particle  can  be  integrated  exactly,^  provided  it 
scatters  elastically.  To  make  the  problem  more  tractable, 
therefore,  we  assume  that  the  scattering  path  of  an  electron 
with  energy  Eq  and  which  loses  energy  AE  can  be  approxi¬ 
mated  by  the  elastic  scattering  path  of  an  electron  with  en¬ 
ergy  £■*,  where  E"^  =  (Eq-\- Eq  —  ^E)/2.  This  approximation 
is  most  valid  when  A£<^£’o. 

Now,  the  core  electron  experiences  a  time-dependent  elec¬ 
tric  field  E(0  due  to  the  incident  electron  with 

E(,)=J^R(/).  (1) 

where  R=[r^<,(0  -i*core]>  and  and  r^ore  are  the  position 
vectors  of  the  incident  and  core  electrons,  respectively.  The 
core  electron  is  assumed  to  be  at  rest  at  the  atom  center. 

To  obtain  the  “virtual  photon  amplitude,”  we 

Fourier  transform  E(f): 
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Fig.  2.  Plot  of  the  “virtual  photon  amplitudes”  of  right-  and  left-handed 
circular  polarization  (solid  lines),  and  the  net  circular  polarization  (dashed 
line)  of  the  electric  field  at  the  atom.  For  an  incident  electron  kinetic  energy 
of  1100  eV,  the  position  of  the  Fe  2p  core  energy  loss  feature  is  marked. 


From  these  x  and  y  components,  which  have  a  well-defined 
phase  relationship,  we  construct  the  right-  and  left-handed 
circularly  polarized  virtual-photon  amplitudes. 

The  results  of  this  simple  calculation  are  shown  in  Fig.  2 
for  Eo“  1 1^0  eV.  The  abcissa  is  chosen  to  be  in  units  of  Eq 
since  we  find  that  the  curves  are  almost  independent  of  Eq 
(for  simpler  geometries  this  is  rigorously  true).  It  is  seen  that 
low-energy  virtual  photons  (i.e.,  \E<^Eq)  have  almost  equal 
right-  and  left-handed  amplitude  (linear  polarization).  This 
comes  from  the  fact  that  in  the  low-wavelength  limit, 
—constant,  while  0.  The  circular  polarization, 

P  —  increases  monotonically  with  in¬ 

creasing  virtual  photon  energy. 

For  a  710  eV  (0.65  Eq)  energy  loss,  sufficient  to  excite  an 
Fe  2/7  core  electron,  the  effective  photon  polarization  is 
>60%.  Although  for  such  large  energy  losses  our  approxi¬ 
mation  AE<Eq  breaks  down,  we  believe  that  Fig.  2  at  least 
qualitatively  describes  the  polarization  behavior  for  such 
scattering  processes.  It  is  known  that  with  100%  circularly 
polarized  light,  the  XMCD  signal  at  the  2p  core  energy  of 
3d  transition  metals  can  vary  by  as  much  as  100%.^’^®  Based 
on  this  simple  semiclassical  model,  one  might  hope  to  see 
magnetization-dependent  changes  of  an  EELS  cross  section 
as  large  as  60%.  Such  changes  should  be  easily  measurable 
using  standard  techniques. 

One  obvious  flaw  with  this  model  is  that  it  neglects  elastic 
scattering  from  all  the  other  atoms  which  surround  the  atom 
of  interest.  In  a  reflection  EELS  experiment  at  these  kinetic 
energies,  it  is  well  known that  the  incident  electron  under¬ 
goes  multiple  elastic  scattering  before  and  after  the  inelastic 
EELS  scattering  event.  What  this  means  is  that  the  incident 
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Fig.  3.  Magnetization  loops  of  a  Si/Zr02  1000  A/Ni8oFe2o  100  A/Fe  25 
A/Au  5  A  sample  taken  along  the  easy  and  hard  axes  of  its  uniaxial  bias. 


and  outgoing  directions  of  the  incident  electron  are  no  longer 
well  defined  by  the  position  of  the  electron  gun  and  detector. 
Clearly,  multiple  scattering  will  reduce  the  magnitude  of  the 
dichroic  effects,  “blurring”  them  out  as  a  function  of  scat¬ 
tering  angle.  Such  blurring  may  not  be  complete,  however, 
and  a  residual  dichroic  effect  probably  remains.  A  priori,  it  is 
difficult  to  predict  the  magnitude  of  such  blurring  effects. 
While  it  is  possible,  in  principle,  to  make  such  predictions 
using  state  of  the  art  multiple-scattering  calculations,  it  is  in 
fact  easier  to  simply  attempt  the  experiment  and  measure  the 
effect. 

To  test  these  ideas,  experiments  were  performed  on  thin 
films  of  Fe,  Co,  and  Ni.  All  the  experiments  were  performed 
at  the  IBM  Almaden  Research  Center  in  San  Jose,  CA.  In 
order  to  see  dichroic  effects,  it  is  necessary  to  ensure  that  the 
films  are  uniformly  magnetized.  This  was  accomplished  by 
depositing  them  on  magnetically  biased  Ni8oFe2o  films,  100 
A  thick,  deposited  on  poly  crystalline  1000  A  Zr02  films  on 
Si  wafers.  The  Ni8oFe2o  films  were  biased  by  growing  them 
in  a  magnetic  field  using  a  small  permanent  magnet  mounted 
directly  behind  the  sample.  Such  polycrystalline  films  have  a 
uniaxial  magnetic  anisotropy  field  of  about  10  Oe,  which 
ensures  single-domain  magnetization.  On  top  of  these 
NigoFe2o  films,  elemental  films  of  ^25  A  Fe,  Co,  and  Ni 
were  deposited.  Preliminary  experiments  showed  that  over 
the  course  of  the  EELS  experiments,  Ni  films  became  con¬ 
taminated  with  C  and  O  due  to  electron-beam  decomposition 
of  residual  chamber  gases.  For  this  reason,  subsequent  depo¬ 
sitions  of  Fe  and  Co  films  were  capped  with  ^5  A  Au  and 
Ag  films,  respectively,  before  transfer  to  the  analysis  cham¬ 
ber.  Then  the  backing  magnet  was  removed,  and  the  samples 
were  transferred,  in  vacuo,  into  an  EELS  analysis  chamber 
with  a  base  pressure  of  2X10~^^  Ton*. 

Figure  3  shows  typical  magnetization  loops  from  a  sample 
with  structure  Si/Zr02  1000  A/Ni8oFe2o  100  A/Fe  25  A/Au  5 
A,  after  removal  from  the  ultrahigh  vacuum  system.  These 
loops  were  taken  along  the  easy  and  hard  directions  of  the 
film’s  uniaxial  bias.  The  easy-axis  loop  shows  almost  100% 
remanence,  with  a  coercive  field  of  ^=^3  Oe.  This  coercivity  is 
larger  than  any  expected  stray  magnetic  fields  in  our  appara- 
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tus.  Thus,  the  samples  were  effectively  100%  magnetized 
during  the  DEELS  measurements. 

We  present  data  from  two  experiments  involving  Fe  and 
Co  films,  taken  with  incident  electron  energies  of  ^1110  and 
^1210  eV,  respectively.  These  incident  kinetic  energies  were 
chosen  to  (a)  maximize  the  right  handed  polarization  ampli¬ 
tude  (following  Fig.  2)  and  (b)  eliminate  any  interference 
from  nearby  Auger  features.  The  EELS  intensity  was  mea¬ 
sured  using  a  Vacuum  Generator’s  hemispherical  electron  en¬ 
ergy  analyzer  with  an  angular  acceptance  of  a  5°  half-angle 
cone,  1  eV  energy  resolution,  and  using  standard  lock-in 
techniques.  Measurements  were  made  in  both  first  and  sec¬ 
ond  derivative  modes,  and  we  present  second  derivative  data 
since  in  this  case  peak  positions  are  more  easily  compared 
with  XAS  spectra.  Comparing  derivative  spectra  should  pre¬ 
serve  the  magnitude  of  observable  dichroism  effects. 

To  see  dichroism,  it  is  necessary  to  vary  the  relative  di¬ 
rections  of  circular  polarization  and  magnetization.  This  was 
accomplished  through  a  rotation  of  the  sample  before  the 
electron  energy  analyzer.  Electrons  are  incident  on  the 
sample  at  about  20°  with  respect  to  the  surface  plane  and 
with  an  in-plane  projected  direction  perpendicular  to  the 
sample  magnetization  M.  Electrons  are  measured  at  normal 
emission.  This  configuration  puts  the  scattering  plane  per¬ 
pendicular  to  M,  and  thus  the  net  circular  polarization  (he- 
licity)  is  parallel  to  M.  After  a  spectrum  is  taken  at  this 
position  (0°),  the  sample  is  rotated  180°  about  the  surface 
normal  (under  computer  control),  and  a  second  spectrum  is 
taken  with  the  helicity  antiparallel  to  M.  These  two  orienta¬ 
tions  should  show  the  greatest  changes  due  to  magnetic  di¬ 
chroism.  In  addition,  spectra  were  taken  in  geometries  with 
M  parallel  to  the  scattering  plane  (90°  and  270°).  The  latter 
two  spectra  should  show  no  circular  dichroism  effects. 

Figure  4  shows  the  results  of  our  experiments.  We  plot 
only  the  spectra  measured  at  0°  and  180°.  It  is  seen  that  no 
significant  differences  are  observed  between  the  spectra  for 
either  the  Fe  or  the  Co  film.  Furthermore,  these  spectra  are 
not  significantly  different  from  the  spectra  obtained  along 
either  the  90°  or  270°  directions.  In  addition  to  these  mea¬ 
surements,  we  have  obtained  similar  results  for  incident  elec¬ 
tron  kinetic  energies  up  to  2.5  keV  on  various  Fe,  Co,  and  Ni 
films,  and  also  at  the  3/?  absorption  edge  of  Ni.  The  results 
from  the  Ni  films  were  similar,  but  of  somewhat  lower  qual¬ 
ity  due  to  the  sample  contamination  mentioned  above.  Based 
on  these  data,  we  estimate  that  while  magnetic  dichroism 
effects  probably  do  exist  in  the  EELS  of  these  materials,  it 
must  amount  to  less  than  5%  of  the  total  absorption 
coefficient.^^ 

This  is  the  main  result  of  the  present  article.  A  semiclas- 
sical  calculation  suggests  the  possibility  that  EELS  can  pro¬ 
vide  a  convenient  chemically  sensitive  probe  of  local  atomic 
magnetization.  However,  the  absolute  magnitude  of  these  ef¬ 
fects  is  difficult  to  predict.  We  have  shown  empirically  that 
these  effects  are  small  (<5%)  even  for  the  most  advanta¬ 
geous  cases  (i.e.,  Fe,  Co).  We  believe  it  is  likely  that  some 
dichroic  effects  do  exist  for  these  materials,  and  more  sensi¬ 
tive  measurements  of  EELS  would  likely  prove  their  exist- 
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Electron  Energy  Loss  (eV) 


Fig.  4.  Electron  energy  loss  spectra  from  Fe  and  Co  films  taken  with  the 
magnetization  parallel  (solid)  or  antiparallel  (dashed)  to  the  net  helicity  of 
the  electric  field  for  this  scattering  geometry.  Difference  spectra  are  plotted 
below,  in  each  case.  It  is  seen  that  any  dichroism  effects  are  smaller  than  5% 
of  the  total  EELS  cross  section  for  these  films. 

ence.  However,  because  these  effects  are  small,  and  because 
of  the  complications  caused  by  electron  multiple  scattering, 
we  feel  it  is  unlikely  that  the  DEELS  technique  can  effec¬ 
tively  compete  with  XMCD  as  a  local  probe  of  atomic  mo¬ 
ments. 
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A  new  group  of  techniques  predicated  on  the  availability  of  a  tunable  source  of  circularly  polarized 
soft  x-rays  show  tremendous  potential  for  advancing  the  understanding  of  magnetic  nanostructures. 

These  techniques  involve  use  of  circularly  polarized  radiation  as  probes  in  x-ray  absorption  and 
x-ray  photoemission  and  photoelectron  diffraction  studies,  and  provide  unique  information  about  the 
element  specific  magnetic  moments  and  magnetic  structure  of  thin  films,  interfaces,  and  surfaces. 

Examples  of  each  technique  will  be  discussed  with  emphasis  given  to  the  recently  demonstrated  first 
observation  of  spin-dependent  photoelectron  diffraction  using  circularly  polarized  x-rays.  This 
technique  promises  the  direct,  element  specific  determination  of  local  magnetic  structure  on  an 
atomic  scale,  and  will  be  of  tremendous  utility  in  the  effort  to  establish  structure-property 
relationships  in  magnetic  nanostructures.  Circularly  polarized  x-rays  are  used  to  produce 
spin-polarized  photoelectrons  from  the  Fe  2p  spin-orbit  doublet.  Intensity  asymmetries  of  up  to 
—  3%  are  observed  for  the  Ip  1,12  sublevel.  These  asymmetries  depend  on  the  relative  orientation  of 
the  x-ray  polarization  vector  and  the  sample  magnetization  as  well  as  upon  the  photoelectron  kinetic 
energy  and  emission  direction.  Multiple  scattering  calculations  are  found  to  reproduce  both  the 
observed  energy  and  angular  asymmetry  variations.  Strategies  to  increase  the  measured  effect  are 
also  discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  requirements  of  the  magnetic  storage  industry 
coupled  with  the  development  of  modern  thin  film  growth 
techniques  have  led  to  the  production  of  new  artificially 
structured  nanoscale  magnetic  materials  with  unique  mag¬ 
netic  properties.^  The  preparation  and  ultimately  the  control 
of  the  properties  of  these  systems  depends  critically  on  the 
development  of  a  better  understanding  of  the  magnetism  of 
surfaces,  interfaces,  and  thin  films.  A  critical  issue  in  this 
development  is  the  subtle  but  critical  relationship  between 
structure  and  magnetic  properties.  In  order  to  better  under¬ 
stand  these  issues,  a  number  of  new  sophisticated  sample 
characterization  techniques  have  been  developed.  One 
such  set  of  techniques  is  based  upon  the  recent  availability  of 
circularly  polarized  x-rays  over  a  wide  energy  range  at  syn¬ 
chrotron  radiation  sources.  These  techniques  fall  generally 
under  the  category  of  magnetic  x-ray  circular  dichroism 
(MXCD)  techniques  which  include  the  application  of  circu¬ 
larly  polarized  x-rays  in  absorption^  and  photoemission 
studies.^ 

In  x-ray  absorption  studies  MXCD  provides  element  spe¬ 
cific  information  about  the  magnetic  moments  (both  spin  and 
orbital)  of  magnetic  materials.^’"^  Also,  pioneering  spin- 
polarized  x-ray  absorption  fine  structure  studies  using 
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MXCD  techniques  have  demonstrated  a  sensitivity  to  mag¬ 
netic  structure  in  bulk  magnetic  systems.^  Core  level  photo¬ 
emission,  utilizing  circularly  polarized  x-rays,  probes  the 
spin-orbit  and  exchange  interactions  and  their  effect  on  the 
core  level  spectral  features.^  This  technique  is  complimen¬ 
tary  to  spin-polarized  photoemission  studies  that  measure  the 
spin  of  the  emitted  electrons.^  All  these  experimental  tech¬ 
niques  draw  heavily  from  calculational  support  for  proper 
interpretation. 

This  article  details  the  application  of  MXCD  techniques 
to  the  field  of  photoelectron  diffraction.  Specifically,  the 
circularly-polarized  x-rays  are  used  to  create  an  “internal” 
source  of  spin  polarized  photoelectrons.  By  comparing  the 
angular  and  energy  dependent  intensity  variations  of  oppo¬ 
sitely  spin-polarized  photoelectrons  it  is  possible  to  isolate 
the  magnetic  contribution  to  the  scattering  cross  section  for 
the  photoelectrons.  This  technique  can  therefore  be  viewed 
as  spin-polarized  photoelectron  diffraction  (SPPED),  and,  in 
the  same  way  that  conventional  photoelectron  diffraction  has 
been  demonstrated  to  be  sensitive  to  local  atomic  structure,^ 
is  sensitive  to  local  magnetic  structure.  Furthermore,  SPPED 
is  surface  sensitive  and  element  specific,  and  as  such  will 
ultimately  prove  to  be  a  valuable  tool  in  the  study  of  nano¬ 
scale  magnetic  systems. 

The  remainder  of  this  article  will  detail  the  general  devel¬ 
opment  of  MXCD  techniques  particularly  where  they  impact 
the  development  of  MXCD-SPPED.  The  issues  critical  to  the 
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MXCD-SPPED  experiment  will  be  discussed  extensively, 
and  first  results  from  fee  Fe  films  grown  on  Cu(OOl)  will  be 
presented.  The  interpretation  of  these  results  hinges  on  com¬ 
parison  to  spin-polarized,  multiple- scattering  calculations. 
The  calculational  details  along  with  comparison  to  experi¬ 
mental  results  are  presented.  Sensitivity  to  magnetic  struc¬ 
ture  is  demonstrated  by  varying  structural  parameters  in  the 
model  calculations.  Finally,  the  results  of  calculations  for  the 
complete,  or  double,  polarization  experiment  in  which  circu¬ 
larly  polarized  x-rays  are  used  to  produce  an  internal  source 
of  spin-polarized  photoelectrons  and  the  spin  of  the  emitted 
photoelectron  is  measured  are  presented,  and  the  implica¬ 
tions  for  future  SPPED  studies  are  discussed. 

II.  MAGNETIC  X-RAY  CIRCULAR  DICHROISM 

All  MXCD  techniques  are  predicated  on  the  availability 
of  a  tunable  source  of  circularly  or  elliptically  polarized 
x-rays.  The  most  common  usage  of  circularly  polarized 
x-rays  is  in  x-ray  absorption  spectroscopy  where  the  photon 
energy  is  scanned  through  an  absorption  threshold  exciting  a 
core  electron  into  an  unoccupied  valence  state.^  For  transi¬ 
tion  metals,  magnetic  properties  are  determined  primarily  by 
the  ^/-electrons,  so  most  MXCD  absorption  studies  of  transi¬ 
tion  metal  magnetism  have  concentrated  on  the  L  2  and  L3 
absorption  edges  that  involve  2p  to  d  transitions,  and  yield 
information  about  the  empty  (i-states  in  the  material.  In 
MXCD  absorption,  the  polarization  vector,  or  helicity,  of  the 
incident  photons  (along  the  photon  propagation  direction  for 
left  circularly  polarized  photons  and  opposite  the  propaga¬ 
tion  direction  for  right  circularly  polarized  photons)  is 
aligned  or  anti-aligned  with  the  sample  magnetization.  In  this 
arrangement,  it  is  a  simple  matter  to  show  that  the  important 
dipole  selection  rules  are  Am^  =  +  1  for  right  circularly  po¬ 
larized  x-rays  and  Amy=  — 1  for  left  circularly  polarized 
x-rays.  If  the  <i-band  is  not  exchange  split  and  an  equal  num¬ 
ber  of  majority  and  minority  spin  vacancies  exist,  then  exci¬ 
tation  with  circularly  polarized  x-rays  produces  62.5% 
(37.5%)  minority  spin  electrons  from  the  2py2  (^3  ^dge) 
and  25%  (75%)  minority  spin  electrons  from  the  2pi/2  {L2 
edge)  for  excitation  by  right  (left)  circularly  polarized 
radiation.^  In  ferromagnetic  materials,  the  exchange  interac¬ 
tion  leads  to  an  imbalance  in  the  occupation  of  majority  and 
minority  spin  states  and  will  therefore  favor  transitions  in¬ 
volving  one  spin  state.  Thus  the  absorption  intensities  at  the 
L3  and  L  2  edges  will  vary  for  different  relative  orientations 
of  photon  helicity  and  magnetization,  and  these  variations 
contain  information  about  the  spin  and  orbital  components  of 
the  magnetic  moment  of  the  sample. 

In  MXCD  applied  to  core  level  photoemission,  one  again 
relies  on  the  effects  of  proper  orientation  of  the  photon  he¬ 
licity  and  magnetization.  Here,  however,  instead  of  exciting 
the  core  electrons  to  empty  levels  just  above  the  Fermi  en¬ 
ergy,  the  photon  energy  is  selected  so  that  the  excited  photo¬ 
electron  escapes  the  solid  with  kinetic  energy  typically  100 
eV  or  greater.  Thus,  there  is  no  spin  imbalance  in  the  avail¬ 
able  final  states,  and  the  large  intensity  variations  observed  in 
MXCD  absorption  are  absent.  Instead  one  observes  (small) 
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shifts  in  the  kinetic  energy  of  the  emitted  electrons  that  de¬ 
pends  upon  the  relative  orientation  of  the  photon  helicity  and 
magnetization.^  The  origin  of  these  shifts  again  rests  upon 
the  production  of  oppositely  spin-polarized  photoelectrons 
from  the  2/? 3/2  and  2pi/2  levels  of  a  ferromagnetic  material 
by  circularly  polarized  radiation.  For  example,  for  antiparal¬ 
lel  orientation  of  helicity  and  magnetization,  the  2py2  pho- 
toemission  peak  is  derived  from  primarily  majority  spin  elec¬ 
trons  and  that  in  the  2/?  1/2  peak  from  primarily  minority  spin 
electrons.  The  opposite  spin  polarization  in  each  peak  is 
achieved  for  a  parallel  orientation  of  helicity  and  magnetiza¬ 
tion.  The  effect  of  the  magnetic  exchange  interaction  is  then 
to  shift  the  2/73/2  photoemission  peak  to  lower  kinetic  energy 
and  the  2/7 1/2  peak  to  higher  kinetic  energy  than  is  observed 
for  the  parallel  orientation.  These  shifts  have  been  observed 
in  both  bulk  and  monolayer  Fe  systems.^  The  initial  investi¬ 
gations  of  MXCD  core  level  photoemission  concentrated  on 
angle  integrated  effects,  but  subsequent  studies,  both  experi¬ 
mental  and  theoretical,  demonstrated  that  the  spectral  fea¬ 
tures  were  also  dependent  upon  photoelectron  emission 
angle.^  These  angular  variations  can  be  used  to  further  elu¬ 
cidate  details  of  the  spin-orbit  and  magnetic  exchange  inter¬ 
actions,  however,  the  primary  interest  of  the  studies  de¬ 
scribed  herein  is  the  spin  dependence  of  the  photoelectron 
scattering  processes  in  magnetic  materials.  Ultimately  it  is 
this  information  that  provides  the  sensitivity  to  the  local 
magnetic  structure. 


III.  SPIN-POLARIZED  PHOTOELECTRON 
DIFFRACTION 

The  first  studies  of  SPPED  involved  the  use  of  ^  core 
level  multiple!  splitting  in  transition  metal  antiferromagnetic 
salts  as  a  source  of  internally  spin-polarized  photoelectrons. ^ 
For  example,  the  relative  intensities  of  the  oppositely  spin- 
polarized  Mn  3^  doublet  states  were  observed  to  exhibit  tem¬ 
perature  and  emission  direction  dependent  fluctuations  which 
were  used  to  examine  short  range  antiferromagnetic  ordering 
at  the  surface  of  KMnF3  The  extension  of  this  technique  to 
metal  surfaces,  interfaces,  and  thin  films  has  proved  a  diffi¬ 
cult  task  that  has  suffered  from  small  35-  cross  section,  and 
the  absence  of  sufficiently  resolved  multiplet  states  in  many 
materials.  Thus,  the  present  approach  to  SPPED  was  devel¬ 
oped  as  an  alternative.  In  this  approach,  excitation  of  a  fer¬ 
romagnetic  transition  metal  2p  core  level  with  circularly  po¬ 
larized  radiation  is  used  to  produce  a  source  of  internally 
spin-polarized  photoelectrons.  The  mechanism  is  the  same  as 
that  outlined  in  the  previous  section  and  relies  on  the  proper 
relative  orientation  of  photon  polarization  and  sample  mag¬ 
netization.  Recall  that  in  an  atomic  picture  with  equal  num¬ 
ber  of  minority  and  majority  spin  state  vacancies  the  2/73/2 
and  2/7 1/2  peaks  consist  of  62.5%  (37.5%)  and  25%  (75%) 
minority  spin  electrons  for  excitation  using  right  (left)  circu¬ 
lar  polarized  x-rays.  In  this  approach,  the  photoelectron  spin 
polarization  can  be  reversed  by  reversing  either  the  sample 
magnetization  direction  or  the  helicity  of  the  exciting  radia¬ 
tion. 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


3154 


Waddill  et  aL  Surface  and  thin  film  magnetic  structure 

Following  photoexcitation,  the  spin-polarized  electrons 
scatter  from  near  neighbor  atoms.  This  scattering  contains 
spin-dependent  terms  that  vary  according  to  the  orientation 
of  the  photoelectron  spin  with  respect  to  the  net  spin  of  the 
scattering  atom.  The  resulting  emission  pattern  of  photoelec¬ 
trons  will  therefore  reveal  details  of  the  local  magnetic  as 
well  as  geometric  structure  due  to  the  presence  of  exchange 
terms  in  the  interaction  Hamiltonian.  The  spin-dependent 
part  of  the  scattering  process  can  then  be  isolated  by  revers¬ 
ing  the  relative  orientation  of  the  photon  helicity  and  sample 
magnetization.  This  is  accomplished  by  either  reversing  the 
photon  helicity  or  the  sample  magnetization.  Comparison  of 
the  results  of  both  methods  provides  a  crucial  internal  con¬ 
sistency  check  of  the  results  since  the  spin-dependent  diffrac¬ 
tion  processes  depend  only  on  the  relative  orientation  of  the 
photon  polarization  and  sample  magnetization,  and  not  on 
the  absolute  orientation  of  either. 

IV.  EXPERIMENT 

The  experiments  were  performed  at  the  Stanford  Synchro¬ 
tron  Radiation  Laboratory  (SSRL)  using  a  spherical  grating 
monochromator  capable  of  delivering  —90%  circularly  po¬ 
larized  photons  over  the  energy  range  from  approximately 
100  eV  to  1000  eV.  Four  monolayer  (ML)  Fe  films  grown  on 
a  Cu(OOl)  substrate  held  at  150  K  resulted  in  a  relatively 
poorly  ordered  metastable  fee  Fe  overlayer  as  evidenced  by  a 
diffuse  /?(1X1)  low-energy  electron  diffraction  (LEED)  pat¬ 
tern  consistent  with  previous  studies  of  this  system.^  These 
films  exhibit  a  magnetic  easy  axis  along  the  sample  normal,^ 
and  were  magnetized  in  situ  by  means  of  an  electromagnetic 
coil  capable  of  producing  fields  up  to  3  kOe.  All  measure¬ 
ments  were  made  in  remanance.  The  Fe  2p  spectra  were 
collected  with  an  angle-resolving  hemispherical  analyzer 
with  angular  acceptance  of  ±  3° .  The  perpendicular  magnetic 
axis  dictated  that  the  photoelectron  polarization  is  maxi¬ 
mized  for  normal  photon  incidence.  The  analyzer  position 
was  then  adjusted  to  the  desired  electron  emission  angle. 
Emission  along  two  high  symmetry  directions  was  investi¬ 
gated.  In  the  first,  the  scattering  was  defined  by  the  sample 
normal  and  the  (110)  azimuth  with  emission  along  the  [111] 
direction,  and  in  the  second  the  scattering  plane  was  defined 
by  the  sample  normal  and  the  (100)  azimuth  with  emission  in 
the  [110]  direction.  Both  geometries  are  in  mirror  planes 
where,  as  will  be  discussed  shortly,  nonmagnetic  photon  po¬ 
larization  effects  vanish.  Also,  in  both  cases  the  scattering 
plane  coincided  with  the  electron  orbital  plane  in  the  storage 
ring  (horizontal).  Photon  energies  between  800  and  860  eV 
were  employed  giving  rise  to  Fe  2p3/2  kinetic  energies  of 
about  90-150  eV.  Angular  alignment  was  determined  using 
combined  LEED  and  laser  reflection  measurements. 

Typical  Fe  2p  spectra  for  emission  along  the  [110]  (lower 
panel)  and  [111]  (upper  panel)  directions  are  shown  in  Fig.  1. 
These  spectra  were  collected  using  a  photon  energy  of  835 
eV  giving  a  kinetic  energy  of  — 125  eV  at  the  Fe  2/?  3/2  peak. 
The  spectral  differences  discussed  below  were  found  to  de¬ 
pend  only  on  the  relative  orientation  of  photon  polarization 
and  sample  magnetization,  and  not  on  the  absolute  orienta- 
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Fig.  1.  Fe  2p  spectra  for  4  ML  Fe/Cu(001)  along  the  [110]  direction  (lower 
panel)  and  the  [111]  direction  (upper  panel)  for  parallel  (solid  curves)  and 
antiparallel  (dashed  curves)  photon  and  minority  electron-spin  orientations. 
The  energy  shifts  are  due  to  the  magnetic  exchange  interaction  on  the  op¬ 
positely  spin-polarized  electrons  in  each  2p  sublevel.  Note  the  intensity 
variation  apparent  in  the  2/73/2  peak  for  the  [111]  direction  and  its  absence  in 
the  [110]  direction. 


tion  of  either.  In  both  sets  of  spectra,  the  apparent  spin-orbit 
splitting  is  reduced  on  going  from  a  parallel  to  an  antiparallel 
orientation  of  photon  and  minority  electron  spin.  As  dis¬ 
cussed  previously,  these  energy  shifts  are  due  to  the  magnetic 
exchange  interaction  which,  for  the  anti-parallel  orientation, 
shifts  the  predominantly  majority  spin  photoelectrons  in  the 
'^P3i2  higher  binding  energy  and  the  predominantly 

minority  spin  electrons  in  the  2/7 1/2  peak  to  lower  binding 
energy.  Also  apparent  for  emission  in  the  [111]  direction  is 
an  intensity  variation  in  the  2/73/2  P^ak.  These  intensity 
variations  are  found  to  be  angle  and  energy  dependent,  and, 
as  we  will  demonstrate,  are  consistent  with  calculations  of 
spin-dependent  diffraction. 

In  order  to  isolate  spin-dependent  diffraction  effects  in  our 
data,  we  have  used  the  following  scheme  to  eliminate  ex¬ 
change  interaction  energy  shifts  from  the  data  and  to  concen¬ 
trate  exclusively  on  the  intensity  variations  in  the  2/? 3/2  peak 
for  parallel  and  antiparallel  photon  and  minority  electron 
spin  orientations.  The  energy  shifts  are  removed  by  artifi¬ 
cially  separating  the  Fe  2p  spectra  for  the  antiparallel  orien¬ 
tation  in  the  region  between  the  2/73/2  and  2/7 1/2  peaks.  Each 
peak  is  then  displaced  in  energy  (the  2/73/2  lo  lower  binding 
energy  and  the  2/7 1/2  to  higher  binding  energy)  until  their 
centroids  align  with  the  like  2p  sublevel  centroid  in  the  spec¬ 
trum  obtained  for  parallel  orientation  of  photon  and  minority 
electron  spin.  This  is  shown  schematically  in  Figs.  2(a)  and 
2(b)  for  the  data  from  the  upper  panel  of  Fig.  1.  Here  the  two 
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Fig.  2.  A  demonstration  of  the  removal  of  exchange  shifts  in  the  data  to 
isolate  diffraction  derived  intensity  variations.  The  data  in  the  [111]  direc¬ 
tion  from  Fig.  1(b)  are  reproduced  in  (a)  and  the  energy  correction  to  the 
spectrum  for  the  antiparallel  orientation  is  demonstrated  in  (b).  The  intensity 
asymmetry  for  this  spectral  pair  is  shown  in  (c).  The  asymmetry  at  the 
2py2  maximum  is  ~5%. 


Spectra  are  offset  for  clearer  presentation.  The  spectra  [in 
Fig.  1  and  in  Fig.  2(a)]  were  also  normalized  to  the  low-  and 
high-energy  backgrounds  in  order  to  extract  the  proper  2p 
photoemission  peak  intensity  differences.  Following  peak 
alignment,  a  difference  spectrum  is  formed  by  subtracting 
the  two  spectra  and  normalizing  to  the  sum  of  the  two  on  a 
channel-by-channel  basis  [see  Fig.  2(c)].  Because  the  spin 
polarization  in  each  peak  is  reversed  upon  going  from  a  par¬ 
allel  to  antiparallel  photon  and  minority  electron  spin  orien¬ 
tation,  this  difference  spectrum  isolates  the  spin-dependent 
scattering  intensities  or  diffraction  effects.  We  are  effectively 
isolating  the  following  term, 

t'I  _  ri 

^3/2  ^3/2 

rT  /i  * 

^3/2  ^^3/2 

We  note  that  this  data  extraction  procedure  is  equivalent  to 
that  obtained  by  independently  integrating  each  2p  sublevel 
(after  background  removal)  for  both  spin  orientations  and 
forming  the  normalized  difference  of  these  values. 

The  spin-dependent  intensity  asymmetry  (SIA)  is  calcu¬ 
lated  for  each  of  the  two  emission  directions  investigated  for 
all  photon  energies  employed.  This  is  shown  for  the  [111] 
direction  and  hv—S35  eV  in  Fig.  2(c)  where  the  observed 
asymmetry  varies  from  -2  to  +5%  over  the  entire  2p  mani¬ 
fold.  We  note  that  the  normalization  scheme  described  above 
essentially  equates  the  intensity  in  the  Fe  2py2  fot  both 
spin  orientations  and  places  all  spin-dependent  asymmetry 
effects  within  the  2^3/2  peak.  The  effective  equalization  of 
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Fig.  3.  Calculated  (solid  curve:  di2  =  ^23=1*8  A,  rp=0.19;  dashed  curve: 
6^12=1.9  A,  a,  rp=0.\l)  and  measured  asymmetries  along  the 

[111]  direction  as  a  function  of  photon  energy.  The  oscillatory  behavior  in 
the  curves  is  due  to  spin-dependent  photoelectron  diffraction. 

the  intensities  of  the  2^1/2  features  (which  should  also  ex¬ 
hibit  spin-dependent  scattering  effects)  is  due  to  both  the 
higher  secondary  electron  background  underlying  this  peak 
together  with  the  Coster- ICronig  broadening  of  the  peak. 
This  combination  of  effects  makes  extraction  of  the  spin- 
dependent  2pii2  photoelectron  scattering  prohibitively  diffi¬ 
cult. 

Figure  3  shows  the  results  of  integrating  the  SIA  curves 
over  a  10  eV  energy  window  centered  on  the  2/73/2  peak  for 
11  different  photon  energies  and  emission  in  the  [111]  direc¬ 
tion.  The  rationale  for  this  integration  is  for  proper  compari¬ 
son  with  calculations  of  spin-dependent  scattering  intensities 
for  the  entire  2/73/2  '^Pm  manifolds  that  will  be  discussed 
in  the  following  section.  Here  the  experimental  SIA  (broken 
curve  connecting  data  points,  +)  varies  from  about  —0.5% 
to  2%  and  exhibits  oscillations  as  a  function  of  photon  en¬ 
ergy.  These  oscillations  are  analogous  to  those  observed  in 
traditional  (spin-integrated)  photoelectron  diffraction,  and 
represent  the  first  observation  of  spin-dependent  photoelec¬ 
tron  diffraction  using  circularly  polarized  photons  to  produce 
an  internal  source  of  spin-polarized  photoelectrons.  Due  to 
the  rather  weak  nature  of  these  oscillations  (down  roughly 
one  order  of  magnitude  from  spin-integrated  photoelectron 
diffraction  oscillations^)  proper  comparison  to  spin- 
dependent  multiple  scattering  calculations  are  critical  in  as¬ 
sessing  the  experimental  data.  This  is  the  subject  of  the  fol¬ 
lowing  section. 

V.  THEORY 

The  calculation  of  spin-dependent  multiple  scattering 
photoemission  intensities  combines  conventional  photoemis¬ 
sion  and  spin-polarized  LEED  methods.  Calculation  of  the 
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excitation  matrix  element,  uses  the  nonrelativ- 

istic  approximation  with  the  dipole  approximation  used  for 
the  interaction  Hamiltonian.  In  this  article,  the  Dirac  matrix, 
a,  is  replaced  by  the  Pauli  matrices,  a.  The  selection  rules 
for  excitation  by  circularly  polarized  light  restrict  transitions 
to  Amy  =  + 1  for  right  circular  polarization  and  Am^  =  - 1  for 
left  circular  polarization.  Transitions  from  the  spin-orbit  split 
Fe  2p  core  sublevels  are  governed  by  these  selection  rules. 

Following  excitation,  the  internally  spin-polarized  photo¬ 
electrons  are  multiply  scattered  inside  the  crystal  in  a  manner 
similar  to  that  experienced  by  spin-polarized  LEED  elec¬ 
trons.  The  single  site  scattering  matrix  is  calculated  using  the 
Dirac  equation  with  spin-polarized  potentials  generated  by  a 
self-consistent  linear  augmented  plane  wave  band  calcula¬ 
tion.  This  scattering  matrix  is  converted  to  the  (Ims)  repre¬ 
sentation  and  used  to  construct  layer  diffraction  matrices. 
The  calculation  then  proceeds  as  in  conventional  photoemis¬ 
sion  calculations  except  that  the  dimension  of  the  layer  dif¬ 
fraction  matrices  is  doubled  to  include  spin-dependent  scat¬ 
tering  and  spin  flip  effects.  Note  that  both  spin-orbit  coupling 
and  exchange  effects  are  accounted  for  since  off-diagonal 
matrix  elements  are  nonvanishing.  In  the  calculation,  the  in¬ 
ner  potential  is  set  to  10  eV  and  inelastic  scattering  is  simu¬ 
lated  by  an  imaginary  potential  of  4.5  eV.  Due  to  the  low 
kinetic  energies  utilized  in  the  experiment  only  terms  up  to 
/=4  are  used  in  most  calculations,  and  convergence  has  been 
checked  using  terms  up  to  1=6  with  no  significant  differ¬ 
ences  found. 

The  calculated  spin-dependent  intensities  are  for  an  entire 
2py2  or  2p  1/2  manifold,  and  comparison  to  the  experimen¬ 
tally  extracted  SIA  requires  the  formation  of  a  calculated  SIA 
defined  as  follows: 

^  +  Iy2ihv)/i;/^h  vn  ’ 

where,  for  example,  1^/2  is  the  integrated  intensity  in  the 
2/7  3/2  manifold  for  a  parallel  (+)  orientation  of  photon  and 
minority  electron  spin.  In  Fig.  3,  we  show  the  calculated 
asymmetry  (solid  curve)  along  the  [111]  direction  for  a 
model  with  fee  Fe  spacing  {d  i2==d23=l.^  A)  as  a  function  of 
incident  photon  energy  compared  to  the  integrated  SIA  ex¬ 
tracted  from  the  experimental  data.  The  agreement  is  quali¬ 
tative  with  oscillations  as  well  as  the  change  in  sign  ca.  840 
eV  well  reproduced.  We  stress  that  it  is  the  position  of  the 
intensity  oscillations  rather  than  the  magnitude  that  is  of 
most  fundamental  importance  in  photoelectron  diffraction. 
Also,  we  demonstrate  in  Fig.  3  the  sensitivity  of  the  data  to 
variations  in  structural  parameters.  Varying  the  values  of 
di2  and  <^23  yields  a  best  fit  to  the  data  for  di2=^1.9  A  and 
^^23=1*7  A  (dashed  curve).  This  fit  has  a  Pendry  r  factor  of 
0.17  which  is  9%  better  than  the  fit  for  the  unrelaxed  over¬ 
layer  (solid  curve).  Variations  of  greater  than  ±0.2  A  de¬ 
stroys  the  agreement  between  calculation  and  experiment. 
These  results  agree  with  our  previous  determinations  of  the 
structure  of  the  Fe  overlayer  using  conventional 
photoemission. 


Fig.  4.  The  effect  of  inclusion  of  disorder  on  the  calculation  of  the  asym¬ 
metry  along  the  [111]  direction  for  the  relaxed  (dashed  curve  in  Fig.  3)  Fe 
layer.  The  calculated  values  are  scaled  by  a  factor  of  (1— x)  where  x  is  the 
percentage  disorder  present  in  the  Fe  layer.  The  experimental  results  are 
shown  with  crosses  connected  by  a  broken  line. 

Further  improvement  of  the  agreement  between  calcula¬ 
tion  and  experiment  can  be  gained  by  introducing  the  effects 
of  partial  disorder  within  the  overlayer.  Figure  4  shows  a 
comparison  of  calculations  of  the  relaxed  Fe  overlayer  with 
50%  disorder  compared  to  the  experimental  data.  As  can  be 
seen,  the  primary  effect  of  the  inclusion  of  disorder  on  the 
calculated  integrated  SIA  is  a  reduction  of  the  asymmetry. 
This  brings  the  calculation  into  better  agreement  with  the 
experiment.  The  precise  origin  of  the  disorder  is  not  investi¬ 
gated  here  since  its  introduction  at  this  level  essentially  rep¬ 
resents  a  rescaling  of  the  calculated  SIA,  however  previous 
investigations  of  the  Fe/Cu(001)  system  suggest  substantial 
structural  disorder  together  with  Fe/Cu  intermixing  at  room 
temperature  and  below. ^  The  good  agreement  between  our 
calculations  and  experimental  data  coupled  with  the  eonsis- 
tency  of  those  results  with  a  number  of  previous  studies  of 
the  Fe/Cu(001)  system  using  a  variety  of  techniques  confirm 
that  the  intensity  variations  observed  in  the  experiment  are 
indeed  due  to  spin-dependent  photoelectron  diffraction. 

VI.  DISCUSSION 

At  this  point  it  is  necessary  to  address  a  number  of  issues 
that  influence  the  details  of  proper  experimental  configura¬ 
tions  as  well  as  affect  the  proper  interpretation  of  the  experi¬ 
mental  results.  These  discussions  are  intended  to  outline  the 
known  limitations  of  this  technique  as  well  as  to  speculate 
about  possible  extensions  and  future  investigations. 

First,  dichroism  has  been  observed  in  nonmagnetic  sys¬ 
tems  with  reversal  of  incident  photon  polarization.^^  The  cal¬ 
culated  2/7 3/2  intensity  for  excitation  from  fee  Fe  assuming 
no  magnetization  by  left  and  right  circularly  polarized  x-rays 
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Azimuthal  angle 

Fig.  5.  Circular  dichroism  in  non-magnetic  fee  Fe.  The  calculated  dichroism 
away  from  mirror  planes  arises  from  the  effects  of  experimentally  induced 
chirality  by  orientation  of  the  crystallographic  axes,  photoelectron  emission 
direction,  and  photon  helicity. 


is  shown  in  Fig.  5.  Away  from  high  symmetry  mirror  planes 
one  sees  the  presence  of  large  photon  polarization  depen¬ 
dence  in  the  excitation  cross  section  that  is  obviously  not  of 
magnetic  origin.  In  studies  of  magnetic  properties  using  cir¬ 
cular  dichroism  it  is  necessary  to  avoid  this  complication  in 
data  interpretation  by  performing  measurements  only  in  mir¬ 
ror  planes  where  the  nonmagnetic  dichroism  vanishes.  The 
origin  of  the  nonmagnetic  dichroism  lies  in  the  imposition  of 
an  experimentally  induced  chirality  in  which  the  photoelec¬ 
tron  wavevector,  the  sample  normal,  and  the  photon  helicity 
define  a  coordinate  system  with  a  definite  handedness.  When 
all  three  vectors  are  coplanar  and  lie  in  a  mirror  symmetry 
plane,  this  induced  chirality  vanishes  as  can  be  seen,  for 
example,  for  (9=0°,  45°,  and  90°  [(110)  and  (001)  planes]  in 
Fig.  5.  It  is  important  to  note  that  the  experimentally  induced 
dichroism  depends  upon  the  absolute  orientation  of  photon 
helicity  while  the  magnetic  dichroism  depends  only  on  the 
relative  alignment  of  helicity  and  magnetization.  MCXD  dif¬ 
fraction  asymmetries  will  necessarily  be  equivalent  for  all 
possible  parallel  versus  antiparallel  configurations. 

Next,  we  discuss  the  variation  of  photoelectron  spin- 
polarization  with  emission  angle.  The  simple  atomic  model 
discussed  previously  predicts  a  spin-polarization  of  ±25% 
and  +50%  from  the  2py2  ^nd  2/7 1/2  sublevels  respectively 
for  excitation  to  empty  J-levels.  In  angle-resolving  experi¬ 
ments,  the  ^-character  of  the  photoelectron  final  state  and 
therefore  the  photoelectron  polarization  will  depend  upon  the 
photoelectron  emission  angle  and  kinetic  energy.  Thus,  a 
proper  interpretation  of  the  experimental  data  must  account 
for  this  variable  spin-polarization.  In  Fig.  6  we  show  the 
results  of  multiple  scattering  calculations  of  the  Fe  2/73/2 
spin-polarization  versus  polar  emission  angle  in  the  [110] 
plane  at  a  kinetic  energy  of  125  eV.  Here  the  spin-up  and 
spin-down  intensities  for  parallel  (solid  curve)  and  antiparal¬ 
lel  (dashed  curve)  orientation  of  helicity  and  sample  magne- 


4  ML  Fe/Cu  (001) 
Spin  distribution  of  Fe 


Fig.  6.  Photoemission  cross  section  calculations  for  the  Fe  2^3/2  level  show¬ 
ing  the  spin  polarization  of  the  2/? 3/2  peak  for  excitation  with  right  circularly 
polarized  light  as  a  function  of  polar  angle  in  the  [110]  plane  for  the  direct 
(upper  panel)  and  total  (lower  panel)  terms.  All  calculations  are  for  4  ML 
Fe/Cu(001)  assuming  pseudomorphic  growth,  and  for  a  photoelectron  ki¬ 
netic  energy  of  125  eV.  Note  the  additional  diffraction  related  structure  in 
the  total  term.  The  solid  and  dashed  curves  are  for  parallel  and  antiparallel 
orientation  of  helicity  and  sample  magnetization,  respectively. 


tization  are  shown.  The  top  panel  is  the  so-called  direct  term 
which  is  essentially  atomic  emission  incorporating  the  appro¬ 
priate  crystalline  symmetry  as  well  as  proper  surface  bound¬ 
ary  conditions. In  the  bottom  panel,  the  effects  of  multiple 
scattering  are  included.  One  sees  that  there  is  a  variable  pho¬ 
toelectron  spin-polarization  even  in  the  absence  of  diffrac¬ 
tion  effects  (upper  panel),  but  that  this  spin-polarization  is 
strongly  modulated  by  diffraction  effects  (bottom  panel).  It  is 
this  modulation  that  gives  spin-dependent  photoelectron  dif¬ 
fraction  its  sensitivity  to  local  magnetic  structure.  It  is,  how¬ 
ever  critical  to  properly  account  for  these  angular  photoelec¬ 
tron  spin  polarization  variations  when  interpreting  the 
MXCD-SPPED  data. 

Finally,  we  turn  to  a  discussion  of  possible  future  exten¬ 
sions  for  this  technique.  Obviously  the  magnitude  of  the  spin 
dependent  scattering  asymmetries  observed  herein  are  suffi¬ 
cient  to  limit  future  studies  using  this  technique.  In  order  to 
generate  a  more  widely  applicable  technique  some  mecha¬ 
nism  for  increasing  the  spin  dependent  asymmetry  in  the 
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Fig.  7.  Integrated  SIA  using  Eq.  (1)  for  scattering  along  the  (110)  azimuth 
(solid  curve)  as  a  function  of  polar  emission  angle.  For  the  dashed  curve  the 
results  for  linear  polarized  incident  photons  and  external  photoelectron  spin 
detection  are  shown.  Note  the  similarity  in  the  size  of  the  spin-dependent 
oscillations  to  those  for  excitation  using  circularly  polarized  light  and  no 
external  photoelectron  spin  detection  (solid  curve).  Finally,  the  spin  polar¬ 
ization  of  the  Fe  Ipm  photoemission  peak  for  excitation  by  right  circularly 
polarized  light  is  shown.  This  arrangement  requires  the  double  or  complete 
polarization  experiment  that  uses  excitation  by  circularly  polarized  photons 
together  with  external  photoelectron  spin  detection, 

diffraction  data  must  be  arrived  at.  In  Fig.  7  we  show  the 
results  of  calculations  that  suggest  that  such  improvements 
may  well  be  within  reach  in  the  very  near  future.  This  figure 
shows  calculations  of  the  spin  dependent  asymmetry  as  de¬ 
fined  in  Eq.  (1)  (solid  line)  as  a  function  of  polar  emission 
angle  along  the  (110)  azimuth  with  hv=S35  eV  for  fee  Fe. 
This  result  is  obtained  by  varying  the  incident  photon  helic- 
ity  and  detecting  all  photoelectrons  emitted  in  a  given  direc¬ 
tion.  Also  shown  are  the  results  of  calculations  using 
linearly-polarized  photon  excitation  at  the  same  energy,  but 
now  detecting  the  spin-polarization  of  the  photoelectrons 
emitted  in  a  given  direction  (dashed  line).  Here  too,  one  sees 
polarization  oscillations  that  reflect  a  sensitivity  to  the  local 
magnetic  structure,  however  these  oscillations  are  again  only 
about  ±1-2%.  This  is  further  complicated  by  the  relative 
inefficiency  of  photoelectron  spin-detection  which  is  prob¬ 
ably  at  least  two  orders  of  magnitude  less  efficient  than  spin- 
integrated  photoelectron  detection.  In  comparison,  the  emit¬ 
ted  photoelectron  intensity  for  excitation  by  circularly 
polarized  radiation  produced  at  a  bending  magnet  monochro¬ 
mator  is  only  reduced  from  that  for  linear  polarization  by 
about  a  factor  5.  However,  if  one  combines  both  excitation 
with  circularly  polarized  photons  and  external  photoelectron 
spin  detection  in  a  so-called  complete  or  double  polarization 
experiment  the  spin  dependent  oscillations  increase  to  ap¬ 
proximately  8-10%.  These  oscillations  are  on  top  of  a 
smoothly  increasing  background  which  arises  from  angle- 
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dependent  spin  polarization  in  the  direct  term  (see  Fig.  6). 
Here  we  plot  the  following: 

^3/2(  T  )  ~  ^3/2(  i  ) 

^3/2(T)  +  ^3/2(i) 

In  order  to  perform  the  double  polarization  experiment,  the 
reduced  efficiency  of  both  the  production  of  circular  versus 
linear  polarized  x-rays  and  the  requirement  of  external  pho¬ 
toelectron  spin  detection  must  be  overcome.  This  capability, 
using  an  insertion  device  for  the  production  of  circularly 
polarized  x-rays,  has  been  demonstrated  recently  by  Starke 
et  alP  for  tungsten  4/  photoemission.  The  magnitude  of  the 
calculated  spin-dependent  diffraction  oscillations  in  the 
double  polarization  experiment  of  Fig.  7  are  very  similar  to 
those  observed  in  spin-independent  photoelectron  diffrac¬ 
tion,^  and  should  prove  tractable  in  extracting  valuable  infor¬ 
mation  about  local  magnetic  structure  from  any  number  of 
surface  and  thin  film  magnetic  systems. 

VII.  CONCLUSIONS 

We  have  presented  the  results  of  the  first  measurement  of 
spin-dependent  photoelectron  diffraction  using  circularly  po¬ 
larized  x-rays  in  investigations  of  fee  Fe  films  grown  on 
Cu(OOl).  The  resulting  spin-polarized  Fe  2p  photoemission 
peaks  exhibit  angle-  and  energy-dependent  intensity  varia¬ 
tions  due  to  spin-dependent  diffraction  which  provides  sen¬ 
sitivity  to  the  details  of  local  magnetic  structure.  The  impor¬ 
tance  of  relativistic,  multiple  scattering  calculations  in  the 
proper  interpretation  of  the  experimental  data  is  demon¬ 
strated.  The  results  demonstrate  sensitivity  to  magnetic  struc¬ 
ture  details  at  the  sub-Angstrom  level.  Factors  which  cur¬ 
rently  limit  the  experimental  technique  as  well  as  dictate 
details  of  experimental  configurations  were  discussed.  Fi¬ 
nally,  the  results  of  calculations  for  double  polarization  ex¬ 
periments  which  combine  excitation  by  circularly  polarized 
x-rays  with  external  photoelectron  spin  detection  suggest  that 
this  will  become  a  powerful  technique  for  the  study  of  sur¬ 
face,  interface,  and  thin  film  magnetism  in  the  near  future. 
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Calculations  of  structural  stabilities  for  thin  iron  layers  as  grown  on  Cu(OOl)  are  reported.  It  is  found 
that  a  free-standing  fcc(OOl)  monolayer  is  unstable  with  respect  to  3X1  and  4X1  distortions.  The 
former  instability  is  directly  related  to  the  Pitsch  transformation  from  fcc(OOl)  to  bcc(llO),  and 
calculations  along  the  path  find  bcc(llO)  as  the  stable  structure.  Inclusion  of  the  Cu  substrate  in  the 
calculation  tends  to  stabilize  fcc(OOl).  The  calculated  instabilities  are  of  similar  form  to  the 
superstructures  observed  for  Fe  grown  on  Cu(OOl).  The  relation  to  the  experimental  situation  is 
discussed.  ©  1996  American  Vacuum  Society. 


L  INTRODUCTION 

The  fact  that  iron  was  observed  to  grow  in  a  fee  structure 
on  Cu(001)‘  has  attracted  a  large  number  of  experimental 
studies  on  this  system.  It  is  now  well  established  that  Fe 
grown  on  the  fee  (001)  Cu  surface  actually  has  a  very  rich 
phase  diagram.^"^  As  a  function  of  overlayer  thickness  and 
growth  conditions,  it  has  been  conveniently  divided  into 
three  regions  (I-III)  according  to  the  overlayer  structure.^ 
All  of  these  phases  are  characterized  by  superstructures  of 
the  basic  square  structure  of  a  pure  (001)  layer:  Phase  I 
shows  4X1  or  5X1,  phase  II  2X1,  and  phase  III  3 X  1-like 
superstructure  patterns  in  low-energy  electron  diffraction 
analyses.  Changes  in  magnetic  ordering,^  e.g.,  ferro-  (F)  or 
antiferromagnetic  (AF)  ordering,  and  spin  reorientations"^’^’^ 
(i.e.,  changes  of  easy  axis)  are  closely  connected  to  the  struc¬ 
tural  changes.  This  phase  diagram  indicates  that  Fe  is  never 
stable  as  a  pure  fee  or  fet  (strained  1 X 1  fee)  overlayer  on 
Cu(OOl).  This  article  reports  on  a  theoretical  study  of  this 
structural  instability  for  some  simple  systems  by  means  of 
density-functional  based  first-principles  calculations. 

The  experimental  situation  seems  to  be  quite  consistent. 
The  magnetic  orderings  of  phases  I  and  III  are  ferromagnetic. 
It  has  been  observed  that  phase  III,  which  occurs  at  thicker 
Fe  films,  is  actually  bcc-like.^’^’^  The  3X1  pattern  arises  from 
the  fact  that  the  phase  transformation  with  growing  layer 
thickness,  from  the  fcc-like  structures  of  phase  I  and  II  to 
phase  III,  is  martensitic  of  the  type  described  first  by 
Pitsch.  In  this  transformation,  an  fcc(OOl)  layer  transforms 
into  a  bcc(llO)  one,  with  the  latter  as  an  approximate  3X1 
superstructure  of  the  former.  The  structure  of  the  very  thin 
iron  layers  of  phase  I,  less  than  5  monolayers  (ML),  has  very 
recently  been  determined.^  It  is  found  that  the  5X1  phase  is 
characterized  by  a  distortion  of  the  fcc(OOl)  layers  due  to 
sinusoidal  shifts  and  bucklings  of  [110]  atomic  rows.  This  is 
also  accompanied  by  an  increase  of  the  layer  spacing.  At 
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intermediate  thicknesses,  phase  II  has  been  observed,  but  the 
range  for  which  it  exists  is  sensitive  to  the  growth  tempera¬ 
ture.  In  fact,  when  the  Fe  layer  is  grown  at  100  K,  there 
seems  to  be  a  direct  transformation  from  phase  I  to  III  with¬ 
out  any  intermediate  phase. ^  When  phase  II  has  been  ob¬ 
served,  it  is  found  to  have  an  AF  ordering  between  all  layers 
except  between  the  two  top  layers,  which  are  F  aligned.  The 
2X1  pattern  reflects  a  reconstruction  of  the  surface  layer.^’^^ 

There  have  been  some  theoretical  calculations  for  Fe/ 
Cu(OOl).  The  first  study  was  for  1  and  2  ML  of  Fe  on 
Cu(OOl)  where  large  magnetic  moments  were  found. These 
magnetic  moments  are  much  larger  than  what  is  calculated 
for  fee  Fe  with  the  same  lattice  constant  and  even  larger  than 
those  of  bcc  Fe,  Recently,  calculations  have  been  reported 
for  thicker  Fe  layers,  4-11  ML,  which  are  in  good  accord 
with  what  is  experimentally  observed  for  phase  II. There 
has  also  been  a  theoretical  study  of  different  magnetic  order¬ 
ings  within  fet  Fe  for  different  c/a,  incorporating  both  bcc 
and  fee  as  two  special  cases. Although  it  was  suggested  to 
be  of  direct  relevance  for  Fe/Cu(001),  this  Bain  path  of  the 
fcc-bcc  transformation  is  not  what  is  experimentally  ob¬ 
served.  None  of  this  work  allowed  for  any  reconstructions  of 
the  in-plane  structure.  The  present  work  elucidates  the  insta¬ 
bility  against  such  distortions  and  the  effects  on  the  elec¬ 
tronic  and  magnetic  structure. 

II.  COMPUTATIONAL  METHOD 

The  calculations  were  performed  with  the  linearized  aug¬ 
mented  plane  wave  (LAPW)  method‘s  within  the  local-spin 
density  approximation  (LSDA).^^  This  method  makes  no  ap¬ 
proximation  to  the  shape  of  the  densities  or  the  potential.  The 
surface  was  simulated  by  a  supercell  technique,  where  the 
gap  between  the  slabs  corresponds  to  three  layer  distances. 
Brillouin  zone  integrations  were  performed  using  54  special 
k  points  in  the  irreducible  wedge  for  the  unreconstructed 
ML.  For  the  larger  structures,  this  number  was  reduced  in  a 
way  which  keeps  the  k-point  density  in  reciprocal  space  con¬ 
stant. 
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Fig.  1.  The  fcc(OOl)  (small  gray  circles)  and  the  corresponding  recon¬ 
structed  3X1  bcc(l  10)-Iike  (large  black  circles)  structures  are  shown.  The 
unit  cells  of  the  unreconstructed  square  lattice  (full  line),  the  fully  recon¬ 
structed  bcc(110)-like  (dashed  line),  and  the  distorted  lattice  used  in  the 
calculations  (dash-dotted  line)  are  indicated. 

ill.  RESULTS 
A.  Structure 


Fig.  2.  The  total  energy  per  atom  is  plotted  against  the  atomic  displacement 
;c  for  a  3X 1  reconstruction.  The  energy  is  put  to  zero  for  the  unreconstructed 
layer.  Also  shown  is  the  total  energy  for  a  true  bcc(llO)  layer  (dashed  line) 
with  the  same  area  per  atom  as  the  fcc(OOl)  layer. 

A  three-dimensional  phase  transformation  of  Pitsch  nature 
also  involves,  in  addition  to  the  sinusoidal  shifting  and  the 
subsequent  skewing  to  get  a  rectangle  within  each  layer,  a 
sliding  of  every  second  layer  a/2  along  [10].  The  interlayer 
distance  is,  however,  kept  constant  if  there  is  no  accompany¬ 
ing  volume  change. 


From  the  description  of  the  different  phases  in  Sec.  I,  it  is 
clear  that  they  are  mainly  formed  from  reconstructions 
within  the  fcc(OOl)  layers.  For  instance,  in  Fig.  1  a  3X1 
superstructure  is  displayed.  This  corresponds  closely  to  the 
Pitsch-transformed  bcc(llO)  layer,  although  it  is  not  yet  rect¬ 
angular.  Instead  of  having  a  square  angle  and  a  ratio  between 
the  sides  of  V2,  the  reconstructed  unit  cell  indicated  in  Fig.  1 
has  the  angle  and  ratio  given  in  Table  I.  This  reconstruction 
can  be  formed  by  a  continuous  distortion  from  the  square 
lattice  by  sinusoidal  shifts  of  atoms  in  a  [01]  row,  which 
corresponds  to  the  [110]  row  of  the  fee  lattice.  The  bcc(llO)- 
like  structure  shown  in  the  figure  corresponds  to  a  shifting  of 
atoms  a  third  of  the  square  lattice  constant  a  long  [10].  It  is 
possible  in  a  similar  way  to  construct  bcc(110)-like  struc¬ 
tures  corresponding  to  4X1  and  5X1  superstructures.  These 
are  of  a  similar  closeness  to  a  rectangular  form  as  the  3X1 
structure,  as  can  be  seen  in  Table  I.  However,  they  are  not 
continuous  in  the  same  way,  i.e.,  they  are  not  martensitic,  so 
the  corresponding  bcc(110)-like  structure  is  not  directly  re¬ 
lated  to  any  sinusoidal  shifts  of  4X1  or  5X1  character  along 
the  [110]  rows. 


Table  I.  Geometrical  data  for  the  bcc(110)-like  distortions  of  the  fcc(lOO) 
layer  compared  to  those  of  truly  rectangular  bcc(llO)  layer. 


Structure 

Angle 

Ratio  of  the  sides 

3X1 

86.8° 

1.387 

4X1 

88.4° 

1.442 

5X1 

89.0° 

1.473 

bcc(llO) 

90.0° 

1.414 

B.  Free-standing  monolayer 

To  start,  we  studied  the  simplest  possible  system  which 
still  has  some  relevance  to  the  experimental  observations:  a 
free-standing  monolayer  (FSM)  of  Fe  with  a  lattice  matching 
Cu(OOl).  This  is  a  reasonable  model  system  since  the  hybrid¬ 
ization  effects  due  to  the  Cu  substrate  are  known  to  be  quite 
small. The  calculated  magnetic  moments  are,  however, 
slightly  larger  than  for  an  overlayer,  3.1  vs  2.8  /x^/atom. 

The  results  of  total  energy  calculations  for  different 
atomic  shifts  corresponding  to  the  3X 1  reconstruction  of  Fig. 
1  are  shown  in  Fig.  2.  The  square  FSM  is  highly  unstable 
with  respect  to  a  Pitsch  transformation.  The  energy  minimum 
is  given  by  a  distortion  x  =  al3,  which  corresponds  exactly 
the  bcc(110)-like  structure  shown  in  Fig.  1.  Also  given  in 
Fig.  2  is  the  extra  energy  gained  in  completing  the  Pitsch 
path  to  form  a  true  bcc(llO)  layer  with  the  same  unit  area. 
The  energy  of  a  4X1  bcc(110)-like  structure  is  found  to  be 
4.2  mRy  higher  than  the  3X1  version  and  17.1  mRy  lower 
than  the  undistorted  fcc(OOl)  layer. 

In  order  to  illustrate  the  interplay  of  magnetism  with  this 
instability,  the  second  derivative  of  the  total  energy  with  re¬ 
spect  to  the  amplitude  of  the  distortion  is  calculated  when  the 
magnetic  moment  is  constrained^^  to  different  values.  This  is 
done  both  for  a  small  3  X 1  sinusoidal  distortion  as  well  as  for 
a  4X1  one,  and  the  results  are  tabulated  in  Table  11.  It  is 
found  that  the  FSM  is  unstable  for  a  wide  range  of  magnetic 
moments  for  both  types  of  distortions.  For  smaller  magnetic 
moments  the  inequivalent  atomic  moments  of  the  larger  unit 
cells  start  to  arrange  in  a  ferrimagnetic  or  even  an  AF  fashion 
in  order  to  fulfill  the  constraint  on  the  total  magnetic  moment 
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Table  II.  The  second  derivative  of  the  total  energy  per  Fe  atom  with  respect  to  the  amplitude  of  two  different 
sinusoidal  distortions  is  given  in  units  of  mRy/(a.u.)^.  A  positive  value  indicates  an  instability.  The  values  are  for 
a  free-standing  ML  with  different  magnetic  moments  m,  for  an  overlayer,  and  for  bulk  fee  Fe.  The  magnetic 
moments  are  given  in  fig  per  atom. 


Structure 

m=3.1 

m=2.1 

m=lA 

m=2.1 

Overlayer 

Bulk 

3X1 

5.18 

4.83 

3.99 

-12.78 

-5.30 

4X1 

3.90 

4.03 

3.04 

0.90 

while  retaining  large  local  moments,  instead  of  decreasing  in 
a  smooth  fashion.  This  leads  to  an  enhanced  instability,  so 
these  cases  have  been  excluded  from  Table  II.  The  3X1  in¬ 
stability  is  larger  over  the  full  range  but  is  decreasing  with 
decreasing  magnetic  moment,  while  the  4X1  instability  has  a 
maximum  at  a  lower  moment.  These  two  sinusoidal  distor¬ 
tions  are  identical  to  transverse  phonons  with  different  wave 
vectors.  The  instabilities  might  hence  be  viewed  as  due  to 
softening  of  these  phonons. 

The  nature  of  the  instability  is  not  clear,  other  than  that  it 
relates  to  the  stability  of  bulk  bcc  Fe  over  fee.  It  is  known 
that  such  instabilities  can  be  driven  by  Fermi  surface  prop¬ 
erties.  In  order  to  see  whether  this  is  the  case  here,  we  show 
the  Fermi  surface  of  the  FSM.  It  is  formed  by  two  bands,  one 
of  each  spin.  For  the  majority  spin  for  which  the  3D  states 
are  filled,  the  Fermi  surface  is  close  to  circular,  i.e.,  free- 
electron-like.  The  Fermi  surface  of  the  minority  spin  is 
shown  in  Fig.  3.  This  shows  some  rather  squarelike  features, 
which  would  indicate  good  nesting.  In  fact,  the  nesting  vec¬ 
tors  are  actually  quite  close  to  1/3  of  a  reciprocal  lattice 
vector,  in  accordance  with  the  3X1  instability  calculated. 
Although  this  is  suggestive,  more  careful  analysis  is  needed 
before  it  can  be  concluded  that  this  is  the  driving  force  re¬ 
sponsible  for  the  instability.  The  shape  of  this  minority  spin 
Fermi  surface  inevitably  has  to  change  with  the  magnetic 
moment  leading  to  a  less  squarelike  structure  or  other  nesting 
vectors.  But  as  we  have  seen,  the  instability  is  rather  insen¬ 
sitive  to  the  exact  value  of  the  magnetic  moments,  which 
indicates  that  this  nesting  only  can  be  a  part  of  the  full  story. 

A  major  simplification  made  when  ignoring  the  substrate 
is  the  neglect  of  repulsive  forces  when  the  displaced  Fe  at- 
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Fig.  3.  The  majority  spin  Fermi  surface  of  a  free-standing  ML  with  its 
self-consistent  magnetic  moment. 


oms  approach  substrate  atoms.  The  FSM  might,  however, 
still  be  viewed  as  the  limiting  case  of  very  large  adsorbate- 
substrate  distance. 

C.  Overlayer  and  bulk 

When  going  to  more  realistic  systems,  reconstructions  in¬ 
evitably  involve  more  structural  parameters.  We  present  here 
some  results  when  the  distortions  are  only  allowed  in  a  sub¬ 
space  of  this  full  parameter  space.  First,  to  study  the  effect  of 
the  Cu  substrate,  calculations  for  the  same  3X1  distortions  as 
for  the  FSM  have  been  done  for  a  slab  of  Fe/Cu3/Fe.  The 
layer  distances  between  the  Cu  layers  are  taken  to  be  given 
by  the  Cu  lattice  constant,  while  the  Fe-Cu  distance  is  re¬ 
laxed.  The  calculated  Fe  height,  1.74  A,  and  magnetic  mo¬ 
ments,  2.82  fjig ,  are  in  good  agreement  with  earlier 
calculations,^^  1.75  A  and  2.85  .  As  can  be  seen  in  Table 

II,  we  find  the  square  overlayer  to  be  clearly  stable.  So  the 
Cu  substrate  is  able  to  stabilize  the  Fe  overlayer.  This  is 
probably  due  to  a  combination  of  the  fact  that  the  hybridiza¬ 
tion  stiffens  the  Fe  layer  and  that  the  substrate  atoms  repel 
displaced  Fe  atoms.  However,  there  are  two  things  to  notice: 
(1)  the  LSD  A- theoretical  Fe-Cu  distance  is  smaller  than  that 
experimentally  observed  which  means  that  the  effect  of  the 
hybridization  is  overestimated  in  the  calculation,  and  (2)  any 
buckling  out  of  the  plane  of  the  distorted  atoms,  which 
would  be  a  way  to  avoid  the  underlying  Cu  atoms,  has  not 
been  allowed  for.  Any  attempt  to  correct  for  these  two  facts 
would  work  in  favor  of  an  instability,  as  observed. 

It  was  suggested  in  Ref.  5  that  the  superstructure  of  phase 
I  implies  that  ferromagnetic  bulk  fee  Fe  is  intrinsically  un¬ 
stable.  To  test  this  possibility  we  have  performed  calculations 
for  ferromagnetic  bulk  fee  Fe  at  the  Cu  lattice  constant  and 
find  that  it  is  stable  with  respect  to  a  3X1  distortion.  It 
should  be  noted  that  the  magnetic  moment  is  comparatively 
small  in  this  case,  1.1  /x^ ,  which  means  that,  in  contrast  to 
the  cases  above,  the  majority  d  band  is  far  from  filled  and 
hence  contributes  significantly  to  the  bonding.  If  the  full 
Pitsch  path  were  followed  it  would  of  course  have  been 
found  that  the  end-point  bcc  Fe  is  more  stable,  but  it  should 
be  remembered  that  the  present  calculation  only  involves  a 
small  in-plane  sinusoidal  distortion.  The  sliding  of  the  (001) 
planes  was  ignored. 

IV.  DISCUSSION  AND  CONCLUSIONS 

These  calculations  have  been  performed  within  LSDA. 
This  suggests  some  caution  since  it  is  well  known  that  this 
approximation  incorrectly  predicts  bulk  fee  Fe  to  be  more 
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stable  than  bcc.  However,  if  gradient  corrected  approxima¬ 
tions  work  in  this  case  in  the  same  way  as  for  bulk  Fe,^^  i.e., 
favor  a  bcc  structure,  the  tendency  toward  an  instability 
would  increase.  A  large  problem  for  LSDA  in  the  bulk  case  is 
that  fee  is  nonmagnetic  while  bcc  is  ferromagnetic,  so  a  com¬ 
parison  between  these  two  structures  involves  a  large  differ¬ 
ence  in  spin-polarization  energies.  The  same  problem  does 
not  exist  in  this  case  where  the  two  types  of  layer  structures 
have  similar  spin-polarization  energies. 

It  has  been  shown  that  calculations  for  a  FSM  give  a 
strong  Pitsch  instability  of  the  fcc(OOl)  lattice  toward  a  3X1 
reconstruction.  This  reconstruction  is  related  by  a  shear  to  a 
bcc(llO)  layer  which  is  slightly  more  stable.  The  fcc(OOl) 
FSM  is  also  shown  to  be  unstable  toward  sinusoidal  4X1 
distortions.  These  two  types  of  instabilities  are  similar  to  the 
experimentally  observed  superstructures  of  phases  III  and  I, 
respectively:  In  phase  III  the  full  Pitsch  transformation  is 
observed,  while  in  phase  I  only  sinusoidal  4X1  or  5X1  dis¬ 
tortions  are  observed.  The  3X1  distortion  is  close  in  nature 
to  the  4X1  or  (5X1)  distortion  for  small  amplitudes,  but  it 
differs  in  that  it  continuously  gives  rise  to  a  Pitsch  recon¬ 
struction  for  large  amplitudes. 

Calculations  for  more  realistic  overlayers  find  that  the  Cu 
substrate  stabilizes  the  1  ML  (001)  Fe  overlayer.  This  is  due 
to  the  small  Fe-Cu  distance  used  in  these  calculations.  In¬ 
creasing  this  will  lead  to  instabilities  since  the  FSM,  which 
corresponds  to  very  large  distances,  is  unstable.  The  question 
is  for  which  interlayer  distance  these  instabilities  occur.  Cal¬ 
culations  are  in  progress  for  the  Fe/Cu(001)  system  to  inves¬ 
tigate  this.  Also,  the  effect  of  more  than  one  Fe  layer  should 
be  studied  in  order  to  get  closer  to  the  experimental  situation. 
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The  structural  and  magnetic  properties  of  metastable  face-centered-cubic  (fee)  Fe  thin  films  on  fee 
Co(lOO)  substrate  were  studied  using  wedged  samples,  fee  Co(lOO)  was  chosen  for  the  substrate 
because  it  is  structurally  very  similar  to  Cu(lOO)  but  is  ferromagnetic  at  room  temperature. 

Reflection  high  energy  electron  diffraction  and  low  energy  electron  diffraction  characterizations 
confirm  that  epitaxially  grown  (MBE)  Fe  on  Co(lOO)  is  structurally  very  similar  to  Fe  on  Cu(lOO): 
face-centered-tetragonal  (fet)  for  Jpe<6  ML,  fee  for  6  ML<d^^<ll  ML,  bcc  for  ^/pg>ll  ML.  In 
situ  surface  magneto-optic  Kerr  effect  measurements  show  that  at  room  temperature  the  fet  and  bcc 
regions  are  ferromagnetic,  while  the  fee  region  is  nonferromagnetic  with  some  magnetic  live  layers. 

All  magnetizations  are  in-plane.  Oxygen  absorption  experiments  suggest  that  these  live  layers  are  at 
the  Fe/Co  interface.  Low  temperature  growth  Fe/Co(100)  shows  a  Kerr  signal  that  increases  linearly 
with  <ipe  and  suggests  that  the  magnetic  moments  for  fee  Fe  and  bcc  Fe  are  the  same.  To  further 
study  the  magnetic  properties  of  the  nonferromagnetic  “fee”  phase,  we  used  metastable  fee  Fe  as  a 
spacer  layer  between  two  Co  layers.  The  Co/fee  Fe/Co  on  Cu(lOO)  sandwiches  exhibit 
ferromagnetic  coupling,  strong  antiferromagnetic  coupling  (AFC)  and  weak  AFC.  An  oscillation  in 
the  strong  AFC  was  found  by  artificially  lengthening  the  thickness  range  of  the  nonferromagnetic 
fee  phase.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Investigation  of  magnetic  ultrathin  films  has  developed 
rapidly  in  the  last  decade  after  the  discovery  of  antiferromag¬ 
netic  coupling  (AFC)^  and  giant  magnetoresi stance^  in  Fe/Cr 
multilayers.  One  important  experimental  method  is  to  per¬ 
form  a  thickness  dependent  study.  Related  topics  using  this 
method  include  the  three-  to  two-dimensional  phase  transi¬ 
tion,  spin-reorientation  transition,  and  the  oscillations  in 
magnetic  coupling,  etc.  Because  of  the  short-range  character 
of  the  magnetic  exchange  interaction,  precise  control  of  the 
film  thickness  is  needed  in  order  to  explore  the  novel  mag¬ 
netic  behavior  that  varies  on  the  atomic  scale.  Prior  to 
wedged  samples,  thickness-dependent  studies  were  per¬ 
formed  on  uniform  thickness  samples.  A  large  number  of 
samples  with  different  thicknesses  are  usually  required  for 
one  project.  There  are  a  number  of  difficulties  using  the  uni¬ 
form  thickness  samples.  First,  such  investigations  are  time 
consuming.  Typically,  a  separate  sample  must  be  made  for 
each  film  thickness.  This  is  particularly  significant  for  mo¬ 
lecular  beam  epitaxy  (MBE)  systems,  where  the  sample  pro¬ 
duction  is  slow  (~l-2  samples  per  day  per  substrate).  Sec¬ 
ond,  variability  in  the  growth  conditions  from  sample  to 
sample  introduces  uncertainty.  If  two  samples  of  differing 
thicknesses  exhibit  differing  behavior,  it  might  be  due  to  the 
thickness.  However,  it  might  alternatively  be  due  to  slightly 
different  growth  conditions  (e.g.,  chamber  pressure,  growth 
rate,  temperature,  substrate  cleaning  procedure).  This  makes 
it  more  difficult  to  definitively  separate  the  thickness  depen¬ 
dence  from  other  effects.  Third,  when  given  two  samples  of 
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nearly  equal  thickness,  one  cannot  be  sure  which  sample  is 
actually  thicker  because  of  uncertainties  in  the  growth  rates. 

To  overcome  the  difficulties  of  the  uniform  thickness 
samples,  wedged  samples  were  developed  and  applied  to  the 
study  of  magnetic  thin  films  in  1991.^  In  conjunction  with 
local  magnetization  measurement  techniques,  wedged 
samples  allow  for  the  efficient  exploration  of  thickness  de¬ 
pendent  phenomena  on  a  single  sample.  The  wedge  tech¬ 
nique  basically  removes  all  the  difficulties  of  the  uniform 
thickness  samples.  First,  many  thicknesses  are  present  on  a 
single  wedge  so  that  the  time  consumed  in  film  growth  is 
greatly  reduced.  Second,  since  all  the  thicknesses  are  on  one 
sample,  they  are  all  subject  to  the  same  growth  conditions.  If 
the  substrate  has  been  tested  for  uniformity,  any  differences 
in  behavior  can  be  attributed  to  thickness.  Third,  even  though 
there  is  uncertainty  in  the  growth  rate,  the  overall  thickness 
always  increases  with  position  as  one  moves  from  the  thin 
side  to  the  thick  side  of  the  wedge.  Hence,  given  two  mea¬ 
surements,  one  always  knows  which  corresponds  to  the 
thicker  film. 

Figure  1  illustrates  how  to  make  a  wedged  layer.  In  an 
MBE  system,  a  wedge  is  made  by  putting  a  “knife  edge” 
mask  in  the  molecular  beam  and  translating  the  substrate 
behind  it  with  uniform  velocity  V.  If  the  evaporation  rate  is 
r,  the  produced  wedge  will  have  a  slope  of  r/V.  With  a 
typical  evaporation  rate  of  ~1  A/min  and  a  substrate  trans¬ 
lation  speed  of  ~1  mm/min,  the  resulting  slope  is  or 

approximately  a  million  atoms  per  monolayer  step.  Hence, 
the  film  can  be  considered  locally  flat.  The  thickness  resolu¬ 
tion  using  wedged  samples  depends  on  the  beam  size  of  the 
magnetic  probe.  For  the  surface  magneto-optic  Kerr  effect 
(SMOKE)  measurement,  the  beam  size  is  '-O.l  mm  so  that 
the  thickness  resolution  with  the  above  slope  can  reach  ~0. 1 
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Fig.  1.  For  MBE  systems,  wedge  samples  are  made  by  placing  a  “knife 
edge”  mask  in  the  molecular  beam  and  translating  the  substrate  behind  it 
with  a  constant  velocity  V.  The  wedge  slope  is  r/V,  where  r  is  the  deposi¬ 
tion  rate. 

A.  This  is  good  enough  to  explore  any  fine  structures  on  the 
atomic  scale.  A  1-cm-long  wedge  is  then  equivalent  to  '^100 
uniform  thickness  samples. 

Wedged  samples  were  first  applied  to  study  the  short- 
period  oscillations  of  magnetic  coupling  in  magnetic 
multilayers, and  then  were  extended  to  study  the  spin- 
reorientation  transition^  and  many  other  topics.  In  this  ar¬ 
ticle,  we  report  our  results  on  the  investigation  of  the  meta¬ 
stable  fee  Fe  films  by  using  wedged  samples,  fee  Fe  exhibits 
rich  structural  and  magnetic  phases  that  sensitively  depend 
on  the  film  thickness.  Thus,  wedged  samples  will  be  perfect 
for  this  system  to  explore  the  fine  features  associated  with 
each  phase. 

II.  MOTIVATION 

The  unique  property  of  transition  metal  magnets  is  that 
there  is  a  high  density  of  states  of  3d  magnetic  electrons  at 
the  Fermi  level.  This  property  makes  it  possible  to  manipu¬ 
late  the  magnetic  phases  of  transition  metals  by  changing  the 
crystal  structure.  Fe  is  the  best  choice  for  this  kind  of  study. 
First,  Fe  is  the  most  common  and  the  most  intensively  stud¬ 
ied  magnetic  element.  Second,  Fe  is  between  the  antiferro¬ 
magnetic  Mn  and  the  ferromagnetic  Co  in  the  periodic  table. 
Thus,  it  has  the  intrinsic  instability  to  fall  into  different  mag¬ 
netic  phases.  It  is  well  known  that  bulk  Fe  has  bcc  phase  at 
room  temperature  and  fee  phase  (y-Fe)  at  high  temperatures 
(>1180  K).  The  room  temperature  lattice  constant  (—3.59 
A)  of  fee  Fe  (extrapolated  from  high  temperature)  is  very 
close  to  Cu  (3.61  A).  This  lattice  matching  led  to  the  idea  of 
stabilizing  fee  Fe  at  low  temperatures,  either  as  small  y-Fe 
precipitates  in  a  Cu  matrix^  or  as  an  epitaxial  ultrathin  film 
on  Cu(lOO).  While  the  antiferromagnetic  y-Fe  precipitates 
show  a  low  magnetic  moment  (— 0.7/4^)  with  a  Neel  tem¬ 
perature  T^=67  K,  the  magnetic  phase  of  the  epitaxial  fee 
Fe  film,  however,  is  very  complicated  and  seems  to  depend 
strongly  on  the  substrate  temperature.  For  example,  while  the 
antiferromagnetic^  and  nonmagnetic^  phases  were  reported 
for  fcc-Fe/Cu(100),  a  ferromagnetic  phase  of  fcc-Fe/Cu(100) 
was  also  identified.^  Particularly  striking  is  the  recent  report 
of  the  (nXl)  superstructure  on  the  surface  of  Fe/Cu(100)^^ 
and  a  ferromagnetic  live  surface  layer  for  —5-11  ML 
Fe/Cu(100)^^  grown  at  room  temperature.  The  sublayer  be¬ 
low  the  “live  layer”  was  also  identified  to  be  antiferromag¬ 
netic  with  Tat  —200  This  rich  variety  of  properties  indi¬ 

cates  that  fee  Fe  on  Cu(lOO)  may  just  be  at  a  magnetic 
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instability  point.  Band  structure  calculations  showed that 
fee  Fe  has  more  magnetic  structure  than  bcc  Fe.  For 
(Wigner-Seitz  radius)  in  the  range  of  2.3-2.8  a.u.,  the  total 
energy  for  bcc  Fe  has  only  one  minima,  corresponding  to  a 
ferromagnetic  state;  the  total  energy  for  fee  Fe  has  two 
minima,  corresponding  to  a  low-spin  (LS)  state  and  a  high- 
spin  state  (HS).  The  instability  point  between  the  LS  and  HS 
states  occurs  at  r^^^2.66  a.u.  Noting  that  the  Wigner-Seitz 
radius  is  2.652  a.u.  for  fee  Fe  and  2.667  a.u.  for  Cu,  and  that 
the  substrate  temperature  determines  the  interlayer  mixture, 
which  in  turn  affects  the  overlayer  lattice  relaxation,  it  is 
therefore  very  interesting  to  ask  if  the  complicated  magnetic 
phases  for  fee  Fe/Cu(100)  is  driven  by  the  LS-HS  instabil¬ 
ity.  The  experimental  difficulty  is  that  the  effect  of  the  inter¬ 
facial  electronic  hybridization  mixes  with  the  effect  of  the 
interfacial  lattice  relaxation.  The  fee  Fe  overlayer  shows  a 
very  complicated  lattice  structure, making  it  extremely  dif¬ 
ficult  to  analyze  the  different  effects. 

In  light  of  these  results,  there  are  two  natural  directions  of 
investigation.  The  first  possibility  is  to  slightly  alter  the 
structure  of  the  substrate  while  holding  its  magnetic  proper¬ 
ties  fixed.  This  direction  has  already  yielded  attention. The 
second  possibility  is  to  alter  the  magnetic  properties  of  the 
substrate  while  holding  the  structural  properties  fixed.  This  is 
the  subject  of  our  present  study.  In  an  attempt  to  alter  the 
magnetic  properties  of  fee  Fe  without  further  changing  the 
structural  properties  observed  in  Fe/Cu(100),  Fe  was  epitaxi¬ 
ally  grown  on  an  fee  Co(lOO)  substrate.  The  fee  Co(lOO) 
substrate  was  made  by  depositing  several  monolayers  of  Co 
on  top  of  Cu(lOO).  Since  fee  Co  has  a  similar  lattice  constant 
to  Cu,  they  are  structurally  very  similar.  However,  fee  Co  is 
ferromagnetic  at  room  temperature. 

To  further  study  the  nature  of  fee  Fe,  we  used  it  as  a 
spacer  layer  between  two  Co  layers.  This  was  made  possible 
because  of  the  existence  of  a  nonferromagnetic  phase  of  fee 
Fe  (described  below).  Since  the  discovery  of  the  AFC  of  two 
ferromagnetic  layers  across  a  nonferromagnetic  spacer  layer, 
there  has  been  extensive  work  on  the  coupling  across  many 
types  of  spacer  layers.  Antiferromagnetic  materials 
(Cr,Mn),^^  semiconductors  (Si),^^  and  many  nonmagnetic 
transition  metals  have  been  used  as  spacer  layers  to  provide 
AFC  and  coupling  strength  oscillations.  The  coupling  of  the 
Co/Fe/Co  system  could  be  compared  to  these  systems  to  help 
determine  the  magnetic  structure  of  the  interlayer. 


III.  EXPERIMENT 

All  samples  were  epitaxially  grown  in  an  ultra-high 
vacuum  (UHV)  chamber  with  a  base  pressure  of  —2X10”^^ 
Torr.  The  UHV  system  is  equipped  with  four  effusion  cells 
for  MBE,  quartz  thickness  monitor,  reflection  high  energy 
electron  diffraction  (RHEED),  low  energy  electron  diffrac¬ 
tion  (LEED),  Auger  electron  spectroscopy  (AES),  Ar"^  ion 
sputtering,  and  an  in  situ  SMOKE  setup.  A  sample  manipu¬ 
lator  provides  a  temperature  range  of  120-900  K  and  permits 
the  use  of  all  the  above  techniques  at  different  stages  of  the 
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chamber.  With  this  system,  we  are  able  to  perform  the 
growth,  characterization,  and  the  magnetic  measurement  in¬ 
side  a  single  chamber. 

A  single  crystal  Cu(lOO)  disk  of  cm  diameter  and 
mm  thickness  serves  as  the  starting  substrate.  The  substrate 
was  mechanically  polished  down  to  0.25  yitm  diamond  paste 
and  was  finished  with  vibration  polishing  using  0.05  fim 
AI2O3  powder.  The  substrate  was  then  cleaned  in  UHV  with 
cycles  of  Ar"^  ions  sputtering  at  2-5  keV  and  annealing  at 
920  K  for  several  hours.  AES  was  used  to  determine  the 
presence  of  impurities  to  insure  the  cleanliness  of  the  sur¬ 
face.  For  the  growth  of  Fe  on  a  dirty  substrate,  we  observed 
as  many  as  40  RHEED  oscillations  accompanied  by  a  2X2 
reconstruction  in  the  FEED  pattern,  consistent  with  the  re¬ 
port  of  Li  et  al}^  A  uniform  thickness  Co  layer  was  grown 
first  on  Cu(lOO)  to  serve  as  the  substrate.  Then,  Fe  wedges 
were  grown  on  Co  using  the  method  shown  in  Fig.  1.  A 
typical  evaporation  rate  for  Co  and  Fe  is  —0.5- 1.0  A/min.  A 
typical  slope  of  the  Fe  wedge  is  —2-10  A/mm. 

Magnetic  properties  of  the  films  were  measured  by 
SMOKE  with  an  intensity  stabilized  He-Ne  laser.  The 
SMOKE  setup  consists  of  two  pairs  of  electromagnets  that 
can  apply  a  magnetic  field  up  to  1.9  kOe,  either  perpendicu¬ 
lar  to  the  film  plane  to  generate  the  polar  signal,  or  in  the 
film  plane  to  generate  the  longitudinal  signal.  The  incident 
angle  of  the  laser  is  fixed  at  —45°  to  allow  for  both  measure¬ 
ments.  The  incident  beam  is  usually  p  polarized  and  is  fo¬ 
cused  by  an  optical  lens  onto  the  sample  surface.  After  re¬ 
flection,  the  beam  intensity  is  measured  by  a  photodiode 
behind  a  linear  polarizer,  which  is  set  at  —1°  from  extinction, 
to  generate  the  magnetic  hysteresis  loop.  There  is  also  a 
quarter-waveplate  in  the  reflection  path  to  cancel  the  birefrin¬ 
gence  of  the  UHV  windows.  The  sample  is  transferred  up 
and  down  along  the  wedge  for  the  thickness  dependent  study. 

IV.  RESULTS 
A.  Fe/Co(100) 

First  we  needed  to  determine  how  many  layers  of  Co 
should  be  deposited  onto  Cu(lOO)  to  allow  it  to  be  consid¬ 
ered  a  “Co(lOO)  substrate.’’  We  did  this  by  growing  identical 
Fe  wedges  on  different  thickness  of  Co  layers  (2-20  ML). 
The  longitudinal  Kerr  signals  of  two  of  these  samples  (6  and 
10  ML  Co)  are  shown  in  Fig.  2.  The  two  curves  have  a 
similar  shape,  but  are  offset  from  each  other  due  to  the  Kerr 
signal  from  the  uniform  Co  layers.  If  the  Co  Kerr  signal  is 
subtracted  from  both,  the  curves  coincide  precisely.  There¬ 
fore,  6  ML  of  Co  is  enough  for  all  the  features  of  the  Fe/Co/ 
Cu(lOO)  system  to  be  present.  The  choice  of  6  ML  as  the  Co 
thickness  for  most  of  our  experiments  was  purely  arbitrary. 
We  will  discuss  the  magnetic  properties  in  detail  later. 

Our  next  concern  was  to  test  the  validity  of  our  conjecture 
that  Fe/Co(100)  should  have  similar  structural  properties  as 
Fe/Cu(100).  The  structural  properties  of  the  room  tempera¬ 
ture  growth  Fe/Co(100)  system  were  investigated  by 
RHEED  and  LEED. 

RHEED  intensity  was  monitored  during  the  growth  of 
uniform  thickness  Fe/Co(100)  and  Fe/Cu(100)  films.  Figure 
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Fig.  2.  Longitudinal  Kerr  signals  of  Fe  wedges  grown  on  different  thick¬ 
nesses  of  fee  Co  (1 00)  at  room  temperature.  A  study  of  the  effect  of  Co 
thickness  on  the  Fe/Co(100)  system  was  performed  by  growing  identical  Fe 
wedges  on  different  thicknesses  of  fee  Co(lOO).  Two  typical  curves  for  6 
and  10  ML  of  Co  are  shown.  Only  in-plane  magnetization  was  found  in 
these  samples  throughout  the  whole  thickness  range. 

3  shows  that  the  oscillations  for  both  systems  are  similar. 
Three  regions  can  be  identified  in  both  samples:  a  region  of 
erratic  oscillations  (region  I),  followed  by  a  region  with 
fairly  periodic  and  constant  oscillations  (region  II),  which 
gives  way  to  a  third  region  where  the  oscillations  disappear 
and  the  RHEED  intensity  is  greatly  decreased  (region  III). 
The  RHEED  diffraction  patterns  for  regions  I  and  II  are  con¬ 
sistent  with  an  fcc-like  crystal  structure,  while  the  pattern  for 
region  III  is  consistent  with  bcc  islandlike  growth.  Studies 
performed  on  the  Fe/Cu(100)  system^®’^^’^^’^^  have  yielded 
similar  RHEED  oscillation  patterns.  Other  studies  performed 
on  the  Fe/Cu  system  have  shown  that  region  I  for  that  system 
has  a  pseudo-fet  crystal  structure,  while  regions  II  and  III  are 
fee  and  bcc,  respectively. 

A  study  of  the  surface  structure  in  regions  I  and  II  was 
performed  using  LEED.  A  sample  in  which  layers  of  uniform 
Fe  were  grown  on  top  of  Co(lOO)  was  used,  taking  a  LEED 


Fe  Thickness  (ML) 

Fig.  3,  Typical  RHEED  intensity  variations  during  Fe  deposition  on  fee 
Co(lOO)  and  Cu(lOO).  Samples  were  grown  at  room  temperature  with  a 
growth  pressure  <6X10“^  Torr.  Different  oscillatory  behaviors  identify 
three  distinct  regions.  RHEED  and  LEED  patterns  show  no  difference  be¬ 
tween  the  Fe/Co(100)  and  Fe/Cu(100)  systems. 
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Fig.  4.  (a)  The  same  Kerr  signals  as  in  Fig.  2  with  the  Co  backgrounds 
subtracted.  The  curves  are  identical  for  both  Co  thicknesses.  Measurements 
were  taken  at  room  temperature.  Note  that  the  magneto-optic  response  in 
regions  I  and  III  falls  roughly  on  the  same  line,  (b)  Cusps  in  the  coercivity 
of  the  SMOKE  loops  were  observed  to  correspond  to  the  transitions  between 
regions. 


picture  after  each  successive  growth.  LEED  patterns  for  0, 
3.6  (2  ML),  6.3  (3.5  ML),  and  10.8  A  (6  ML)  Fe  on  Co(lOO) 
were  carefully  studied.  0  and  3.6  A  Fe  (region  I)  showed  a 
sharp  1X1  reconstruction.  The  LEED  pattern  for  6.3  A  Fe 
(region I/II  transition),  showed  a4Xl  or  5X1  reconstruction. 
Finally,  the  pattern  for  10.8  A  Fe  (region  II)  showed  a  1X1 
or  a  weak  2X1  reconstruction.  Wedged  samples  of  Fe  on 
Co(lOO)  were  also  studied  with  LEED,  yielding  similar  re¬ 
sults.  Figure  4  includes  a  summary  of  the  results  for  the 
LEED  study.  Experiments  performed  on  the  Fe/Cu(100)  sys¬ 
tem  yield  similar  results:  IXl  reconstruction  for  region  I, 
4-5X1  reconstruction  for  the  region  I/II  transition,  and  2X1 
reconstruction  for  region  11. 

The  similarity  between  the  Fe/Cu  and  Fe/Co  systems  in 
the  RHEED  and  LEED  studies  is  consistent  with  the  asser¬ 
tion  that  Fe/Co(100)  is  structurally  similar  to  Fe/Cu(100). 
Hence,  we  believe  that  regions  I,  II,  and  III  of  Fe/Co(100) 
have  fet,  fee,  and  bcc  crystal  structures,  respectively,  as  in 
the  Fe/Cu(100)  system.  This  also  suggests  that  the  magnetic 
properties  of  the  substrate  do  not  affect  the  growth  of  the  fee 
Fe  film. 

Magnetic  measurements  were  performed  using  the  in  situ 
SMOKE  technique  (Fig.  4).  Each  of  the  three  structural  re¬ 
gions  was  observed  to  have  a  distinct  magnetic  behavior.  In 
contrast  to  the  room  temperature  growth  Fe/Cu(100)  system, 
where  the  magnetization  is  out-of-plane  in  the  first  two  re¬ 
gions  with  an  ordering  temperature  below  room 
temperature,^®’^^’^^’^^  the  room  temperature  growth  Fe/ 
Co(lOO)  system  exhibits  in-plane  magnetization  throughout 
the  thickness  range  with  an  ordering  temperature  well  above 
room  temperature. 
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Fig.  5.  Low  temperature  growth  Fe/Co(100).  Room  temperature  SMOKE 
measurement  shows  longitudinal  Kerr  signal  increasing  linearly  with  Fe 
thickness.  The  nonferromagnetic  phase  seen  in  the  room  temperature  growth 
sample  does  not  exist.  The  linear  dependence  across  the  fcc-to-bcc  transition 
suggests  that  the  magnetic  moment  for  fee  Fe  and  bcc  Fe  are  the  same. 

Region  I,  which  goes  from  0  to  10  A  (6  ML)  of  Fe,  is 
ferromagnetic,  with  the  Kerr  signal  increasing  linearly  with 
the  Fe  thickness  {d^^.  The  Kerr  signal  drops  to  a  small  and 
constant  level  in  region  II,  which  goes  from  10-20  A  (6-11 
ML)  of  Fe.  In  region  III,  the  Kerr  signal  increases  and  again 
acquires  a  ferromagnetic  character,  again  increasing  linearly 
with  d'pe .  Since  all  three  regions  exhibit  in-plane  magnetiza¬ 
tion,  we  are  able  to  compare  directly  the  magnitudes  of  the 
Kerr  signals  in  these  three  regions.  It  is  interesting  to  note 
that  the  magneto-optic  responses  for  regions  I  and  III  fall 
roughly  on  the  same  line,  suggesting  that  the  Fe  film  retains 
the  magnitude  of  the  magnetic  moment  as  its  structure 
changes  from  fee  to  bcc. 

Since  both  Co  and  Cu  produce  the  same  crystal  structure 
for  the  Fe  overlayer,  we  attribute  the  different  magnetic  be¬ 
havior  of  the  Fe/Co(100)  and  Fe/Cu(100)  systems  to  the 
magnetic  nature  of  the  Co  substrate.  We  believe  that  it  is  the 
direct  polarization  of  the  Fe  by  the  Co  moment  at  the  inter¬ 
face  that  causes  the  in-plane  magnetization  and  the  enhanced 
ordering  temperature  in  Fe/Co  system  as  compared  to  the 
Fe/Cu  system.  To  support  our  idea  about  direct  coupling  be¬ 
tween  Fe  and  Co,  we  investigated  the  shape  of  the  magnetic 
hysteresis  loop  of  the  Fe/Co  film.  If  the  Fe  overlayer  were 
not  coupled  to  the  Co  underlayer,  a  hysteresis  loop  for  a 
Fe/Co  film  would  have  a  step  at  the  fields  corresponding  to 
the  coercivities  of  Fe  and  Co.  What  we  observe,  however,  is 
a  single  square  loop  with  the  coercivity  shifted  from  that  of 
Co  to  a  new  value  as  increases.  The  coercivity  of  the 
Fe/Co  system  versus  ^pe  is  shown  in  Fig.  4.  Similar  to  the 
Fe/Cu  system,  as  reported  by  Li  et  we  observed  cusps 
in  the  coercivity  at  the  transitions  between  the  three  struc¬ 
tural  regions.  Recalling  that  the  area  enclosed  by  the  hyster¬ 
esis  loop  is  the  work  done  by  the  external  magnetic  field,  the 
observed  cusps  in  the  coercivity  are  a  signature  of  the  energy 
dissipation  associated  with  the  phase  transition. 

We  also  studied  low  temperature  growth  of  Fe/Co(100). 
After  growing  the  Fe  film  at  120  K  and  warming  up  the 
sample  to  room  temperature,  a  SMOKE  measurement  was 
performed  (Fig.  5).  Again,  only  the  longitudinal  signal  is 
present.  Different  from  the  room  temperature  growth  film, 
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Fig.  6.  Oxygen  absorption  experiment  performed  to  ascertain  the  origin  of 
the  magnetic  live  layers  in  region  II.  SMOKE  scans  before  and  after  oxygen 
absorption  are  shown.  The  shift  in  region  I  is  evidence  that  the  oxygen 
reacted  with  the  surface.  The  magnitude  of  the  Kerr  signal  in  region  II 
remains  unchanged  after  absorption,  implying  that  the  magnetic  live  layers 
in  this  region  are  not  at  the  surface. 

the  low  temperature  growth  film  shows  a  linear  increase  of 
the  Kerr  signal  through  the  entire  range  of  the  Fe  film  thick¬ 
ness,  indicating  that  only  the  ferromagnetic  phase  is  present. 
This  result  is  similar  to  that  of  the  low  temperature  growth 
Fe/Cu(100)  system, further  confirming  our  speculation 
that  the  structural  properties  of  the  fee  Fe  overlayer  are  the 
same  for  both  the  Fe/Co(100)  and  the  Fe/Cu(100)  systems. 
In  addition,  the  same  linear  dependence  of  the  SMOKE  sig¬ 
nal  as  the  Fe  film  crosses  the  fcc-to-bcc  boundary  suggests 
the  same  magnetic  moment  of  fee  Fe  as  compared  with  bcc 
Fe. 

B.  Oxygen  absorption  on  Fe/Co(100) 

For  the  room  temperature  growth  Fe/Co(100),  region  II 
seems  to  be  the  most  interesting  of  the  three  regions  found  in 
this  system.  The  fact  that  the  Kerr  signal  is  constant  through¬ 
out  the  whole  region  seems  to  imply  that  there  are  only  a  few 
layers  that  are  magnetically  live,  A  similar  live  layer  was  also 
found  in  the  Fe/Cu  system  below  room  temperature  and 
seems  to  come  from  the  surface  layer  of  the  Fe  film.  Since 
the  live  layer  in  our  Fe/Co  system  was  observed  at  room 
temperature  and  there  exists  a  direct  magnetic  coupling  be¬ 
tween  Fe  and  Co,  we  speculate  that  the  live  layer  in  the 
Fe/Co  system  originates  from  the  interface  of  Fe  and  Co 
instead  of  from  the  surface  layer  of  the  Fe.  A  simple  experi¬ 
ment  was  performed  to  determine  whether  or  not  the  mag¬ 
netic  live  layer  is  at  the  surface  of  the  film.  An  Fe  wedge 
grown  on  6  ML  Co  was  prepared  and  SMOKE  measure¬ 
ments  were  performed  just  as  described  previously.  A  small 
amount  of  oxygen  was  leaked  into  the  chamber  so  that  it 
would  react  with  the  surface  of  our  sample  (a  few  minutes  at 
10“^  Torr).  In  principle,  the  oxygen  would  react  with  a  few 
surface  layers  of  our  sample  and  if  the  magnetic  live  layers 
of  the  fee  region  were  located  at  the  surface,  we  would  see  a 
change  in  the  Kerr  signal.  A  complete  SMOKE  scan  across 
the  wedge  was  performed  after  the  oxygen  absorption.  The 
results  of  the  SMOKE  scans  both  before  and  after  oxygen 
absorption  are  shown  in  Fig.  6.  First,  we  notice  that  there  is 
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a  reduction  of  magnetization  in  region  I,  indicating  that  the 
absorbed  oxygen  indeed  kills  the  surface  magnetism.  Sec¬ 
ond,  the  transition  thickness  from  region  I  to  region  II  in¬ 
creases.  This  is  not  surprising,  since  both  regions  I  and  II  are 
metastable  phases  and  the  transition  between  them  is  very 
sensitive  to  the  surface  structure,  so  that  the  oxygen  atoms  at 
the  surface  should  effectively  change  the  transition  thickness. 
The  fact  that  the  transition  to  region  III  does  not  shift  too 
much  can  be  attributed  to  the  fact  that  surface  changes  are 
less  likely  to  cause  a  shift  from  the  more  stable  bcc  phase. 
The  most  salient  result  is  that  the  Kerr  signal  in  region  II 
remains  the  same,  supporting  our  speculation  that  the  mag¬ 
netic  live  layers  for  this  region  are  from  the  Fe/Co  interface 
rather  than  from  the  Fe  surface. 

C.  Coupling  in  Co/fee  Fe/Co  on  Cu(100) 

Fe  normally  cannot  be  used  as  the  spacer  layer  to  study 
interlayer  coupling.  Because  Fe  is  a  ferromagnetic  material, 
the  interlayer  coupling  will  always  be  ferromagnetic  and  the 
coupling  strength  will  not  depend  on  the  spacer  thickness. 

On  the  other  hand,  we  have  seen  that  fee  Fe  possesses  a 
thickness  range  where  it  is  not  ferromagnetic  (region  II). 
Thus,  it  is  feasible  to  use  it  as  a  spacer  layer.  Because  fee  Fe 
can  be  grown  on  fee  Co,  the  appropriate  system  to  study  is 
Co/Fe/Co  on  Cu(lOO).  Using  the  extensive  work  on  AFC  as 
a  reference,  this  system  may  provide  clues  as  to  the  magnetic 
nature  of  the  fee  Fe. 

“Symmetric”  Co/Fe/Co  sandwiches  on  Cu(lOO)  with  Co 
thicknesses  of  13,  15,  20,  and  40  ML  were  fabricated  at  room 
temperature.  The  Co  layers  were  of  uniform  thickness  and 
the  Fe  layer  was  a  wedge. 

Figure  7  shows  representative  longitudinal  SMOKE  hys¬ 
teresis  loops  observed  in  a  Co(15  ML)/Fe/Co(15  ML) 
sample  at  room  temperature.  Figure  7(a)  shows  ferromag¬ 
netic  coupling  of  the  Co  layers  for  corresponding  to  re¬ 
gion  I  of  Fig.  4.  Figure  7(b)  shows  strong  AFC  in  the  region 
I/II  transition.  The  AFC  is  strong  enough  that  we  cannot 
saturate  the  magnetization  with  our  1.9  kOe  magnetic  field. 
Figure  7(c)  shows  weak  AFC  for  in  region  II.  It  was 
found  that  the  splitting  of  the  loops  changed  very  little  as 
varied  from  ~1Q  to  —15  A.  Figure  7(d)  shows  a  “stretched 
out”  AFC  loop,  for  c/p^  in  the  thicker  part  of  region  II.  This 
loop  will  be  discussed  later. 

Loops  7(a)  and  7(b)  were  easily  reproducible,  but  loop 
7(c)  was  not  easily  reproducible.  Typically  the  loop  was  not 
well  split.  We  hypothesized  that  structural  instabilities  of  the 
fee  Fe  might  allow  restructuring  of  the  Fe  as  the  Co  over¬ 
layer  is  deposited.  We  based  this  on  several  observations. 

First,  we  believed  that  a  structural  instability  would  not 
arise  from  the  Co  underlayer  because  the  structural  and  mag¬ 
netic  properties  of  Fig.  4  were  easily  reproducible.  In  fact, 
for  each  of  our  sandwiches,  we  performed  quick  SMOKE 
measurements  before  putting  on  the  Co  overlayer.  We  found 
that  for  all  samples,  the  qualitative  features  of  Fig.  4 
persisted — a  linear  increase  in  region  I,  and  a  drop  to  a 
roughly  constant  signal  in  region  II.  In  addition,  the  LEED 
patterns  of  the  Fe/Co/Cu(100)  systems  were  sharp.  During 
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Fig.  7.  Longitudinal  SMOKE  hysteresis  loops  for  Co(15  ML)/Fe/Co(15 
ML).  Many  Fe  thicknesses  {dpc)  were  studied  by  using  a  wedged  Fe  layer. 
Four  representative  loops  are  shown:  (a)  dp^=2.5  A.  Ferromagnetic  cou¬ 
pling  (region  I),  (b)  dYc=^2  A.  Strong  AFC  (region  I/II  transition).  We  were 
not  able  to  saturate  the  magnetization  with  our  electromagnets  (H<\.9  kOe) 
(Note:  H  greater  than  600  Oe  caused  our  sample  manipulator  to  move.  We 
subtracted  out  the  background  signal  due  to  this  motion.)  (c)  dY^~\2J  A. 
Weak  AFC  (region  II).  (d)  ^/p(.=  15.2  A.  The  weak  AFC  is  beginning  to  get 
“stretched  out,”  signifying  either  the  beginning  of  the  fcc-bcc  transition  or  a 
strong  AFC  oscillation.  Loops  (a)  and  (b)  were  easily  reproducible,  while  (c) 
and  (d)  were  not. 

the  growth  of  the  Co  overlayer,  however,  we  were  never  able 
to  obtain  RHEED  oscillations.  This  contrasts  with  the  Co 
underlayer  growth,  where  we  were  able  to  get  —30  oscilla¬ 
tions  with  very  little  amplitude  drop.  Also,  after  growth,  the 
LEED  patterns  became  diffuse. 

To  further  support  our  idea,  we  investigated  the  properties 
of  Co  on  fee  Fe  without  the  Co  underlayer.  This  allowed  us 
to  remove  any  effect  that  the  Co  underlayer  may  have  had, 
thus  isolating  the  effect  of  the  Co  overlayer.  We  first  made  an 
Fe  wedge  on  Cu(lOO)  and  then  grew  a  uniform  Co  layer  on 
top  of  it.  Figure  8  shows  the  result  of  a  longitudinal  SMOKE 


Fig.  8.  Longitudinal  Kerr  signal  vs  Fe  thickness  for  Co(15  ML)/Fe/Cu(100). 
Comparing  with  Fe/Co  (Fig.  4)  shows  qualitative  differences.  For  Fe/Co,  the 
Kerr  signal  increases  linearly  with  the  Fe  thickness  in  region  I.  For  corre¬ 
sponding  Fe  thicknesses  in  Co(15  ML)/Fe,  the  Kerr  signal  is  approximately 
constant. 


Fig.  9.  Longitudinal  SMOKE  measurement  of  M  (77= 200  Oe)  vs  Fe  thick¬ 
ness  for  the  Co(15  ML)/Fe/Co(15  ML)  sample.  200  Oe  is  sufficient  to  satu¬ 
rate  the  magnetization  for  the  weak  AFC  and  the  ferromagnetic  coupling 
across  fee  Fe.  However,  it  does  not  saturate  the  magnetization  for  the  strong 
AFC  or  in  the  fcc-bcc  transition  region.  Hence,  dips  in  M(//=200  Oe) 
signify  the  onset  of  the  strong  AFC  or  the  fcc-bcc  transition.  The  first  dip  at 
~6  A  is  strong  AFC.  The  second  dip  at  —16  A  is  due  to  either  strong  AFC 
or  the  fcc-bcc  transition. 

measurement  for  15  ML  Co.  The  middle  region  is  flat,  like 
region  II  of  Fe/Co/Cu(100)  (Fig.  4).  The  thicker  region  has 
increasing  Kerr  signal,  as  in  Fig.  4.  However,  the  thinner 
region  does  not  have  a  linearly  increasing  signal  like  region 
I  of  Fig.  4.  This  result  is  consistent  with  the  idea  that  the  Co 
overlayer  somehow  changes  the  structural  properties  of  the 
underlying  fee  Fe.  However,  more  studies  are  needed  to  de¬ 
termine  the  effect  of  overlayers  on  fee  Fe. 

D.  Oscillations  in  coupling 

Returning  to  the  Co(15  ML)/Fe/Co(15  ML)  sample.  Fig.  9 
shows  the  magnetization  (Kerr  signal)  at  //=200  Oe  vs  Fe 
thickness,  taken  from  SMOKE  hysteresis  loops.  This  pro¬ 
vides  an  efficient  way  of  representing  our  data:  200  Oe  is 
sufficient  to  saturate  the  magnetization  for  the  weak  AFC  and 
for  the  ferromagnetic  coupling  across  fee  Fe,  but  it  does  not 
saturate  the  magnetization  for  the  fcc-bcc  transition  region 
(due  to  the  cusp  in  the  coercivity)  or  for  the  strong  AFC. 
Hence,  dips  in  M(//=200  Oe)  signify  either  strong  AFC  or 
the  fcc-bcc  transition.  The  first  dip  at  A  is  strong 

AFC.  The  second  dip  is  ambiguous  because  of  the  onset  of 
the  fcc-bcc  transition. 

Thus,  loop  7(d),  which  lies  at  the  beginning  of  the  second 
dip,  poses  an  interesting  question.  Does  the  “stretching  out” 
of  the  loop  signify  the  beginning  of  the  fcc-bcc  transition,  or 
is  it  a  superposition  of  a  weak  AFC  loop  and  a  strong  AFC 
loop?  If  it  is  the  latter,  perhaps  there  will  be  an  oscillation 
back  to  strong  AFC.  Unfortunately,  we  do  not  find  out  be¬ 
cause  the  fcc-bcc-transition  begins  and  the  structure  of  the 
Fe  changes  drastically. 

To  examine  this  issue,  we  made  a  compromise.  We  know 
from  Li  et  and  our  own  experience  that  if  the  Cu(lOO) 
substrate  is  “dirty”  (i.e.,  not  sufficiently  cleaned  by  sputter¬ 
ing  and  annealing),  the  fee  phase  of  Fe  can  persist  to  a  higher 
thickness.  (For  detailed  information,  please  see  Refs.  19  and 
22.)  Of  course,  the  fee  Fe  on  a  “dirty”  substrate  will  be 
different  from  the  fee  Fe  on  a  clean  substrate,  but  we  assume 
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Fig.  10.  Longitudinal  SMOKE  measurements  of  M(//=200  Oe)  vs  Fe 
thickness  for  Fe/Co(6  ML)  and  Co(6  ML)/Fe/Co(6  ML)  on  a  “dirty” 
Cu(lOO)  substrate.  The  “dirty”  substrate  allows  the  fee  structure  to  persist  at 
higher  Fe  thicknesses,  allowing  us  to  search  for  an  oscillation  back  to  strong 
AFC  as  suggested  in  Fig.  9.  The  Fe/Co  measurement  verifies  that  region  II 
is  longer  than  usual;  it  normally  ends  at  A.  The  Co/Fe/Co  measure¬ 
ment  shows  a  second  dip  at  A,  signifying  an  oscillation  back  to 

strong  AFC. 

the  difference  is  negligible  with  regard  to  the  existence  of  a 
second  strong  AFC  peak.  We  do  not  know  the  amount  of 
impurity  on  the  “dirty”  substrate  since  the  impurity  level  is 
too  low  to  be  detected  by  our  Auger  electron  spectroscopy. 
The  signature  of  a  “dirty”  film  in  practice  is  that  region  II  in 
Fig.  4  gets  bigger  and  that  the  surface  reconstruction  is  2X2 
instead  of  2X1.  Nevertheless,  we  studied  a  “dirty”  Co(6 
ML)/Fe/Co(6  ML)  sandwich.  Figure  10  shows  the  Kerr  sig¬ 
nal  before  and  after  the  Co  overlayer  was  grown.  The  Fe/Co 
signal  shows  that  the  fee  region  is  indeed  longer.  To  study 
the  oscillations  of  the  magnetic  coupling  in  the  Co/Fe/Co 
sandwich,  we  measured  the  magnetization  (Kerr  signal)  at  a 
field  of  200  Oe,  as  before.  Two  dips  were  observed,  each 
signifying  the  strong  AFC.  We  conclude  that  we  have  found 
an  oscillation  back  to  the  strong  AFC  with  a  periodicity  is 
—  12  A.  It  is  interesting  to  note  that  the  first  AFC  occurs  at 
the  transition  point  between  regions  I  and  11.  We  do  not  know 
if  this  is  a  coincidence  or  if  this  is  somewhat  associated  with 
the  structural  transition  (fet-fee). 

V.  SUMMARY 

We  have  successfully  grown  and  characterized  fee  Fe  on 
fee  Co(lOO).  Structurally,  Fe/Co(100)  is  very  similar  to  Fe/ 
Cu(lOO).  The  magnetic  phases  (e.g.,  ferromagnetic,  para¬ 
magnetic,  etc.)  are  similar  in  both  systems,  but  Fe/Co(100) 
differs  from  Fe/Cu(100)  because  it  has  in-plane  magnetiza¬ 
tion  for  all  film  thicknesses.  The  nonferromagnetic  phase  has 
a  magnetic  live  layer.  Oxygen  absorption  experiments  show 
that  the  live  layer  is  not  at  the  surface,  so  it  is  most  likely  at 
the  Fe/Co  interface.  Low  temperature  growth  Fe/Co(100)  ex¬ 
hibits  a  longitudinal  Kerr  signal  that  grows  linearly  with  , 
even  across  the  fcc-to-bcc  boundary.  The  same  linear 


dependence  suggests  that  the  magnetic  moments  of  fee  Fe 
and  bcc  Fe  are  the  same.  Studies  of  the  coupling  in  Co/fee 
Fe/Co/Cu(100)  sandwiches  yield  ferromagnetic  coupling, 
strong  AFC,  and  weak  AFC.  We  find  that  Co/Fe/Cu{100)  is 
different  from  Fe/Co/Cu(100),  so  the  Co  layers  do  not  act 
symmetrically;  the  difference  is  possibly  caused  by  a  struc¬ 
tural  instability.  By  using  a  “dirty”  substrate  to  artificially 
lengthen  region  II,  we  find  an  oscillation  in  the  strong  AFC. 
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The  magnetic  structure  of  nanoscale  alloy  films  has  been  probed  using  the  magnetic  x-ray  linear 
dichroism  in  photoelectron  spectroscopy  and  magnetic  x-ray  circular  dichroism  in  x-ray  absorption 
spectroscopy.  FeNi  and  CoFe  epitaxial  films  were  grown  on  Cu(OOl),  in  situ  and  using  MBE 
techniques.  Because  soft  x-rays  were  used  to  probe  the  2p  and  3p  core  levels,  both  elemental 
selectivity  and  magnetic  sensitivity  were  achieved  simultaneously.  Correlation  of  these  magnetic 
techniques  with  compositional  and  structural  information  will  be  done.  Ultimately,  from  studies 
such  as  this  a  complete  determination  of  the  structure-property  relationships  will  be  achievable. 
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I.  INTRODUCTION 

A  full  elucidation  of  the  underlying  principles  driving 
magnetic  properties  in  complex  systems  will  require  the  ap¬ 
plications  of  probes  that  couple  elemental  specificity  and 
magnetic  sensitivity.  Two  important  classes  of  magnetic  de¬ 
vices,  spin  valves,^  and  GMR  materials^  are  typically  com¬ 
posed  of  several  nanoscale  layers  or  aggregations^  composed 
of  different  elements  or  alloys.  The  ultrathin  nature  of  these 
films  or  clusters  further  complicates  the  picture  by  introduc¬ 
ing  the  impact  of  interfacial  effects,  including  pseudomor- 
phic  strain  and  spin-specific  scattering  at  the  interfaces."^  One 
avenue  to  address  such  issues  is  to  build  epitaxial  ultrathin 
alloy  films  and  probe  them  directly  using  techniques  that  are 
both  elementally  selective  and  magnetically  sensitive.  Here 
we  report  the  beginnings  of  such  a  study,  using  CoFe  and 
FeNi  films  grown  with  molecular  beam  epitaxy  (MBE)  tech¬ 
niques  and  investigated  with  magnetic  x-ray  linear 
dichroism^"^  (MXLD)  in  the  core-level  photoelectron  spec¬ 
troscopy  of  the  3p  states  and  magnetic  x-ray  circular  dichro¬ 
ism  in  the  x-ray  absorption^  of  the  2p  states  (Fig.  1). 

II.  EXPERIMENT 

The  x-ray  absorption  experiments  (Fig.  2)  were  performed 
at  the  Stanford  Synchrotron  Radiation  Laboratory  (SSRL) 
using  Beamline  8-2,  part  of  the  University  of  California/ 
National  Laboratories  facilities.^  Beamline  8-2,  is  based  on  a 
spherical  grating  monochromator  with  only  three  optical  el¬ 
ements  reflecting  the  x  rays,  including  the  gratings.  Extrac¬ 
tion  of  circularly  polarized  x-rays  was  achieved  by  either 
raising  or  lowering  the  first  mirror  or  insertion  of  vertical  slit 
for  blocking  part  of  the  beam.^’^^  As  illustrated  in  Fig.  1,  in 
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X-ray  absorption  an  electron  moves  from  a  core  level  into  the 
unoccupied  density  of  states  immediately  above  the  Fermi 
energy.  The  combination  of  circularly  polarized  excitation  (x 
rays),  strong  dipole  selection  rules,  and  spin-polarized  va¬ 
lence  bands  (i.e.,  magnetized  atom)  gives  rise  to  strong  dich- 
roic  effects  in  these  spectra,  as  we  will  see  below.  In  this 
experiment,  total  yield  detection  was  utilized.  Here,  the  cas¬ 
cade  of  electrons,  including  Auger  decay  electrons,  but  more 
predominantly  low-energy  secondary  electrons,  causes  a  net 
loss  of  electrons  by  the  sample.  Hence,  we  can  measure  the 
absorption  of  ultrathin  films  simply  by  monitoring  the  re¬ 
plenishing  current  through  a  picoammeter  (A).  Because  we 
are  using  ultrathin  films  of  only  multiple  monolayer  thick¬ 
ness,  x-ray  attenuation,  and  electron  escape  depth  effects  will 
be  minimized  if  not  totally  eliminated.  From  these  intensity 
variations,  and  within  a  localized  picture,  it  is  possible  to 
extract,  with  varying  degrees  of  precision  and  accuracy,  the 
spin  and  orbital  components  of  the  elemental  magnetic 
moments.^  For  more  detail  concerning  MXCD  absorption, 
please  see  Ref.  8  and  references  therein.  Sample  cleanliness 
and  composition  were  monitored  with  photoelectron  spec¬ 
troscopy  using  photons  of  about  500  eV  for  excitation. 

The  MXLD  photoemission  measurements  (Fig.  3)  were 
made  at  the  Spectromicroscopy  Facility  (Beamline  7)  of  the 
Advanced  Light  Source  at  Lawrence  Berkeley  Laboratory. 
Extraordinarily  bright,  linearly  polarized  x-rays  were  gener¬ 
ated  by  the  U  5.0  undulator  and  wavelength  selection  was 
achieved  using  the  spherical  grating  monochromator,  with  a 
resolving  power  of  over  8000.  The  photoelectrons  were  de¬ 
tected  using  the  angle-resolving,  multichannel,  5.4  in.  radius, 
Perkin-Elmer  hemispherical  deflector  system.  Sample  align¬ 
ment  (including  pseudomorphic  growth),  cleanliness,  and 
composition  were  measured  using  the  hemispherical  deflec¬ 
tor  and  a  separate  Mg  Ka  (photon  energy  =1254  eV)  source, 
thus  freeing  up  the  beamline  for  other  uses  during  our  peri¬ 
ods  of  sample  preparation.  The  actual  MXLD  measurements 
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Fig.  1.  Shown  here  is  the  excitation  of  a  core-level  electron  in  a  magneti¬ 
cally  polarized  atom,  i.e.,  the  valence  bands  are  spin  polarized.  For  illustra¬ 
tive  purposes  only,  a  circularly  polarized  photon  is  shown,  with  a  minority 
electron  moving  into  the  unoccupied  valence  (conduction)  band  and  a  ma¬ 
jority  electron  being  ejected.  In  x-ray  absorption  (light  vertical  arrow)  near¬ 
edge  measurements,  the  electron  moves  into  the  strongly  polarized  valence 
states  (e.g.,  the  3d  states),  giving  rise  to  strong  dichroic  effects  at  the  /  + 
2  and  /  “  2  edges.  In  photoelectron  spectroscopy  (heavy  vertical  arrow  with 
e~),  the  electron  moves  into  states  well  above  the  strongly  polarized  con¬ 
duction  bands  and  is  collected  in  an  energy  analyzer. 


used  the  highly  linearly  polarized  synchrotron  radiation  and 
were  performed  with  a  total  instrumental  energy  resolution 
of  less  than  100  meV  and  angular  resolution  of  2°  or  less. 
The  angle  of  incidence  of  the  x  rays  was  30°  relative  to  the 
surface  plane.  The  electrons  were  collected  along  the  surface 
normal;  i.e.,  “normal  emission.”  Typically,  the  magnetic  al¬ 
loy  was  magnetized  in  the  plane  of  the  surface  but  perpen¬ 
dicularly  to  the  plane  containing  the  emission  direction  (sur¬ 
face  normal)  and  the  Poynting  vector  and  electric 
polarization  vector  of  the  x  rays.  Thus,  the  “transverse- 
chiral”  condition  necessary  for  MXLD  was  achieved:  revers¬ 
ing  the  magnetic  field  causes  two  mirror-image  configura¬ 
tions  that  are  equivalent  but  totally  nonsuperimpossible.  By 
comparing  spectra  from  these  configurations,  it  is  possible  to 
directly  probe  the  magnetic  perturbations  of  the  elementally 
specific  electronic  structure  of  the  systems. 
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Fig.  2.  Shown  here  is  a  schematic  representation  of  the  MXCD  absorption 
experiment.  A  total  yield  method  has  been  used.  The  picoammeter  (A)  moni¬ 
tors  the  grounding  current,  giving  a  measure  of  the  absorption  in  an  ultrathin 
film.  It  is  necessary  to  align  and  antialign  the  magnetization  and  helicity  or 
circular  polarization,  hence  grazing  incidence  (15*’)  is  used  with  in-plane 
magnetized  samples  (see  Ref.  8). 


electron 

energy 

analyzer 


single-crystal 

Cu(OOl) 

Fig.  3.  The  MXLD  experimental  configuration  is  schematically  shown  here. 
The  grazing  angle  of  incidence  of  the  linearly  polarized  x  rays  was  30°, 
relative  to  the  surface  plane.  Electrons  ejected  perpendicularly  to  the  surface 
were  collected  by  the  angle  resolving  detector  (i.e.,  normal  emission).  The 
plane  containing  the  ejected  electron  and  the  Poynting  vector  of  the  x  ray 
also  contained  the  electric  (linear  polarization)  vector  of  the  x  ray  as  well. 
The  magnetization  vector  was  perpendicular  to  this  experimental  plane,  but 
in  the  plane  of  the  surface.  To  achieve  MXLD,  comparisons  were  made 
between  spectra  in  which  only  the  direction  of  the  magnetization  was  re¬ 
versed;  all  else  was  kept  the  same. 


III.  OVERLAYER  GROWTH:  COMPOSITION, 
THICKNESS,  AND  ORDERING 

The  alloy  systems  under  consideration  were  CoFe  and 
FeNi.  Ultrathin  films  of  each  were  grown  on  Cu(OOl)  at 
room  temperature,  using  well-developed  MBE  techniques. 
All  samples  were  about  2-5  monolayers  thick.  Studies  of  the 
bulk  electronic  structure  have  been  performed, but  the 
properties  of  these  pseudomorphic  overlayers  are  further 
complicated  by  epitaxial  strain  and  other  nanoscale  effects. 
An  important  issue  is  growth  control,  including  thickness, 
composition,  and  ordering.  At  the  ALS,^^  we  can  monitor 
thickness  and  composition  by  performing  x-ray  photoelec¬ 
tron  spectroscopy  (XPS)  and  element  specific  ordering  with 
x-ray  photoelectron  diffraction  (XPD),  using  the  MgKa 
x-ray  source.  Examples  of  XPS  spectra  are  shown  in  Fig.  4. 
The  XPD  analyses  are  illustrated  in  Figs.  5  and  6. 

Figure  4  illustrates  the  type  of  quantitative  information 
that  can  be  gleaned  from  XPS.  Here,  three  major  sets  of  peak 
structure  are  immediately  apparent:  The  Culp  (8^^933, 
952  eV),  the  Co  2/7  {8^=11%,  793  eV),  and  the  Fe3/7 
(8^—101,  720  eV)  {8^  is  the  binding  energy  with  respect  to 
the  Fermi  level.)  The  relative  intensities  of  the  Fe  and  Co 
lines  give  a  zeroth  order  estimate  of  composition,  assuming 
chemical  disorder  within  the  alloy  overlayer.  The  relative 
intensity  of  the  Culp  lines  versus  the  sum  of  the  Co  and  Fe 
lines  gives  a  measure  of  overlayer  thickness,  if  there  is  no 
substrate  intermixing  or  other  such  complications.  A  quanti¬ 
tative  estimate  of  thickness  requires  assumptions  concerning 
x-ray  penetration  depths  and  electron  escape  depths.  Quanti¬ 
tative  estimates  in  general  necessitate  corrections  for  photon 
energy  dependent  cross  sections  and  angles  of  emission.  Af¬ 
ter  properly  including  such  effects,  it  is  possible  to  arrive  at 
composition  and  thickness  estimates  such  as  those  shown  in 
Fig.  4. 

A  key  question  in  the  growth  of  epitaxial  films  is  that  of 
the  degree  and  nature  of  ordering,  including  strain.  A  means 
to  probe  ordering  in  ultrathin  films  in  an  elementally  specific 
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Fig.  4.  XPS  analysis  of  various  Fe(l~jc)  Co(x)  alloy  overlayers  on 
Cu(OOl).  Spectral  structure  includes  the  Fe2/?  {B^=101,  720  eV),  Co  2;:> 
793  eV)  and  Cu  2p  {B^=932,  952  eV).  The  sample  in  (c)  may 
be  slightly  Fe  rich  at  the  interface;  this  effect  was  not  included  in  the  con¬ 
centration  estimate. 

fashion  is  the  application  of  forward  focusing  in  x-ray  pho¬ 
toelectron  diffraction  Here,  buried  atoms  emit 

core-level  photoelectrons  when  excited  with  x  rays  and  in¬ 
tensity  maxima  correspond  to  positions  of  nearby  atoms 


Fig.  5.  XPD  azimuthal  scans  are  shown  here,  for  a  series  of  FeCo  alloy 
overlayers,  with  the  energy  set  to  the  core  levels  of  either  Co  or  Fe.  The 
polar  angle  is  45°:  the  diffraction  peaks  demonstrate  the  degree  to  which  the 
overlayer  atoms  are  occupying  ordered  lattice  sites. 


Fig.  6.  XPD  polar  scans  are  plotted  in  this  figure.  The  azimuth  is  set  on  a 
peak  and  the  polar  angle  is  scanned.  For  a,  b,  and  d,  the  plane  of  rotation  is 
a  {110}  plane.  For  c,  it  is  a  {100}  plane. 


above  the  emitters.  This  is  because  these  nearby  atoms  can 
act  as  lenses  to  focus  electrons  of  fairly  high  kinetic  energies 
(hundreds  of  eV  or  more).  Thus,  one  can  immediately  iden¬ 
tify  bond  axes  or  nearest  neighbor  directions.  Of  course, 
complications  exist:  long  chains  of  atoms  can  cause  defocus- 
ing,  multiple  scattering  events  can  arise,  and  bulk  signals  can 
overwhelm  those  of  a  near  surface  atom  of  the  same  elemen¬ 
tal  nature  as  the  bulk.  However,  many  of  these  complications 
are  minimized  in  ultrathin  overlayers.  In  fact,  the  elemental 
specificity  of  the  source  core  level  electrons  allows  for  a 
separation  of  overlayer  (here,  Co  and  Fe)  forward  focusing 
from  the  bulk  (Cu)  effects  and  a  separation  of  overlayer  com¬ 
ponents  from  each  other  (e.g.,  Co  vs  Fe).  Thus  on  can  di¬ 
rectly  probe  overlayer  ordering  on  an  element-by-element 
basis,  independent  of  substrate  ordering  effects. 

In  FeCo  alloy  films,  it  is  expected  that  the  growth  will  be 
FCC-like  with  the  Cu(OOl)  serving  as  a  template.  Two  tests 
of  this  ordering  are  shown:  in  Fig.  5  are  azimuthal  scans  and 
in  Fig.  6  are  polar  scans.  FCC  growth  will  give  rise  to  strong 
forward  focusing  peaks  at  a  45°  polar  angle  relative  to  the 
normal  (0°):  this  is  along  the  (101)  and  (011)  directions  (i.e., 
the  face  diagonal).  Because  of  the  width  of  the  forward  fo¬ 
cusing  peaks,  at  45°  polar  angle  one  may  also  observe  part  of 
the  (112)  and  (111)  families  of  peaks,  centered  at  a  polar 
angle  of  35°  and  55°  from  the  normal  (0°),  respectively. 
Thus,  an  azimuthal  scan  (polar  angle  of  45°)  will  give  rise  to 
a  series  of  peaks  separated  by  45°  azimuthally,  but  possibly 
with  an  intensity  period  of  90°  azimuthally.  That  is  what  is 
observed  for  FeCog,  FeCo,  and  FeCo3  in  Fig.  5,  indicating  a 
high  degree  of  ordering.  The  FeCo2  seems  to  be  more  disor¬ 
dered,  with  only  weaker  maxima.  While  a  high  degree  of 
ordering  is  indicated,  it  still  needs  to  be  proven  that  it  is 
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Fig.  7.  Photoelectron  spectra  of  2  ML  (Fe[l]Co[9])Cu(001),  taken  using  a 
photon  energy  of  1 30  eV.  Also  shown  is  the  difference  for  these  two  spectra, 
a  measure  of  the  MXLD  effect. 


Fig.  8.  Photoelectron  spectra  of  2  ML  (Fe[l]Ni[4])Cu(001),  taken  using  a 
photon  energy  of  160  eV.  Again,  a  difference  spectrum  is  also  shown,  as  a 
measure  of  the  MXLD  effect. 


FCC-like.  Unfortunately,  BCC  growth  could  give  rise  to  a 
similar  azimuthal  pattern,  at  least  qualitatively.  A  further  test 
of  FCC  growth  is  to  perform  polar  scans  in  the  {100}  and 
{110}  planes.  In  the  {100}  plane,  the  maximum  associated 
with  the  (101)  should  be  at  45°:  An  example  of  this  result  is 
shown  in  Fig.  6(c).  In  the  {110}  plane,  the  (112)  maximum 
should  be  at  35°  and  the  (111)  maximum  at  55°.  Figures  6(a), 
6(b),  and  6(d)  show  examples  of  peaks  at  or  near  35°,  with 
possible  shoulders  at  55°.  Normal  emission  peaks  (0°)  are 
also  expected  and  evident  in  all  of  the  subsections  of  Fig.  6. 
In  the  case  of  FeCo3  [Fig.  6(c)],  the  order  appears  to  be 
stronger  around  the  Co  emitters  than  the  Fe  emitters,  consis¬ 
tent  with  Fig.  5(c).  A  complete  analysis  would,  of  course, 
include  a  full  computational  simulation  and  explanation  of 
all  features  and  effects.  Nevertheless,  the  presence  of  strong 
maxima  indicates  a  high  degree  of  FCC  ordering.  It  should 
be  noted  that  while  strong  structural  ordering  is  indicated 
here  and  the  source  of  the  photoelectrons  can  be  specified 
elementally,  the  scattering  events  do  not  have  such  elemental 
specificity.  Moreover,  we  assume  that  while  these  films  are 
structurally  ordered  they  are  chemically  disordered,  a  natural 
by-product  of  the  simultaneous  co-evaporation  of  Fe  and  Co 
onto  Cu(OOl). 

IV.  MAGNETIC  X-RAY  DICHROISM 

Now,  let  us  turn  our  attention  to  the  MXLD  results.  Ex¬ 
amples  of  our  spectroscopic  MXLD  photoemission  results 
are  shown  in  Figs.  7  and  8.  In  this  energy  regime,  the  cross 
sections  for  the  Fe,  Co,  and  Ni  3P  are  strongly  photon-energy 
dependent:  hence,  the  peaks  do  not  scale  with  concen¬ 
tration  without  further  correction.  Along  with  the  photo 
emission  spectra  are  difference  curves.  From  these  we 
can  calculate  an  asymmetry,  using  the  definition 
ASYM=(/’^-/”)/(/'^  +  /“).  Setting  each  prepeak  region 
equal  to  zero,  asymmetry  values  of  approximately  15%  for 
Co  and  40%  for  Fe  in  FeCo  and  5%  for  Fe  and  2%  for  Ni  in 
NiFe  are  obtained. 


Presently,  we  are  pursuing  studies  of  these  systems  where 
we  vary  the  thickness  and  composition  of  the  magnetic  over¬ 
layers  and  probe  the  3p  levels  of  each  element.  In  essence, 
we  are  attempting  to  use  the  MXLD  measurements  as 
element-specific,  surface  magnetometers.  This  is  not  quite  as 
straightforward  as  it  might  seem:  it  is  clear  that  simple  per¬ 
turbative  (Fig.  9)  models  often  fail.^^’^^  For  example,  if  it  is 
possible  to  work  in  the  regime  where  ^  ,  the  spin  orbit 
splitting,  is  large  versus  the  exchange  splitting,  then 

AEex  can  be  treated  as  perturbation  upon  AEg^  .  Under 
these  conditions,  it  seems  possible  that  the  variations  in 
asymmetry  will  correlate  with  the  elemental  magnetic  mo¬ 
ment.  A  fair  candidate  for  this  condition  is  Ni,  where  the 
stronger  spin-orbit  splitting  is  evidenced  by  the  peak  and 
shoulder  appearance  in  both  spectra  in  Fig.  8.  Fe  and  Co  are 
probably  more  difficult  cases,  where  the  exchange  and  spin 
orbit  splittings  need  to  be  dealt  with  on  an  equal  footing,  in  a 
non-perturbative  approach.^^  However,  it  may  be  possible  to 
follow  variations  around  a  central  value  in  a  perturbative 
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Fig.  9.  The  effect  of  spin  orbit  and  exchange  splitting  upon  a  p  core  level  is 
illustrated  here.  Two  perturbative  approaches  are  shown:  large  spin  orbit  and 
small  exchange  splitting  (left)  and  small  spin  orbit  and  large  exchange  split¬ 
ting  (right).  The  Ni3p  would  fall  to  the  left  while  the  Fe  3p  is  near  the 
center,  where  spin  orbit  and  exchange  effects  are  nearly  equal  and  a  non- 
perturbative  approach  is  required.  State  mixing  is  indicated  by  dashed  and 
solid  lines. 
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Fig.  10.  An  example  of  MXCD  absorption  is  shown  here.  The  5  ML 
(Fe3Coi)Cu(001)  sample  was  magnetized  in-plane.  The  degree  of  circular 
polarization  was  approximately  70%.  This  lower-than-usual  value  was  cho¬ 
sen  for  ease  of  operation  and  enhanced  counting  rates.  Corrections  for  non¬ 
ideal  polarization  (<100%)  can  be  made  following  Ref.  8.  Regardless, 
strong  dichroisms  are  easily  observable  here. 

fashion.^  Moreover,  we  can  correlate  the  MXLD  photoemis¬ 
sion  by  comparing  that  technique  with  the  results  from 
MXCD  absorption.  An  example  of  our  MXCD  absorption 
results  is  shown  in  Fig.  10.  Here,  a  strong  dichroism  is  ob¬ 
served  at  both  the  Fe  2/7  and  Co  2 p  peaks.  Within  certain 
limitations,^  it  is  possible  to  extract  from  such  spectra  both 
the  spin  and  orbital  components  of  the  projection  of  the  el¬ 
ementally  specific  magnetic  moments  along  the  axis  of  quan¬ 
tization.  (Again,  it  should  be  noted  that  the  ultrathin  nature 
of  our  samples  minimizes  the  impact  of  electron  escape  or 
x-ray  attenuation  effects.)  Although  our  initial  results  are 
only  preliminary  and  further  study  is  warranted,  they  are 
promising  in  terms  of  understanding  key  effects  in  magnetic 
nanoscale  films,  including  such  possibilities  as  strain- 
modified  invar  quenching. 

V.  SUMMARY 

We  are  using  the  elementally  specific  and  magnetically 
sensitive  techniques  of  MXLD  photoelectron  spectroscopy 
and  MXCD  absorption  to  probe  the  structure-property  rela¬ 
tionships  of  nanoscale  magnetic  alloys.  These  ultrathin  films 
are  the  building  blocks  of  many  new  magnetic  devices,  in¬ 
cluding  spin- valve  and  GMR  materials.  The  variation  of 
classical  magnetic  effects,  such  as  invar  quenching,  due  to 
pseudomorphic  strain  and  other  nanoscale  perturbations,  is 
being  investigated. 
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The  polar  ellipticity  spectra  of  the  interfaces  of  epitaxial  Fe/Au(001)  wedges  have  been  measured. 

The  interface  spectra  resemble  neither  the  reflection  Faraday  spectrum  of  Au  nor  the  thin-film 
spectra  of  thin  Au  sandwiched  Fe.  The  existence  of  low  and  high  energy  peaks  suggest  that 
magneto-optical  transitions  occur  from  and  to  the  interface  states.  ©  1996  American  Vacuum 
Society. 


L  INTRODUCTION 

The  study  of  the  magneto-optical  (MO)  properties  of  ul- 
trathin  films  and  multilayers  is  important  both  from  an  appli¬ 
cation  as  well  as  a  theoretical  point  of  view.^’^  The  dielectric 
properties  of  ultrathin  films  are  different  from  that  of  bulk 
material.  Strain  caused  by  the  difference  in  lattice  parameters 
between  thin  film  and  substrate  influences  the  band  structure 
and  can  change  the  MO  transitions.  Calculation  on  Ni 
showed  that  the  position  of  the  lower  sp  band  relative  to  the 
Fermi  level  depends  on  the  lattice  parameter.^  Changing  the 
effective  lattice  parameter  by  alloying  with  a  paramagnetic 
element  causes  a  shift  of  the  corresponding  MO  peaks.^’^  The 
difference  in  crystal  potential  of  bulk  and  thin  film  causes  the 
electrons  to  reflect  at  the  interfaces,^  because  of  the  finite 
dimensions  of  the  thin  film  perpendicular  to  the  substrate, 
some  electrons  will  be  confined  in  quantum  well  states 
(QWSs).^“^  Such  discontinuous  electron  dispersion  causes 
new  peaks  in  the  MO  spectra^  and  the  Kerr  rotation  of 
ferromagnetic^^  and  nonferromagnetic  thin  films^^’^^  oscillate 
as  a  function  of  thickness.  Extended  overviews  on  this  sub¬ 
ject  can  be  found  in  Ref.  13.  In  addition  to  strain  and  finite 
size  effects,  it  is  also  expected  that  the  interfaces  themselves 
will  contribute  to  modifying  the  MO  properties.  The  redistri¬ 
bution  of  the  free  carriers  and/or  the  occurrence  of  interface 
electronic  states^’ might  change  the  effective  dielectric 
properties  of  the  entire  thin-film  system. 

Measurements  on  polycrystalline  Co  films  sandwiched 
between  Pd  and  Pt  have  provided  evidence  of  the  existence 
of  such  an  interface  contribution.^^  A  non-vanishing  Kerr 
rotation  and  Kerr  ellipticity  exists  when  extrapolating  the 
results  to  zero  film  thickness. Furthermore  the  existence  of 
an  interface  MO  effect  is  evident  from  the  MO  spectra  of 
multilayers  and  ultrathin  film.^^"^^  Xu  found  that  the  Ken- 
spectra  of  Fe/Ag  multilayers  could  only  be  understood 
by  assuming  a  band  splitting  in  the  paramagnetic  Ag.^^  Vis- 


^^Current  address:  NHMFL/Department  of  Materials  Science,  University  of 
Florida,  Gainesville,  Florida;  Electronic  mail:  wgeer@silica.msLufl.edu 


novsky  et  al  investigated  the  MO  properties  of  Au- 
sandwiched  ultrathin  Co  films.  Although  most  of  their  results 
could  be  explained  by  the  classical  electromagnetic  wave 
theory  (Maxwell  equations),  evidence  for  the  existence  of 
interface  MO  contributions  exists. Their  interface  spectra 
resembled  that  of  the  reflection  Faraday  spectra  of  pure  Au. 

In  this  article,  we  report  on  a  systematic  investigation  of 
the  MO  interface  effects  of  Au-sandwiched  (001)  Fe  films. 
To  our  knowledge,  this  is  the  first  time  the  MO  interface 
effect  is  studied  on  epitaxial  thin  films.^ 


II.  EXPERIMENTAL  PROCEDURE 

Thin  Fe  wedges  were  grown  by  molecular  beam  epitaxy 
(MBE).  Cleaning  of  the  substrates  and  deposition  conditions 
are  extensively  described  in.^’^’^®  The  total  preparation  pro¬ 
cedure  was  optimized  so  that  flat  films  with  sharp  interfaces 
were  obtained.  Because  of  the  perfect  matching  of  the  lattice 
parameters  of  Au  and  Fe,  the  small  difference  in  surface  free 
energy  of  Au  and  Fe,  and  the  possible  surfactant-effect  of  the 
Au  atoms,  high-quality  films  could  be  grown.  Two  wedges 
were  prepared  with  the  following  structure  (from  surface  to 
substrate^  Sample  No.  1:  Au(20  A)/Fe(0-20  A)/Au(2000 
A)/Ag(2000  A)/polished  MgO[(001)  orientation];  Sample 
No.  2:  Au(20  A)/Fe(0-10  A)/Au(20  A)/Ag(3000  A)/polished 
MgO  [(001)  orientation].  Figure  1  shows  their  structure.  In 
both  samples,  the  Fe  layer  is  sandwiched  by  Au.  By  chang¬ 
ing  the  thick  buffer  layer  from  Au  (sample  No.  1)  to  Ag 
(sample  No.  2),  we  can  check  optical  effects  on  the  spectra^^ 
(the  average  penetration  depth  of  the  light  is  about  200  A). 

MO  experiments  were  performed  using  a  system  based  on 
the  piezobirefringent  modulator  technique  (Jasco-2500 
system).^®  The  polar  Kerr  ellipticity  was  determined  along 
the  wedge  (at  80  different  positions)  for  different  photon 
energies  (1.6, 1.8, ..,6  eV).  The  applied  field  exceeded  1.8  T  to 
ensure  saturation  of  the  magnetization  for  all  points  along  the 
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Fig.  1.  (a)  Wedge  structure  of  sample  No.  1  and  (b)  sample  No.  2. 
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wedge.  All  measurement  conditions  were  similar  to  those 
described  in.^^’^^ 

III.  MEASUREMENT  RESULTS 

Figure  2(a)  shows  the  measured  Kerr  ellipticity  (77^)  as  a 
function  of  the  film  thickness  (D)  for  two  different  photon 
energies.  Three  different  contributions  can  be  observed 

(1)  A  damped  oscillating  term  originating  from  the  electron 
confinement  in  the  thin-film  structure. In  both  samples, 
multiple  oscillating  terms  occurred.^^  For  most  of  the  photon 
energies,  a  short  period  oscillation  (3-5  A)  and  a  long  period 
oscillation  (7-10  A)  could  be  observed. 

(2)  An  interface  term:  extrapolation  of  the  curves  to  zero  film 
thickness  results  in  nonzero  Kerr  ellipticities^  (see  Fig.  2).  It 
is  believed  that  this  MO  contribution  originates  from  transi¬ 
tions  from  or  to  states  which  are  located  near  the  interfaces 
of  the  thin-film  structure.  Three  types  of  states  can  be  con¬ 
sidered. 


Electromagnetic  wave  calculation 


(b)  Thickness  [A] 

Fig.  2.  (a)  Measured  Kerr  ellipticity  as  a  function  of  the  film  thickness  for 
several  photon  energies  (sample  No.  1);  (b)  Calculated  ellipticity  as  a  func¬ 
tion  of  the  film  thickness  for  sample  No.  1  (bulk  term  only). 


Fig.  3.  Calculated  photon  energy  dependence  of  the  non  linear  bulk  term  for 
sample  No.  1  (at  15  A)  and  sample  No.  2  (at  7.5  A). 

(a)  The  electronic  wave  function  is  limited  to- the  interface; 
i.e.,  the  envelope  function  of  this  kind  of  state  has  an 
exponential  decay  in  both  the  Fe  and  the  Au  layers.  The 
eigen  energy  will  be  in  the  band  gap  of  both  metals. 

(b)  The  electronic  wave  function  describes'  an  extended 
state  in  the  Fe  layer  and  penetrates  into  the  Au  layer; 
only  in  the  Au  layer,  the  envelope  function  will  have  an 
exponential  decay.  The  eigen  energy  of  such  a  state 
will  be  in  an  Fe  band  but  not  in  a  Au  band. 

(c)  The  electronic  wave  function  describes  an  extended 
state  in  Au  and  an  exponential  decaying  one  in  Fe. 

It  is  expected  that  the  interface  states  of  type  (a)  and  (b) 
show  exchange  splitting.  As  the  spin  orbit  coupling  in  Au  is 
very  large, large  interface  MO  effects  are  expected  in  the 
Au/Fe  system. 

(3)  A  bulk  term  originating  from  the  bulk  dielectric  proper¬ 
ties  of  the  thin-film  structure.  Using  optical^^  and 
magneto-opticaP^  experimental  data  from  the  literature,  we 
can  calculate  the  expected  thickness  dependence  of  this  bulk 
term.  A  calculation  scheme  similar  to  that  given  in  Ref.  21 
was  used.  Some  results  for  sample  No.  1  are  given  in  Fig. 
2(b).  For  the  relevant  thickness  range  (0-20  A),  the  elliptic¬ 
ity  can  be  considered  to  be  linear  proportional  on  the  film 
thickness,  which  is  in  agreement  with^’^^  [see  inset  in  Fig. 
2(b)].  It  appears  that  this  linear  behavior  occurs  at  all  photon 
energies  except  for  a  small  interval  around  the  plasma  edge 
of  the  Au  buffer  layer  (2.3-2.7  eV).  For  these  wavelengths, 
the  adsorption  in  the  Fe  layer  causes  large  second-order 
terms  even  below  a  film  thickness  of  20  A.^^ 

Calculations  on  sample  No.  2  revealed  essentially  the 
same  results.  Except  for  the  energy  interval  3. 8-4.2  eV  (the 
plasma  edge  of  the  Ag  buffer  layer),  the  second-order  term 
was  negligible  when  compared  to  the  linear  term.  Figure  3 
shows  the  nonlinear  bulk  term  (equal  to  the  2nd-order  term 
at  a  specified  thickness)  of  sample  No.  1  and  sample  No.  2  at 
15  and  7.5  A,  respectively. 

The  interface  term  was  isolated  from  the  measurement 
data  by  a  linear  regression  algorithm.  For  sample  No.  1 
(sample  No.  2)  the  data  from  film  thicknesses  between  3  and 
20  A  (4  and  10  A)  were  used.  For  lower  thicknesses,  the 
amplitudes  of  the  finite  size  oscillations  are  very  large  and  an 
accurate  determination  of  the  interface  term  becomes  diffi- 
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Fig.  4.  Measured  bulk  (a)  and  interface  term  (b)  as  a  function  of  the  photon 
energy  for  sample  No.  1  and  sample  No.  2. 


cult.  The  results  of  this  analysis  are  given  in  Fig.  4.  Figure 
4(a)  gives  the  spectrum  of  the  calculated  bulk  term.  The 
shape  of  both  spectra  is  in  agreement  with  the  results  of  the 
calculations.^  The  structures  around  2.5  and  4  eV  originate 
from  a  sudden  change  of  the  optical  properties  of  the  corre¬ 
sponding  buffer  layers.^^  A  first-order  linear  approximation 
of  the  relation  between  dielectric  properties  and  observable 
complex  Kerr  rotation  is  given  by 

^k=4>k+iVk=^'^iD/k€^y /(!-€,)  (1) 

in  which  6^=  is  the  dielectric  constant  of  the  buffer 

layer,  €j^y=  the  nondiagonal  component  of  the  di¬ 

electric  tensor  of  the  thin  Fe  layer,  \  the  wavelength  of  the 
light,  D  the  film  thickness,  the  complex  Kerr  rotation, 
the  Kerr  rotation,  and  the  Kerr  ellipticity. 

At  the  plasma  edge,  the  complex  dielectric  constant  (e^) 
changes  from  mainly  real  to  mainly  imaginary;  we  can  con¬ 
clude  from  Eq.  (1)  that  below  the  plasma  edge,  the  ellipticity 
spectra  mainly  represents  the  spectra  while  above  the 
plasma  edge,  will  have  the  largest  contribution  to  the 
ellipticity. 

In  Fig.  4(b)  the  calculated  interface  MO  contribution  is 
presented.  The  spectra  is  completely  different  from  the  bulk 
term  [Fig.  4(a)].  The  sign  change  at  2.5  eV  in  the  spectrum  of 
sample  No.  1  can  be  attributed  to  the  aforementioned  plasma 
edge  effect  of  the  buffer  layer.  As  the  spectrum  of  sample 
No.  2  changes  sign  for  the  same  photon  energy  it  can  be  also 
concluded  that  the  non-diagonal  component  of  the  dielectric 
tensor  (e^,.^)  is  changing  drastically  around  2.5  eV.  The  same 
can  be  said  about  the  structure  at  4  eV  in  the  spectrum  of 
sample  No.  2. 

The  nonlinearities  of  the  bulk  term  shown  in  Fig.  3  could 
give  rise  to  a  virtual  interface  term.  The  magnitude  of  this 


Fig.  5.  Band  structure  model  of  the  Fe-interface  atoms  in  the  Au  sandwiched 
Fe{001)  structure. 

virtual  interface  term  is  one  half  the  value  shown  in  Fig.  3, 
and  is  thus  negligible  except  for  values  around  the  plasma 
edges  of  the  buffer  layers.  This  nonlinear  term  is  also  shown 
in  the  error  bars  of  Fig.  4(b).  Thus,  it  can  be  concluded  that 
the  interface  ellipticity  spectra  of  Fe/Au  (001)  display  peaks 
around  3  and  5.7  eV. 

IV.  DISCUSSION 

The  large  difference  in  work  function  between  Au  and  Fe 
(5.46  and  4.67  eV)^^  suggests  that  charge  transfer  will  occur 
from  the  Fe  to  the  Au  layer.  Because  of  the  large  density  of 
states  around  the  Fermi  level  of  both  materials,  the  charge 
redistribution  will  be  limited  to  the  interface,  i.e.,  the  dipole 
layer  will  be  thinner  than  a  monolayer  (ML).  In  this  case,  it 
is  not  clear  whether  or  not  it  is  possible  to  approximate  the 
band  structure  of  the  total  system  by  bending  of  the  bulk 
bands  of  the  materials  involved,  and  exact  band  calculations 
need  to  be  performed.  Wang  et  al?^  and  Ohnishi  et  al?^  per¬ 
formed  an  ab  initio  self-consistent  spin-polarized  energy 
band  study  of  a  7  ML  Fe  thin  film.  They  found  narrowing  of 
the  d  bands  near  the  surface  which  resulted  in  a  decrease  of 
the  density  of  states  of  the  majority  spin  electrons.  Because 
of  the  existence  of  surface  states  around  the  Fermi  level  in 
the  minority  spin  d  band,  the  minority  spin  density  was 
largely  enhanced  near  the  surface.  A  similar  behavior  was 
found  by  Li  et  al?^  for  a  monolayer  of  Fe  on  Au  (001):  a 
narrow  majority  spin  band  completely  under  the  Fermi  level 
and  a  large  minority  spin  density  at  the  Fermi  level. 

Furthermore,  Li  et  al  found  that  the  Au  interface  atoms 
have  a  net  magnetic  moment  of  0.03  (Bohr  magneton).  A 
closer  look  at  Fig.  2  of  their  publication^^  learns  that  the  spin 
polarization  of  the  Au  interface  atoms  is  clearly  inhomoge¬ 
neous.  Surface  states  occur  along  all  symmetry  directions. 
Similar  results  are  expected  for  a  real  Fe/Au  (001)  interface. 
Figure  5  gives  a  schematic  diagram  of  the  electronic  struc¬ 
ture  of  the  Fe-interface  atoms.  The  bulk  density  of  states  for 
the  d  band  is  given  for  both  spin-up  and  spin-down  electrons. 
The  dashed  curves  give  the  modifications  for  the  interface 
area  (we  neglect  the  effect  of  band  narrowing).  The  bold 
solid  curves  at  the  bottom  and  the  top  of  the  figure  represent 
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the  dispersion  of  the  sp  band.  The  solid  arrows  show  the 
main  optical  transitions  of  bulk  The  MO  effect  arises 

from  the  combination  of  spin-orbit  interaction  and  exchange 
splitting,  which  results  in  different  absorption  coefficient  for 
right-circularly  polarized  and  left-circularly  polarized  light.^^ 
Bulk  and  interface  values  of  both  spin-orbit  interaction  and 
exchange  splitting  will  be  different.  The  results  found  in 
Ref.  3,  however,  justify  that  for  an  impression  of  the  global 
shape  of  the  spectra,  it  is  sufficient  to  consider  only  the 
change  in  exchange  splitting  near  the  interface.  So  a  global 
idea  of  the  shape  of  the  MO  interface  spectrum  analog  to 
Ref.  38  can  be  obtained  from  Fig.  5.  The  band  structure 
suggests  that,  near  the  surface,  transitions  will  occur  to  and 
from  the  surface  states.  Transitions  from  the  surface  states  in 
the  majority  spin  band  to  the  sp  band  will  cause  a  peak  near 

3.2  eV.  Transitions  from  the  deep  sp  band  to  the  surface 
states  in  the  minority  spin  band  will  cause  a  structure  near 

5.2  eV  in  the  MO  interface  spectrum.  This  picture  is  in  basic 
agreement  with  the  experimental  results  of  Fig.  4(b)  which 
show  a  low  and  high  energy  interface  peak. 

By  using  the  data  of  Ref.  32  and  Ref.  39,  a  picture  similar 
to  Fig.  5  can  be  constructed  for  the  Au-interface  atoms.  The 
calculation  of  Li  et  al?^  shows  a  polarization  of  the  density 
of  states  2  eV  below  the  Fermi  level.  Because  of  the  large 
number  of  possible  optical  transitions  in  Au^^  however,  an 
argumentation  scheme  similar  to  that  used  for  the  Fe- 
interface  (mentioned  previously),  would  lead  to  the  predic¬ 
tion  of  a  lot  of  peaks  in  the  interface  spectrum. 

V.  CONCLUSION 

MO  measurements  on  a  Au  sandwiched  Fe(001)-wedge 
show  the  existence  of  a  low  and  high  energy  peak  in  the 
interface  ellipticity  spectra.  Although  the  changed  electronic 
structure  of  the  Fe  interface  can  explain  the  existence  of 
these  peaks,  the  large  spin  orbit  interaction^"^  and  the  ex¬ 
change  splitting  of  the  Au,^^  suggest  a  contribution  from  the 
Au-interface  atoms.  Experiments  on  thin  Fe  films  sand¬ 
wiched  between  other  noble  metals  (Ag,  Cu,  etc.)  could 
separate  both  contributions.  Furthermore,  theoretical  calcula¬ 
tions  on  the  electronic  and  magneto-optical  properties  of  the 
Fe/Au  interface  could  reveal  the  exact  origin  of  the  presented 
measurement  results. 
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Ultrathin  Co  films  were  epitaxially  grown  on  a  Cu(OOl)  surface  with  different  initial  roughness 
created  by  Ar  ion  sputtering.  The  roughness  of  the  Cu  substrate  and  the  Co  film  were  characterized 
by  high  resolution  low  energy  electron  diffraction.  The  measured  angular  profiles  were  compared 
with  a  diffraction  theory  for  rough  surfaces  and  the  roughness  parameters  were  extracted 
quantitatively.  Magnetic  hysteresis  loops  of  these  characterized  films  were  measured  by  surface 
magneto-optic  Kerr  effect.  The  hysteresis  loop  shape  and  loop  area  can  be  related  to  the  nanoscale 
roughness  in  the  Co  films.  For  the  roughest  film  with  interface  width  where  t  is  the  single 

atomic  step  height,  the  magnetization  is  reduced  several  fold  compared  with  that  of  smooth  films 
with  interface  width  Also,  the  coercivity  in  the  roughest  film  is  the  highest  and  there  exists 

a  wide  range  of  nucleation  centers  and  coercive  fields  for  magnetic  domain  reversals.  These  are 
related  to  the  high  step  density  in  the  rough  substrate  as  the  pinning  centers.  The  hysteresis  loop 
changes  its  shape  and  area  under  a  sinusoidal  external  magnetic  field  as  functions  of  frequency  and 
field  amplitude  for  all  films.  For  the  smooth  films  in  the  low  frequency  and  low  field  regimes  the 
loop  area  shows  a  f  power  law  scaling  behavior.  The  f  value  of  scaling  exponents  are  consistent  with 
the  prediction  of  a  dynamic  mean  field  theory  with  a  double- well  energy  barrier.  For  a  film  with  the 
same  interface  width  but  different  step  density  and  lateral  correlation  length  the  scaling  exponents 
deviate  from  f  value  drastically.  ©  1996  American  Vacuum  Society. 


1.  INTRODUCTION 

The  advancement  of  magnetic  characterization  in  mag¬ 
netospectroscopy,  magnetic  imaging  (magnetic  force  micros¬ 
copy,  scanning  electron  microscopy  with  polarization  analy¬ 
sis  (SEMPA),  spin  polarized  scanning  tunneling  microscopy, 
spin  polarized  low  energy  electron  microscopy,  x-ray  dichro- 
ism  photoemission  microscopy),  electron  diffraction  tech¬ 
niques  (spin  polarized  electron  diffraction,  spin  polarized 
photoelectron  diffraction),  material  growth  and  characteriza¬ 
tion,  and  large  scale  computations  has  provided  new  oppor¬ 
tunities  for  fundamental  research^  in  surface,  ultrathin,  and 
thin  film  magnetism  as  well  as  applications  to  magnetic 
devices.^  Issues  such  as  proximity  effects  at  the  interface 
between  a  magnetic  film  and  a  nonmagnetic  substrate,  the 
exchange  coupling  across  interfaces,  coupling  through  non¬ 
magnetic  layers  in  trilayers,  multilayers,  and  superlattices, 
and  magnetic  domain  structures  have  been  actively 
studied.^’^  Applications  of  the  giant  magnetoresistance 
(GMR)  effect  in  devices  such  as  hard  disk  drive  read  heads 
are  being  researched  and  developed.^  Activities  in  magneto¬ 
electronics  have  also  been  initiated.^  The  understanding  of 
the  magnetic  properties  of  nanoscale  structures  has  grown 
exponentially  in  the  past  decade. 

In  this  article,  we  present  the  results  of  our  study  of  hys¬ 
teresis  loops  of  heteroepitaxial  ultrathin  Co  films  grown  on  a 
Cu(OOl)  surface  using  surface  magneto-optic  Kerr  effect 
(SMOKE).  The  SMOKE  technique  was  pioneered  by  Moog 
and  Bader  in  a  study  of  Fe  on  Au(OOl)  surface^  and  has 
become  one  of  the  most  simple  but  powerful  techniques  used 
in  ultrathin  magnetic  film  characterization.  The  hysteresis 
loop  provides  information  on  magnetization,  coercivity,  and 


anisotropy.  Although  the  ultrathin  film  is  one  of  the  simplest 
systems  being  studied,  many  variables  such  as  substrate 
roughness,  film  roughness,  film  thickness,  film  temperature, 
applied  field  direction,  and  the  amplitude  and  frequency  of 
periodic  applied  magnetic  field  can  be  changed.  These  varia¬ 
tions  allow  the  study  of  equilibrium  magnetic  phase  transi¬ 
tions,  in-plane  or  out-of-plane  anisotropy,  effects  of  dimen¬ 
sionality  on  magnetic  properties,  effects  of  roughness  on 
magnetic  properties,  dynamic  magnetic  phase  transitions, 
etc. 

In  a  heteroepitaxial  ultrathin  film,  the  deposited  film  ma¬ 
terial  may  have  symmetry  and  lattice  constant  different  from 
those  of  the  substrate.  This  mismatch  in  lattice  constants  may 
induce  strain  at  the  interface  and  roughness  at  the  surface  of 
films.  Often  the  surface  free  energy  of  the  film  is  different 
from  that  of  the  substrate.  This  may  cause  compositional 
heterogeneities  or  intermixing  at  the  interface.  It  is  not  un¬ 
usual  to  form  a  metastable  epitaxial  film  or  an  alloyed  film. 
These  possible  microstructures  may  be  examined  using  a 
structural  technique  such  as  high  resolution  low  energy  elec¬ 
tron  diffraction  (HRLEED)  developed  by  Henzler’s  group. ^ 
This  technique  is  sensitive  to  atomic  arrangements  in  the 
surface  and  ultrathin  film  regime  and  is  also  quantitative  in 
the  regime  from  sub  nm  to  hundreds  of  nm. 

One  expects  that  the  deviation  of  structural  properties  in 
the  epitaxial  film  from  those  of  the  bulk  material  might 
change  the  magnetic  moments,  remanent  magnetization, 
magnetic  domain  structure,  anisotropy,  coercivity,  etc.  In¬ 
deed,  many  prior  results  of  structures  and  magnetic  domain 
structures  exist.^’^^  In  this  article  we  will  present  the  mag¬ 
netic  properties  of  Co/Cu(001)  obtained  from  the  hysteresis 
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loops  measured  by  SMOKE  under  a  sinusoidal  external  mag¬ 
netic  field  with  various  frequencies  and  field  amplitudes.  We 
found  that  the  area  and  shape  of  a  hysteresis  loop  depends  on 
how  the  film  is  prepared.  Different  preparations  of  initial  Cu 
substrates  for  the  growth  of  films  result  in  different  nanos¬ 
cale  roughness  in  the  films.  Three  kinds  of  Co  films  were 
grown:  on  a  clean  smooth  Cu  substrate,  on  a  lightly  sputtered 
Cu  substrate,  and  on  a  heavily  sputtered  Cu  surface.  The  goal 
of  this  study  is  to  find  out  the  effect  of  initial  substrate  im¬ 
perfection,  which  causes  interface  imperfection  and  film 
roughness,  on  magnetic  properties.  The  quantitative  charac¬ 
terization  of  structural  imperfection  is  obtained  from  the  an¬ 
gular  profile  measurements  of  diffraction  beams  using  HR- 
LEED.  The  correlation  between  hysteresis  loop  shape  and 
nanoscale  roughness  will  be  presented. 

In  addition,  the  area  of  a  hysteresis  loop  is  measured  as  a 
function  of  either  the  amplitude  or  the  frequency  of  an  ap¬ 
plied  sinusoidal  magnetic  field  for  all  films.  The  frequency- 
and  amplitude-dependent  loops  obtained  from  the  Co  film  on 
the  lightly  sputtered  surface  are  qualitatively  the  same  as  that 
of  the  smooth  film  grown  on  smooth  surface  but  differ  quan¬ 
titatively  in  resonance  frequency,  coercive  field,  and  scaling 
exponents.  For  the  smooth  film,  there  exists  a  dynamic  scal¬ 
ing  behavior  of  power  law  form  with  well-defined  exponents 
in  the  low  field  and  low  frequency  regime.  A  dynamic  mag¬ 
netic  phase  transition  is  observed  at  high  frequencies.  The 
values  of  the  scaling  exponents  indicate  that  the  dynamic 
behavior  of  Co  belongs  to  the  Ising  universality  class. 

II.  EXPERIMENT 

All  experiments  were  performed  in  an  ultrahigh  vacuum 
chamber  (UHV).  The  Cu(OOl)  surface  was  sputtered  by  Ar^ 
(—2.0  /xA/cm^  or  1.0X10““^  Torr  at  500  eV)  at  room  tem¬ 
perature  and  then  annealed  at  600  °C  for  30  min.  The  HR- 
LEED  pattern  showed  a  1 X 1  pattern  after  this  cycling  pro¬ 
cess.  The  cleanliness  was  checked  by  Auger  electron 
spectroscopy.  This  is  the  ordered  smooth  surface.  For  the 
lightly  sputtered  surface,  Ar^  sputtering  was  applied  at  —0.2 
/xA/cm^  or  6.0X10”^  Torr  partial  pressure  at  500  eV  for  10 
min  and  no  annealing.  For  the  heavily  sputtered  surface 
(HSS),  the  sputtering  was  held  for  30  min  at  Ar"**  pressure  of 
l.bXlO”"^  Torr  or  —2.2  /xA/cm^  or  at  500  eV.  The  surface 
perfection  was  then  characterized  by  the  analysis  of  mea¬ 
sured  angular  profiles  of  the  (00)  and  (01)  diffraction  beams. 

The  evaporation  of  Co  is  from  an  electron  beam  bom¬ 
barded  Co  foil.  The  rate  can  be  varied  from  0.01  to  1.0 
ML/min  by  adjusting  the  temperature  of  the  foil  and  the 
electron  bombardment  voltage.  The  deposition  rates  for  all 
films  grown  on  ordered  smooth  surface,  lightly  sputtered  sur¬ 
face,  and  heavily  sputtered  surface  were  fixed  at  0.08  ML/ 
min  to  minimize  possible  changes  due  to  different  rates.  All 
Co  films  grown  on  a  Cu(OOl)  surface  held  at  room  tempera¬ 
ture  were  epitaxial  fee  films  as  evidenced  from  the  HRLEED 
diffraction  patterns.  The  pressure  during  evaporation  was 
<5.0X10"^^  Torr.  The  angular  profiles  of  integer  order  dif¬ 
fraction  beams,  (00)  and  (01),  as  a  function  of  energy  were 
measured  after  each  monolayer  deposition  at  room  tempera¬ 


ture.  The  roughness  parameters  such  as  average  terrace 
width,  interface  width,  lateral  correlation  length,  and  rough¬ 
ness  exponent  can  be  extracted  from  the  analysis  of  the  en¬ 
ergy  dependent  diffraction  profiles. 

After  a  certain  thickness  of  Co  film  was  grown  at  room 
temperature,  magnetic  hysteresis  loops  were  measured  at 
room  temperature  as  functions  of  applied  magnetic  field  am¬ 
plitude  and  frequency  using  a  UHV  SMOKE  setup. The 
maximum  magnetic  field  strength  of  the  electromagnet  can 
reach  1000  Oe.  All  the  hysteresis  loops  presented  have  been 
saturated  before  the  maximum  field  amplitude  is  reached. 
The  magnetic  field  can  be  applied  to  either  the  [100]  parallel 
or  the  [001]  perpendicular  direction  of  the  Cu(OOl)  surface 
by  rotating  the  magnet  inside  the  UHV.  Note  that  the  easy 
axis  direction  in  bulk  Co  is  along  the  [Oil]  direction.^  When 
measuring  hysteresis  loops  from  the  Co  film,  the  eddy  cur¬ 
rent  effect  from  the  Cu  substrate  needs  to  be  considered.  The 
major  eddy  current  effect  comes  from  the  magnetic  core.  All 
the  calculations  of  loop  areas  have  been  corrected'*  by  the 
frequency-dependent  field  strength  calibration  factor  g(n), 
where  [1^2 irf  and  /  is  the  frequency  of  the  applied  sinu¬ 
soidal  magnetic  field.  Our  UHV  electromagnet  provides  a 
maximum  magnetic  field  strength  of  1000  Oe  at  a  very  low 
frequency.  All  the  loops  measured  and  presented  here  have 
been  saturated. 

III.  RESULTS  AND  DISCUSSION 

A.  Growth  of  Co  films  and  characterization  of  film 
roughness 

1.  Roughness  parameters 

To  characterize  a  rough  surface,  one  can  employ  several 
parameters:  average  terrace  width  77,  interface  width  w,  lat¬ 
eral  correlation  length  local  slope  p,  and  roughness  expo¬ 
nent  a.  The  average  terrace  width  is  the  average  distance 
between  two  adjacent  steps.  The  interface  width  w  is  defined 
as  the  square  root  of  {[h{r)—{h)]^},  where  h{r)  is  the  surface 
height  at  position  r.  {h)  is  the  average  height,  w  is  the  “am¬ 
plitude”  of  surface  fluctuations  (rms  roughness)  and  thus  de¬ 
scribes  the  property  of  the  interface  at  a  large  distance.  The  ^ 
describes  the  “wavelength”  of  surface  fluctuations.  Two  sur¬ 
faces  having  the  same  interface  width  w  and  the  same  lateral 
correlation  length  ^  may  possess  very  different  local  rough¬ 
ness  structures,  which  are  described  by  the  local  slope  p  and 
roughness  exponent  a,  where  p— Details  of  how  to 
extract  these  parameters  from  diffraction  profiles  can  be 
found  elsewhere.*'^  We  will  briefly  summarize  them  in  the 
next  paragraph. 

2.  Diffraction  characteristics 

All  diffraction  profiles  contain  a  diffraction  structure  fac¬ 
tor,  which  is  related  to  the  morphology  of  the  growth  front. 
In  most  cases  all  the  important  parameters  related  to  the 
growth  front,  w,  p,  and  a  can  be  determined  from  the 
diffraction  profile.  In  general,  the  specularly  reflected  inten- 
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Table  I.  Roughness  parameters  for  three  Co  films  with  different  initial 
roughness  in  Cu(OOl). 


Initial  clean 

Cu 

terrace 
width  ?? 

(A) 

Co  film  terrace 
width  7) 

(A) 

Interface 

width  w 

Lateral 

correlation 
length  ^ 

(A) 

os 

300±10 

408  ±10 

Co/OS  (3.0  ML) 

31.4±2 

OM±omt 

285  ±7 

LSS 

44±2 

0.50±0.02f 

218±5 

Co/Cu  (3.0  ML) 

25±2 

0.50±0.02/ 

190±5 

HSS 

10±1 

1.81±0.05f 

96±4 

Co/HSS  (2.5  ML) 

12.5  ±1 

1.18±0.05r 

27±2 

sity  can  be  written  as  I{k)--exp[~{k_^w)^]S(ki\) 
+  /diff{fc|l  where  is  the  momentum  transfer 

with  and  k^^  being  the  outgoing  and  incoming  wave  vec¬ 
tors,  and  kii  and  k_^  are  the  momentum  transfers  parallel  and 
perpendicular  to  the  surface,  respectively.  The  central  sharp 
intensity  is  the  S  component  that  depends  on  the  interface 
width  and  is  used  for  the  determination  of  w.  The  lateral 
correlation  length  i  is  inversely  proportional  to  the  full  width 
at  half  maximum  (FWHM)  of  the  diffuse  part  of  the  inten¬ 
sity,  /diff(^ii  when  k^  is  small  (or  near  the  in-phase  con¬ 
dition).  For  large  k^  conditions  (or  near  the  out-of-phase 
conditions),  the  FWHM  of  the  diffuse  part  of  the  intensity 
(diffuse  line  shape)  is  proportional  to  pkY^  •  One  therefore  is 
able  to  determine  both  a  and  p  by  plotting  the  FWHM  as  a 
function  of  kj^ .  The  FWHM  at  the  out-of-phase  condition  is 
inversely  proportional  to  the  average  terrace  width  y  and  the 
local  slope  or  step  density  p  is  equal  to  l/rj. 


3.  Angular  profiles  from  ultrathin  Co  films  grown  on 
Cu  surfaces  with  three  different  initial 
roughnesses  in  Cu  and  decomposition  of  profiies 

Co  films  were  grown  on  a  Cu(OOl)  substrate  with  three 
different  initial  roughnesses:  an  ordered  smooth  surface  (OS) 
with  a  large  average  terrace  width  ^300  A,  a  lightly  sput¬ 
tered  surface  (LSS)  with  an  average  terrace  width  ^44  A, 
and  a  heavily  sputtered  surface  (HSS)  with  an  average  ter¬ 
race  width  A.  These  numbers  are  listed  in  Table  1.  The 
in  situ  growth  of  Co  films  was  monitored  by  measuring  the 
angular  profiles  of  the  (00)  and  (01)  diffraction  beams  at  the 
near  out-of-phase  condition  of  £  =  150  and  116  eV,  respec¬ 
tively.  Figures  1(a),  1(b),  and  1(c)  show  the  measured  angu¬ 
lar  intensity  profiles  of  the  (00)  beam  at  both  in-phase  (108 
eV)  and  out-of-phase  conditions  (150  eV)  for  about  3-ML- 
thick  Co  films  on  three  Cu  surfaces  with  different  rough¬ 
nesses. 

For  the  Co/OS  shown  in  Fig.  1(a),  the  in-phase  profile  is 
basically  the  instrument  function.  The  near  out-of-phase  pro¬ 
file  is  typical  of  a  two  level  system,  i.e.,  the  profile  has  a 
central  spike  and  a  broad  profile.  The  profile  has  been  de¬ 
composed  into  a  narrow  instrument  function  (ID  Lorentzian) 
and  a  broad  profile  (2D  Lorentzian)  plus  a  constant  back¬ 


ground  shown  as  solid  curves  in  Fig.  1(a).  Note  the  peak 
intensity  of  the  in-phase  profile  is  a  factor  of  6  higher  than 
the  out-of-phase  peak  intensity. 

For  the  Co/LSS,  Fig.  1(b)  shows  the  profiles  near  in-phase 
and  out-of-phase  conditions.  The  best  fit  indicates  that  the 
in-phase  profile  contains  not  only  the  instrument  function  but 
also  an  extra  profile.  This  is  an  indication  of  reduced  lateral 
correlation  length  ^  after  sputtering  compared  with  that  of 
the  Co/OS.  The  decomposed  profiles  near  the  out-of-phase 
profile  consist  of  a  broader  profile  with  higher  intensity  com¬ 
pared  with  that  of  Fig.  1  (a)  and  a  very  small  component  of 
the  central  narrow  profile.  This  implies  a  multilevel  film 
rather  than  a  two-level  film.  The  peak  intensity  ratio  between 
the  in-phase  and  out-of-phase  data  is  about  a  factor  of  5.  The 
overall  peak  intensity  of  the  Co  film  on  a  lightly  sputtered 
surface  is  reduced  by  about  a  factor  of  2  compared  with  that 
of  the  Co  film  on  the  ordered  smooth  surface. 

For  the  Co/HSS,  Fig.  1(c)  shows  two  profiles.  The  in- 
phase  profile  broadens  considerably  and  the  peak  intensity  is 
reduced  by  a  factor  of  6  compared  with  that  of  Fig.  1(b).  The 
central  S  peak  has  almost  disappeared.  This  implies  that  the 
surface  after  heavy  sputtering  has  a  very  small  lateral  corre¬ 
lation  length.  The  FWHM  of  the  broad  profile  at  the  out-of¬ 
phase  condition  is  further  increased  compared  to  the  lightly 
sputtered  surface.  Again,  the  central  narrow  profile  is  almost 
gone,  an  indication  of  multiple  levels  in  the  Co  film.  The 
peak  intensities  for  both  in-phase  and  out-of-phase  have  the 
same  order  of  magnitude. 


4.  Roughness  parameters  obtained  from  the  anaiysis 
of  diffraction  profiles 

Figures  2(a)  and  2(b)  show  the  peak  intensity  and  the 
FWHM  of  broad  diffuse  profiles  of  the  (10)  beam  versus  Co 
coverage  in  a  few  monolayer  (ML)  regime  for  the  three  dif¬ 
ferent  initial  substrates.  The  coverage  is  determined  from  the 
periodic  oscillation  of  the  peak  intensity  and  FWHM  versus 
deposition  time  for  Co/OS.  The  FWHM  oscillates  periodi¬ 
cally  for  the  Co/OS  but  it  only  has  one  very  obvious  oscil¬ 
lation  for  both  Co/LSS  and  Co/HSS.  The  absolute  values  of 
the  FWHM  around  3  ML  are  ^0.20,  ^0.25,  and  ^0.50  A“^ 
for  Co/OS,  Co/LSS,  and  Co/HSS,  respectively.  If  the  FWHM 
is  inversely  proportional  to  the  average  terrace  width  in  the 
Co  film  at  the  out-of-phase  condition,  then  at  about  3  ML, 
the  average  terrace  widths  listed  in  Table  I  for  the  Co  films 
prepared  on  OS,  LSS,  and  HSS  are  '=^31.4,  ^25,  and  ^12.5 
A,  respectively.  This  is  due  to  the  loss  of  long  range  order  as 
more  sputtering  is  applied  to  the  Cu  substrate.  The  (10)  beam 
peak  intensity  versus  coverage  for  both  Co/LSS  and  Co/HSS 
has  one  obvious  oscillation  at  the  beginning,  but  the  intensity 
oscillation  is  not  obvious  thereafter,  in  contrast  to  many  nice 
periodic  oscillations  usually  observed  in  the  case  of  the 
layer-by-layer  growth.  The  loss  of  periodic  peak  intensity 
oscillation  and  the  broadening  of  diffuse  profile  are  consis¬ 
tent  with  the  buildup  of  a  second  layer^"^  before  the  comple¬ 
tion  of  the  first  layer,  which  results  in  multilevel  films. 
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Fig.  1.  Angular  intensity  distributions  of  the  (00)  diffraction  beam  measured 
near  in-phase  and  near  out-of-phase  conditions  for  Co  films  grown  on 
Cu(OOl)  surfaces,  (a)  3.0  ML  on  an  ordered  smooth  (OS)  surface;  (b)  3.0 
ML  on  a  lightly  sputtered  surface  (LSS);  (c)  2.5  ML  on  a  heavily  sputtered 
surface  (HSS).  The  solid  curves  in  each  profile  are  decomposed  components 
representing  a  narrow  delta  function  (ID  Lorentzian),  a  diffuse  profile  (2D 
Lorentzian),  and  a  constant  background.  (The  5||  is  equivalent  to  k\\  in  the 
text.) 


Assumptions  made  in  the  derivation  of  diffraction  profiles 
from  a  real  space  rough  surface  are  that  the  roughness  is 
random  and  the  surface  height  distribution  is  a  Gaussian.  For 
the  ultrathin  Co  films  grown  on  an  OS,  the  roughness  is  not 


random  and  there  are  multilevels.  For  the  Co/LSS,  the  films 
are  rougher  but  barely  satisfy  the  assumptions.  For  the  film 
grown  on  the  HSS,  the  assumptions  may  be  valid.  If  we 
assume  that  the  assumptions  made  about  the  roughness  are 
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Fig.  2.  (a)  The  peak  intensity  and  (b)  the  FWHM  of  the  (10)  diffraction  beam  as  a  function  of  Co  film  thickness  measured  near  the  out-of-phase  condition  for 
Co  films  grown  on  ordered  smooth,  lightly  sputtered,  and  heavily  sputtered  Cu(OOl)  surfaces.  The  data  for  Co/LSS  and  Co/HSS  in  (a)  are  offset  from  zero  by 
4000  and  8000  cps,  respectively.  The  data  for  Co/LSS  and  Co/HSS  in  (b)  are  offset  from  zero  by  0.3  and  0.6  A”',  respectively.  The  solid  curve  through  the 
peak  intensity  data  for  Co/OS  is  a  fit  using  the  formula  exp{-(/:^w)^[l+cos  (277^/)]}. 


valid  on  the  HSS  and  approximately  valid  on  the  LSS,  then 
the  interface  width  is  1.8U  after  heavy  sputtering  and  is 
1.1 8r  after  about  2.5  ML  Co  film  was  deposited  on  the  HSS. 
The  f  is  a  single  atomic  step  height  of  1.76  A.  This  1.1 8f 
value  is  in  contrast  to  the  value  around  0.5t  obtained  in  LSS. 
The  roughness  exponents  are  around  0.6  for  Co/HSS  and 
Co/LSS. 

If  the  peak  intensity  oscillation  is  layer-by-layer-like,  one 
can  use  a  recently  developed  analytical  formula, 
/^,^^^~exp{-(7rvy)^[l  +  cos(27r^/)]}  where  d  is  the  thickness 
in  units  of  monolayer,  to  extract  the  w  from  the  decay  of 
peak  oscillations.  Figure  2(a)  shows  the  peak  intensity  of  the 
(10)  beam  from  Co/OS  film  and  the  solid  curve  is  the  fit.  The 
value  of  interface  width  obtained  from  the  fit  changes  from 
about  0.2t  to  OJt  in  about  10  oscillations.  The  w  obtained 
from  the  Co/OS  at  3  ML  is  ^0.54t  and  is  listed  in  Table  1. 

The  surface  height  distribution  can  be  obtained  either 
from  a  real  space  imaging  height  profile  scan  or  from  a  G 
factor  analysis  of  diffraction  profiles.^  At  the  present  time, 
we  have  not  obtained  the  real  space  images.  The  G  factor 
analysis  was  done  on  Co/Cu  (001)  films,  which  shows  obvi¬ 
ous  peak  intensity  oscillations.^^  The  vertical  distribution  or 
the  probability  of  finding  a  surface  atom  in  level  i  de¬ 
creases  as  the  level  number  increases. 

The  chemical  inhomogeneity  at  the  interface  can  be  ob¬ 
tained  from  the  analysis  of  angular  profiles  if  the  atomic 
scattering  factors  of  the  two  elements  differ  in  strength  by  at 
least  a  factor  of  2.  Examples  have  been  demonstrated  for  Co 


silicide,  Ni  silicide,  Si/Si02,^^  and  Fe/Au(001).^^  For  the 
Co/Cu,  the  atomic  scattering  factors  of  Co  (Z=27)  and 
Cu  (Z=29)  are  not  much  different,  so  an  analysis  of  inho¬ 
mogeneity  is  not  attempted. 

B.  Hysteresis  loops  and  analysis 

An  ultrathin  ferromagnetic  film  exhibits  a  change  in  mag¬ 
netization  under  an  applied  magnetic  field.  A  closed  magne¬ 
tization  curve  under  one  cycle  of  applied  magnetic  field  is 
called  the  hysteresis  loop.  It  takes  work  to  magnetize  a  fer¬ 
romagnetic  film.  The  energy  supplied  by  the  work  is  partially 
stored  as  the  potential  energy  and  partly  lost  as  heat  in  the 
film.  Because  at  the  end  of  a  cycle  of  the  loop  the  potential 
energy  returns  to  its  original  value,  the  work  must  be  con¬ 
sumed  as  heat  in  the  film.  This  is  called  hysteresis  loss.  The 
area  surrounded  by  the  hysteresis  loop  is  A  =  §MdH .  The 
remanent  magnetization  is  the  magnetization  at  77=0. 
The  squareness  is  the  ratio  of  M to  .  The  value  of  M ^ 
and  therefore  the  squareness  depends  on  the  saturation  mag¬ 
netization  and  also  on  how  well  the  preferred  axes  of  the 
domains  are  aligned  with  the  direction  in  which  the  magne¬ 
tization  is  being  measured.  For  a  symmetrical  loop  about  the 
origin  (77=0),  the  coercive  force  is  defined  as  the  value  of  H 
at  which  is  zero.  If  the  field  strength  is  high  enough  to 
magnetize  the  sample  magnetization  to  saturation,  then  the 
coercive  force  becomes  the  coercivity  Soft  magnetic 
materials  such  as  soft  iron  and  Permalloy  usually  have  low 
77^ ,  small  hysteresis  loss,  and  high  permeability.  This  is  dif- 
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Fig.  3.  Hysteresis  loops  for  3.0  ML-thick  Co  films  grown  on  OS,  LSS,  and 
HSS  Cu(OOl)  surfaces  at  room  temperature.  The  amplifier  gains  for  the  Kerr 
intensity  were  the  same  in  three  measurements. 


ferent  from  hard  magnetic  materials  such  as  permanent  mag¬ 
nets,  which  have  opposite  magnetic  properties  to  those  just 
mentioned. 

1.  Hysteresis  loop  shapes  from  3-ML-thick  Co  films 
grown  on  Cu  substrates  with  three  different 
initial  surface  roughnesses 

The  in-plane  hysteresis  loops  measured  around  3  ML  Co 
films  grown  on  three  kinds  of  substrates  are  shown  in  Fig.  3. 
All  loops  including  the  Co/HSS  have  been  saturated.  The 
loop  areas  and  shapes  are  very  different  for  Co/OS,  Co/LSS, 
and  Co/HSS.  The  squareness  {M  JM^)  for  Co/LSS  and  for 
Co/HSS  are  degraded  from  the  Co/OS  (^0.99).  The  rounded 
loop  shape  was  also  observed  on  a  clean  single  crystal 
Fe(OOl)  surface  50  A  deep  after  heavy  sputtering.^^ 

a.  Kerr  intensity  or  magnetization.  When  the  loops  were 
measured,  the  gain  in  the  photodetector  and  all  experimental 
conditions  were  identical  except  for  the  preparation  of  films. 
Because  the  remanent  magnetization  is  proportional  to  the 
Kerr  intensity,^  for  films  with  equivalent  thickness  one  can 
compare  the  relative  change  in  Kerr  intensity  as  the  relative 
change  in  magnetization  as  shown  in  Fig.  3.  The  value  of  M,. 
in  arbitrary  units  are  57,  42,  and  8.5  for  Co/OS,  Co/LSS,  and 
Co/HSS.  If  we  express  the  reduction  of  magnetization  in 
terms  of  a  ratio,  one  gets  57/42=1.36  and  57/8.5=6.7  for 
Co/LSS  and  Co/HSS.  From  the  average  terrace  width  listed 


in  Table  I,  one  can  estimate  the  step  density  that  is  the  high¬ 
est  in  the  Co/HSS.  Higher  step  density  means  there  are  more 
edge  Co  atoms  associated  with  the  steps.  It  is  believed  that 
an  edge  atom  has  less  coordinate  numbers  and  the  magnetic 
moment  may  change  from  its  ideal  value.  If  we  take  the  ratio 
of  step  density  as  an  indication  of  increased  number  of  edge 
atoms,  one  can  estimate  the  ratio  of  increase  in  edge  atoms. 
Using  the  average  terrace  width  in  Table  I,  we  take  the  ratio 
of  terrace  width  between  Co/OS  and  Co/LSS  and  between 
Co/OS  and  Co/HSS  and  then  square  the  ratios  by  assuming 
isotropic  terrace  width  to  get  the  factors  of  increasing  step 
density.  That  is,  (31.4/25)^=1.59  for  Co/OS  vs  Co/LSS  and 
(31.4/12.5)^=6.3  for  Co/OS  vs  Co/HSS.  These  ratios  are 
consistent  with  the  reduction  of  magnetization  from  Co/OS 
to  Co/LSS  and  to  Co/HSS.  Experimentally,  there  is  no  Kerr 
intensity  for  smooth  Co  films  thinner  than  1.3  ±0.3  ML,  i.e., 
the  thickness  of  a  magnetic  dead  layer.  For  a  rough  Co  film, 
the  thickness  is  not  uniform  and  there  exist  many  edge  and 
step  atoms.  In  some  areas  of  the  film,  the  thickness  may  be 
less  than  1.3±0.3  ML,  a  reduction  of  magnetization  (due  to 
the  dead  layer)  from  a  rough  film  is  likely  to  occur.  This 
result  may  indicate  that  the  edge  atoms  dominate  the  reduc¬ 
tion  of  magnetization.^^  The  reduction  in  magnetization  is 
also  observed  in  other  epitaxial  systems  such  as  ultrathin  Ni 
films  on  a  rough  Cu(OOl)  surface.^^ 

h.  Coercivity.  The  values  of  are  about  25,  45,  and  160 
Oe  for  3-ML-thick  Co/OS,  Co/LSS,  and  Co/HSS,  respec¬ 
tively.  See  Fig.  3.  The  increase  of  //^  for  rougher  films 
grown  on  LSS  and  HSS  may  be  due  to  the  increase  number 
of  pinning  centers  such  as  steps  which  pin  the  magnetic  do¬ 
main  walls.  A  SEMPA  study  of  ultrathin  Co  film  ranging 
from  3  to  19.5  ML  grown  on  a  smooth  Cu(OOl)  surface  has 
found  multidomain  structure  with  irregular  shapes  and  the 
domain  boundaries  are  bent  or  extremely  rough.^  We  believe 
there  exist  much  more  multidomains  in  Co/LSS  and  Co/HSS 
compared  with  the  multidomains  in  Co/OS.  It  has  been  esti¬ 
mated  that  for  a  smooth  Co  film,  the  configurational  entropy 
at  300  K  exceeds  the  wall  energy  when  the  thickness  is  less 
than  2  ML.^’^^  This  results  in  rough  domain  walls  as  the  state 
of  lower  free  energy.  The  domain  walls  in  the  rough  film 
could  be  even  more  irregular  because  sputtering  induced 
roughness.  A  SMOKE  study  of  Co  on  a  terraced  Cu(l  1  13) 
finds  a  step  induced  anisotropy  favoring  magnetization  par¬ 
allel  to  the  steps.^^  Because  of  the  sputtering,  there  is  a  high 
density  of  steps  created  in  our  Cu  surface  with  random  di¬ 
rection  and  height  level,  the  magnetization  may  also  oriented 
along  random  step  direction  in  different  height  levels.  To 
reverse  the  multidomains  with  irregular  domain  walls  and 
anisotropic  alignment  of  magnetization,  a  larger  field  is 
needed.  If  we  take  the  ratio  of  values  from  Co/LSS  over 
Co/OS  and  from  Co/HSS  over  Co/OS,  we  find  45/25  =  1.8 
and  160/25=6.4,  respectively.  The  ratios  are  consistent  with 
the  ratios  found  in  the  change  of  step  density  from  OS  to 
HSS  (1.6)  and  HSS  (6.3)  mentioned  in  the  last  section. 

Other  heteroepitaxial  film  study  also  indicates  film  imper¬ 
fections  can  act  as  hard  pinning  centers  for  domains  and 
walls.  In  that  case  the  domain  structure  is  directly  deter- 
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Fig.  4.  Hysteresis  loop  areas  as  a  function  of  applied  magnetic  field  fre¬ 
quency  for  3.0-,  3.0-,  and  5.0-ML-thick  Co  films  grown  on  OS,  LSS,  and 
HSS  Cu(OOl)  surfaces  at  room  temperature,  respectively.  The  (3  values  ob¬ 
tained  from  the  low  frequency  scaling  regime  are  0.66±0.03  and  0,72±0.03 
for  Co/OS  and  Co/LSS,  respectively.  No  /3  value  is  obtained  for  Co/HSS 
due  to  noisy  data  from  a  rough  film.  Because  the  value  of  applied  magnetic 
field  //q  depends  on  the  magnitude  of  the  current  and  is  frequency  depen¬ 
dence,  we  set  the  field  strengths  Hq 

Oe  for  Co/OS,  Co/LSS,  and  Co/HSS,  respectively.  The  effective  field 
strength  at  a  given  frequency  is  where  g{Q,)  is  the  frequency- 

dependent  field  strength  calibration  factor.  The  effective  field  strengths  at 
the  resonance  frequency  for  Co/OS,  Co/LSS,  and  Co/HSS  are  49,  74,  and 
246  Oe,  respectively. 

mined  by  the  film  morphology.  Similar  results  in  the  sub¬ 
strate  roughness  dependent  coercivity  were  observed  in  other 
clean  ferromagnetic  surfaces, epitaxial  systems, and  in 
multilayer  thin  films  of  Co/Pt  on  glass,  Si,  Si  nitride,  etc.^^ 

2.  Frequency  ^i-dependent  hysteresis  loop  area 

All  the  hysteresis  loop  data  presented  so  far  were  taken  at 
low  frequencies  and  low  fields.  In  the  following  we  will 
present  the  loop  area  over  a  wider  range  of  frequency. 

a.  Scaling  law  in  low  frequency  for  smooth  Co/OS  films. 
The  hysteresis  loop  area  A  as  a  function  of  frequency  H  for 
3.0-,  3.0-,  and  5.0-ML-thick  Co  films  grown  on  three  sub¬ 
strates  OS,  LSS,  and  HSS,  respectively,  are  shown  in  Fig.  4. 
The  reason  to  plot  data  for  5.0-ML-thick  Co  film  on  HSS  is 
that  the  Kerr  intensity  is  too  low  at  3.0  ML  to  do  frequency- 
dependent  measurements  over  a  wide  range  of  11.  The  area 
increases  at  low  frequencies  and  reaches  a  maximum  at  a 
resonance  frequency  .  As  frequency  increases  beyond  , 
the  area  decreases.  This  behavior  can  be  understood  using  a 
mean  field  dynamics^^  with  a  double  well  barrier  as  follows: 
when  the  fl— but  where  is  the  threshold 

field,  the  system’s  magnetization  is  unable  to  response  to  the 
fast  alternating  driving  field  and  the  magnetization  stays  at 
one  state,  i.e.,  the  average  magnetization  =  or 

— .  When  the  H-^O,  the  magnetization  can  be  trapped  in 
a  metastable  state  where  {M(0)^0,  while  Hq<H^  if  the 
fluctuations  are  suppressed.  The  system  can  return  to  its 
equilibrium  state  (M(/))=0  when  the  field  is  larger  than  the 
threshold  field  This  results  in  a  finite  loop  area. 


When  H  increases,  the  magnetization  cannot  respond  to  the 
rapidly  varying  field  and  thus  it  results  in  a  delay.  Such  a 
delay  increases  the  loop  area  A  and  a  scaling  relation  is 
found  A  with  fS—  |  for  HQ>Hf  and  .  Typical  loop 

shapes  at  different  frequencies  have  been  shown  previously. 
At  low  frequencies  and  low  fields,  the  area  of  hysteresis 
loops  for  Co/OS  show  scaling  behavior.  The  details  have 
been  published  and  will  not  be  repeated  here. 

But  is  there  scaling  behavior  for  Co  films  grown  on  LSS 
or  on  HSS?  We  can  apply  a  similar  fit  procedure  as  we  did 
for  Co  on  OS.  We  found  the  value  of  increases  from  0.66 
to  0.72  for  Co/LSS.  For  Co/HSS,  the  loop  area  data  is  noisy 
and  a  fit  may  not  be  meaningful.  Physically  the  domains’ 
reversal  barriers  are  not  so  uniform  as  that  of  Co/OS,  and  one 
cannot  use  a  single  double  well  potential  as  the  barrier. 
Therefore,  the  scaling  may  not  exist  and  even  if  it  exists  one 
needs  a  more  complicated  potential  barrier  to  describe  the 
rough  films. 

b.  Roughness-dependent  resonance  frequency  .  It  is 
shown  in  Fig.  4  that  the  is  the  highest  for  Co  film  grown 
on  OS.  The  decreases  as  the  film  becomes  rougher.  The 
resonance  frequency  can  be  identified  with  the  inverse  of 
the  characteristic  time  for  the  particle  to  slide  from  an  un¬ 
stable  maximum  to  one  of  the  minima  of  the  effective  free 
energy  in  a  double  well  potential  barrier.  The  resonance  fre¬ 
quency  Ctf.  as  a  function  of  film  thickness  also  shows  a  de¬ 
crease  of  value  due  to  the  buildup  of  roughness  as  the  film 
becomes  thicker  and  will  be  published  elsewhere.^^  A  recent 
simulation  of  submonolayer  magnetic  islands  indicates  that 
the  magnetic  moments  of  edge  atoms  that  have  less  coordi¬ 
nation  number  can  affect  the  response  of  the  entire  magnetic 
system,  especially  when  the  surface  to  volume  ratio  is 
large.^^  The  prediction  is  that  for  a  fixed  submonolayer  cov¬ 
erage  and  fixed  total  island  number  density,  the  increases 
monotonically  as  the  fractal  character  of  each  island  in¬ 
creases.  Our  results  from  ultrathin  film  shown  in  Fig.  4  is  not 
consistent  with  this  prediction.  But  the  scaled  area  versus 
frequency  plots  of  Luse  and  Zangwill^^  are  consistent  with 
the  Co/OS  and  Co/LSS  data  shown  in  Fig.  4.  The  drastic 
change  in  the  hysteresis  loop  shape  from  OS  and  LSS  to  HSS 
data  suggests  that  a  new  physical  mechanism  is  needed  in  the 
HSS  case.  For  the  Co/OS  film,  we  observed  that  the  reso¬ 
nance  frequency  depends  on  the  applied  field  strength  and 
the  result  will  be  published  elsewhere.^^ 


3.  Field  strength  Ho-dependent  hysteresis  loop  area 

The  hysteresis  loop  area  as  a  function  of  applied  field 
strength  Hq  at  a  fixed  frequency  (4  Hz)  for  3.0-,  3.0-,  and 
5.0-ML-thick  Co  films  grown  on  OS,  LSS,  and  HSS  are 
shown  in  Fig.  5.  The  field  at  which  the  loop  area  increases 
gradually  is  extended  over  about  150  Oe  for  Co/HSS  com¬ 
pared  with  30  Oe  of  Co/OS  and  Co/LSS.  The  threshold  field 
Hf  where  a  more  rapid  increase  of  area  occurs  is  rounded 
between  200  to  400  Oe  for  Co/HSS.  For  Co/LSS  and  Co/OS, 
the  values  are  narrowed  down  to  45  and  25  Oe,  respec- 
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Fig.  5.  Hysteresis  loop  areas  as  a  function  of  applied  magnetic  field  strength 
for  3.0-,  3.0-,  and  5.0-ML-thick  Co  films  grown  on  OS,  LSS,  and  HSS 
Cu(OOl)  surfaces  at  room  temperature,  respectively.  The  a  values  obtained 
from  the  low  field  scaling  regime  are  0.67  ±0.01  and  1.03  ±0.02  for  Co/OS 
and  Co/LSS,  respectively.  No  a  value  is  obtained  for  Co/HSS  due  to  the 
uncertainty  in  determining  the  scaling  regime.  The  data  for  Co/LSS  and 
Co/OS  are  offset  by  12  000  and  24  000  arbitrary  units  from  zero. 

lively.  Again,  this  is  consistent  with  the  fact  that  there  exists 
a  wide  range  of  nucleation  fields  for  domains’  reversals  in 
rough  films. 

a.  Scaling  law  in  low  field  strength  for  smooth  Co/OS 

films.  For  Hq>H^  and  ,  it  has  been  observed  that  the 

loop  area  A  exhibits  a  dynamic  scaling  law  in  the  low  field 
strength  regime,  i.e.,  with  a=0.67±0.01^f  for  a 

smooth  Co  film  on  Cu(OOl)  surface^^  and  Fe/Au(001).^®  The 
details  can  be  found  elsewhere.  That  the  a  value  for  Hq  is  the 
same  as  the  ^  value  for  can  be  understood  from  the  result 
of  mean  field  dynamics,  where  the  hysteresis  loop  area  is 
proportional  to  the  product  of  fl  and  Hq  with  the  same  di¬ 
mension.  From  the  experimental  point  of  view,  increasing  Hq 
at  a  fixed  fl  means  increasing  sweep  rate  dHQ/dt.  The  in¬ 
crease  of  dHQfdt  can  be  achieved  by  increasing  O  at  a  fixed 
Ho. 

Judging  from  the  data  in  Fig.  5,  the  value  of  a  for  Co/LSS 
and  Co/HSS  definitely  deviates  from  the  |.  If  we  applied  a  fit 
to  Co/LSS  as  we  did  for  Co/Os,  we  obtain  a=1.03±0.02. 
This  change  is  expected  because  the  interface  roughness 
changes  the  dynamic  magnetic  response  of  the  whole  film.^^ 

b.  Field  strength  HQ-dependent  Kerr  4.  From  the  dy¬ 
namic  mean  field  model,^^  it  is  derived  that  or 

—447/^.  We  obtained  the  loop  area  by  integrating  §  I^dH 
using  the  digitized  data  of  4  vs  H(t).  In  Figs,  6(a)  and  6(b) 
we  plot  the  vs  Hq  and  H^  vs  Hq,  respectively.  This  allows 
us  to  understand  the  field  dependent  loop  area  in  more  detail. 

Figure  6(a)  shows  Ij,  vs  Hq  for  Co/OS  and  Co/LSS.  Below 
40  Oe,  the  Kerr  7^  increases  rapidly  in  Co/OS  films,  indicat¬ 
ing  that  stronger  fields  align  the  magnetic  moments  in  better 
order.  For  77o>40  Oe,  the  7^  is  pretty  constant  up  to  160  Oe 
(slope  (77^/J77o=0.0016)  for  Co/OS  and  implies  that  a  uni¬ 
form  alignment  of  magnetic  moments  in  the  domains  has 
been  reached.  However,  for  Co/LSS,  it  takes  a  larger  field  of 


(b)  //,(Oe) 


Fig.  6.  (a)  Kerr  intensity  4  as  a  function  of  applied  magnetic  field  strength 
Hq  for  3.0-ML-thick  Co  films  grown  on  OS  and  LSS  Cu(OOl)  surfaces  at 
room  temperature.  In  the  low  field  regime,  Oe,  the  Kerr  intensity  is 

nearly  a  constant  for  Co/OS,  whereas  the  Kerr  intensity  increases  for  Co/ 
LSS.  (b)  Coercivity  as  a  function  of  applied  magnetic  field  strength  Hq 
for  3.0-ML-thick  Co  films  grown  on  OS  and  LSS  Cu(OOl)  surfaces  at  room 
temperature.  In  the  low  field  regime,  the  values  of  the  scaling  exponent  a 
obtained  from  fits  are  0.68  and  1.02  for  Co/OS  and  Co/LSS,  respectively. 
These  values  are  consistent  with  the  fit  from  loop  area  vs  field  strength. 

about  80  Oe  to  align  majority  of  the  magnetic  moments.  In 
order  to  align  even  more  magnetic  moments  in-plane,  an 
increasing  field  strength  beyond  80  Oe  is  needed  as  seen 
from  the  slight  positive  slope  (J7jt/c?7/o= 0.0373)  up  to  240 
Oe  in  the  7^  vs  77o. 

The  step  densities  (1/77)  listed  in  Table  I  for  3.0-ML-thick 
Co/OS  and  Co/LSS  are  1/31.4  A  (=0.03  A“^)  and  1/25  A 
(=0.04  A“^),  respectively.  Also,  the  lateral  correlation 
lengths  are  285  and  190  A,  respectively.  These  numbers  in¬ 
dicate  that  the  atoms  in  the  Co/LSS  film  are  less  correlated 
compared  with  those  of  the  Co/OS  films,  although  the  inter¬ 
face  width  is  about  the  same  (=«0.5/)  in  both  films.  This  is 
seen  in  Fig,  6(a),  where  a  stronger  field  is  needed  to  align  the 
majority  magnetic  moments  in  the  same  in-plane  direction 
and  in  Fig.  3  where  the  lower  Kerr  intensity  is  observed. 

For  the  Co/OS  film,  we  notice  that  the  Kerr  intensity  in- 
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creases  with  a  larger  slope  beyond  the  scaling  regime  (40 
Oe<//o<160  Oe).  This  may  be  related  to  the  change  of  do¬ 
main  switching  mechanism  at  higher  fields  and  we  do  not 
have  an  explanation. 

c.  Field  strength  HQ-dependent  coercivity  Figure 

6(b)  shows  the  coercivity  versus  field  strength  Hq  for 
Co/OS  and  Co/LSS.  We  have  learned  from  Sec.  II B  3  a  that 
for  Co/OS,  with  a=0.67±0.01±0.01~f.  Because 

and  the  4  is  nearly  constant  in  be¬ 
tween  40  Oe  and  120  Oe,  we  expect  the  has  a  scaling 
behavior  as  Co/OS.  By  fitting  the  Co/OS  data,  we 

obtained  a=0.68  [see  Fig.  6(b)].  This  tells  us  that  the  area 
scaling  in  low  field  regime  at  a  fixed  low  frequency  is  actu¬ 
ally  the  coercivity  scaling  with  Hq. 

For  Co/LSS,  we  know  from  Sec.  Ill  B  3  b  and  Fig.  6(a) 
that  I vs  Hq  is  not  a  constant  in  the  low  field  regime.  In  fact, 
the  increase  is  positive.  Therefore,  a  fit  of  vs  Hq  is  not 
meaningful.  If  we  do  apply  a  similar  scaling  fit  as  we  did  for 
Co/OS,  as  shown  in  Fig.  6(b),  the  a  value  obtained  is  1.02. 

IV.  CONCLUSION 

SMOKE  and  HRLEED  have  been  combined  to  study  the 
hysteresis  loops  and  nanoscale  roughness  at  the  interface  of 
Co  and  Cu(OOl)  and  in  the  epitaxial  Co  films.  Three  different 
degrees  of  roughness  in  a  Cu(OOl)  substrate  were  created  by 
Ar"^  sputtering.  The  initial  Cu(OOl)  substrate  roughness  de¬ 
termines  the  hysteresis  loop  shape  and  area  of  the  Co  films. 
A  several  fold  increase  of  step  density  and  an  interface  width 
of  less  than  2  ML  in  a  rough  Co  film  grown  on  a  rough 
Cu(OOl)  substrate  results  in  several  times  higher  coercivity 
with  a  wide  range  of  coercive  fields,  and  severalfold  reduced 
magnetization  as  compared  with  that  of  a  smooth  Co  film 
grown  on  a  smooth  Cu(OOl)  substrate.  These  changes  are 
related  to  the  pinning  of  Co  magnetic  domains  by  sputtering 
induced  steps.  For  a  smooth  3.0-ML-thick  Co  film,  the  dy¬ 
namic  hysteresis  loop  area  as  functions  of  frequency  and 
field  amplitude  of  a  sinusoidal  magnetic  field  show  a  power 
law  scaling  behavior  at  low  frequencies  and  low  amplitudes. 
The  value  of  scaling  exponents  (^§)  is  consistent  with  the 
dynamic  mean  field  Ising  model  assuming  a  double  well  po¬ 
tential  barrier.  For  rougher  Co  films  with  the  same  interface 
width  but  a  few  percent  increase  in  step  density  and  30% 
decrease  in  lateral  correlation  length  as  compared  with  a 
smooth  film,  the  scaling  exponents  deviates  from  the  §  dras¬ 
tically.  These  results  demonstrate  that  the  stringent  structural 
perfection  in  ultrathin  films  down  to  the  nanoscale  may  be 
required  when  desired  magnetic  properties  of  ultrathin  films 
are  to  be  used  in  magnetic  devices.  It  also  cautions  us  when 
experimental  data  are  to  be  compared  with  microscopic 
model  predictions  in  order  to  better  understand  the  funda¬ 
mental  magnetic  properties  of  ultrathin  magnetic  films. 
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We  set  out  to  compare  the  ferromagnetic  behavior  of  Fe^Ni(i_^)  and  Co^Ni(i_^)  films  of  varying 
stoichiometry  epitaxially  grown  on  Cu(lOO).  The  thickness  chosen  was  5  ML  over  a  wide  alloy 
composition  range.  Using  a  scaling  law  for  the  thickness  vs  Curie  temperature,  we  extrapolate  the 
bulk  fee  film  thickness  and  plot  the  bulk  fee  Curie  temperature  as  a  function  of  composition.  The 
results  suggest  (a)  that  the  Invar  effect  is  quenched  in  these  ultrathin  pseudomorphic  fee  films  of 
Fe^Ni(i_j,.)  alloys  and  (b)  the  magnetic  phase  of  Fe  atoms  is  the  low-spin  ferromagnetic  phase.  The 
COj^Ni(i  films  show  the  expected  monotonic  decrease  in  with  increasing  Ni  content.  ©  1996 
American  Vacuum  Society. 


L  INTRODUCTION 

Recently,  the  study  of  the  electronic  and  magnetic  prop¬ 
erties  of  fee  Fe,  Co,  and  Ni  films  and  their  alloys  has  at¬ 
tracted  a  lot  of  attention.  The  electronic  band  structures  of 
pure  Cu,  Ni,  Co,  and  Fe  films  with  (100)  surface  orientation 
have  been  described  in  many  publications. Preliminary 
measurements  of  the  Curie  temperatures  of  ultrathin 
Fe^Ni(i„^)  and  Co^^-Ni^i.^^.)  alloy  films  deposited  on  Cu(lOO) 
have  been  reported  in  an  earlier  publication.^^  These  mea¬ 
surements  show  that  for  a  few  single  monolayers  film  thick¬ 
ness,  the  Curie  temperature  increases  with  increasing  Fe  or 
Co  content  and  the  thickness  dependence  can  be  expressed 
by  an  empirical  finite- size  scaling  formula.  Meanwhile,  these 
ultrathin  alloy  films  with  (100)  surface  orientation  have  dif¬ 
ferent  magnetic  anisotropy  behavior.  Fe  deposited  on  various 
substrates  has  a  reorientation  of  easy  axis  of  magnetization 
from  perpendicular  to  in-plane  as  a  function  of  film  thickness 
(a  few  monolayers),  and  growth  temperature  (at  a  fixed  film 
thickness). Ni  films  have  a  transition  from  parallel  to 
perpendicular  magnetization  at  a  few  monolayers  thickness 
and  a  transition  back  to  in-plane  at  greater  thickness.^^’^^ 
Pure  Co  films  grown  on  Cu(lOO)  substrates  show  an  in-plane 
magnetic  anisotropy  for  all  thicknesses.^^"^'’  For  Fe^Ni(i_^) 
alloys,  our  measurements  show  that  the  anisotropy  behavior 
depends  on  the  alloy  composition,  with  Feo.75Nio.25  and 
Fcq  sNio.s  showing  a  reorientation  of  preferred  magnetization 
from  perpendicular  to  parallel  at  4  ML,  while  higher  Ni  con¬ 
tent  alloys  Feo.25Nio75  and  Feo.iNio.9  have  the  magnetization 
lying  in-plane  for  all  the  film  thicknesses  in  the  range  that  we 
have  investigated  (to  be  published).  On  the  other  side, 
CO;^Ni(j_^)  alloys  with  varying  composition  all  show  a  pre¬ 
ferred  magnetization  in  plane. 

Face-centered-cubic  Fe  can  exist  in  three  different  states: 
high-spin  ferromagnetic,  low-spin  ferromagnetic,  and  anti¬ 
ferromagnetic  state  depending  on  the  lattice  constant.^^“^^  At 
the  Cu  lattice  constant,  the  energy  differences  between  these 
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states  become  very  small,  and  this  results  in  a  structural  in¬ 
stability  of  Fe/Cu(100)  films, which  in  turn  explains  the 
different  magnetic  properties  of  Fe/Cu(100)  reported  in  the 
literature.  In  bulk  Fe^Ni^^.j,.),  there  is  an  Invar  effect  ob¬ 
served  atx=65%,  where  the  Curie  temperature  collapses  and 
the  structure  is  a  mixture  of  fee  and  bcc.  If  we  increase  the 
Fe  content  further,  a  transition  from  fee  to  bcc  structure 
occurs.^^  As  we  go  from  pure  Ni  to  Co  or  Fe,  the  electron 
number  is  reduced.  At  8.4  number  of  3d  electrons  we  are  at 
the  maximum  in  the  magnetic  moment  curve.^"^  However,  the 
basic  physics  describing  this  behavior  are  somewhat  clouded 
by  the  change  in  crystal  structure  as  we  go  from  fee  Ni  to 
hep  Co  to  bcc  Fe.  The  structural  change  introduces  a  differ¬ 
ence  due  to  the  orbital  moment.  In  addition  to  the  crystal 
structure  change,  the  lattice  constant  of  the  bulk  material  also 
changes.  This  has  a  greater  effect  because  as  the  lattice  con¬ 
stant  is  reduced  the  overlap  between  the  orbitals  of  adjacent 
atoms  becomes  greater,  and  this  will  result  in  a  reduced  mo¬ 
ment  and  ultimately  produce  a  paramagnetic  phase.  In  this 
article  we  will  show  that  we  have  stabilized  an  fee  low-spin 
ferromagnetic  Fe  phase  in  the  Fe^Ni^.^^)  alloy  films  pro¬ 
duced  on  a  Cu(lOO)  substrate,  and  the  Invar  ‘"collapse”  ef¬ 
fect  was  not  observed  in  these  metastable  ultrathin  films, 
which  is  confirmed  in  a  separate  study.^^ 

II.  EXPERIMENT 

In  the  ultrathin  film  limit,  the  higher  surface  free  energy 
of  the  ferromagnetic  film  verses  the  Cu  substrate  prevents  the 
film  from  wetting  the  substrate  completely.  The  study  of  the 
initial  stages  of  growth  of  these  systems  has  been  the  subject 
of  numerous  publications.  Briefly,  for  low  temperature 
growth  (around  100  K),  the  mobility  of  the  surface  atoms  is 
reduced  to  eliminate  the  possibility  of  Cu  atoms  diffusing 
from  step  edges  and  partially  covering  the  film  surface.  This 
has  the  advantage  of  completely  eliminating  segregation,  but 
the  film  continuously  roughens  with  increasing  thickness, 
with  the  roughness  increasing  according  to  a  scaling  law.^^’^^ 
For  such  a  film,  the  thickness  is  no  longer  a  well-defined 
quantity,  since  areas  of  the  substrate  are  covered  with  islands 
of  varying  size  and  thickness.  If  the  growth  temperature  is 
below  the  temperature  where  bulk  diffusion  becomes  signifi- 
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Fig.  1.  The  oscillations  of  RHEED  intensity  of  a  5  ML  Feo.75Nio,25  growth 
with  an  electron  beam  energy  of  3.0  kV.  It  reveals  a  layer-by-layer  growth. 


cant  while  still  allowing  for  surface  diffusion,  Cu  will  segre¬ 
gate  onto  the  film  surface  from  step  edges  (defect  sites) 
within  a  distance  defined  by  the  surface  diffusion  rate  and  the 
flux  of  the  evaporation  source.  If  the  flux  is  high  enough,  the 
amount  of  Cu  covering  the  film  can  be  reduced  to  a  few 
percent  of  a  monolayer.  It  is  our  contention  that  it  is  more 
important  to  have  a  uniform  film  than  it  is  to  eliminate  Cu 
segregation  near  defect  sites. 

Thin  films  of  Fe,  Co,  Ni,  and  Fe-Ni,  Co-Ni  alloys  were 
produced  on  a  Cu  substrate  with  (100)  surface  orientation  in 
ultrahigh  vacuum  using  molecular  beam  epitaxy  (MBE).  The 
growth  temperature  was  400  K  for  Fe^Ni(i_j^)  and  300  K  for 
COj,.Ni(i_^)  alloy  films,  high  enough  to  produce  smooth  films 
while  low  enough  to  limit  interdiffusion  at  the  film/substrate 
interface  to  the  first  few  layers.  The  deposition  rate  was  con¬ 
trolled  to  be  1  ML/min  with  the  flux  of  the  sources  calibrated 
with  reflection  high  energy  electron  diffraction  (RHEED)  os¬ 
cillations.  The  films  were  then  cooled  to  200  K  and  the  mag¬ 
netization  was  measured  by  a  surface  magneto-optic  Kerr 
effect  (SMOKE)  magnetometer  and  the  electron  band  struc¬ 
ture  was  studied  by  ultraviolet  photoemission  spectroscopy 
(UPS)  with  21.2  eV  photons.  The  incident  angle  is  45°  from 
the  surface  normal  and  the  energy  distribution  curve  of  the 
photoemitted  electrons  was  collected  along  the  surface  nor¬ 
mal. 

Figure  1  shows  the  oscillations  of  RHEED  intensity  of  the 
(0,0)  beam  during  the  growth  of  5  ML  Feo.75Nio.25  with  elec¬ 
tron  beam  energy  3.0  kV.  Similar  oscillations  were  observed 
for  all  the  Fe^Ni(i  and  Co^Ni(i  alloy  films,  which  con¬ 
firms  a  good  epitaxial  layer-by-layer  growth  with  the  rela¬ 
tively  low  intensity  of  the  first  peak  showing  a  tendency  of 
initial  bilayer  growth.  Meanwhile,  observation  in  low  energy 
electron  diffraction  (LEED)  on  the  grown  films  has  shown 
that  the  surface  structure  is  fee,  which  further  verifies  a 
pseudomorphic  growth  of  these  alloy  film  systems. 
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Fig.  2.  The  Gaussian  fittings  of  the  energy  distribution  curves  between  —2 
and  0  eV  (Fermi  energy  level)  with  a  photon  energy  of  2 1 .2  eV  for  both  5 
ML  Ni  and  5  ML  Feo.5Nio.5  on  Cu(lOO). 

III.  RESULTS  AND  DISCUSSION 

The  electronic  band  structures  for  occupied  states  of  Ni, 
Co,  and  Fe  films  with  (100)  surface  orientation  have  previ¬ 
ously  been  measured  in  detail  using  spin  resolved  photo¬ 
emission  and  inverse  photoemission.  The  ferromagnetic  ex¬ 
change  splittings  of  the  A5-symmetry  3d  bands  are  estimated 
to  be  A£e^=0.3,  1.2,  and  1.2  eV  for  Ni(lOO),  Co(lOO),  and 
Fe(lOO),  respectively.  For  Fe^Ni^^.j^)  and  Co^Ni(i_j^)  alloys, 
we  expect  the  3d  band  filling  and  the  exchange  splitting  to 
vary  continuously  from  that  of  pure  Ni  to  Co  or  Fe. 

At  21.2  eV  photon  energy  in  normal  emission,  we  are 
exploring  the  transitions  about  halfway  from  the  Brillouin 
zone  center  to  the  zone  edge.  Only  transitions  from  initial 
states  of  Aj  or  A5  symmetry  are  allowed  with  emission  from 
the  A5  symmetry  band  approximately  three  times  as  intense 
in  our  experimental  arrangement.  In  addition,  the  energy  of 
the  minority-spin  band  near  the  Fermi  energy  is  most  easily 
extracted,  since  peak  widths  increase  linearly  for  transitions 
farther  from 

Figure  2  gives  a  sample  of  the  Gaussian  fittings  of  the 
energy  distribution  curves  between  —2  and  0  eV  (Fermi  en¬ 
ergy  level)  for  5  ML  Ni  and  Feo.5Nio.5  films  after  the  Shirley 
background  intensity  has  been  subtracted  from  the  raw  data. 
In  Fig.  3,  UPS  measurements  of  5-ML- thick  alloy  films  of 
varying  composition  are  presented.  The  positions  of  the 
minority-  (down  arrows)  and  majority-  (up  arrows)  spin 
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Fig.  3.  The  energy  distribution  curves  measured  with  ultraviolet  photoemis¬ 
sion  spectroscopy  on  5  ML  Co^Ni(i_^.j  and  Fe;tNi(i_^)  alloy  films  as  a 
function  of  composition  x.  The  up  and  down  arrows  indicate  the  energies  of 
the  majority  and  minority  spin  A5  bands  for  the  pure  films. 


bands  in  these  fee  struetures  are  indieated.  The  speetra  for 
the  pure  films  are  eonsistent  with  other  work,  with  the  main 
peak  eoming  from  an  initial  A5  minority-spin  state.  For 
CO;,.Ni(i_^)  alloys,  the  trend  is  as  expeeted:  the  peak  energy 
gradually  inereases  as  Co  eoneentration  inereases.  For  the 
Fe^Ni(i_j^)  alloys,  a  different  behavior  is  observed.  At  50% 
Fe  eoneentration,  a  peak  is  observed  at  —0.9  eV,  mueh  lower 
than  the  pure  Co  film  whieh  has  the  same  number  of  valenee 
eleetrons.  However,  without  spin  diserimination  we  eannot 
identify  either  the  spin  eharaeter  or  the  symmetry  of  this 
transition.  For  75%  Fe  eoneentration,  there  are  no  elearly 
resolved  peaks,  suggesting  that  the  minority-spin  band  is 
above  as  in  pure  Fe. 

Our  previous  measurements  of  Curie  temperature  of  thin 
films  have  shown  that  ean  be  expressed  by  a  finite-size 
empirical  scaling  law: 

r,(cx)) 

with  T^{n)  the  Curie  temperature  for  ultrathin  films  with  n 
ML  thickness,  and  represents  the  bulk  Curie  tempera¬ 
ture.  From  our  previous  study  of  Co^Ni(i_^)  alloys,  we 
found  that  n '  is  found  to  be  around  1 ,  which  means  the  Curie 
temperature  reaches  zero  for  all  the  films  with  1  ML  thick¬ 
ness,  and  k  is  a  constant  which  is  found  to  be  1.3.^^  We  have 
kept  these  parameters  constant  and  used  this  scaling  law  to 
extrapolate  the  bulk  Curie  temperature  T.H  forFe^Ni(i_^) 
and  CO;(.Ni(i_;^.)  alloy  systems.  In  Fig.  4  we  have  plotted  the 
normalized  Curie  temperature  T^{n)ITc{^)  as  a  function  of 
film  thickness  for  CO;^.Ni(|_;p)  alloy  films  with  varying  com¬ 
position.  All  the  data  points  for  these  fee  films  are  indeed 
following  this  scaling  law  curve  which  intersects  the  hori¬ 
zontal  axis  at  1  ML  thickness  with  7^=0  K. 


Fig.  4.  The  normalized  Curie  temperature  T^{n)ITc{^)  as  a  function  of  film 
thickness  for  Co^-Ni^.^^)  alloy  films  with  varying  composition. 

In  the  mean-field  approximation,  the  Curie  temperature  is 
proportional  to  7*^(5+!),  where  J  is  an  effective  exchange 
parameter  and  s  is  the  spin  moment  in  Bohr  magnetons  [jUg  . 
The  solid  lines  in  Fig.  5  are  the  plots  of  bulk  Curie  tempera¬ 
tures  for  Co^Ni(i_^)  and  Fe^Ni(i„^)  alloys  as  a  function  of 
Co  or  Fe  atomic  concentration  x  that  we  calculated  by  using 
the  method  proposed  by  Vonsovskii  of  correlating  the  Curie 
temperature  with  the  interactions  between  different  types  of 
atoms  in  alloys.^®  The  diamond  and  round  data  points  are 
from  our  experimental  measurements  of  Curie  temperature 
as  a  function  of  film  thickness  and  then  extrapolating  the 


Fig.  5.  Bulk  Curie  temperatures  for  Co^Ni(i_;^)  (left)  and  Fe^Ni(i_jf)  (right) 
alloys  as  a  function  of  Co  or  Fe  atomic  concentration  x.  For  Fe^.Ni(i_^) 
alloys,  Fe  atoms  can  exist  in  three  phases:  high-spin  ferromagnetically 
coupled  Fe  (HS  FM  Fe),  low-spin  ferromagnetically  coupled  Fe  (LS  FM 
Fe),  and  antiferromagnetically  coupled  Fe  (AFM  Fe).  The  diamond  data 
points  are  our  experimental  results  extrapolating  from  the  thin  film  Curie 
temperature  measurements.  The  round  points  are  from  a  closer  look  at  con¬ 
centrations  near  the  Invar  effect:  Feo.gsNio  35.  Our  data  show  that  Fe  is  in  the 
low-spin  ferromagnetic  phase  in  all  the  alloy  films  we  produced. 
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bulk  value  using  the  above  scaling  law.  The  round  data  are 
from  a  recent  closer  look  at  the  concentration  around 
Feo.65Nio.35  where  Invar  effect  is  popularly  observed  in  bulk 
material.  These  measurements  have  confirmed  that  the  struc¬ 
ture  of  these  alloys  are  fee  at  this  thickness  range,  there  is  no 
structural  transition  from  fee  to  bcc  as  observed  in  bulk. 
Comparison  of  these  results  with  the  theoretical  results 
strongly  suggest  that  the  Fe  atoms  in  Fej^.Ni(i_^)  alloy  films 
exist  in  the  low-spin  ferromagnetic  phase.  Besides,  no  sud¬ 
den  drop  in  Curie  temperature  has  been  observed  at  and 
above  65%  Fe  concentration,  so  the  Invar  effect  has  been 
quenched  in  these  thin  film  systems.  Further  measurements 
of  thicker  Fe^Ni(i_^)  films,  around  65%  Fe  content  has  sug¬ 
gested  a  structural  relaxation  or  spin  reduction  occurring 
later.^^ 

IV.  SUMMARY 

By  tuning  the  elemental  composition,  we  are  able  to 
achieve  Fe^^Ni^.^)  and  CO;^Ni(i_.^)  films  with  varying  elec¬ 
tron  number  in  random  substitutional  alloys  of  3  J  ferromag- 
nets.  Our  study  of  growth,  structure,  electron  band,  and  mag¬ 
netic  measurements  combined  has  confirmed  an  fee 
pseudomorphic  growth  of  these  alloys  films  in  the  ultrathin 
thickness  range.  The  Fe  atoms  in  these  metastable  Fe^.Ni(i_^) 
alloys  appear  to  exist  in  the  low-spin  ferromagnetic  phase. 
Besides,  no  structural  transition  from  fee  to  bcc  is  observed, 
and  the  Invar  “collapse”  effect  appears  to  have  been 
quenched  in  these  ultrathin  films  near  65%  Fe  concentration. 
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The  structure  and  composition  of  Fe  films  grown  on  As-terminated  GaAs(001)-2X4  surfaces  at 
175  °C  has  been  studied  in  situ  with  scanning  tunneling  microscopy  (STM),  photoelectron 
diffraction  (FED),  and  x-ray  photoelectron  spectroscopy  (XPS).  The  GaAs  surfaces  were  prepared 
by  molecular  beam  epitaxy  (MBE)  and  exhibited  large  atomically  well-ordered  terraces.  We  find 
that  the  2X4  reconstruction  has  a  significant  impact  on  the  Fe  nucleation  and  growth,  with  initial 
nucleation  occurring  at  As-dimer  sites.  STM  reveals  that  the  first  half-monolayer  of  Fe  forms  small 
two-dimensional  islands  along  the  As-dimer  rows  before  growing  onto  the  adjacent  Ga-rich  rows, 
with  no  evidence  of  substrate  disruption.  FED  indicates  that  the  growth  is  predominantly  layer  by 
layer,  with  the  growth  front  for  the  nth  deposited  layer  limited  to  the  (n+  l)th  layer.  XFS  spectra 
show  that  the  Fe  films  include  a  concentration  gradient  of  Ga  and  As  out-diffused  from  the  interface, 
with  some  of  the  As  segregating  to  the  Fe  surface,  similar  to  previous  results  obtained  for  growth 
on  non-MBE  prepared  GaAs  surfaces.  Possible  mechanisms  for  the  film  growth  and  the  origins  of 
the  intermixing  are  discussed.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

The  behavior  of  magnetic  thin  films  on  semiconductor 
substrates  is  of  great  practical  interest  due  to  the  potential  for 
integrating  such  films  into  novel  semiconductor  heterostruc¬ 
ture  devices.  For  very  thin  films,  the  magnetic  and  electronic 
properties  of  the  heterostructure  may  depend  strongly  on  the 
nature  of  the  metal/semiconductor  interface.  This  depen¬ 
dence  can  arise  from  the  atomic-scale  structure  of  the  inter¬ 
face,  the  occurrence  of  interdiffusion,  or  the  nature  of  the 
initial  film  growth.  Before  magnetic  heterostructure  devices 
can  be  widely  realized,  the  characteristics  of  the  metal/ 
semiconductor  interface  must  be  well  understood. 

One  magnetic  system  of  special  interest  is  Fe  on  GaAs, 
which  is  considered  to  have  one  of  the  most  well-ordered 
and  abrupt  metal/semiconductor  interfaces.  Fe  grows  epitaxi¬ 
ally  (bcc)  on  GaAs  (110)  and  (001),  with  the  film  crystallo¬ 
graphic  axes  coincident  with  those  of  the  substrate,  due  in 
part  to  the  fact  that  the  bcc  Fe  lattice  constant  is  approxi¬ 
mately  half  that  of  zinc-blende  GaAs  (2aY:JaQ^fi^^=l.0l3).^'^ 
Although  the  films  are  epitaxial,  studies  of  Fe  deposition  on 
both  cleaved  GaAs(llO)  and  sputter-annealed  (001)-c(8X2) 
surfaces  have  found  the  growth  to  be  three-dimensional  (3D) 
and  observed  Ga  and  As  intermixed  in  the  first  ^^50  A  of 
Fe.^’"^  Magnetic  measurements  of  Fe  films  on  GaAs(OOl)  re¬ 
vealed  anomalous  behavior,  with  films  as  thick  as  200  A 
exhibiting  substantially  lower  magnetization  than  expected 
from  bulk  properties.^  Furthermore,  such  films  often  have  an 
in-plane  uniaxial  component  to  the  magnetic  anisotropy,^  al¬ 
though  an  ideal  bcc  Fe(OOl)  film  should  have  fourfold  sym¬ 
metry. 

One  factor  that  may  contribute  to  the  magnetic  anisotropy 
is  the  substrate  surface  reconstruction,  which  is  thought  to 
influence  the  Fe  nucleation  and  growth^;  no  studies  have  yet 
focused  on  this  effect,  however.  A  more  complete  under- 
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standing  of  the  magnetic  behavior  requires  a  detailed  de¬ 
scription  of  both  the  physical  and  compositional  structure  of 
the  Fe  films.  In  this  article  we  describe  the  structure  and 
composition  of  Fe  films  grown  on  the  As-terminated 
GaAs(001)-2X4  surface,  which  is  the  most  commonly  uti¬ 
lized  surface  for  the  growth  of  compound  semiconductor  de¬ 
vice  structures.  The  samples  were  prepared  by  molecular 
beam  epitaxy  (MBE)  and  characterized  in  situ  with  scanning 
tunneling  microscopy  (STM),  photoelectron  diffraction 
(FED),  and  x-ray  photoelectron  spectroscopy  (XFS).  We  find 
that  the  2X4  surface  reconstruction  has  a  significant  impact 
on  the  Fe  nucleation  and  film  growth,  with  initial  nucleation 
occurring  at  As-dimer  sites. 

II.  EXPERIMENT 

The  experiments  were  performed  in  a  multichamber  ultra- 
high  vacuum  (UHV)  molecular  beam  epitaxy  and  surface 
analysis  facility  consisting  of  two  MBE  chambers,  a  FED/ 
XFS  chamber,  and  an  STM  chamber,  all  connected  via  a 
UHV  sample  transfer  system.  The  GaAs(OOl)  surface  was 
prepared  via  homoepitaxial  MBE  growth  on  a  silicon-doped 
substrate  oriented  within  0.1°  of  (001).  After  growing  a 
l-/xm- thick  buffer  layer  of  Si-doped  GaAs  (rz— 10^^/cm^)  at 
580  °C,  a  300  A  layer  of  undoped  GaAs  was  grown  at  a 
reduced  rate  combined  with  growth  interrupts  to  eliminate  Si 
contamination  of  the  surface  and  produce  large  well-ordered 
terraces.  The  substrate  was  then  cooled  to  550  °C  under  an 
As  flux  from  a  valved  cracker  source,  at  which  point  the  As 
source  was  both  valved  and  shuttered.  The  sample  was 
cooled  further  to  200  °C  under  no  flux  prior  to  transfer  to 
another  chamber.  This  procedure  consistently  produced  an 
atomically  well-ordered,  As-dimer  terminated  (2X4)- 
reconstructed  surface  as  indicated  by  reflection  high  energy 
electron  diffraction  (RHEED)  and  confirmed  by  STM.^ 

Fe  deposition  was  performed  primarily  in  a  second  MBE 
chamber  using  a  high-temperature  Knudsen  cell  source  at  a 
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Fig.  1.  (a)  Filled  state  STM  image  (3  V)  of  0.1  ML  Fe  on  GaAs(001)-2X4.  Image  size  380  Ax380  A.  A  v-shaped  step  edge,  2.8  A  high,  is  visible  in  the  lower 
part  of  the  image,  (b)  Filled  state  image  (3  V)  of  0.5  ML  Fe.  The  surface  corrugation  is  1-2  A.  (c)  Model  of  the  2X4  reconstructed  surface,  and  the  bcc  (001) 
surface. 


rate  of  0.05  monolayer/s.  We  define  1  monolayer 
(ML)^  1.22X10*^  atoms/cm^,  corresponding  to  the  atomic 
density  of  the  bcc  Fe(OOl)  plane,  which  is  approximately 
twice  the  atomic  density  of  the  bulk  terminated  GaAs(OOl) 
surface  (and  would  produce  a  film  —1.4  A  thick  if  deposited 
uniformly).  For  the  XPS  core  level  measurements,  the 
evaporations  were  performed  in  the  PED/XPS  chamber  at  a 
rate  of  0.025  ML/s  with  an  electron-beam  heated  rod 
evaporator.^  The  substrate  temperature  was  175  °C  for  all  Fe 
depositions.  The  Fe  deposition  sources  were  regularly  cali¬ 
brated  by  x-ray  fluorescence  measurements  performed  on 
thick  films  (—50  A).  Relative  Fe  coverages  were  also  con¬ 
firmed  by  XPS. 

After  Fe  growth  the  sample  was  cooled  to  room  tempera¬ 
ture  and  transferred  to  one  of  the  other  chambers  for  study. 
All  data  were  collected  with  the  sample  at  or  slightly  above 
room  temperature.  PED  and  XPS  spectra  were  recorded  us¬ 
ing  a  monochromatized  Al  Xa  source  and  a  120  mm  hemi¬ 
spherical  analyzer  with  an  angular  resolution  of  ±3°  (PED) 
and  ±15°  (XPS).  Sample  motion  was  computer-controlled 
via  a  stepper  motor-driven  two-axis  goniometer,  with  the 
angle  between  the  x-ray  source  and  analyzer  fixed  at  70.8°. 
High  resolution  core  level  spectra  (AX  =  0.6  eV)  were  re¬ 
corded  at  an  emission  angle  of  15°  from  the  surface  to  maxi¬ 
mize  surface  sensitivity.  STM  images  were  acquired  with  a 
constant  current  of  0.1  nA  and  a  sample  bias  of  —3.0  V  and 
are  displayed  in  gray  scale. 


III.  RESULTS  AND  DISCUSSION 

A  filled-state  STM  image  of  a  surface  covered  with  0.1 
ML  of  Fe,  the  lowest  coverage  examined,  is  displayed  in  Fig. 
1(a).  At  this  coverage  most  of  the  surface  is  unperturbed 
GaAs(001)-2X4,  with  bright  rows  oriented  along  the  [f  10] 
direction  (spaced  16  A  apart)  and  a  small  corrugation  visible 
along  each  row  (every  8  A).  As  illustrated  in  Fig.  1(c),  the 
structure  of  this  surface  is  believed  to  consist  of  rows  of 
As-dimer  pairs  separated  by  Ga-terminated  rows  (where  the 
As-dimer  pairs  are  missing^);  the  bright  rows  observed  in  the 
STM  image  are  associated  with  the  rows  of  As  dimers.  The 
deposited  Fe  gives  rise  to  the  bright  “bumps”  that  occur 
almost  exclusively  on  the  As-dimer  rows,  typically  covering 
one  dimer  pair.  The  association  of  these  features  with  Fe 
(and  not  surface  defects,  for  instance)  was  confirmed  by 
multibias  imaging:  whereas  the  Fe-related  features  become 
much  more  prominent  at  lower  bias  voltages,  the  appearance 
of  typical  defects  observed  on  an  Fe-free  surface  has  little 
bias  dependence.^  These  results  indicate  that  the  Fe  adatoms 
preferentially  adsorb  on  the  As-dimer  rows,  forming  2D  is¬ 
lands  with  an  average  diameter  of  8  A.  It  is  important  to  note 
that  the  2X4  surface  reconstruction  remains  intact  on  the 
sites  adjacent  to  the  Fe  islands,  demonstrating  that  any  dis¬ 
ruption  of  this  surface  is  limited  to  atomic-scale  reactions 
between  the  Fe  and  As  dimers  in  the  immediate  2X4  unit 
cell. 
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The  STM  image  obtained  after  0.5  ML  of  Fe  deposition  is 
shown  in  Fig.  1(b).  Although  the  surface  is  almost  com¬ 
pletely  covered  by  2D  Fe  islands,  the  rowlike  structure  of  the 
(2X4)  substrate  reconstruction  is  still  evident  in  the  general 
alignment  of  the  islands:  the  islands  are  elongated  along  the 

[110]  direction  with  an  average  size  of  15  A  by  25  A  as 
determined  from  the  2D  autocorrelation  function.  The  islands 
are  now  typically  wider  than  the  As-dimer  rows,  indicating 
that  Fe  adsorption,  initially  confined  to  the  dimer  rows,  has 
now  taken  place  on  the  missing  dimer  (Ga-terminated)  rows. 
The  fact  that  the  islands  appear  flat  suggests  that  they  are  at 
least  two  Fe  layers  thick  in  these  Ga-terminated  areas.  Al¬ 
though  most  Fe  islands  are  approximately  the  same  height,  a 
small  fraction  of  them  appear  noticeably  higher,  consistent 
with  some  additional  Fe  growth  on  the  predominantly  flat 
islands. 

Whereas  STM  provides  an  excellent  view  of  the  atomic- 
scale  surface  topography  following  Fe  deposition,  FED  and 
XPS  provide  complementary  information  regarding  the 
structure  and  composition  within  the  Fe  overlayers.  The  in¬ 
tegrated  ¥c  2 p  emission  intensity  as  a  function  of  polar 
emission  angle  for  a  range  of  Fe  coverages  between  0.5  and 
31  ML  is  shown  in  Fig.  2.  At  this  kinetic  energy  (772  eV)  the 
photoelectrons  are  in  the  forward  scattering  regime,  so  that 
intensity  maxima  are  expected  along  the  directions  of  near¬ 
neighbor  atoms  (i.e.,  along  low-index  directions)  [Fig.  2(c)]. 
In  particular,  the  low-index  directions  within  bulklike  crys¬ 
tals  produce  well-defined  peaks.  Each  peak  in  the  FED  spec¬ 
trum  for  the  31-ML-thick  Fe  film,  for  example,  can  be  iden¬ 
tified  with  a  low-index  direction  within  a  bcc  crystal,  as 
labeled  in  Fig.  2. 

Since  forward  scattering  can  be  observed  only  when  at¬ 
oms  are  above  the  plane  of  the  emitting  atom,  the  informa¬ 
tion  provided  by  FED  can  be  utilized  to  determine  early 
growth  morphology.  If  the  angle-dependent  emission  of  an 
adsorbate  is  measured  for  a  series  of  coverages,  the  onset  of 
forward-scattering  peaks  at  angles  which  characterize  the 
structure  signals  the  occupation  of  second,  third,  or  fourth 
layer  sites.  A  single  atomically  flat  monolayer,  for  example, 
would  produce  no  forward-scattering  peaks,  while  2  ML  of 
layer-by-layer  bcc  growth  would  result  in  a  single  peak  in  the 

[111]  direction  (for  the  [110]  azimuth).  If  the  absolute 
growth  rate  is  known,  multilayer  growth  or  clustering  can  be 
identified  during  deposition  of  the  first  few  monolayers  by 
the  occurrence  of  forward-scattering  peaks  corresponding  to 
the  occupation  of  (n+  l)th  layer  sites  well  before  comple¬ 
tion  of  the  nih  monolayer. 

The  polar  scan  along  the  [110]  azimuth  for  0,5  ML  of  Fe 
on  GaAs(001)-2X4  clearly  shows  a  maximum  in  the  [111] 
direction  [Fig.  2(a)],  indicating  that  some  second  layer  sites 
are  already  occupied.  Although  the  FED  spectra  do  not  re¬ 
veal  the  identity  of  the  second  layer  scatterers  (e.g.,  Fe  ver¬ 
sus  As),  spectra  along  the  [110]  and  [110]  azimuths  are 
equivalent,  demonstrating  that  the  occupied  sites  have  a  bcc- 
like  structure.  (Note  that  this  would  not  be  the  case  if  the 
emitting  Fe  atoms  were  substitutionally  embedded  in  the 
zinc-blende  substrate.)  These  data  indicate  that  the  Fe  ada- 
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Fig.  2.  (a)  Fe  2p  photoelectron  polar  diffraction  spectra  in  the  [110]  azimuth 
(electron  kinetic  energy =772  eV)  as  a  function  of  Fe  coverage  on 
GaAs(001)-2X4.  The  spectra  are  vertically  offset  for  clarity,  (b)  Spectra  in 
the  [100]  azimuth,  (c)  An  illustration  of  forward-scattering  conditions  in  the 
[110]  and  [100]  projections  for  a  multilayer  bcc  (001)  film.  Note  that  the 
observation  of  specific  forward-scattering  peaks  can  be  correlated  with  oc¬ 
cupation  of  particular  layer  sites  within  the  film. 


toms  are  bonding  either  above  or  within  the  surface  plane  of 
the  As  dimers,  with  minimal  disruption  of  the  substrate  sur¬ 
face,  concurrent  with  some  occupation  of  second  Fe  mono- 
layer  sites  before  completion  of  the  first  layer.  These  results 
are  consistent  with  the  STM  results,  which  indicated  that 
some  bilayer  Fe  growth  may  occur  both  between  the  As- 
dimer  rows  and  on  top  of  the  initial  2D  islands.  Note  that 
within  the  resolution  of  the  FED  spectra,  no  evidence  for 
third  layer  occupation  is  yet  observed. 

The  FED  data  obtained  at  a  coverage  of  1 .25  ML  exhibit 
a  strong  peak  along  [111]  for  scans  in  the  [110]  azimuth,  and 
weaker  peaks  along  [101]  in  the  [100]  azimuth  and  along 
[001]  for  polar  scans  in  both  azimuths  (Fig.  2).  The  [111] 
peak  signals  the  expected  occupation  of  second  layer  sites, 
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Fig.  3.  XPS  spectra  of  As  and  Ga  3d  core  levels  as  a  function  of  Fe 
coverage  on  GaAs(001)-2X4.  Each  spectrum  has  been  normalized  to  its 
peak  intensity  and  vertically  offset  for  clarity.  Core  levels  shown  for  clean 
GaAs  have  been  shifted  0,15  eV  to  lower  binding  energy  to  compensate  for 
band  bending. 


Fig.  4.  Normalized  Ga  and  As  3  J  core  level  intensities  as  a  function  of  Fe 
coverage.  The  attenuation  of  the  substrate  levels  expected  for  ideal  Fe  layer- 
by-layer  growth  (without  interdiffusion)  is  shown  for  comparison  as  the 
solid  line,  calculated  for  an  emission  angle  of  15°  from  the  surface  and  an 
electron  mean  free  path  of  15  A, 


whereas  the  appearance  of  the  [101]  peak  suggests  occupa¬ 
tion  of  some  third  layer  bcc-like  sites.  Although  the  weak 
[001]  peak  is  also  indicative  of  third  layer  occupation  in  a 
simple  forward- scattering  picture,  single-scattering 
calculations^^  show  that  this  peak  has  contributions  from 
higher  order  diffraction  effects  associated  with  the  first  two 
layers.  Since  this  [001]  peak  appears  together  with  the  [101] 
peak,  we  interpret  it  here  as  further  evidence  for  slight  third 
layer  site  occupation. 

As  the  third  equivalent  monolayer  of  Fe  is  deposited,  the 
FED  polar  scans  begin  to  exhibit  most  of  the  features  typical 
of  bulklike  films,  as  can  be  seen  by  comparing  the  2.5  and  31 
ML  data  of  Fig.  2.  For  2.5  ML,  peaks  along  [001]  and  [101] 
are  pronounced  due  to  the  formation  of  the  third  layer,  as 
expected,  and  some  fourth  layer  site  occupation  is  indicated 
by  the  appearance  of  a  peak  in  the  [113]  direction.  However, 
the  absence  of  peaks  in  the  [112]  and  [102]  directions  im¬ 
plies  that  there  is  no  significant  fifth  layer  site  occupation 
during  growth  of  the  third  ML.  We  thus  observe  that  Fe 
grows  in  a  predominantly  layer-by-layer  fashion,  with  the 
growth  front  for  the  nth  deposited  layer  limited  to  the  (n 
-h  l)th  layer.  Although  sites  in  the  (;?+  l)th  layer  are  occu¬ 
pied  before  the  nth  layer  is  completed,  extreme  3D  growth 
such  as  that  observed  on  the  Fe/GaAs(110)  system^^  can  be 
ruled  out  for  Fe/GaAs(100)-2X4  from  our  FED  and  STM 
results.  For  coverages  =^3  ML,  the  FED  and  STM  data  gen¬ 
erally  show  the  formation  of  predominantly  fiat  islands  with 
some  multilayer  growth  occurring,  but  little  or  no  tendency 
towards  3D  clustering  or  significant  disruption  of  the  sub¬ 
strate. 

The  elemental  composition  of  the  Fe  films  is  elucidated 
by  XFS  measurements  as  a  function  of  Fe  coverage.  As 
shown  in  Figs.  3  and  4,  both  Ga  and  As  can  be  detected  even 
for  the  higher  coverages  studied  (41  ML).  Since  the  STM 
data  show  the  GaAs  surface  to  be  completely  covered  by  a 


uniform  Fe  overlayer  after  the  first  few  monolayers^  and  the 
mean  free  path  for  the  As  and  Ga  3  J  photoelectrons  is  —15 
A,  we  attribute  the  persistance  of  these  signals  to  out- 
diffusion  of  Ga  and  As  from  the  interface.  The  nature  of 
these  out-diffused  atoms  is  revealed  somewhat  by  the  3d 
core  level  spectra,  which  evolve  considerably  with  increas¬ 
ing  Fe  coverage  (Fig.  3).  Two  regimes  are  evident:  (i)  cov¬ 
erages  less  than  ~2  ML,  where  the  levels  show  little  change; 
and  (ii)  higher  coverages,  where  distinctly  shifted  core  levels 
appear.  In  the  first  regime,  which  we  associate  with  the  bond¬ 
ing  of  Fe  to  the  GaAs  surface,  the  As  level  broadens  slightly 
with  little  change  in  binding  energy,  and  the  Ga  level  shows 
a  slight  shift  in  spectral  weight  towards  lower  binding  en¬ 
ergy.  If  this  behavior  is  interpreted  simply  in  terms  of  charge 
transfer  accompanying  bonding  with  the  Fe  adatoms,  it  re¬ 
flects  an  increase  in  charge  density  on  the  Ga  atoms  but  little 
change  on  the  As.  These  features  are  consistent  with  interfa¬ 
cial  bonding  between  Fe  and  As  and  concomitant  debonding 
between  As  and  Ga.  For  coverages  of  a  few  monolayers,  the 
core  levels  give  no  indication  of  Fe-induced  substrate  surface 
disruption,  consistent  with  the  STM  and  FED  results. 

As  the  Fe  coverage  is  increased  (>2  ML),  a  second  re¬ 
gime  of  behavior  is  observed  in  which  the  substratelike  Ga 
and  As  3  J  features  are  replaced  by  peaks  at  lower  binding 
energy  that  show  a  considerable  evolution  in  character  with 
coverage.  Within  the  resolution  of  our  experiment,  at  least 
one  new  peak  is  observed  for  both  the  As  and  Ga  levels,  as 
seen  in  the  spectra  for  a  6-ML-thick  film.  While  the  new  As 
3d  feature  is  relatively  constant  in  binding  energy  as  a  func¬ 
tion  of  coverage,  the  Ga  3d  spectra  exhibit  a  more  complex 
coverage  dependence,  with  the  new  peak(s)  shifting  continu¬ 
ously  to  lower  binding  energy.  For  a  41-ML-thick  Fe  film, 
the  net  shift  is  0.45  eV  for  the  new  As  3^^  doublet  and  1.15 
eV  for  the  Ga,  and  the  peaks  associated  with  bulk  GaAs  are 
no  longer  observed.  (The  signal-to-noise  ratio  of  the  spectra 
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is  poor  at  higher  coverages,  and  the  small-amplitude  oscilla¬ 
tions  at  higher  binding  energy  are  residues  of  the  smoothing 
and  background  subtraction  and  are  not  considered  signifi¬ 
cant.)  The  abrupt  change  in  character  of  the  3d  levels  sug¬ 
gests  that  both  Ga  and  As  atoms  are  being  displaced  from  the 
interface  after  the  formation  of  a  few  monolayers  of  Fe,  i.e., 
after  the  Fe  film  has  assumed  a  metallic  character.  Similar 
core  level  shifts  observed  for  Fe  on  GaAs(llO)  have  been 
associated  with  Ga  and  As  atoms  multiply  coordinated  with 
Fe  atoms  in  an  increasingly  metallic  environment.^ 

The  persistence  of  As  and  Ga  3d  signals  for  thicker  Fe 
films  is  illustrated  in  Fig.  4.  The  reduced  intensity,  defined  as 
ln[/(r)//(0)]  where  t  is  the  Fe  film  thickness,  is  plotted  for 
the  As  3d  and  Ga  3d  levels  as  a  function  of  Fe  coverage. 
The  behavior  expected  for  layer-by-layer  growth  without  in¬ 
terdiffusion  is  included  for  comparison  as  the  solid  line.  At 
coverages  up  to  2  ML,  the  experimental  data  are  in  reason¬ 
able  agreement  with  this  ideal  growth.  At  coverages  above  2 
ML,  however,  the  data  depart  significantly  from  this  ideal 
behavior  due  to  As  and  Ga  out-diffusion.  Although  the  Ga 
intensity  continues  to  decrease  monotonically,  it  does  so 
much  more  slowly  than  expected.  The  As  signal  also  initially 
decreases,  but  then  plateaus  at  ~10%  of  its  initial  intensity, 
suggestive  of  As  surface  segregation.  This  behavior  is  quali¬ 
tatively  similar  to  that  observed  previously^ and  demon¬ 
strates  that  while  Ga  out-diffusion  and  incorporation  is  lim¬ 
ited  to  the  interface  region,  some  As  continuously  segregates 
to  the  Fe  surface. 

The  exact  location  of  the  out-diffused  As  atoms  is  ex¬ 
pected  to  play  an  important  role  in  understanding  the  modi¬ 
fied  magnetic  behavior  of  the  Fe  film.  Chambers  et  al  pro¬ 
posed  an  interstitial  face-centered  As  site  within  the  Fe  film 
based  on  an  anomalous  feature  in  electron- stimulated  Fe 
LMM  Auger  electron  diffraction  spectra  from  bcc  Fe  films 
grown  on  a  sputter-annealed  GaAs(001)-c(8  X2)  surface.^^ 
To  determine  the  sites  of  the  out-diffused  As  in  our  samples, 
we  have  performed  polar  scans  of  the  intensity  of  the 
photon-stimulated  As  L3M45M45  Auger  level  (kinetic  energy 
of  1225  eV)  as  shown  in  Fig.  5.  It  is  important  to  consider 
contributions  to  the  emission  from  substrate  As  atoms,  which 
is  significant  at  near-normal  angles.  For  polar  angles  >45°, 
however,  we  believe  that  substrate  contributions  become 
small  relative  to  near-surface  As  emission.  The  features  in 
both  azimuths  at  large  polar  angles  suggest  significant  As 
occupation  of  subsurface  bcc  sites  [see  Fig.  2(c)].  The  over¬ 
all  increase  in  intensity  at  larger  polar  angles  suggests  that 
some  of  the  As  is  also  adsorbed  on  the  surface  of  the  Fe  film, 
as  observed  for  growth  on  the  c(8X2)  surface.  We  observe 
no  indication  for  an  fee  interstitial  As  site. 

The  observed  out-diffusion  of  Ga  and  As  at  higher  cover¬ 
ages  must  be  reconciled  with  the  fact  that  the  STM  images 
show  no  evidence  of  surface  disruption  during  the  initial 
nucleation  of  the  Fe  overlayer.  Although  the  STM  images 
provide  little  insight  regarding  the  composition  of  the  quasi- 
ordered  2D  islands  observed  along  the  As-dimer  rows,  one 
would  expect  any  exchange  reactions  with  the  surface  atoms 
to  result  in  a  more  disordered  surface  topography  than  is 


Fig.  5.  Photoelectron  diffraction  spectra  of  the  photon-stimulated  As 
L3M45M45  Auger  transition  (kinetic  energy =1225  eV)  for  the  [110]  and 
[100]  azimuths.  Note  that  the  spectra  have  many  of  the  features  character¬ 
istic  of  a  bcc  structure  [see  Fig.  2(c)]. 


observed.  In  addition,  the  core  level  data  indicate  that  sig¬ 
nificant  disruption  of  the  substrate  surface  occurs  only  after 
deposition  of  2-3  ML  of  Fe.  A  possible  scenerio  which  ac¬ 
counts  for  the  origin  of  the  out-diffused  Ga  and  As  and  the 
corresponding  driving  mechanism  is  proposed  as  follows:  we 
speculate  that  the  Fe  would  ideally  like  to  bond  to  an  As- 
terminated  bulklike  (IXl)  surface.  One  pathway  to  such  a 
surface  on  GaAs(001)-2X4  is  for  the  Fe  to  initially  bond  to 
the  As-dimer  rows  and  then,  with  increasing  coverage,  to 
bond  to  less  favorable  Ga  atoms  between  the  dimer  rows  (in 
the  second  GaAs  layer).  As  the  film  grows,  Fe  at  some  point 
(~2  ML)  displaces  the  Ga  atoms  in  this  substrate  layer,  in¬ 
cluding  those  below  the  As-dimer  rows.  This  results  in  an 
Fe/As  interface  and  frees  one  layer  of  Ga  and  a  half  layer  of 
As  (from  the  dimer  rows)  for  out-diffusion  into  the  growing 
Fe  film.  Given  that  the  onset  of  out-diffusion  appears  to  be 
correlated  with  the  onset  of  metallicity,  we  further  speculate 
that  the  displacement  of  the  second  layer  (substrate)  Ga  at¬ 
oms  and  first  layer  As  is  driven  by  the  accompanying  change 
in  chemical  potential  at  the  interface.  The  liberated  Ga  and 
As  atoms  are  incorporated  within  the  growing  film,  while  a 
fraction  of  the  As  segregates  to  the  surface. 


IV.  CONCLUSIONS 

We  have  studied  the  nucleation  and  growth  of  Fe  on 
MBE-grown  GaAs(001)-2X4  at  175  °C  using  in  situ  STM, 
FED,  and  XPS.  STM  reveals  that  Fe  initially  forms  small  2D 
islands  along  the  As-dimer  rows  before  growing  onto  the 
adjacent  Ga-rich  rows;  the  images  show  no  evidence  of  sub¬ 
strate  disruption.  FED  shows  that  the  Fe  grows  in  a  predomi¬ 
nantly  layer-by-layer  mode,  with  the  growth  front  for  the  nth 
deposited  layer  limited  to  the  (n  +  l)th  layer.  As  previously 
observed  for  Fe  grown  on  GaAs(001)-c(8  X2)  and 
GaAs(llO),  the  films  include  a  gradient  of  Ga  and  As  con- 
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centrations,  with  some  of  the  As  segregating  to  the  Fe  sur¬ 
face.  We  speculate  that  As  in  the  (2X4)  dimer  rows  and  the 
second  substrate  layer  Ga  are  displaced  by  the  Fe  in  order  to 
make  an  Fe/As  interface,  and  that  this  disruption  is  driven  by 
a  chemical  potential  gradient  created  when  the  Fe  film  be¬ 
comes  metallic  with  increasing  thickness. 
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The  Mn  3  s  multiplet  splitting  for  a  series  of  Mn  overlayers  on  Ru(OOl)  has  been  measured  with 
synchrotron  radiation  photoemission  spectroscopy.  These  overlayers  were  prepared  by  vapor 
deposition  of  Mn  onto  a  Ru(OOl)  substrate  at  a  monolayer/minute  dosing  rate  at  room  temperature. 

Under  these  conditions,  the  Mn  overlayers  grow  pseudomorphic  ally  to  multilayer  coverages.  It  is 
found  that  this  system  exhibits  a  noticeable  increase  in  the  Mn  3  s  multiplet  splitting  and  a  positive 
binding  energy  shift  as  the  coverage  increases  from  monolayer  to  multilayer  (12  ML).  These  results 
are  attributed  to  the  increase  in  the  atomic  volume,  and  hence  the  local  magnetic  moment  at  the  Mn 
site  on  the  average,  with  increasing  coverage.  ©  1996  American  Vacuum  Society. 


I.  INTRODUCTION 

Fabrication  and  characterization  of  magnetic  thin  films 
have  attracted  considerable  attention  in  recent  years  partly 
because  of  the  need  to  develop  better  materials  for  magnetic 
memory  and  related  technology  and  partly  because  of  the 
development  of  new  and  better  tools,  such  as  ultrahigh 
vacuum  (UHV)  surface  science  techniques  and  synchrotron 
radiation.  These  developments  have  led  to  more  precise  thin 
film  engineering  and  characterization. 

Mn,  and  its  neighboring  3d  transition  metals,  continue  to 
be  the  subject  of  investigation,^"^  since  these  metals  possess 
unpaired  electrons  in  its  3d  shell  and  their  magnetic  thin 
films  can  be  prepared  under  well-defined  conditions  using 
modern  surface  science  technology.  The  electronic  structure 
of  Mn  on  Ru(OOl)  has  been  investigated  in  several 
studies. 

It  has  been  generally  recognized  that  at  modest  deposition 
rates  (^1/2  ML/min),  Mn  grows  pseudomorphically  on 
Ru(OOl)  at  room  temperature  up  to  ~3  ML.  It  exhibits  a 
(IXl)  low  energy  electron  diffraction  (LEED)  pattern  indis¬ 
tinguishable  from  that  of  clean  Ru  except  for  the  less  intense 
diffraction  spots.  Above  this  coverage,  Mn  forms  a  {V3 
XV3)R  30°  LEED  pattern  that  has  been  associated  with  the 
Laves  phase.  ^  The  V3  structure  can  grow  to  multilayers  de¬ 
pending  on  the  condition,  and  at  higher  coverage,  the  Mn 
overlayer  becomes  disordered.^  Both  the  (V3X\/3)/?  30°  and 
the  disordered  structure  are  metastable;  upon  annealing,  Mn 
aggregates  to  form  three-dimensional  (3D)  islands  exposing 
a  large  area  of  Ru  surface.^  More  recently,  we  reported  that 
at  higher  dosing  rate  (~1  ML/min),  the  (IXI)  pseudomor¬ 
phic  structure  persists  at  multilayer  coverages  at  room 
temperature.^ 

The  pseudomorphic  growth  of  Mn  on  Ru  has  been  very 
interesting  in  that  Mn  must  adopt  a  structure  with  Mn-Mn 
distance  of  2.65  A.  This  is  considerably  larger  than  the  clos¬ 
est  Mn-Mn  distance  of  2.24  A  in  a-Mn.  Thus,  pseudomor- 
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phic  Mn  on  Ru  has  an  expanded  structure  in  which  the 
atomic  volume  of  Mn  increases.*  This  behavior  may  lead  to 
higher  magnetic  moment  at  the  Mn  site  compared  to  its  bulk 
or  close-packed  counterparts  (such  as  the  Laves  phase).  Con¬ 
siderable  work  has  been  done  along  this  direction. The 
electronic  structure  and  chemical  properties  of  this  system 
have  also  been  investigated.^’"^ 

One  of  the  methods  for  measuring  the  magnetic  moment 
at  the  transition  metal  site  is  conventional  XPS  (x-ray  pho¬ 
toelectron  spectroscopy)  in  which  the  multiplet  splitting  of 
the  3p  and  3s  levels  of  the  3d  transition  elements,  for  ex¬ 
ample,  is  investigated.^’*^  The  Mn35  multiplet  splitting 
arises  from  the  exchange  interaction  in  the  final  state  involv¬ 
ing  the  3*5'  and  the  3d  electrons  (no  unpaired  3d  electron,  no 
multiplet  splitting)  and  the  3.5'  spectrum  appears  as  a  doublet 
with  predominantly  majority  (t)  and  minority  (|)  photoelec¬ 
tron  spin  components.  The  magnitude  of  the  splitting  in  the 
case  of  the  3.v  level  of  3d  transition  metals,  for  example,  is 
proportional  to  the  number  of  unpaired  electrons  in  the  3d 
orbital  (multiplicity)  and  the  exchange  interaction  between 
the  35  and  the  3d  electrons.^’***  The  origin  of  the  splitting  in 
terms  of  exchange  interaction  has  been  confirmed  by  spin 
polarized  photoemission  studies,**  although  quantitative 
agreement  is  not  always  obtained*^  as  the  result  of  (i)  the 
exchange  interaction  calculated  based  on  the  atomic  model 
overestimates  the  splitting  and  (ii)  configuration  interactions 
leading  to  satellite  structures.  Despite  these  complications, 
the  35  multiplet  splitting  is  still  very  useful  for  the  elucida¬ 
tion  of  the  systematics  of  the  magnetic  moment.  In  general, 
the  larger  the  splitting,  the  large  the  magnetic  moment.  More 
recently,  the  magnetic  properties  of  transition  elements  have 
also  been  investigated  with  x-ray  absorption  magnetic  circu¬ 
lar  dichroism  measurements  at  the  3d  transition  metal  L32 
edges.^’*^ 

In  this  article,  we  report  photoemission  measurements  of 
the  Mn  35  multiplet  splitting  for  a  series  of  Mn  overlayers  on 
Ru(lOO).  In  comparison  with  previous  studies,  our  results 
have  the  following  new  features:  (i)  the  Mn  overlayers  were 
prepared  with  a  higher  dosing  rate  (^1  ML/min)  that  allows 
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Fig.  1.  Mn35  photoelectron  spectrum  of  Mn  on  Ru(OOl)  with  180  eV 
photon  energy:  (a)  0  ML,  pure  Ru(OOl);  (b)  1;  (c)  2;  (d)  4;  (e)  6;  (f)  8;  (g) 
10;  (h)  12  ML. 


for  stepwise  preparation  of  pseudomorphic  films  up  to  12 
ML  thick;  (ii)  the  photoemission  experiments  were  carried 
out  with  optimized  photon  energy  (hence,  cross  section)  and 
with  energy  resolution  better  than  conventional  XPS.  The 
experimental  procedure  is  described  in  Sec.  II,  followed  by 
results  and  discussion  in  Sec.  Ill,  and  summary  and  conclu¬ 
sions  in  Sec.  IV. 

II.  EXPERIMENTAL  PROCEDURE 

Photoemission  measurements  were  carried  out  at  the  U7 
beamline  of  the  National  Synchrotron  Light  Source  (NSLS) 
using  a  plane  grating  monochromator.  This  beamline  is 
equipped  with  a  UHV  chamber  that  has  angle-integrated  ca¬ 
pabilities  in  a  VSW  100  mm  hemispherical  electron  energy 
analyzer.  Measurements  were  carried  out  at  180  eV  photon 
energy  with  the  analyzer  operating  at  25  eV  pass  energy.  The 
overall  experimental  resolution  was  —0.4  eV.  The  Ru(OOl) 
crystal  was  positioned  at  45°  relative  to  the  incoming  beam 
and  the  axis  of  the  analyzer.  Mn  overlayers  on  Ru(OOl)  were 
prepared  in  situ  with  a  temperature-controlled  doser 
(ceramic-coated  tungsten  basket)  operating  at  700  °C  and  the 
Ru  crystal  is  at  room  temperature.  The  dosing  rate  under  this 
condition  was  —1  ML/min  as  calibrated  from  thermal  de¬ 
sorption  measurements  on  the  same  Ru(OOl)  substrate.  The 
Mn  overlayers  were  prepared  sequentially  at  steps  of  two  ML 
at  a  time  after  the  initial  depositions  of  the  first  two  mono- 
layers.  The  sample  preparation  procedure  has  been  discussed 
elsewhere.^^"^ 
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Fig.  2.  Close  inspection  of  2,  4,  and  6  ML  spectra  showing  the  asymmetric 
broadening.  The  positions  for  the  predominantly  majority  (|)  and  minority 
(i)  photoelectron  spins  components  are  noted. 


III.  RESULTS  AND  DISCUSSION 

Figure  1  shows  the  photoemission  spectra  for  a  series  of 
coverages  from  1  to  12  ML  of  Mn  on  Ru(OOl).  Independent 
experiments  carried  out  in  a  separate  chamber  under  identical 
conditions  indicate  that  at  1  ML/min,  the  Mn  films  exhibit  a 
(1X1)  LEED  pattern  for  the  range  of  coverage  studies  here. 

Several  features  from  Fig.  1  are  immediately  apparent. 
First  there  are  three  peaks  at  the  monolayer  coverage,  and  at 
least  four,  perhaps  five,  peaks  at  higher  coverages.  The  three- 
peak  feature  at  monolayer  coverage  corresponds  to  Ru  4^ 
(calibrated  to  75.0  eV)  and  the  multiplet  split  Mn  3s  with  the 
minority  (i)  and  majority  (|)  spin  peak  at  binding  energies  of 
—82.6  and  —87.3  eV,  respectively.  As  expected,  the  Mn  3s 
peaks  become  more  intense  relative  to  the  Ru  4.y  signal  as 
the  Mn  coverage  increases.  Second,  there  appears  to  be  a 
broad  peak  at  —66  eV  that  also  becomes  more  intense  with 
increased  Mn  coverage.  This  peak  is  one  of  the  satellite 
peaks  originating  from  the  photoemission  of  the  Mn  3 p 
level it  was  too  weak  to  be  detected  at  low  coverage. 
Third,  the  Mn3s  doublet  exhibits  a  noticeable  increase  in 
the  apparent  splitting  as  the  coverage  increases;  this  is  ac¬ 
companied  by  a  positive  binding  energy  shift. 

Close  inspection  of  Fig.  1  reveals  the  following:  (i)  the  1 
and  2  ML  spectra  have  well-defined  doublets  with  same  mul¬ 
tiplet  splitting;  (ii)  the  4  and  6  ML  spectra  exhibit  a  notice¬ 
able  asymmetric  broadening  (to  higher  binding  energy)  in 
both  components;  the  higher  energy  component  is  poorly  de¬ 
fined;  and  (hi)  the  8,  10,  and  12  ML  spectra  exhibit  similar 
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fit. 

doublets  with  the  largest  splitting.  The  components  are 
slightly  broader  then  those  for  lower  coverage  and  are  shifted 
to  higher  binding  energy.  Figure  2  shows  the  evolution  of  the 
Mn35  multiplets  from  2  to  6  ML.  It  can  be  seen  that  the 
doublets  broaden  and  shift  progressively  towards  higher 
binding  energy  as  coverage  increases.  The  broadening  is  at¬ 
tributed  to  surface  inhomogeneity  due  to  the  presence  of  Mn 
atoms  with  slightly  larger  magnetic  moment.  Figure  3  com¬ 
pares  the  two  extreme  cases  (1  and  12  ML);  the  increase  in 
splitting  and  accompanying  binding  energy  shift  at  high  cov¬ 
erages  are  clearly  visible. 

The  spectra  shown  in  Fig.  1  have  also  been  analyzed  by 
fitting.  The  main  consideration  here  is  that  the  linewidth  for 
the  doublet  is  not  required  to  be  the  same,  as  was  assumed 
previously  in  some  analyses. This  consideration  comes 
about  from  the  spin-resolved  photoemission  studies  of  the 
Fe  3^  multiplets.  In  the  Fe  study, it  was  clearly  shown 
that  the  majority  spin  photoelectron  peak  is  considerably 
broader  than  that  of  the  minority  spin.  This  linewidth  differ¬ 
ence  between  the  two  components  is  related  to  the  lifetime  of 
the  corehole  and  many  body  effects.  A  couple  of  configu¬ 
ration  interaction  satellites  related  to  the  majority  spin  are 
known  to  present  at  higher  binding  energies. The  presence 
of  the  satellites  reduces  the  intensity  of  the  majority  spin 
peak. 

We  fitted  all  the  spectra  with  a  Doniach-Sunjic  lineshape 
for  the  Ru45’,  and  Gaussians  for  the  Mn  doublets  and  the 
Mn  3/7  satellites.  The  choice  of  using  Gaussian  in  the  latter  is 
motivated  by  the  surface  inhomogeneity  that  is  expected.  We 
consistently  obtained  reasonable  fits  with  the  linewidth  of  the 
majority  spin  peak  being  —50%  broader  than  the  minority 
spin  peak,  which  has  a  width  of  —3-3.4  eV.^^  The  results  of 
multiplet  splitting  and  binding  energy  shift  (minority  spin 
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Fig.  4.  A  plot  of  multiplet  splitting  and  binding  energy  shift  (Mn  3  s  minor¬ 
ity  spin-Ru  45)  vs.  Mn  coverage  (ML). 


peak  referenced  to  Ru  4 5)  as  a  function  of  coverage  are  sum¬ 
marized  in  Fig.  4. 

It  is  quite  clear  from  these  observations  that  the  multiplet 
splitting,  and  hence  the  average  magnetic  moment  at  the  Mn 
site  in  these  overlayers,  increases  from  4.7  to  5.5  eV.  It  is 
interesting  to  compare  our  results  with  some  previous  work. 
McFeely  et  showed  that  of-Mn,  which  has  a  magnetic 
moment  of  3.5/^^  ,  exhibits  a  splitting  of  —4  eV,  while  ionic 
Mn  compounds,  which  have  a  magnetic  moment  of  5/^5 , 
exhibit  an  —6  eV  splitting.  Heinrich  et  al}  reported  splitting 
of  4.30  and  4.45  eV  for  the  Laves  phase  and  the  expanded 
structure  of  Mn(2  ML)/Ru(001),  respectively.  Our  2  ML  re¬ 
sult  is  in  good  agreement  with  the  result  of  Heinrich  et  al? 
The  much  larger  splitting  observed  in  our  work  confirms  that 
the  Mn  overlayer  remains  pseudomorphic,  consistent  with 
our  other  measurements.^  This  observation  means  that  the 
atomic  volume  of  Mn,  on  the  average,  increases  with  cover¬ 
age  in  these  overlayers.  By  scaling  with  the  results  of  Mc¬ 
Feely  etal}^  we  obtain  magnetic  moment  of  — 3.9ya5  and 
— 4.6/ab  at  1  and  12  ML  coverage,  respectively.  It  is  apparent 
that  at  a  higher  dosing  rate  (—1  ML/min),  the  thermally  un¬ 
stable,  expanded  (1X1)  pseudomorphic  structure  is  pre¬ 
served  at  room  temperature.  This  observation  is  in  contrast  to 
a  recent  observation  of  Mn  on  Ir,  where  it  was  found  that  the 
multiplet  splitting  of  the  expanded  structure  (—2  ML  Mn, 
pseudomorphic  on  Ir)  is  larger  than  that  of  a  thicker  layer  (10 
ML  on  Ir).^  The  discrepancy  between  the  Mn/Ir  and  our  re- 
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suits  may  be  related  to  the  difference  in  the  morphology  of 
the  overlayers. 

It  should  be  noted  that  the  observed  binding  energy  shift 
(Fig.  4)  can  be  attributed  to  the  effect  of  volume  induced 
rehybridization.  It  is  well  known  that  the  condensation  of 
free  atoms  of  transition  metals  favors  s^d  charge  rehybrid¬ 
ization.  Thus,  a  lattice  expansion  would  most  likely  induce  a 
3d^4s  charge  transfer  in  Mn.  Since  the  binding  energy 
shift  is  primarily  determined  by  the  coulomb  integral,  F{3s, 
valence),  and  for  Mn,  F(35,3(i)>F(35,45),  a  3d~^4s  elec¬ 
tron  redistribution  would  lead  to  a  positive  binding  energy 
shift  as  observed.  More  work  is  needed  to  substantiate  these 
effects. 

IV.  SUMMARY  AND  CONCLUSION 

We  have  presented  Mn  3^  multiplet  splitting  systematics 
for  a  series  of  Mn  overlayers  adsorbed  on  Ru(OOl)  at  room 
temperature.  The  trend  of  the  multiplet  splitting  indicates  an 
increase  in  magnetic  moment  at  the  Mn  site  (3.9-4.6/X5) 
with  coverage  from  1  to  12  ML,  respectively.  We  conclude 
that  this  trend  is  associated  with  an  increase  of  atomic  vol¬ 
ume  of  Mn;  this  observation  is  consistent  with  pseudomor¬ 
phic  growth.  The  binding  energy  shift  with  increasing  cover¬ 
age  is  also  noted  and  interpreted  in  terms  of  the  volume 
effect  on  45’-3  J  charge  rehybridization. 
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The  temperature  dependence  of  the  uniaxial  surface  anisotropy,  Kj^{T),  in  ultrathin  Fe  films  (4-10 
atomic  layers)  grown  on  Cu(lOO)  is  determined  by  comparing  the  dipole  and  anisotropy  energies  at 
the  spin  reorientation  temperature  for  films  of  varying  thickness.  It  is  observed  that  the  uniaxial 
anisotropy  has  a  weak  temperature  dependence  compared  to  the  bulk  constants  for  Fe.  The  measured 
exponent  of  r^=2.6(0.5)  agrees  well  with  the  /(/+l)/2  law,  which  is  obtained  from  a  spin 
fluctuation  model  where  /=2  at  the  surface.  This  shows  that  the  spin  reorientation  transition  can  be 
understood  as  being  driven  by  thermal  spin  fluctuations.  ©  7996  American  Vacuum  Society. 


Ferromagnetism  in  films  of  transition  metals  a  few  atomic 
layers  (ALs)  thick  has  become  well  established  since  the 
development  of  modern  vacuum  equipment  has  allowed  the 
preparation  of  high  quality  films.  ^  The  most  striking  behavior 
of  some  of  these  films,  e.g.,  Fe  grown  on  Cu(lOO)  or 
Ag(lOO),  is  the  presence  of  magnetic  anisotropies  large 
enough  to  overcome  the  dipole  energy  of  the  film  and  pull 
the  magnetization  perpendicular  to  the  plane  of  the  film.^’^ 
The  presence  of  magnetic  anisotropy  is  very  important  to  the 
magnetic  ordering  of  these  systems  because  it  has  been 
shown  that  the  critical  behavior  of  the  magnetism  of  very 
thin  films  comes  very  close  to  that  of  a  true  two-dimensional 
system."^  Since  an  isotropic  two-dimensional  system  at  finite 
temperature  has  no  long  range  order,^  it  is  very  important  to 
understand  how  anisotropy  enters  into  the  problem.  The 
Hamiltonian  for  a  Heisenberg  spin  system  can  be  written  as  a 
sum  of  exchange  energy,  spin — spin  (dipole)  coupling,  and 
spin- orbit  terms  (with  no  external  field), 

+  +  (1) 

If  the  exchange  term  is  taken  to  be  isotropic,  e.g.,  using  the 
Heisenberg  model,  then  the  anisotropy  originates  either  in 
the  dipole  or  the  spin- orbit  terms. 

For  a  constant  saturation  magnetization  M ^ ,  the  classical 
dipole  energy/unit  area  of  the  film  is  obtained  from 

^^^0,  (2) 

where  ao  is  the  lattice  constant.  The  local  field  at  the 
point  exerted  by  a  collection  of  point  dipoles  py  at 
is  given  by 


which  always  favors  magnetization  in  the  plane  of  the  film.  It 
has  been  shown  that  only  the  presence  of  a  dipole  term  is 
sufficient  to  cause  spontaneous  magnetization.^ 

The  spin -orbit  term  originates  from  the  interaction  of  the 
electrons  with  the  crystal  lattice.  In  bulk  bcc-Fe,  for  ex¬ 
ample,  the  spin  orbit  term  is  anisotropic,  favoring  magneti¬ 
zation  in  the  [100]-like  directions.^  The  bulk  anisotropy 
energy/unit  volume  in  Fe  is  about  an  order  of  magnitude 
smaller  than  the  shape  anisotropy  energy  density  in  a  thin 
film  (10^  vs  10^  erg/cm^)  and,  hence,  is  unlikely  to  be  a 
dominant  contribution  in  Eq.  (1).  The  breaking  of  symmetry 
at  a  surface  can  also  cause  the  spin-orbit  interaction  to  be 
anisotropic.  In  cubic  materials  the  anisotropy  vanishes  to 
fourth  order,  however,  at  the  surface  the  broken  symmetry 
brings  the  second  order  anisotropy  term  into  play.  Neel^  has 
discussed  the  surface  anisotropy  with  a  phenomenological 
theory.  Using  the  classical  form  of  the  pairwise  interactions 
between  neighboring  atoms,  he  predicted  that  a  strong  an¬ 
isotropy  perpendicular  to  the  plane  of  the  film  is  a  natural 
consequence  of  the  reduced  symmetry  at,  e.g.,  the  (100)  face 
of  a  cubic  lattice.  The  contribution  of  this  term  can  be  in¬ 
cluded  into  the  Hamiltonian  as 

^so=-2/i:„X  (Srn)\  (5) 

i 

where  K^^  is  a  uniaxial  surface  anisotropy  energy/unit  area 
and  the  factor  of  2  includes  both  the  top  and  bottom  surfaces. 
Here,  a  positive  value  of  will  minimize  the  energy,  thus 
favoring  magnetization  perpendicular  to  the  film.  The  energy 
density  associated  with  this  surface  anisotropy  term  then  be¬ 
comes 
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ru  PiTu 
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Evaluating  this  sum  for  perpendicular  and  in-plane  magneti¬ 
zations  gives  rise  to  a  demagnetizing  field  4  ttM^  in  the  plane 
of  the  film.  Therefore,  the  total  demagnetizing  energy  of  the 
film  {shape  anisotropy  energy)  is 

Ed  =  2ttMI  (4) 


“^Electronic  mail:  DPAPPAS@CABELL.VCU.EDU 


with  d  being  the  film  thickness.  The  first  efforts  to  evaluate 
the  direction  and  strength  of  were  made  by  Neel  using 
typical  elastic  and  magneto- striction  constants.  He  predicted 
that  K^^  at  the  (100)  face  would  favor  magnetization  perpen¬ 
dicular  to  the  film  and  be  on  the  order  of  1  erg/cm^.  Thus,  for 
a  film  of  thickness  less  than  about  10  A  the  surface  anisot¬ 
ropy  energy/unit  area,  erg/cm^,  would  be  large 

enough  to  overcome  the  dipole  energy.  However,  it  is  neces- 
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sary  to  do  a  fully  relativistic  bandstructure  calculation  in 
order  to  predict  from  first  principles  the  strength  and  sign  of 
the  magneto-crystalline  anisotropy  in  transition  metals.  Ab 
initio  electronic  structure  calculations  which  include  the 
spin-orbit  interaction  correctly  predicted  that  the  moments 
of  some  thin  transition  metal  films  orient  themselves  perpen¬ 
dicular  to  the  film,^  however  it  is  clear  that  the  difference 
between  perpendicular  and  in-plane  energies  is  small,  and 
the  calculations  are  pushing  to  the  limits  of  present  compu¬ 
tational  techniques. 

These  theoretical  considerations  in  combination  with  ex¬ 
perimental  studies  that  confirmed  that  at  low  thickness  the 
magnetization  is  perpendicular  to  the  plane  of  the  film  has 
sparked  much  work  over  the  past  few  years  on  this  topic.  In 
particular,  it  has  been  observed  that  films  thinner  than  5-6 
AL  of  Fe  grown  on  Ag(lOO)^^  are  perpendicularly  magne¬ 
tized,  and  become  magnetized  in  the  plane  of  the  film  above 
this  critical  thickness.  A  similar  spin  reorientation  behavior 
for  Fe  grown  on  Cu(lOO)  is  observed  with  a  critical  thickness 
of  about  6  AL.  These  observations  agree  with  expectations 
from  ferromagnetic  resonance  results,  in  which  the  uniaxial 
anisotropy  for  Fe/Ag(100)  was  measured  to  be  on  the  order 
of  1  Contributions  to  may  also  arise  from  ep¬ 

itaxial  strain,^"^  however  it  is  interesting  to  note  that  the  spin 
reorientation  transition  thickness  is  very  close  for  Fe  grown 
on  Ag(lOO)  and  Cu(lOO),  even  though  the  structures  are  en¬ 
tirely  different  (bcc  and  fct,  respectively). 

The  first  indications  of  a  temperature  dependent  spin  re¬ 
orientation  from  perpendicular  at  low  T  to  in-plane  at  high  T 
was  observed  by  Jonker  et  using  spin  resolved  photo¬ 
emission  on  Fe  films  grown  on  Ag(OOl),  and  was  then  veri¬ 
fied  by  Volkening  et  al}^  using  the  conversion  electron 
Mdssbauer  effect.  Subsequent  studies  of  Fe/Cu(100)  grown 
at  low  temperature  showed  that  the  temperature  dependent 
spin  reorientation  transition  is  reversible  and  occurs  over  a 
narrow  temperature  and  thickness  range^^  with  an  accompa¬ 
nying  loss  of  magnetic  signal  as  the  reorientation  occurs  for 
both  Fe/Ag(100)  and  Fe/Cu(100).^^  This  loss  of  signal  near 
has  been  investigated  both  from  an  experimental  and 
theoretical  perspective, and  has  been  shown  to  be  due  to 
the  formation  of  microscopic  stripe  domains  in  the  region  of 
thickness  and  temperature  that  This  is  important 

because  it  shows  that  the  higher  order  anisotropy  terms  are 
sufficient  to  maintain  short  range  magnetic  order. 

Observation  of  a  temperature  dependent  spin  reorientation 
leads  to  the  consideration  of  the  temperature  dependence  of 
the  anisotropy  constants  with  the  conclusion  that  the  surface 
anisotropy  energy,  decreases  faster  than  the  dipole  en¬ 
ergy,  Ej),  as  the  temperature  is  increased.  This  behavior  can 
be  expected  because  the  origin  of  ^surf  is  the  same  as  that  of 
the  bulk  magneto-crystalline  anisotropy,  i.e.,  the  spin-orbit 
interaction.  Callen  and  Callen  have  discussed  the  tempera¬ 
ture  dependence  of  the  bulk  anisotropy  constants^^  using  a 
spin  fluctuation  model.  By  allowing  each  spin  to  sample  a 
local  temperature  independent  anisotropy  ^^^(0)  and  then  al¬ 
lowing  the  spins  to  deviate  from  their  equilibrium  positions, 
the  following  relation  is  obtained: 
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where  I  is  the  order  of  the  anisotropy  and  r=/(/+l)/2. 
Good  agreement  with  experiment  is  obtained  using  this  ex¬ 
pression  for  bulk  bcc-Fe  with  exponent  of  r5  =  10,  where 
/— 4  for  a  cubic  system.  For  the  surface,  however,  an  expo¬ 
nent  of  Fj^^S  is  expected  because  the  anisotropy  comes  in  at 
second  order,  i.e.,  1=2.^  Spin  fluctuation  theory  has  been 
used  successfully  to  understand  the  spin  reorientation 
transition,^ however  most  emphasis  has  been  placed  on 
the  nature  of  the  domain  formation  at  the  reorientation  tran¬ 
sition  rather  than  the  cause  of  the  temperature  dependence  of 
the  uniaxial  anisotropy.  It  has  also  never  been  shown  experi¬ 
mentally  that  spin  fluctuations  can  explain  this  effect  without 
including  thermal  stress  and  bandstructure  changes  into  the 
problem.  In  the  present  work,  the  exponent  F  is  obtained  for 
ultrathin  Fe  films  grown  on  Cu(lOO)  by  comparing  the  satu¬ 
ration  magnetization  and  thickness  at  which  the  spin  reorien¬ 
tation  occurs.  This  is  achieved  by  finding  the  normalized 
magnetization  [M^(r)]/[M^(0)]  at  the  spin  reorientation 
temperature  7^  for  a  given  thickness, and  equating  the  di¬ 
pole  energy  density  to  the  surface  energy  density  at  that 
thickness.  Good  agreement  with  the  spin  fluctuation  model  is 
found. 

In  order  to  find  [M^(r)]/[M^(0)]  it  is  necessary  to  fit  the 
dependence  of  with  respect  to  temperature  in  the  range 
where  the  transition  occurs  because  the  long  range  order  is 
reduced  at  Figure  1  illustrates  the  temperature  and 

thickness  dependence  of  the  magnetization  in  thin  Fe  films 
which  were  grown  at  125  K  and  annealed  while  the  second¬ 
ary  electron  polarization  was  monitored. The  reduction  of 
signal  at  7;^  is  demonstrated  in  the  center  panel  for  the  5.4 
AL  film  and  has  been  shown  to  be  due  to  the  formation  of 
microscopic  domains  in  this  temperature  range.^^"^^  The  4 
and  10  AL  films  show  only  remanent  magnetization  perpen¬ 
dicular  and  in-plane,  respectively.  These  results  match  with 
magneto-optical  Kerr  effect  (MOKE)  studies  on  Fe/Cu(100), 
where  the  border  between  perpendicular  and  in-plane  mag¬ 
netization  was  observed  at  ^6  AL.^^’^  The  relevant  aspect  of 
these  curves  for  this  work  is  the  decrease  of  the  magnetiza¬ 
tion  with  temperature  up  to  about  300  K.  In  the  spin  wave 
regime,  i.e.,  for  T<TJ2,  it  is  well  known  that  the  bulk  and 
surface  magnetization  can  be  fit  using  the  Bloch  law  7^^^ 
law.^’^"^’^^  For  a  thin  film,  however,  it  can  be  shown^^  that  the 
behavior  of  the  magnetization  is  more  appropriately  fit  with  a 
function  of  the  form 


M,(7) 

MsiO) 


^l-kT\n 


7. 


(8) 


The  low  temperature  fit  of  the  polarization  data  of  the  5.6  AL 
film  to  Eq.  (8)  is  shown  in  Fig.  2.  Since  it  has  been  shown 
that  the  films  are  in  a  single  domain  state  in  this  temperature 
region, this  fit  can  be  used  to  determine  the  magnetiza¬ 
tion  at  the  spin  reorientation  temperature.  A  prefactor  of 
A:  =0.000  68  is  obtained  for  the  5.6  AL  film.  Using  this  fit,  it 
is  now  possible  to  determine  the  relation  between  Mj(7)  and 


3205 


D.  P.  Pappas:  Magnetic  surface  anisotropy 


3205 


100  200  300  400  500  600 


Temperature  (K) 

Fig.  1.  Temperature  dependence  of  the  spin  polarization  of  secondary  elec¬ 
trons  from  a  4  AL  (top),  5.6  AL  (center),  and  a  10  AL  film  (bottom  panel). 


K^^{T)  for  the  first  time  from  the  data  of  Ref.  17.  This  can  be 
obtained  by  observing  that  at  the  transition  temperature  T ; 
the  dipole  energy/unit  volume  of  the  film  is  equal  to  the 
energy  due  to  the  uniaxial  anisotropy  From  Eq.  (4) 

and  Eq.  (6)  for  a  given  thickness  we  obtain 
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TLog(T) 

Fig.  2.  Fit  of  the  data  from  Fig.  1  to  find  the  coefficient  k  of  the  equation 
Mg(T)~M s(0)[l  —kXTln  T]  in  the  spin  wave  regime  for  5.6  AL  film. 
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Fig.  3.  Log-log  plot  of  the  thickness  vs  magnetization  at  the 

spin  reorientation  transition  temperature.  The  data  from  this  plot  is  obtained 
from  the  same  data  set  as  Ref.  17,  where  the  corrected  thickness  calibration 
(see  Ref.  18)  is  used. 


2K,iT,) 


(9) 


Here,  the  normalized  anisotropy  and  magnetization  can  be 
used,  and  the  relation  of  Eq.  (7)  substituted  for  the  value  of 
[K,{T)]/[K,m  to  find 
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and  thus 


1  \maTr)Y^-^ 
^[MAO), 


(11) 


A  log-log  plot  of  dj^  vs  [Mj(r^)]/[M^(0)]  will  thus  give  a 
slope  of  r„”2.  Therefore,  for  each  of  seven  films  that  dem¬ 
onstrated  a  spin  reorientation  transition  the  normalized  mag¬ 
netization  is  determined  at  using  Eq.  (8).  The  results  are 
plotted  in  Fig.  3.  The  linear  least  squares  fit  shown  gives  a 
slope  of  0.6±0.5,  and  therefore  r„=2.6±0.5. 

In  this  work,  the  magnetic  properties  of  Fe  grown  on 
Cu(lOO)  substrates  at  low  temperature  are  studied.  The 
uniaxial  anisotropy  is  observed  to  fall  off  faster  than  the  bulk 
anisotropy  as  the  magnetization  decreases!  A  reduced  expo¬ 
nent  relative  to  the  bulk  is  determined  that  agrees  well  with 
the  spin  fluctuation  model  exponent  of  /(/+l)/2=3  where 
/=2  at  the  surface.  This  demonstrates  that  thermal  fluctua¬ 
tions  have  a  much  smaller  effect  on  the  anisotropy  at  the 
surface  than  in  the  bulk  of  a  cubic  material  because  the  next 
closest  energy  minimum  is  180°  away  versus  only  90°  in  the 
bulk.  This  shows  that  the  magnetic  anisotropy  can  be  under¬ 
stood  using  a  thermal  spin  fluctuation  model.  However  this 
does  not  rule  out  the  possibility  that  thermal  stress  and/or 
band  structure  changes  affect  the  magnetic  properties  of  the 
films. 
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Distinguishing  the  close-packed  hexagonal  and  face  centered  cubic  phases 
of  the  metallization  of  diamond  by  polarization-dependent  extended 
x-ray  absorption  fine  structure 
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Polarization-dependent  extended  x-ray  absorption  fine  structure  (PD-EXAFS)  measurements  have 
been  made  on  a  single-crystal  1000-A-thick  Co  film  deposited  on  (100)  diamond.  By  comparing  the 
EXAFS  signals  corresponding  to  the  in-plane  and  out-of-plane  structure  relative  to  the  film  plane, 
we  clearly  determine  that  the  film  does  establish  the  face  centered  cubic  (fee)  phase.  Distinguishing 
between  the  hep  and  fee  phases  can  thus  be  performed  by  qualitatively  noting  the  polarization 
dependence  of  the  Fourier  transform  peak  amplitudes  corresponding  to  the  third  and  fourth 
coordination  shells.  This  EXAFS  study  shows  that  the  Co  film  studied  here  is  unambiguously  in  the 
fee  phase.  The  Fourier  transform  peaks  corresponding  to  the  first  nearest  neighbors,  parallel  and 
perpendicular  to  the  film,  have  been  compared.  Results  indicate  less  than  a  0.0075  A  difference 
between  in-plane  and  out-of-plane  bondlengths,  no  significant  change  in  coordination  number  and 
less  than  a  0.001  A^  difference  in  EXAFS  Debye-Waller  factors.  All  of  these  results  indicate  no 
tetragonal  distortion  of  the  fee  Co  crystal  lattice.  This  technique  should  prove  valuable  in  future 
studies  when  trying  to  determine  the  crystallinity  of  thin  films.  ©  1996  American  Vacuum  Society, 


I.  rNTRODUCTION 

The  interest  in  the  theoretical  and  experimental  investiga¬ 
tions  of  metastable  phases  and  their  unique  characteristics 
has  steadily  increased  during  the  past  few  years.  ^  By  using 
lattice-matched  substrates,  the  growth  of  a  wide  variety  of 
exotic  phases  of  common  materials  has  been  performed. 
Similarly,  the  application  of  these  new  materials  in  the  field 
of  magnetoelectronics  has  also  provided  exciting  results.^ 

Among  the  3d  metals,  Co  has  generated  renewed  interest 
for  after  having  been  stabilized  in  the  body  centered  cubic 
(bcc)^  and  the  thermodynamically  stable  close-packed  hex¬ 
agonal  (hep)^"^  phases,  recent  results  seemed  to  indicate  that 
it  had  now  been  grown  in  the  metastable  face  centered  cubic 
(fee)  phase.^  Because  of  the  unique  behavior  of  metastable 
fee  Co,  a  new  field  of  applications  seems  to  be  very 
promising.^ 

II.  EXPERIMENT 

Polarization-dependent  extended  x-ray  absorption  fine 
structure  (PD-EXAFS)  measurements  have  been  made  on  a 
single-crystal  1000-A-thick  Co  film  deposited  on  (100)  dia¬ 
mond.  The  growth  (molecular  beam  epitaxy)  and  character¬ 
ization  (reflection  high  energy  electron  diffraction,  x-ray  dif¬ 
fraction,  vibrating  sample  magnetometry,  quantum 
interference  magnetometry,  and  ferromagnetic  resonance)  of 
this  film  has  been  documented  elsewhere.^  X-ray  absorption 
spectra  were  collected  in  electron  yield  mode^  utilizing  a  low 
temperature  total  electron  yield  (TEY)  detector^^  that  kept 
the  sample  temperature  at  90  ±3  K  during  the  data  acquisi¬ 
tion.  The  data  were  collected  near  the  boiling  point  of  liquid 
nitrogen  to  reduce  effects  of  any  thermal  Debye-Waller 
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terms  that  dampen  the  amplitude  of  the  EXAFS  signal.  A 
reduction  in  the  thermal  Debye-Waller  terms  increases  the 
range  of  useful  data  that  can  be  Fourier  transformed,  and 
thus  increases  the  resolution  of  the  Fourier  transform  peaks. 
Measurements  were  made  on  the  National  Institute  of  Stan¬ 
dards  and  Technology  EXAFS  beamline  X23A2  at  the  Na¬ 
tional  Synchrotron  Light  Source  (Brookhaven  National 
Laboratory,  Upton,  NY).  This  beamline  is  equipped  with  a 
fixed  exit  double  crystal  monochromator  that  uses  (311)  Si 
crystals.  Due  to  the  use  of  the  (311)  Si  crystals,  no  harmonic 
rejection  was  required.  These  experiments  were  performed 
with  stored  beam  currents  of  190-230  mA  and  beam  energy 
of  2.54  GeV.  The  initial  x-ray  intensity  was  monitored  with  a 
35.4-cm-long  ionization  chamber  filled  with  free  flowing  ni¬ 
trogen  gas. 

The  extended  fine  structure  appearing  above  the  Co  K  ab¬ 
sorption  edge  (7709  eV)  was  measured  with  the  incident 
x-ray  beam  at  a  glancing  angle  incidence  of  less  than  10° 
with  respect  to  the  film  surface  and  the  x-ray  beam’s  electric 
field  vector  parallel  and  then  perpendicular  to  the  film  sur¬ 
face.  Because  the  EXAFS  signal  originates  with  a  cos^  an¬ 
gular  dependence  with  respect  to  the  electric-field  vector  of 
the  incident  radiation  and  the  bond  between  the  absorbing 
and  backscattering  atom,^^  these  orientations  allow  preferen¬ 
tial  sampling  of  the  local  in-plane  and  out-of-plane  structure, 
respectively.  At  least  two  successive  energy  scans  were  taken 
at  a  minimum  of  three  slightly  different  azimuthal  angles  so 
as  to  differentiate  between  the  EXAFS  oscillations  and  the 
Bragg  reflections  from  the  diamond  substrate,  which  were 
later  deglitched.^^  A  different  azimuthal  angle  was  obtained 
by  rotating  the  sample  about  an  axis  normal  to  the  sample  by 
approximately  2°. 

The  experimentally  measured  x-ray  absorption  cross  sec¬ 
tion  fXg(E),  where  E  is  the  x-ray  photon  energy,  is  propor- 
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Fig.  1.  Raw  XAFS  data  for  the  average  in-plane  (solid  line)  and  out-of- 
plane  (dashed  line)  local  structure  surrounding  the  Co  atoms  in  a  1000-A- 
thick  Co  film  deposited  on  diamond. 


tional  to  IJI^  for  TEY  data,  and  being  the  incident 
x-ray  and  total  electron  yield  intensities,  respectively.  The 
raw  TEY  data  corresponding  to  the  average  in-plane  (solid 
line)  and  out-of-plane  (dashed  line)  structure  surrounding  the 
Co  atoms  in  this  film  are  shown  in  Fig.  1.  The  pre-edge 
background  absorption  jXp  is  determined  from  a  quadratic  fit 
to  the  data  roughly  200-50  eV  below  the  edge  energy  and 
then  extrapolated  over  the  entire  range  of  the  spectrum.  The 
A^-edge  absorption  fjb  is  then  isolated  by  subtracting  (jUp  from 
(Lq.  The  smoothly  varying  atomic  absorption  is  deter¬ 
mined  by  fitting  the  postedge  data  of  fx  with  a  cubic-spline 
function.  The  absorption  spectra  are  then  step  normalized  by 
dividing  by  the  amplitude  of  the  absorption  edge.  This  value 
is  determined  by  making  two  linear  fits  to  the  data:  typically 
-200  to  -50  eV  and  100  to  300  eV  relative  to  the  edge.  The 
two  fits  are  extrapolated  to  absorption  threshold,  and  their 
difference  is  taken  to  be  the  amplitude  of  the  absorption 
edge.  The  zero  energy  was  assigned  to  be  the  first  maximum 
of  the  first  derivative  of  the  TEY  data  after  the  onset  of 
absorption.  The  normalized  oscillations  [^"(E’)]  beginning  at 
about  20-50  eV  and  extending  hundreds  of  eV  (and  in  this 
case  1500  eV)  are  defined  to  be  the  extended  x-ray  absorp¬ 
tion  fine-structure  region  of  the  XAFS.  The  normalized  os¬ 
cillations  are  then  converted  to  units  of  according  to  the 
equation  (27r)/h.  Four  successive  nor¬ 
malized  were  averaged  and  the  standard  deviation 

of  each  data  point  was  calculated.  The  normalized  ;^(/:)*/:^ 
data  corresponding  to  the  average  in-plane  (solid  line)  and 
out-of-plane  (dashed  line)  structure  surrounding  the  Co  at¬ 
oms  in  this  film  are  shown  in  Fig.  2.  The  data  in  Fig.  2,  for  k 
values  of  3.0-16.0  were  Fourier  transformed  with  a  0.5 
A“^  modified  Hanning  window  function  to  reduce  truncation 
ripple. The  resulting  Fourier  transformed  data  correspond¬ 
ing  to  the  average  in-plane  (solid  line)  and  out-of-plane 
(dashed  line)  structure  surrounding  the  Co  atoms  in  this  film 
are  shown  in  Fig.  3. 

Inspection  of  Fig.  3  shows  very  little  anisotropy  between 
the  in-plane  and  out-of-plane  structure.  Specifically,  there  is 
very  little  difference  between  the  in-plane  and  out-of-plane 
amplitudes  for  the  third  and  fourth  peaks  of  the  Fourier 
transform  (3.5-5.0  A).  The  primary  source  of  these  peak 
amplitudes  is  due  to  colinear  multiple  scattering  of  the  out¬ 
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Fig,  2.  Normalized  corresponding  to  the  average  in-plane 

(solid  line)  and  out-of-plane  (dashed  line)  structure  surrounding  the  Co  at¬ 
oms  in  a  1000-A-thick  Co  film  deposited  on  diamond. 

going  and  backscattered  photoelectron  wave.  This  focusing 
effect  increases  these  peak  amplitudes  above  that  expected 
for  a  single,  unfocused  outgoing  and  backscattering  event. 
Amplification  of  these  peaks  is  typical  of  a  close-packed 
crystal  phase. 

To  illustrate  this,  we  have  used  the  theoretical  codes  de¬ 
veloped  by  Rehr  and  co-workers to  simulate  the  Fourier 
transforms  of  Co  in  bcc,  fee,  and  hep  structures.  Further,  we 
have  also  utilized  the  ability  of  the  FEFF6  code  to  incorpo¬ 
rate  (1)  the  orientation  of  the  electric  field  vector  of  the  in¬ 
coming  x-ray  beam  relative  to  the  film  surface  and  (2)  the 
temperature  of  the  sample  during  data  acquisition.  The  re¬ 
sults  of  the  theoretical  simulations  for  the  average  in-plane 
and  out-of-plane  local  environments  of  Co  in  bcc,  fee,  and 
hep  (relative  to  the  basal  plane)  structures  at  90  K  are  shown 
in  Figs.  4(a)-4(c).  These  figures  resulted  from  the  theoreti¬ 
cally  generated  data  being  Fourier  transformed  in 

the  same  manner  as  the  experimental  data  described 

above.  Due  to  the  cubic  symmetry  of  the  bcc  and  fee  crystal 
structures,  no  polarization  dependence  is  seen  in  the  theoreti¬ 
cally  generated  Fourier  transforms.  Comparison  of 
polarization-dependent  experimental  data  in  Fig.  3  to  the 
theoretical  data  in  Figs.  4(a)-4(c)  unequivocally  shows  that 
the  Co  film  studied  here  exists  in  the  fee  structure. 

The  Fourier  transform  data  representing  the  in-plane  and 
out-of-plane  first  coordination  shell  (1.52-2.84  A)  was  fil- 


Fig.  3.  Fourier  transformed  experimental  data  with  the  x-ray  beam’s  electric 
field  vector  in  the  plane  of  the  film  (solid  line)  and  perpendicular  to  the 
plane  of  the  film  (dashed  line). 
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Fig.  4.  (a)  Fourier  transform  of  the  theoretically  generated  bcc  crystal  struc¬ 
ture  Co  data;  (b)  Fourier  transfomi  of  the  theoretically  generated  fee  crystal 
structure  Co  data;  (c)  Fourier  transform  of  the  theoretically  generated  hep 
crystal  structure  Co  data  parallel  (solid  line)  and  perpendicular  (dashed  line) 
to  the  basal  plane. 

tered  and  back-transformed  with  a  modified  Hanning  win¬ 
dow  function  of  0.25  A  to  reduce  truncation  ripple.  These 
filtered  data  were  compared  to  each  other  using  the  EXAFS 
ratio  analysis  technique^^  to  determine  any  relative  differ¬ 
ences  in  the  coordination  number,  radial  distance,  or  EXAFS 
Debye-Waller  factors.  Results  from  this  analysis  indicate 
less  than  a  0.0075  A  difference  between  in  plane  and  out-of- 
plane  bondlengths,  less  than  a  difference  of  1.3  atoms  in 
coordination  number  and  less  than  a  0.001  A^  difference  in 
EXAFS  Debye-Waller  factors.  These  differences  represent  a 
doubling  of  the  residual  during  the  fitting  procedure  and  are 
typically  much  larger  than  the  magnitude  of  experimental 
error.  These  values  are  much  smaller  than  typical  differences 
from  EXAFS  experiments  because  the  EXAFS  spectra  for 
two  different  polarizations  are  being  compared  to  each  other 
and  not  to  another  standard.  This  way,  the  errors  that  would 
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be  introduced  by  chemical  shifts  and  different  background 
removals  are  eliminated.  Therefore,  within  the  limits  of  this 
EXAFS  experiment,  no  tetragonal  distortion  of  the  fee  Co 
crystal  lattice  is  seen.  This  result  indicates  that  the  majority 
of  the  1000-A-thick  film  is  relaxed  and  not  anisotropically 
distorted  by  pseudomorphic  growth. 

III.  SUMMARY 

We  have  employed  the  polarization-dependent  properties 
of  EXAFS  to  probe  the  average  local  environment  in  a  1000- 
A-thick  Co  film  deposited  on  diamond.  By  qualitatively  not¬ 
ing  the  polarization  dependence  of  the  third  and  fourth  Fou¬ 
rier  transform  peak  amplitudes,  this  EXAFS  study  shows  that 
the  Co  film  studied  here  is  unambiguously  in  the  fee  phase. 
The  Fourier  transform  peaks  corresponding  to  the  first  near¬ 
est  neighbors,  parallel  and  perpendicular  to  the  film,  have 
been  compared.  Results  indicate  no  tetragonal  distortion  of 
the  fee  Co  crystal  lattice,  therefore  indicating  that  the  major¬ 
ity  of  the  film  is  relaxed  and  not  pseudomorphic.  This  tech¬ 
nique  should  prove  valuable  in  future  studies  when  trying  to 
determine  the  crystallinity  of  thin  films. 
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constant  rate  of  loading  as  the 
sample  moves  at  a  constant  rate  of 
travel.  Force  is  increased  until  the 
coating  catastrophically  tears. 

The  force  at  which  tearing  occurs, 
up  to  the  substrate  crushing  force, 
is  a  measure' of  relative 
adherence,  for  samples  of  similar 
character  and  thickness. 

Typically,  highly  adherent 
coatings  display  micro-cracks 
normal  to  the  diamond  travel 
direction.  These  occur  when  the 
diamond  drag  force  exceeds  the 
tensile  strength  of  the  coating.  As 
the  force  increases,  the  cracks 
become  more  tightly  spaced  until 
the  coating  tears  or  the  substrate 
mechanically  fails. 

It  is  believed  that  the  relative 
value  of  coating  strength  and 
elasticity  will  eventually  be 
deduced  for  coatings  of  similar 
friction  coefficient. 


TOOLING 

A  breadbox  sized  enclosure 
contains  an  IBM  compatible 
computer,  high  speed  analog  to 
digital  converter,  power  supplies, 
signal  processing,  force 
application  and  control  systems. 

The  scratch  assembly,  shown  in 
the  photo  insert,  contains  the 
movable  diamond  stylus  and  the 
constant  rate  of  travel  sample 
stage. 
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An  acoustic  transducer,  mounted 
on  the  stylus,  is  used  to  identify 
the  point  of  initiation  of  micro¬ 
cracking  and  senses  the  coating  or 
substrate  failure.  Acoustic  energy, 
applied  diamond  force,  the  force 
required  to  move  the  sample  and 
effective  friction  between  the 
diamond  and  the  coating  are 
plotted  as  a  function  of  travel 
distance. 
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